Large scale SNP data management and integration in breeding programs

Size: px
Start display at page:

Download "Large scale SNP data management and integration in breeding programs"

Transcription

1 Large scale SNP data management and integration in breeding programs Eildert Groeneveld Institute of Farm Animal Genetics

2 TheSNPpit: Who is it for? Needs to manage many SNPs Uses standardized workflows Uses pipelines Does some gblup Eildert Groeneveld Interbull Tallinn August 26/27,

3 TheSNPpit Genotyping labs import SNPpit database export -E gs_055 data use/analysis chicken maf.05 plink -E gs_022 pigs genable -E gs_123 beef GS run week12 Define Subsets: Maf Nocalls Chrom Ind SNP -E gs_086 -E gs_009 -E gs_031 dairy HF gblup dairy chrom 12 GWAS goat imp 54-> 850K 3 / 32

4 Genotype sets SNP vector Sample vector Eildert Groeneveld Interbull Tallinn August 26/27,

5 Genotype sets SNP vector Sample vector Eildert Groeneveld Interbull Tallinn August 26/27,

6 Genotype sets Eildert Groeneveld Interbull Tallinn August 26/27,

7 TheSNPpit command line Map info for each chip >snppit --import panel ctl_800k 800k.map >snppit --import panel shp_57k shp_57k.map Done once 7 / 32

8 TheSNPpit command line Import SNP data snppit -import data ctl_800k wk13.ped snppit --import data ctl_800k wk14.ped snppit --import data shp_57k wk snppit --export gs_012 snppit -export gs_023 Export SNP data 8 / 32

9 Breeding prg: workflow comp Phenotype database SNPpit database 9 / 32

10 Breeding prg: workflow Phenotype database Extract pedigree Extract phenotype data Extract list of ID with SNP data Extract list of SNPname/panel SNPpit database Provide SNP data for ID/SNPs Run gblup 10 / 32

11 Genotype sets - reminder Eildert Groeneveld Interbull Tallinn August 26/27,

12 Breeding prg: workflow cc Phenotype database On your table: wk21_anis.txt, wk21_snps.txt, pheno.dat,pedi.dat SNPpit database 12 / 32

13 Breeding prg: workflow cc Phenotype database On your table: wk21_anis.txt, wk21_snps.txt, pheno.dat,pedi.dat SNPpit database 13 / 32

14 Breeding prg: workflow cc Phenotype database On your table: wk21_anis.txt, wk21_snps.txt, pheno.dat,pedi.dat SNPpit database >snppit --create ind_sel -p ctl_800 -i wk21_anis.txt > is_ / 32

15 Breeding prg: workflow cc Phenotype database On your table: wk21_anis.txt, wk21_snps.txt, pheno.dat,pedi.dat SNPpit database >snppit --create ind_sel -p ctl_800 -i wk21_anis.txt > is_123 >snppit --create snp_sel -p ctl_800 -i wk21_snps.txt > ss_ / 32

16 Breeding prg: workflow cc Phenotype database On your table: wk21_anis.txt, wk21_snps.txt, pheno.dat,pedi.dat SNPpit database >snppit --create ind_sel -p ctl_800 -i wk21_anis.txt > is_123 >snppit --create snp_sel -p ctl_800 -i wk21_snps.txt > ss_243 >snppit --create genotype_set is_123 ss_243 > gs_ / 32

17 Breeding prg: workflow cc Phenotype database On your table: wk21_anis.txt, wk21_snps.txt, pheno.dat,pedi.dat SNPpit database >snppit --create ind_sel -p ctl_800 -i wk21_anis.txt > is_123 >snppit --create snp_sel -p ctl_800 -i wk21_snps.txt > ss_243 >snppit --create genotype_set is_123 ss_243 > gs_434 >snppit -export gs_434 -o SNPs_wk >gblup_prg pheno.dat pedi.dat SNPs_wk / 32

18 Breeding prg: workflow script Put the following in file run_newexport.sh : #!/bin/bash IS=$(snppit -q -C individual_selection -p 57K -i $1_ANIs.txt) SS=$(snppit -q -C snp_selection -p 57K -i $1_SNPs.txt) GS=$(snppit -q -C genotype_set --snp $SS --individual $IS echo ' new genotype set: '$GS snppit -E genotype_set --name $GS -f o $ Run as: >bash run_newexport.sh wk21 18 / 32

19 Breeding prg: workflow timings Assumptions animals with SNPs > wk21_anis.txt SNP from 57K panel > wk21_snps.txt 19 / 32

20 Breeding prg: workflow timings Assumptions animals with SNPs > wk21_anis.txt SNP from 57K panel > wk21_snps.txt SNPpit database >snppit --create ind_sel -p ctl_57 -i wk21_anis.txt :.57sec 20 / 32

21 Breeding prg: workflow timings Assumptions animals with SNPs > wk21_anis.txt SNP from 57K panel > wk21_snps.txt SNPpit database >snppit --create ind_sel -p ctl_57 -i wk21_anis.txt : >snppit --create snp_sel -p ctl_57 -i wk21_snps.txt :.57sec.56sec 21 / 32

22 Breeding prg: workflow timings Assumptions animals with SNPs > wk21_anis.txt SNP from 57K panel > wk21_snps.txt SNPpit database >snppit --create ind_sel -p ctl_57 -i wk21_anis.txt : >snppit --create snp_sel -p ctl_57 -i wk21_snps.txt :.57sec.56sec >snppit --create genotype_set is_123 ss_243 :.28sec 22 / 32

23 Breeding prg: workflow timings Assumptions animals with SNPs > wk21_anis.txt SNP from 57K panel > wk21_snps.txt SNPpit database >snppit --create ind_sel -p ctl_57 -i wk21_anis.txt : >snppit --create snp_sel -p ctl_57 -i wk21_snps.txt :.57sec.56sec >snppit --create genotype_set is_123 ss_243 :.28sec >snppit -export gs_434 -o SNPs_wk : 51.07sec 23 / 32

24 Breeding prg: workflow timings Assumptions animals with SNPs > wk21_anis.txt SNP from 57K panel > wk21_snps.txt SNPpit database >snppit --create ind_sel -p ctl_57 -i wk21_anis.txt : >snppit --create snp_sel -p ctl_57 -i wk21_snps.txt :.57sec.56sec >snppit --create genotype_set is_123 ss_243 :.28sec >snppit -export gs_434 -o SNPs_wk >gblup_prg pheno.dat pedi.dat SNPs_wk : 51.07sec :?????sec 24 / 32

25 Scaling: Storage 2 bits/snp - 4 SNP/byte 4096 mio SNP / 1 MB Eildert Groeneveld Interbull Tallinn August 26/27,

26 Scaling: Storage 2 bits/snp - 4 SNP/byte 4096 mio SNP / 1 MB Eildert Groeneveld Interbull Tallinn August 26/27,

27 Scaling: Storage 2 bits/snp - 4 SNP/byte 4096 mio SNP / 1 MB Eildert Groeneveld Interbull Tallinn August 26/27,

28 Ressources SNPpit server Linux TheSNPpit software User Manual 28 / 32

29 Ressources SNPpit server Linux TheSNPpit software 29 / 32

30 Ressources SNPpit server Linux TheSNPpit software Open Source Open Source 30 / 32

31 Eildert Groeneveld Interbull Tallinn August 26/27,

32 Thank you for your attention Eildert Groeneveld Interbull Tallinn August 26/27,

FVGWAS- 3.0 Manual. 1. Schematic overview of FVGWAS

FVGWAS- 3.0 Manual. 1. Schematic overview of FVGWAS FVGWAS- 3.0 Manual Hongtu Zhu @ UNC BIAS Chao Huang @ UNC BIAS Nov 8, 2015 More and more large- scale imaging genetic studies are being widely conducted to collect a rich set of imaging, genetic, and clinical

More information

ICAR Parentage Analysis Accreditation for DNA Data Interpretation Centres - Applicant's Guide -

ICAR Parentage Analysis Accreditation for DNA Data Interpretation Centres - Applicant's Guide - ICAR Parentage Analysis Accreditation for DNA Data Interpretation Centres - Applicant's Guide - Overview This document serves as a guide for organizations that submit an application to ICAR for its Parentage

More information

Database requirements CryoWEB software

Database requirements CryoWEB software Gene banking and cryo preservation workshop 1-3 June 1010 Lelystad Database requirements CryoWEB software Action EFABISnet AGRI GEN RES 020 receives financial support from the European Commission, Genetic

More information

PRSice: Polygenic Risk Score software - Vignette

PRSice: Polygenic Risk Score software - Vignette PRSice: Polygenic Risk Score software - Vignette Jack Euesden, Paul O Reilly March 22, 2016 1 The Polygenic Risk Score process PRSice ( precise ) implements a pipeline that has become standard in Polygenic

More information

Random Forest in Genomic Selection

Random Forest in Genomic Selection Random Forest in genomic selection 1 Dpto Mejora Genética Animal, INIA, Madrid; Universidad Politécnica de Valencia, 20-24 September, 2010. Outline 1 Remind 2 Random Forest Introduction Classification

More information

BICF Nano Course: GWAS GWAS Workflow Development using PLINK. Julia Kozlitina April 28, 2017

BICF Nano Course: GWAS GWAS Workflow Development using PLINK. Julia Kozlitina April 28, 2017 BICF Nano Course: GWAS GWAS Workflow Development using PLINK Julia Kozlitina Julia.Kozlitina@UTSouthwestern.edu April 28, 2017 Getting started Open the Terminal (Search -> Applications -> Terminal), and

More information

iloci software is used to calculate the gene-gene interactions from GWAS data. This software was implemented by the OpenCL framework.

iloci software is used to calculate the gene-gene interactions from GWAS data. This software was implemented by the OpenCL framework. iloci software iloci software is used to calculate the gene-gene interactions from GWAS data. This software was implemented by the OpenCL framework. Software requirements : 1. Linux or Mac operating system

More information

Animal ID Entry 4HOnline HelpSheet

Animal ID Entry 4HOnline HelpSheet Log in to 4HOnline: Open a web browser and type in: https://iowa.4honline.com On the Iowa 4-H Youth Development Log In page: 1. Click "I have a Profile." 2. Enter family email address provided to Extension

More information

Polymorphism and Variant Analysis Lab

Polymorphism and Variant Analysis Lab Polymorphism and Variant Analysis Lab Arian Avalos PowerPoint by Casey Hanson Polymorphism and Variant Analysis Matt Hudson 2018 1 Exercise In this exercise, we will do the following:. 1. Gain familiarity

More information

ASSIGNMENT Write a program shown in the textbook at page 10 and see what this program does. 2. Write a program that prints

ASSIGNMENT Write a program shown in the textbook at page 10 and see what this program does. 2. Write a program that prints ASSIGNMENT 1. Suggestion: Always use implicit none as the second line of the program to help the program detect undeclared variables. Program... implicit none... 1. Write a program shown in the textbook

More information

The fgwas software. Version 1.0. Pennsylvannia State University

The fgwas software. Version 1.0. Pennsylvannia State University The fgwas software Version 1.0 Zhong Wang 1 and Jiahan Li 2 1 Department of Public Health Science, 2 Department of Statistics, Pennsylvannia State University 1. Introduction Genome-wide association studies

More information

Genetic type 1 Error Calculator (GEC)

Genetic type 1 Error Calculator (GEC) Genetic type 1 Error Calculator (GEC) (Version 0.2) User Manual Miao-Xin Li Department of Psychiatry and State Key Laboratory for Cognitive and Brain Sciences; the Centre for Reproduction, Development

More information

Computer Users: How to Get Started

Computer Users: How to Get Started UC Davis Procedures Effective July 1 2013, ADCA members may use University of California - Davis (UC Davis) as an approved lab for genotyping/parentage verification. The process is much the same as you

More information

GMDR User Manual Version 1.0

GMDR User Manual Version 1.0 GMDR User Manual Version 1.0 Oct 30, 2011 1 GMDR is a free, open-source interaction analysis tool, aimed to perform gene-gene interaction with generalized multifactor dimensionality methods. GMDR is being

More information

Axiom Analysis Suite Release Notes (For research use only. Not for use in diagnostic procedures.)

Axiom Analysis Suite Release Notes (For research use only. Not for use in diagnostic procedures.) Axiom Analysis Suite 4.0.1 Release Notes (For research use only. Not for use in diagnostic procedures.) Axiom Analysis Suite 4.0.1 includes the following changes/updates: 1. For library packages that support

More information

QTL DETECTION WITH GS3. A Legarra, 19/06/2013

QTL DETECTION WITH GS3. A Legarra, 19/06/2013 QTL DETECTION WITH GS3. A Legarra, 19/06/2013 Hélène Gilbert simulated a data set with 2 chromosomes and 5 QTLs at positions 20 cl, 73.14 cm on chromosome 1 and 20, 33.24 and 59.60 in chromosome 2. Each

More information

KGG: A systematic biological Knowledge-based mining system for Genomewide Genetic studies (Version 3.5) User Manual. Miao-Xin Li, Jiang Li

KGG: A systematic biological Knowledge-based mining system for Genomewide Genetic studies (Version 3.5) User Manual. Miao-Xin Li, Jiang Li KGG: A systematic biological Knowledge-based mining system for Genomewide Genetic studies (Version 3.5) User Manual Miao-Xin Li, Jiang Li Department of Psychiatry Centre for Genomic Sciences Department

More information

Small example of use of OmicABEL

Small example of use of OmicABEL Small example of use of OmicABEL Yurii Aulchenko for the OmicABEL developers July 1, 2013 Contents 1 Important note on data format for OmicABEL 1 2 Outline of the example 2 3 Prepare the data for analysis

More information

Manual code: MSU_pigs.R

Manual code: MSU_pigs.R Manual code: MSU_pigs.R Authors: Jose Luis Gualdrón Duarte 1 and Juan Pedro Steibel,3 1 Departamento de Producción Animal, Facultad de Agronomía, UBA-CONICET, Buenos Aires, ARG Department of Animal Science,

More information

MPG NGS workshop I: Quality assessment of SNP calls

MPG NGS workshop I: Quality assessment of SNP calls MPG NGS workshop I: Quality assessment of SNP calls Kiran V Garimella (kiran@broadinstitute.org) Genome Sequencing and Analysis Medical and Population Genetics February 4, 2010 SNP calling workflow Filesize*

More information

SNP HiTLink Manual. Yoko Fukuda 1, Hiroki Adachi 2, Eiji Nakamura 2, and Shoji Tsuji 1

SNP HiTLink Manual. Yoko Fukuda 1, Hiroki Adachi 2, Eiji Nakamura 2, and Shoji Tsuji 1 SNP HiTLink Manual Yoko Fukuda 1, Hiroki Adachi 2, Eiji Nakamura 2, and Shoji Tsuji 1 1 Department of Neurology, Graduate School of Medicine, the University of Tokyo, Tokyo, Japan 2 Dynacom Co., Ltd, Kanagawa,

More information

Emile R. Chimusa Division of Human Genetics Department of Pathology University of Cape Town

Emile R. Chimusa Division of Human Genetics Department of Pathology University of Cape Town Advanced Genomic data manipulation and Quality Control with plink Emile R. Chimusa (emile.chimusa@uct.ac.za) Division of Human Genetics Department of Pathology University of Cape Town Outlines: 1.Introduction

More information

The fgwas Package. Version 1.0. Pennsylvannia State University

The fgwas Package. Version 1.0. Pennsylvannia State University The fgwas Package Version 1.0 Zhong Wang 1 and Jiahan Li 2 1 Department of Public Health Science, 2 Department of Statistics, Pennsylvannia State University 1. Introduction The fgwas Package (Functional

More information

bash Tests and Looping Administrative Shell Scripting COMP2101 Fall 2017

bash Tests and Looping Administrative Shell Scripting COMP2101 Fall 2017 bash Tests and Looping Administrative Shell Scripting COMP2101 Fall 2017 Command Lists A command is a sequence of commands separated by the operators ; & && and ; is used to simply execute commands in

More information

Package detectruns. February 6, 2018

Package detectruns. February 6, 2018 Type Package Package detectruns February 6, 2018 Title Detect Runs of Homozygosity and Runs of Heterozygosity in Diploid Genomes Version 0.9.5 Date 2018-02-05 Detection of runs of homozygosity and of heterozygosity

More information

Maximizing Public Data Sources for Sequencing and GWAS

Maximizing Public Data Sources for Sequencing and GWAS Maximizing Public Data Sources for Sequencing and GWAS February 4, 2014 G Bryce Christensen Director of Services Questions during the presentation Use the Questions pane in your GoToWebinar window Agenda

More information

Step-by-Step Guide to Advanced Genetic Analysis

Step-by-Step Guide to Advanced Genetic Analysis Step-by-Step Guide to Advanced Genetic Analysis Page 1 Introduction In the previous document, 1 we covered the standard genetic analyses available in JMP Genomics. Here, we cover the more advanced options

More information

detectruns: an R package to detect runs of homozygosity and heterozygosity in diploid genomes

detectruns: an R package to detect runs of homozygosity and heterozygosity in diploid genomes detectruns: an R package to detect runs of homozygosity and heterozygosity in diploid genomes Filippo Biscarini, Paolo Cozzi, Giustino Gaspa, Gabriele Marras IBBA-CNR, PTP, Università degli Studi di Sassari,

More information

Data Currently Available (And How to Access It) Chance Hohensee Data Training September 9, 2016

Data Currently Available (And How to Access It) Chance Hohensee Data Training September 9, 2016 Data Currently Available (And How to Access It) Chance Hohensee Data Training September 9, 2016 Introduction The WHI dataset is large and complex There are different cohorts within the 161,808 WHI participants,

More information

ASSIGNMENT Write a program shown in the textbook at page 10 and see what this program does. 2. Write a program that prints

ASSIGNMENT Write a program shown in the textbook at page 10 and see what this program does. 2. Write a program that prints ASSIGNMENT 1. Suggestion: Always use implicit none as the second line of the program to help the program detect undeclared variables. Program... implicit none... 1. Write a program shown in the textbook

More information

ELAI user manual. Yongtao Guan Baylor College of Medicine. Version June Copyright 2. 3 A simple example 2

ELAI user manual. Yongtao Guan Baylor College of Medicine. Version June Copyright 2. 3 A simple example 2 ELAI user manual Yongtao Guan Baylor College of Medicine Version 1.0 25 June 2015 Contents 1 Copyright 2 2 What ELAI Can Do 2 3 A simple example 2 4 Input file formats 3 4.1 Genotype file format....................................

More information

bash Tests and Looping Administrative Shell Scripting COMP2101 Fall 2017

bash Tests and Looping Administrative Shell Scripting COMP2101 Fall 2017 bash Tests and Looping Administrative Shell Scripting COMP2101 Fall 2017 Command Lists A command is a sequence of commands separated by the operators ; & && and ; is used to simply execute commands in

More information

GS3. Andrés Legarra 1 2 Anne Ricard 3 4 Olivier Filangi 5 6. June 17, Genomic Selection Gibbs Sampling Gauss Seidel.

GS3. Andrés Legarra 1 2 Anne Ricard 3 4 Olivier Filangi 5 6. June 17, Genomic Selection Gibbs Sampling Gauss Seidel. GS3 Genomic Selection Gibbs Sampling Gauss Seidel (and BayesCπ) Andrés Legarra 1 2 Anne Ricard 3 4 Olivier Filangi 5 6 June 17, 2011 1 andres.legarra [at] toulouse.inra.fr 2 INRA, UR 631, F-31326 Auzeville,

More information

bash Scripting Introduction COMP2101 Winter 2019

bash Scripting Introduction COMP2101 Winter 2019 bash Scripting Introduction COMP2101 Winter 2019 Command Lists A command list is a list of one or more commands on a single command line in bash Putting more than one command on a line requires placement

More information

Shell scripting and system variables. HORT Lecture 5 Instructor: Kranthi Varala

Shell scripting and system variables. HORT Lecture 5 Instructor: Kranthi Varala Shell scripting and system variables HORT 59000 Lecture 5 Instructor: Kranthi Varala Text editors Programs built to assist creation and manipulation of text files, typically scripts. nano : easy-to-learn,

More information

The H3ABioNet GWAS Pipeline

The H3ABioNet GWAS Pipeline School of Electrical and UNIVERSITY OF THE WITWATERSRAND, JOHANNESBURG Information Engineering The H3ABioNet GWAS Pipeline https://github.com/h3abionet/h3agwas Scott Hazelhurst 1 Introduction Need to build

More information

GBS Bioinformatics Pipeline(s) Overview

GBS Bioinformatics Pipeline(s) Overview GBS Bioinformatics Pipeline(s) Overview Getting from sequence files to genotypes. Pipeline Coding: Ed Buckler Jeff Glaubitz James Harriman Presentation: Terry Casstevens With supporting information from

More information

Package EMLRT. August 7, 2014

Package EMLRT. August 7, 2014 Package EMLRT August 7, 2014 Type Package Title Association Studies with Imputed SNPs Using Expectation-Maximization-Likelihood-Ratio Test LazyData yes Version 1.0 Date 2014-08-01 Author Maintainer

More information

snpqc an R pipeline for quality control of Illumina SNP data

snpqc an R pipeline for quality control of Illumina SNP data snpqc an R pipeline for quality control of Illumina SNP data 1. In a nutshell snpqc is a series of R scripts to perform quality control analysis on Illumina SNP data. The objective of the program is to

More information

Importing and Merging Data Tutorial

Importing and Merging Data Tutorial Importing and Merging Data Tutorial Release 1.0 Golden Helix, Inc. February 17, 2012 Contents 1. Overview 2 2. Import Pedigree Data 4 3. Import Phenotypic Data 6 4. Import Genetic Data 8 5. Import and

More information

SEQGWAS: Integrative Analysis of SEQuencing and GWAS Data

SEQGWAS: Integrative Analysis of SEQuencing and GWAS Data SEQGWAS: Integrative Analysis of SEQuencing and GWAS Data SYNOPSIS SEQGWAS [--sfile] [--chr] OPTIONS Option Default Description --sfile specification.txt Select a specification file --chr Select a chromosome

More information

Feed the Future Innovation Lab for Peanut (Peanut Innovation Lab) Data Management Plan Version:

Feed the Future Innovation Lab for Peanut (Peanut Innovation Lab) Data Management Plan Version: Feed the Future Innovation Lab for Peanut (Peanut Innovation Lab) Data Management Plan Version: 20180316 Peanut Innovation Lab Management Entity The University of Georgia, Athens, Georgia Feed the Future

More information

Documentation for FCgene

Documentation for FCgene Documentation for FCgene Nab Raj Roshyara Email: roshyara@yahoo.com Universitaet Leipzig Leipzig Research Center for Civilization Diseases(LIFE) Institute for Medical Informatics, Statistics and Epidemiology

More information

MAGA: Meta-Analysis of Gene-level Associations

MAGA: Meta-Analysis of Gene-level Associations MAGA: Meta-Analysis of Gene-level Associations SYNOPSIS MAGA [--sfile] [--chr] OPTIONS Option Default Description --sfile specification.txt Select a specification file --chr Select a chromosome DESCRIPTION

More information

GSCAN GWAS Analysis Plan, v GSCAN GWAS ANALYSIS PLAN, Version 1.0 October 6, 2015

GSCAN GWAS Analysis Plan, v GSCAN GWAS ANALYSIS PLAN, Version 1.0 October 6, 2015 GSCAN GWAS Analysis Plan, v0.5 1 Overview GSCAN GWAS ANALYSIS PLAN, Version 1.0 October 6, 2015 There are three major components to this analysis plan. First, genome-wide genotypes must be on the correct

More information

Using bash. Administrative Shell Scripting COMP2101 Fall 2017

Using bash. Administrative Shell Scripting COMP2101 Fall 2017 Using bash Administrative Shell Scripting COMP2101 Fall 2017 Bash Background Bash was written to replace the Bourne shell The Bourne shell (sh) was not a good candidate for rewrite, so bash was a completely

More information

Computational requirements for the H3ABioNet GWAS workflows

Computational requirements for the H3ABioNet GWAS workflows School of Electrical and UNIVERSITY OF THE WITWATERSRAND, JOHANNESBURG Information Engineering Computational requirements for the H3ABioNet GWAS workflows Scott Hazelhurst 1 Introduction Recap of GWAS

More information

Electronic Technology News Gazette

Electronic Technology News Gazette Electronic Technology News Gazette Volume 10 Issue 9 September 2009 http://www.uky.edu/ag/animalsciences/news/gazette/september2009gazette.pdf ************* Index of Articles *Smart Classroom Changes (pic)

More information

Linux Shell Scripting. Linux System Administration COMP2018 Summer 2017

Linux Shell Scripting. Linux System Administration COMP2018 Summer 2017 Linux Shell Scripting Linux System Administration COMP2018 Summer 2017 What is Scripting? Commands can be given to a computer by entering them into a command interpreter program, commonly called a shell

More information

Genome-Wide Association Study Using

Genome-Wide Association Study Using has to Department of Epidemiology UT MD Anderson Cancer Center Houston, TX April 2, 2008 Programmers Cross Training Outline has to 1 has 2 to 3 Going object-oriented: Outline has Brief introduction to

More information

RNAseq analysis: SNP calling. BTI bioinformatics course, spring 2013

RNAseq analysis: SNP calling. BTI bioinformatics course, spring 2013 RNAseq analysis: SNP calling BTI bioinformatics course, spring 2013 RNAseq overview RNAseq overview Choose technology 454 Illumina SOLiD 3 rd generation (Ion Torrent, PacBio) Library types Single reads

More information

Impact of information load on the centrality parameters of a pig trade network in Northern Germany

Impact of information load on the centrality parameters of a pig trade network in Northern Germany Faculty of Agricultural and Nutritional Science Christian-Albrechts-University Kiel Institute of Animal Breeding and Husbandry Impact of information load on the centrality parameters of a pig trade network

More information

renumf90 Introduction to 5/27/2016 UGA 05/2016 Data File for blupf90 family: a) Only numbers Integer or real

renumf90 Introduction to 5/27/2016 UGA 05/2016 Data File for blupf90 family: a) Only numbers Integer or real Introduction to renumf90 UGA 05/2016 Data File for blupf90 family: a) Only numbers Integer or real a) All effects need to be renumbered from 1 consecutively Controlled by the same parameter file! 1 RENUMF90

More information

GS3. Andrés Legarra. March 5, Genomic Selection Gibbs Sampling Gauss Seidel

GS3. Andrés Legarra. March 5, Genomic Selection Gibbs Sampling Gauss Seidel GS3 Genomic Selection Gibbs Sampling Gauss Seidel Andrés Legarra March 5, 2008 andres.legarra [at] toulouse.inra.fr INRA, UR 631, F-31326 Auzeville, France 1 Contents 1 Introduction 3 1.1 History...............................

More information

User s Guide. Version 2.2. Semex Alliance, Ontario and Centre for Genetic Improvement of Livestock University of Guelph, Ontario

User s Guide. Version 2.2. Semex Alliance, Ontario and Centre for Genetic Improvement of Livestock University of Guelph, Ontario User s Guide Version 2.2 Semex Alliance, Ontario and Centre for Genetic Improvement of Livestock University of Guelph, Ontario Mehdi Sargolzaei, Jacques Chesnais and Flavio Schenkel Jan 2014 Disclaimer

More information

GxE.scan. October 30, 2018

GxE.scan. October 30, 2018 GxE.scan October 30, 2018 Overview GxE.scan can process a GWAS scan using the snp.logistic, additive.test, snp.score or snp.matched functions, whereas snp.scan.logistic only calls snp.logistic. GxE.scan

More information

USER S MANUAL FOR THE AMaCAID PROGRAM

USER S MANUAL FOR THE AMaCAID PROGRAM USER S MANUAL FOR THE AMaCAID PROGRAM TABLE OF CONTENTS Introduction How to download and install R Folder Data The three AMaCAID models - Model 1 - Model 2 - Model 3 - Processing times Changing directory

More information

Statistical Analysis for Genetic Epidemiology (S.A.G.E.) Version 6.4 Graphical User Interface (GUI) Manual

Statistical Analysis for Genetic Epidemiology (S.A.G.E.) Version 6.4 Graphical User Interface (GUI) Manual Statistical Analysis for Genetic Epidemiology (S.A.G.E.) Version 6.4 Graphical User Interface (GUI) Manual Department of Epidemiology and Biostatistics Wolstein Research Building 2103 Cornell Rd Case Western

More information

Package SMAT. January 29, 2013

Package SMAT. January 29, 2013 Package SMAT January 29, 2013 Type Package Title Scaled Multiple-phenotype Association Test Version 0.98 Date 2013-01-26 Author Lin Li, Ph.D.; Elizabeth D. Schifano, Ph.D. Maintainer Lin Li ;

More information

U pública mais antiga dos EUA: k graduandos 9k pós-graduandos Tuition: 26k 46k

U pública mais antiga dos EUA: k graduandos 9k pós-graduandos Tuition: 26k 46k U pública mais antiga dos EUA: 1785 Top 16 nos EUA 29k graduandos 9k pós-graduandos Tuition: 26k 46k Estádio para 92k pessoas Jogos Olímpicos de 1996 (Atlanta) Futebol Americano Salário do Reitor? Salário

More information

Original Greenstar and GS 2 Raven Setup

Original Greenstar and GS 2 Raven Setup Original Greenstar and GS 2 Raven Setup Raven Controller Settings needed to communicate with GreenStar System STEP #1: Setting Up Raven Controller Press the DATA MENU button. Press the 1 or 2 Arrow buttons

More information

# use temporary files to store partial results # remember to delete these temporary files when you do not need them anymore

# use temporary files to store partial results # remember to delete these temporary files when you do not need them anymore #!/bin/bash/ #Man entry # Script that reads a file and outputs a list of unique words # in the file, their frequency and total number # Usage: wsc.sh file finalres # This solution es not assume that we

More information

GWAS Exercises 3 - GWAS with a Quantiative Trait

GWAS Exercises 3 - GWAS with a Quantiative Trait GWAS Exercises 3 - GWAS with a Quantiative Trait Peter Castaldi January 28, 2013 PLINK can also test for genetic associations with a quantitative trait (i.e. a continuous variable). In this exercise, we

More information

JMP Genomics. Release Notes. Version 6.0

JMP Genomics. Release Notes. Version 6.0 JMP Genomics Version 6.0 Release Notes Creativity involves breaking out of established patterns in order to look at things in a different way. Edward de Bono JMP, A Business Unit of SAS SAS Campus Drive

More information

TECHNICAL SPECIFICATION FOR SOFTWARE TO MANAGE POULTRY FARM DATABASE

TECHNICAL SPECIFICATION FOR SOFTWARE TO MANAGE POULTRY FARM DATABASE TECHNICAL SPECIFICATION FOR SOFTWARE TO MANAGE POULTRY FARM DATABASE Version 2.0 04.10.2018 A* : Add M: Modify D: Delete Diary for Material Changes Date Items Status ( A * M D ) Note 01.03.2018 Init version

More information

What can we do today that we couldn t do before.

What can we do today that we couldn t do before. BIG DATA DISCOVERY SCIENCE Big Data for Discovery Science (BDDS) What can we do today that we couldn t do before. Arthur Toga, PI Sept. 30, 2016 Santa Rosa, CA 10/6/2016 BIG DATA DISCOVERY SCIENCE 2 BIG

More information

Intro to NGS Tutorial

Intro to NGS Tutorial Intro to NGS Tutorial Release 8.6.0 Golden Helix, Inc. October 31, 2016 Contents 1. Overview 2 2. Import Variants and Quality Fields 3 3. Quality Filters 10 Generate Alternate Read Ratio.........................................

More information

Breeding Guide. Customer Services PHENOME-NETWORKS 4Ben Gurion Street, 74032, Nes-Ziona, Israel

Breeding Guide. Customer Services PHENOME-NETWORKS 4Ben Gurion Street, 74032, Nes-Ziona, Israel Breeding Guide Customer Services PHENOME-NETWORKS 4Ben Gurion Street, 74032, Nes-Ziona, Israel www.phenome-netwoks.com Contents PHENOME ONE - INTRODUCTION... 3 THE PHENOME ONE LAYOUT... 4 THE JOBS ICON...

More information

ToCatchAThief c ryan campbell & jenn coughlan 7/23/2018

ToCatchAThief c ryan campbell & jenn coughlan 7/23/2018 ToCatchAThief c ryan campbell & jenn coughlan 7/23/2018 Welcome to the To Catch a Thief: With Data! walkthrough! https://bioconductor.org/packages/devel/ bioc/vignettes/snprelate/inst/doc/snprelatetutorial.html

More information

Variant calling using SAMtools

Variant calling using SAMtools Variant calling using SAMtools Calling variants - a trivial use of an Interactive Session We are going to conduct the variant calling exercises in an interactive idev session just so you can get a feel

More information

CTL mapping in R. Danny Arends, Pjotr Prins, and Ritsert C. Jansen. University of Groningen Groningen Bioinformatics Centre & GCC Revision # 1

CTL mapping in R. Danny Arends, Pjotr Prins, and Ritsert C. Jansen. University of Groningen Groningen Bioinformatics Centre & GCC Revision # 1 CTL mapping in R Danny Arends, Pjotr Prins, and Ritsert C. Jansen University of Groningen Groningen Bioinformatics Centre & GCC Revision # 1 First written: Oct 2011 Last modified: Jan 2018 Abstract: Tutorial

More information

Package RobustSNP. January 1, 2011

Package RobustSNP. January 1, 2011 Package RobustSNP January 1, 2011 Type Package Title Robust SNP association tests under different genetic models, allowing for covariates Version 1.0 Depends mvtnorm,car,snpmatrix Date 2010-07-11 Author

More information

Recalling Genotypes with BEAGLECALL Tutorial

Recalling Genotypes with BEAGLECALL Tutorial Recalling Genotypes with BEAGLECALL Tutorial Release 8.1.4 Golden Helix, Inc. June 24, 2014 Contents 1. Format and Confirm Data Quality 2 A. Exclude Non-Autosomal Markers......................................

More information

REAP Software Documentation

REAP Software Documentation REAP Software Documentation Version 1.2 Timothy Thornton 1 Department of Biostatistics 1 The University of Washington 1 REAP A C program for estimating kinship coefficients and IBD sharing probabilities

More information

COMP 4/6262: Programming UNIX

COMP 4/6262: Programming UNIX COMP 4/6262: Programming UNIX Lecture 12 shells, shell programming: passing arguments, if, debug March 13, 2006 Outline shells shell programming passing arguments (KW Ch.7) exit status if (KW Ch.8) test

More information

A Generalized Framework for the Implementation of Integrated Databases for Animal Recording Schemes

A Generalized Framework for the Implementation of Integrated Databases for Animal Recording Schemes A Generalized Framework for the Implementation of Integrated Databases for Animal Recording Schemes Eildert Groeneveld a a Institute for Animal Breeding, Federal Agricultural Research Center, Mariensee,

More information

Supplementary Figure 1. Comparison between the. phenotypic values of soluble. solids content and the genomic. estimated breeding values

Supplementary Figure 1. Comparison between the. phenotypic values of soluble. solids content and the genomic. estimated breeding values Supplementary Figure 1. Comparison between the phenotypic values of soluble solids content and the genomic estimated breeding values (GEBV) in the progeny populations. GEBVs were calculated by using genomic

More information

A practical example of tomato QTL mapping using a RIL population. R R/QTL

A practical example of tomato QTL mapping using a RIL population. R  R/QTL A practical example of tomato QTL mapping using a RIL population R http://www.r-project.org/ R/QTL http://www.rqtl.org/ Dr. Hamid Ashrafi UC Davis Seed Biotechnology Center Breeding with Molecular Markers

More information

Using echo command in shell script

Using echo command in shell script Lab 4a Shell Script Lab 2 Using echo command in shell script Objective Upon completion of this lab, the student will be able to use echo command in the shell script. Scenario The student is the administrator

More information

Useful commands in Linux and other tools for quality control. Ignacio Aguilar INIA Uruguay

Useful commands in Linux and other tools for quality control. Ignacio Aguilar INIA Uruguay Useful commands in Linux and other tools for quality control Ignacio Aguilar INIA Uruguay 05-2018 Unix Basic Commands pwd ls ll mkdir d cd d show working directory list files in working directory as before

More information

PRSice: Polygenic Risk Score software v1.22

PRSice: Polygenic Risk Score software v1.22 PRSice: Polygenic Risk Score software v1.22 Jack Euesden jack.euesden@kcl.ac.uk Cathryn M. Lewis April 30, 2015 Paul F. O Reilly Contents 1 Overview 3 2 R packages required 3 3 Quickstart 3 3.1 Input Data...................................

More information

Interbull Centre Activity Report

Interbull Centre Activity Report Interbull Centre Activity Report August 2017 January 2018 1 INTERBULL CENTRE ACTIVITY REPORT AUGUST 2017 JANUARY 2018 1 INTERBULL CENTRE Department of Animal Breeding and Genetics Swedish University of

More information

Tags Approved for the Ovine Sector

Tags Approved for the Ovine Sector Tags Approved for the Ovine Sector Livestock Identification and Traceability Program VERSION 5a 2014/01/21 The tags approved under the national livestock identification and traceability program are described

More information

Package RVS0.0 Jiafen Gong, Zeynep Baskurt, Andriy Derkach, Angelina Pesevski and Lisa Strug October, 2016

Package RVS0.0 Jiafen Gong, Zeynep Baskurt, Andriy Derkach, Angelina Pesevski and Lisa Strug October, 2016 Package RVS0.0 Jiafen Gong, Zeynep Baskurt, Andriy Derkach, Angelina Pesevski and Lisa Strug October, 2016 The Robust Variance Score (RVS) test is designed for association analysis for next generation

More information

ENIGMA functional processing script

ENIGMA functional processing script ENIGMA functional processing script This repository contains processing scripts for internal usage in the ENGIMA functional project. Goal is the improved reproducibility of processing steps across different

More information

More Linux Shell Scripts

More Linux Shell Scripts nit 5 ore Linux hell cripts opics for this nit ore shell commands Control statements Constraints xpressions elect command Case command unning hell cripts shell script as a standalone is an executable program

More information

Ricopili: Introdution. WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015

Ricopili: Introdution. WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015 Ricopili: Introdution WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015 What will we offer? Practical: Sorry, no practical sessions today, please refer to the summer school, organized

More information

Using the Website. Get Me Known (

Using the Website. Get Me Known ( Using the Website Get Me Known (http://www.getmeknown.co.za) Introduction to GetMeKnown.co.za Registering a Account Register a Microchip Perform a Transfer Request Aprrove Transfer Ordering Microchips

More information

WLBP FARM RECORDS GUIDANCE SHEEP & CATTLE RECORDS

WLBP FARM RECORDS GUIDANCE SHEEP & CATTLE RECORDS Version 17:10:2017 WLBP FARM RECORDS GUIDANCE SHEEP & CATTLE RECORDS For further help and support please contact: - Welsh Lamb & Beef Producers Ltd PO Box 8, Gorseland North Road, Aberystwyth Ceredigion,

More information

Co:Z Dataset Pipes: z/os Remote Services

Co:Z Dataset Pipes: z/os Remote Services An Introduction To Co:Z Dataset Pipes: z/os Remote Services Steve Goetze Kirk Wolf Lisa Goetze steve@dovetail.com kirk@dovetail.com lisa@dovetail.com Copyright 2017 Dovetailed Technologies, LLC 1 Co:Z

More information

CLC Server Command Line Tools USER MANUAL

CLC Server Command Line Tools USER MANUAL CLC Server Command Line Tools USER MANUAL Manual for CLC Server Command Line Tools 2.2 Windows, Mac OS X and Linux August 29, 2014 This software is for research purposes only. CLC bio, a QIAGEN Company

More information

2018 State Fair 4-H Livestock Entry

2018 State Fair 4-H Livestock Entry AGAIN FOR 2018: 2018 State Fair 4-H Livestock Entry FairEntry Help Sheet ENTRY DEADLINE: July 1 @ 11:59 PM!! 4-H Livestock entries will NOT go through your County Extension Office this year All 4-H Livestock

More information

The Analysis of RAD-tag Data for Association Studies

The Analysis of RAD-tag Data for Association Studies EDEN Exchange Participant Name: Layla Freeborn Host Lab: The Kronforst Lab, The University of Chicago Dates of visit: February 15, 2013 - April 15, 2013 Title of Protocol: Rationale and Background: to

More information

Falcon Accelerated Genomics Data Analysis Solutions. User Guide

Falcon Accelerated Genomics Data Analysis Solutions. User Guide Falcon Accelerated Genomics Data Analysis Solutions User Guide Falcon Computing Solutions, Inc. Version 1.0 3/30/2018 Table of Contents Introduction... 3 System Requirements and Installation... 4 Software

More information

Computer Systems and Architecture

Computer Systems and Architecture Computer Systems and Architecture UNIX Scripting Bart Meyers University of Antwerp August 29, 2012 Outline Basics Conditionals Loops Advanced Exercises Shell scripts Grouping commands into a single file

More information

called Hadoop Distribution file System (HDFS). HDFS is designed to run on clusters of commodity hardware and is capable of handling large files. A fil

called Hadoop Distribution file System (HDFS). HDFS is designed to run on clusters of commodity hardware and is capable of handling large files. A fil Parallel Genome-Wide Analysis With Central And Graphic Processing Units Muhamad Fitra Kacamarga mkacamarga@binus.edu James W. Baurley baurley@binus.edu Bens Pardamean bpardamean@binus.edu Abstract The

More information

Genomics. Nolan C. Kane

Genomics. Nolan C. Kane Genomics Nolan C. Kane Nolan.Kane@Colorado.edu Course info http://nkane.weebly.com/genomics.html Emails let me know if you are not getting them! Email me at nolan.kane@colorado.edu Office hours by appointment

More information

Analytical Processing of Data of statistical genetics research in UNIX like Systems

Analytical Processing of Data of statistical genetics research in UNIX like Systems Survival Skills for Analytical Processing of Data of statistical genetics research in UNIX like Systems robert yu :: March 2011 anote UNIX like? Traditional/classical UNIX, e.g. System V (Solaris), BSD

More information

Package lodgwas. R topics documented: November 30, Type Package

Package lodgwas. R topics documented: November 30, Type Package Type Package Package lodgwas November 30, 2015 Title Genome-Wide Association Analysis of a Biomarker Accounting for Limit of Detection Version 1.0-7 Date 2015-11-10 Author Ahmad Vaez, Ilja M. Nolte, Peter

More information

PhD: a web database application for phenotype data management

PhD: a web database application for phenotype data management Bioinformatics Advance Access published June 28, 2005 The Author (2005). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oupjournals.org PhD:

More information