Genome-Wide Association Study Using

Size: px
Start display at page:

Download "Genome-Wide Association Study Using"

Transcription

1 has to Department of Epidemiology UT MD Anderson Cancer Center Houston, TX April 2, 2008 Programmers Cross Training

2 Outline has to 1 has 2 to 3 Going object-oriented:

3 Outline has Brief introduction to The GUI interface of The interface of 1 has Brief introduction to The GUI interface of The interface of Why do we need to write a script? Why do we need to write a script? to

4 What is? has Brief introduction to The GUI interface of The interface of Why do we need to write a script? A powerful and flexible system for population-based SNP analysis Supports case/control, quantitative trait loci (QTL) and categorical analysis Has and a interface An expensive genetic analysis software we ve already paid for to

5 GUI has Brief introduction to The GUI interface of The interface of Why do we need to write a script? to

6 interface has Brief introduction to The GUI interface of The interface of Why do we need to write a script? to

7 Advantages of using scripts, now has Brief introduction to 1 Power of Conditions, loops, command line arguments,... Multiple analyses in a script Different analyses using command line arguments Running (multiple jobs) in batch mode The GUI interface of The interface of Why do we need to write a script? to

8 Advantages of using scripts, now has Brief introduction to The GUI interface of The interface of Why do we need to write a script? to 1 Power of 2 Extentions Conditions, loops, command line arguments,... Multiple analyses in a script Different analyses using command line arguments Running (multiple jobs) in batch mode Non- analyses: e.g. statistical analyses and graphics in R Control of output: automatic annotation, use filters to output selected fields Additional functions: mergespreadsheets(), calcindgenotypesex() Code reuse: function library, add menus to

9 But more importantly, for future 3 What if results look suspicious? has Brief introduction to The GUI interface of The interface of Why do we need to write a script? to

10 But more importantly, for future 3 What if results look suspicious? AFAIR, I selected that option. has Brief introduction to The GUI interface of The interface of Why do we need to write a script? to

11 But more importantly, for future has Brief introduction to 3 What if results look suspicious? AFAIR, I selected that option. I do not know, Qing has left. The GUI interface of The interface of Why do we need to write a script? to

12 But more importantly, for future has Brief introduction to 3 What if results look suspicious? AFAIR, I selected that option. I do not know, Qing has left. I can have a look and re-run script. The GUI interface of The interface of Why do we need to write a script? to

13 But more importantly, for future has Brief introduction to 3 What if results look suspicious? AFAIR, I selected that option. I do not know, Qing has left. I can have a look and re-run script. 4 We have some additional data! The GUI interface of The interface of Why do we need to write a script? to

14 But more importantly, for future has Brief introduction to The GUI interface of 3 What if results look suspicious? AFAIR, I selected that option. I do not know, Qing has left. I can have a look and re-run script. 4 We have some additional data! Again? The interface of Why do we need to write a script? to

15 But more importantly, for future has Brief introduction to The GUI interface of The interface of 3 What if results look suspicious? AFAIR, I selected that option. I do not know, Qing has left. I can have a look and re-run script. 4 We have some additional data! Again? I can (modify and) re-run script. Why do we need to write a script? to

16 But more importantly, for future has Brief introduction to The GUI interface of The interface of Why do we need to write a script? 3 What if results look suspicious? AFAIR, I selected that option. I do not know, Qing has left. I can have a look and re-run script. 4 We have some additional data! Again? I can (modify and) re-run script. 5 I ve got a new project. to

17 But more importantly, for future has Brief introduction to The GUI interface of The interface of Why do we need to write a script? to 3 What if results look suspicious? AFAIR, I selected that option. I do not know, Qing has left. I can have a look and re-run script. 4 We have some additional data! Again? I can (modify and) re-run script. 5 I ve got a new project. OK, what is first step?

18 But more importantly, for future has Brief introduction to The GUI interface of The interface of Why do we need to write a script? to 3 What if results look suspicious? AFAIR, I selected that option. I do not know, Qing has left. I can have a look and re-run script. 4 We have some additional data! Again? I can (modify and) re-run script. 5 I ve got a new project. OK, what is first step? We have a script for a similar project, do we?

19 GUI is still nice to have, of course has 1 Debug Brief introduction to The GUI interface of The interface of Why do we need to write a script? to

20 GUI is still nice to have, of course has 1 Debug 2 Quick experimental runs Brief introduction to The GUI interface of The interface of Why do we need to write a script? to

21 GUI is still nice to have, of course has Brief introduction to 1 Debug 2 Quick experimental runs 3 View results/datasets The GUI interface of The interface of Why do we need to write a script? to

22 GUI is still nice to have, of course has Brief introduction to The GUI interface of The interface of 1 Debug 2 Quick experimental runs 3 View results/datasets 4 Quick plot Why do we need to write a script? to

23 Outline has to What do we need to know before 2 to What do we need to know before Ways of in? How to use Python shell in? How to find command references? A tiny example Ways of in? How to use Python shell in? How to find command references? A tiny example

24 Things to know before has 1 What does boss want me to do??? to What do we need to know before Ways of in? How to use Python shell in? How to find command references? A tiny example

25 Things to know before has 1 What does boss want me to do??? 2 Python Python Python to What do we need to know before Ways of in? How to use Python shell in? How to find command references? A tiny example

26 Things to know before has to 1 What does boss want me to do??? 2 Python Python Python 3 integrated functional interface What do we need to know before Ways of in? How to use Python shell in? How to find command references? A tiny example

27 Something about Python 1 Python is a dynamic object-oriented programming language that supports has to What do we need to know before Ways of in? How to use Python shell in? How to find command references? A tiny example

28 Something about Python has 1 Python is a dynamic object-oriented programming language that supports 2 Python is easy to get started (hours - a few days) to What do we need to know before Ways of in? How to use Python shell in? How to find command references? A tiny example

29 Something about Python has 1 Python is a dynamic object-oriented programming language that supports 2 Python is easy to get started (hours - a few days) 3 Python makes you become a good programmer to What do we need to know before Ways of in? How to use Python shell in? How to find command references? A tiny example

30 Something about Python has to What do we need to know before 1 Python is a dynamic object-oriented programming language that supports 2 Python is easy to get started (hours - a few days) 3 Python makes you become a good programmer 4 Python is becoming more and more popular (NASA, Cisco, Google, Golden Helix...) Ways of in? How to use Python shell in? How to find command references? A tiny example

31 Something about Python has to What do we need to know before Ways of in? How to use Python shell in? How to find command references? 1 Python is a dynamic object-oriented programming language that supports 2 Python is easy to get started (hours - a few days) 3 Python makes you become a good programmer 4 Python is becoming more and more popular (NASA, Cisco, Google, Golden Helix...) 5 Resources: Dive into Python Thinking in Python A tiny example

32 Scripting in has In Python shell to What do we need to know before Ways of in? How to use Python shell in? How to find command references? A tiny example

33 Scripting in has In Python shell From drop-down menu to What do we need to know before Ways of in? How to use Python shell in? How to find command references? A tiny example

34 Scripting in has to What do we need to know before Ways of in? In Python shell From drop-down menu From command line -s /path/to/script.py param1 param2 Note that current installation requires full path name to script. How to use Python shell in? How to find command references? A tiny example

35 Use Python shell in has to What do we need to know before The Python shell Acquire current object: obj = ghi.getcurrentobject() Get available methods: dir(obj) Get help: help(obj.associationtests) Ways of in? How to use Python shell in? How to find command references? A tiny example

36 command references has to What do we need to know before Ways of in? Offline Online pdf manual under directory Getting help from /Manual/manual.html How to use Python shell in? How to find command references? A tiny example

37 Open a project and run a study has to What do we need to know before Ways of in? How to use Python shell in? How to find command references? A tiny example Code ghi.openproject( /home/yxu/research/projects/mockdataset /Mock_example/Mock_example.ghp ) obj = ghi.getobject( Mock_geno )[0] obj.hweplot(1) ghi.saveproject() Available at pcprhelix:yxu/research/projects/ MockDataSet/Mock_tiny.py

38 Outline has to Shortcomings of using functions directly 3 Going object-oriented: Shortcomings of using functions directly to An example Your work Additional tools Summary to An example Your work Additional tools Summary

39 Shortcomings of using commands directly has Not effective in some situations (rerun whole analysis after redo/modification of analysis) to Shortcomings of using functions directly to An example Your work Additional tools Summary

40 Shortcomings of using commands directly has to Not effective in some situations (rerun whole analysis after redo/modification of analysis) Interface parameter s redundance Shortcomings of using functions directly to An example Your work Additional tools Summary

41 Shortcomings of using commands directly has to Not effective in some situations (rerun whole analysis after redo/modification of analysis) Interface parameter s redundance No programming pattern Shortcomings of using functions directly to An example Your work Additional tools Summary

42 What is? has 1 What is a Python class A little bit object-oriented Everything in Python is object If you still don t know... (a function library) to Shortcomings of using functions directly to An example Your work Additional tools Summary

43 What is? has to Shortcomings of using functions directly 1 What is a Python class A little bit object-oriented Everything in Python is object If you still don t know... (a function library) 2 What is A wrapper for s project/spreadsheet management A data-processing class that enforces a few strategies to handle large GWAS dataset to An example Your work Additional tools Summary

44 Design idea one has to Automatically load existing project, spreadsheets, results when ir dependencies are unchanged; A signal is used to force rerun of some analyses Shortcomings of using functions directly to An example Your work Additional tools Summary

45 Design idea two Seperate stable large data (genotype) from variable small data description (meta) has to Shortcomings of using functions directly to An example Your work Additional tools Summary

46 Design idea two Seperate stable large data (genotype) from variable small data description (meta) A single genotype spreadsheet without demographic information (apply genetic map if exists) has to Shortcomings of using functions directly to An example Your work Additional tools Summary

47 Design idea two has to Seperate stable large data (genotype) from variable small data description (meta) A single genotype spreadsheet without demographic information (apply genetic map if exists) Two spreadsheets DATA_NAME_ind_info and DATA_NAME_marker_info that keeps meta information for individuals and markers Shortcomings of using functions directly to An example Your work Additional tools Summary

48 Design idea two has to Shortcomings of using functions directly Seperate stable large data (genotype) from variable small data description (meta) A single genotype spreadsheet without demographic information (apply genetic map if exists) Two spreadsheets DATA_NAME_ind_info and DATA_NAME_marker_info that keeps meta information for individuals and markers Meta information can be changed programmatically or using outside programs such as excel to An example Your work Additional tools Summary

49 Design idea two has to Shortcomings of using functions directly Seperate stable large data (genotype) from variable small data description (meta) A single genotype spreadsheet without demographic information (apply genetic map if exists) Two spreadsheets DATA_NAME_ind_info and DATA_NAME_marker_info that keeps meta information for individuals and markers Meta information can be changed programmatically or using outside programs such as excel The Info spreadsheets will be read into Python so that y can be used or changed easily to An example Your work Additional tools Summary

50 Design idea two has to Shortcomings of using functions directly to An example Your work Additional tools Summary Seperate stable large data (genotype) from variable small data description (meta) A single genotype spreadsheet without demographic information (apply genetic map if exists) Two spreadsheets DATA_NAME_ind_info and DATA_NAME_marker_info that keeps meta information for individuals and markers Meta information can be changed programmatically or using outside programs such as excel The Info spreadsheets will be read into Python so that y can be used or changed easily Samples and markers are chosen according to Info spreadsheets, a new genotype spreadsheet, with case-control information is created for each analysis

51 Data structure (Genotype) has to Shortcomings of using functions directly to An example Your work Additional tools Summary

52 Data structure (indinfo) has to Shortcomings of using functions directly to An example Your work Additional tools Summary

53 Data structure (markerinfo) has to Shortcomings of using functions directly to An example Your work Additional tools Summary

54 GWAS member functions has to Shortcomings of using functions directly to An example Your work Additional tools Summary setoption() loadgenotype()/ applygeneticmap() loadindinfo()/ loadmarkerinfo() calcindcallrate()/calcmarkercallrate() (more powerful than menu option) calcindgenotypesex() (our own addition) calchwe() (copied from ) extractgenotype() mergespreadsheets() (our own addition) openorcreatecasecontrolspreadsheet() / getorimportspreadsheet() (time consuming steps...) associationstudy() saveresult() (choose selected columns, add annotation from external sources)

55 MockDataSet has to Shortcomings of using functions directly to An example Your work Additional tools Summary Code: case_control analysis def case_control(prj, excludesexmisspecified=false): case control association analysis indinfo = prj.indinfo markerinfo = prj.markerinfo inds = indinfo.labels() SNPs = markerinfo.labels() cases = [inds[x] for x in range(len(inds)) if indinfo[ aff ][x] == 2 \ and indinfo[ individual call rate ][x] > and not indinfo[ exclude ][x] \ and (not excludesexmisspecified or indinfo[ gen controls = [inds[x] for x in range(len(inds)) if indinfo[ aff ][x] == 1 \ and indinfo[ individual call rate ][x] > and not indinfo[ exclude ][x] \ and (not excludesexmisspecified or indinfo[ gen

56 MockDataSet (cont.) has to Shortcomings of using functions directly to An example Your work Additional tools Summary markers = [SNPs[x] for x in range(len(snps)) if markerinfo[ HWE p-value ][x] > and markerinfo[ marker call rate ][x] > 0.9] if prj.verbose: print With %d cases, %d controls, and %d markers % \ (len(cases), len(controls), len(markers)) data = prj.openorcreatecasecontrolspreadsheet( case-control, cases, controls, markers) if prj.verbose: print Performing case-control association tests return prj.associationstudy(data, 3, 0, bonferroni=1, fdr=1, genocounts=1, allelecounts=1, usepca=1, numcomponents=10

57 MockDataSet (cont.) has to Shortcomings of using functions directly to An example Your work Additional tools Summary Code: define and run project prj = GWAS(projectName= Mock_example, projectpath=os.path.join(phome, MockDataSet ), datapath=os.path.join(phome, MockDataSet, data ), ghi=ghi) # se two options are turned on by default prj.setoption(verbose=true, cautious=true) # load genotype data prj.extractgenotype( MockData, Mock_geno, filename= MockD prj.loadgenotype(name= Mock_geno ) prj.applygeneticmap( HelixResult.csv, C, 0, markerid=1, d # indinfo, markerinfo = prepareindandmarkerinfo(prj) # data analysis function eval( %s(prj) % projname) # save results maffilter = greaterthanfilter( Minor Allele Freq., 0.05) chi2filter = lessthanfilter( Chi-Squared P, 0.001) resultname = %s_result % projname

58 MockDataSet (cont.) has to Shortcomings of using functions directly to Code: save results prj.saveresult(resultname, os.path.join(phome, MockData, %s.csv % resultname), columns = resultcolumns) prj.saveresult(resultname, os.path.join(phome, MockData, %s-chi csv % resultname), columns=resultcolumns, filter=chi2filter) prj.saveresult(resultname, os.path.join(phome, MockData, %s-maf-0.05-chi csv % resultname), columns=resultcolumns, filter=andfilter(maffilter, chi2filter)) An example Your work Additional tools Summary Available at pcprhelix:yxu/research/projects/mockdataset/

59 What need I to do indeed? has to Shortcomings of using functions directly Create a project Prepare datasets: indinfo, markerinfo Select cases/controls/markers Call associationstudy() Save results to An example Your work Additional tools Summary

60 gwasutil.py has to Shortcomings of using functions directly Defined classes of utility functions Write to a log file of screen output Defined filters Prepare info files Data-sources for saving results Plot figures using R to An example Your work Additional tools Summary

61 Summary has to Shortcomings of using functions directly is a good genetic analysis software Python is a great programming language Scripting is not difficult Scripting is very important and valuable to An example Your work Additional tools Summary

62 Thank you! has to Thank you! Shortcomings of using functions directly to An example Your work Additional tools Summary

BICF Nano Course: GWAS GWAS Workflow Development using PLINK. Julia Kozlitina April 28, 2017

BICF Nano Course: GWAS GWAS Workflow Development using PLINK. Julia Kozlitina April 28, 2017 BICF Nano Course: GWAS GWAS Workflow Development using PLINK Julia Kozlitina Julia.Kozlitina@UTSouthwestern.edu April 28, 2017 Getting started Open the Terminal (Search -> Applications -> Terminal), and

More information

Importing and Merging Data Tutorial

Importing and Merging Data Tutorial Importing and Merging Data Tutorial Release 1.0 Golden Helix, Inc. February 17, 2012 Contents 1. Overview 2 2. Import Pedigree Data 4 3. Import Phenotypic Data 6 4. Import Genetic Data 8 5. Import and

More information

CTL mapping in R. Danny Arends, Pjotr Prins, and Ritsert C. Jansen. University of Groningen Groningen Bioinformatics Centre & GCC Revision # 1

CTL mapping in R. Danny Arends, Pjotr Prins, and Ritsert C. Jansen. University of Groningen Groningen Bioinformatics Centre & GCC Revision # 1 CTL mapping in R Danny Arends, Pjotr Prins, and Ritsert C. Jansen University of Groningen Groningen Bioinformatics Centre & GCC Revision # 1 First written: Oct 2011 Last modified: Jan 2018 Abstract: Tutorial

More information

KGG: A systematic biological Knowledge-based mining system for Genomewide Genetic studies (Version 3.5) User Manual. Miao-Xin Li, Jiang Li

KGG: A systematic biological Knowledge-based mining system for Genomewide Genetic studies (Version 3.5) User Manual. Miao-Xin Li, Jiang Li KGG: A systematic biological Knowledge-based mining system for Genomewide Genetic studies (Version 3.5) User Manual Miao-Xin Li, Jiang Li Department of Psychiatry Centre for Genomic Sciences Department

More information

called Hadoop Distribution file System (HDFS). HDFS is designed to run on clusters of commodity hardware and is capable of handling large files. A fil

called Hadoop Distribution file System (HDFS). HDFS is designed to run on clusters of commodity hardware and is capable of handling large files. A fil Parallel Genome-Wide Analysis With Central And Graphic Processing Units Muhamad Fitra Kacamarga mkacamarga@binus.edu James W. Baurley baurley@binus.edu Bens Pardamean bpardamean@binus.edu Abstract The

More information

FVGWAS- 3.0 Manual. 1. Schematic overview of FVGWAS

FVGWAS- 3.0 Manual. 1. Schematic overview of FVGWAS FVGWAS- 3.0 Manual Hongtu Zhu @ UNC BIAS Chao Huang @ UNC BIAS Nov 8, 2015 More and more large- scale imaging genetic studies are being widely conducted to collect a rich set of imaging, genetic, and clinical

More information

6.034 Design Assignment 2

6.034 Design Assignment 2 6.034 Design Assignment 2 April 5, 2005 Weka Script Due: Friday April 8, in recitation Paper Due: Wednesday April 13, in class Oral reports: Friday April 15, by appointment The goal of this assignment

More information

USER S MANUAL FOR THE AMaCAID PROGRAM

USER S MANUAL FOR THE AMaCAID PROGRAM USER S MANUAL FOR THE AMaCAID PROGRAM TABLE OF CONTENTS Introduction How to download and install R Folder Data The three AMaCAID models - Model 1 - Model 2 - Model 3 - Processing times Changing directory

More information

Emile R. Chimusa Division of Human Genetics Department of Pathology University of Cape Town

Emile R. Chimusa Division of Human Genetics Department of Pathology University of Cape Town Advanced Genomic data manipulation and Quality Control with plink Emile R. Chimusa (emile.chimusa@uct.ac.za) Division of Human Genetics Department of Pathology University of Cape Town Outlines: 1.Introduction

More information

Step-by-Step Guide to Advanced Genetic Analysis

Step-by-Step Guide to Advanced Genetic Analysis Step-by-Step Guide to Advanced Genetic Analysis Page 1 Introduction In the previous document, 1 we covered the standard genetic analyses available in JMP Genomics. Here, we cover the more advanced options

More information

Step-by-Step Guide to Basic Genetic Analysis

Step-by-Step Guide to Basic Genetic Analysis Step-by-Step Guide to Basic Genetic Analysis Page 1 Introduction This document shows you how to clean up your genetic data, assess its statistical properties and perform simple analyses such as case-control

More information

Convert Dosages to Genotypes Author: Autumn Laughbaum, Golden Helix, Inc.

Convert Dosages to Genotypes Author: Autumn Laughbaum, Golden Helix, Inc. Convert Dosages to Genotypes Author: Autumn Laughbaum, Golden Helix, Inc. Overview This script converts allelic dosage values to genotypes based on user-specified thresholds. The dosage data may be in

More information

Package REGENT. R topics documented: August 19, 2015

Package REGENT. R topics documented: August 19, 2015 Package REGENT August 19, 2015 Title Risk Estimation for Genetic and Environmental Traits Version 1.0.6 Date 2015-08-18 Author Daniel J.M. Crouch, Graham H.M. Goddard & Cathryn M. Lewis Maintainer Daniel

More information

Step-by-Step Guide to Relatedness and Association Mapping Contents

Step-by-Step Guide to Relatedness and Association Mapping Contents Step-by-Step Guide to Relatedness and Association Mapping Contents OBJECTIVES... 2 INTRODUCTION... 2 RELATEDNESS MEASURES... 2 POPULATION STRUCTURE... 6 Q-K ASSOCIATION ANALYSIS... 10 K MATRIX COMPRESSION...

More information

Genetic Analysis. Page 1

Genetic Analysis. Page 1 Genetic Analysis Page 1 Genetic Analysis Objectives: 1) Set up Case-Control Association analysis and the Basic Genetics Workflow 2) Use JMP tools to interact with and explore results 3) Learn advanced

More information

The fgwas software. Version 1.0. Pennsylvannia State University

The fgwas software. Version 1.0. Pennsylvannia State University The fgwas software Version 1.0 Zhong Wang 1 and Jiahan Li 2 1 Department of Public Health Science, 2 Department of Statistics, Pennsylvannia State University 1. Introduction Genome-wide association studies

More information

A whirlwind introduction to using R for your research

A whirlwind introduction to using R for your research A whirlwind introduction to using R for your research Jeremy Chacón 1 Outline 1. Why use R? 2. The R-Studio work environment 3. The mock experimental analysis: 1. Writing and running code 2. Getting data

More information

Statistical Analysis for Genetic Epidemiology (S.A.G.E.) Version 6.4 Graphical User Interface (GUI) Manual

Statistical Analysis for Genetic Epidemiology (S.A.G.E.) Version 6.4 Graphical User Interface (GUI) Manual Statistical Analysis for Genetic Epidemiology (S.A.G.E.) Version 6.4 Graphical User Interface (GUI) Manual Department of Epidemiology and Biostatistics Wolstein Research Building 2103 Cornell Rd Case Western

More information

Recalling Genotypes with BEAGLECALL Tutorial

Recalling Genotypes with BEAGLECALL Tutorial Recalling Genotypes with BEAGLECALL Tutorial Release 8.1.4 Golden Helix, Inc. June 24, 2014 Contents 1. Format and Confirm Data Quality 2 A. Exclude Non-Autosomal Markers......................................

More information

Release Notes. JMP Genomics. Version 4.0

Release Notes. JMP Genomics. Version 4.0 JMP Genomics Version 4.0 Release Notes Creativity involves breaking out of established patterns in order to look at things in a different way. Edward de Bono JMP. A Business Unit of SAS SAS Campus Drive

More information

Package GWAF. March 12, 2015

Package GWAF. March 12, 2015 Type Package Package GWAF March 12, 2015 Title Genome-Wide Association/Interaction Analysis and Rare Variant Analysis with Family Data Version 2.2 Date 2015-03-12 Author Ming-Huei Chen

More information

Polymorphism and Variant Analysis Lab

Polymorphism and Variant Analysis Lab Polymorphism and Variant Analysis Lab Arian Avalos PowerPoint by Casey Hanson Polymorphism and Variant Analysis Matt Hudson 2018 1 Exercise In this exercise, we will do the following:. 1. Gain familiarity

More information

Bioinformatics - Homework 1 Q&A style

Bioinformatics - Homework 1 Q&A style Bioinformatics - Homework 1 Q&A style Instructions: in this assignment you will test your understanding of basic GWAS concepts and GenABEL functions. The materials needed for the homework (two datasets

More information

PRSice: Polygenic Risk Score software - Vignette

PRSice: Polygenic Risk Score software - Vignette PRSice: Polygenic Risk Score software - Vignette Jack Euesden, Paul O Reilly March 22, 2016 1 The Polygenic Risk Score process PRSice ( precise ) implements a pipeline that has become standard in Polygenic

More information

Package SimGbyE. July 20, 2009

Package SimGbyE. July 20, 2009 Package SimGbyE July 20, 2009 Type Package Title Simulated case/control or survival data sets with genetic and environmental interactions. Author Melanie Wilson Maintainer Melanie

More information

SOLOMON: Parentage Analysis 1. Corresponding author: Mark Christie

SOLOMON: Parentage Analysis 1. Corresponding author: Mark Christie SOLOMON: Parentage Analysis 1 Corresponding author: Mark Christie christim@science.oregonstate.edu SOLOMON: Parentage Analysis 2 Table of Contents: Installing SOLOMON on Windows/Linux Pg. 3 Installing

More information

The Imprinting Model

The Imprinting Model The Imprinting Model Version 1.0 Zhong Wang 1 and Chenguang Wang 2 1 Department of Public Health Science, Pennsylvania State University 2 Office of Surveillance and Biometrics, Center for Devices and Radiological

More information

Genomics. Nolan C. Kane

Genomics. Nolan C. Kane Genomics Nolan C. Kane Nolan.Kane@Colorado.edu Course info http://nkane.weebly.com/genomics.html Emails let me know if you are not getting them! Email me at nolan.kane@colorado.edu Office hours by appointment

More information

Using IDLE for

Using IDLE for Using IDLE for 15-110 Step 1: Installing Python Download and install Python using the Resources page of the 15-110 website. Be sure to install version 3.3.2 and the correct version depending on whether

More information

GMDR User Manual. GMDR software Beta 0.9. Updated March 2011

GMDR User Manual. GMDR software Beta 0.9. Updated March 2011 GMDR User Manual GMDR software Beta 0.9 Updated March 2011 1 As an open source project, the source code of GMDR is published and made available to the public, enabling anyone to copy, modify and redistribute

More information

The H3ABioNet GWAS Pipeline

The H3ABioNet GWAS Pipeline School of Electrical and UNIVERSITY OF THE WITWATERSRAND, JOHANNESBURG Information Engineering The H3ABioNet GWAS Pipeline https://github.com/h3abionet/h3agwas Scott Hazelhurst 1 Introduction Need to build

More information

Analytical Processing of Data of statistical genetics research in UNIX like Systems

Analytical Processing of Data of statistical genetics research in UNIX like Systems Survival Skills for Analytical Processing of Data of statistical genetics research in UNIX like Systems robert yu :: March 2011 anote UNIX like? Traditional/classical UNIX, e.g. System V (Solaris), BSD

More information

HybridCheck User Manual

HybridCheck User Manual HybridCheck User Manual Ben J. Ward February 2015 HybridCheck is a software package to visualise the recombination signal in assembled next generation sequence data, and it can be used to detect recombination,

More information

DiskBoss DATA MANAGEMENT

DiskBoss DATA MANAGEMENT DiskBoss DATA MANAGEMENT Disk Change Monitor Version 9.3 May 2018 www.diskboss.com info@flexense.com 1 1 Product Overview DiskBoss is an automated, policy-based data management solution allowing one to

More information

A short manual for LFMM (command-line version)

A short manual for LFMM (command-line version) A short manual for LFMM (command-line version) Eric Frichot efrichot@gmail.com April 16, 2013 Please, print this reference manual only if it is necessary. This short manual aims to help users to run LFMM

More information

DiskBoss DATA MANAGEMENT

DiskBoss DATA MANAGEMENT DiskBoss DATA MANAGEMENT File Integrity Monitor Version 9.3 May 2018 www.diskboss.com info@flexense.com 1 1 Product Overview DiskBoss is an automated, policy-based data management solution allowing one

More information

UAccess ANALYTICS Next Steps: Working with Bins, Groups, and Calculated Items: Combining Data Your Way

UAccess ANALYTICS Next Steps: Working with Bins, Groups, and Calculated Items: Combining Data Your Way UAccess ANALYTICS Next Steps: Working with Bins, Groups, and Calculated Items: Arizona Board of Regents, 2014 THE UNIVERSITY OF ARIZONA created 02.07.2014 v.1.00 For information and permission to use our

More information

Automatic Programming: How Far Can Machines Go? Hila Peleg Technion

Automatic Programming: How Far Can Machines Go? Hila Peleg Technion Automatic Programming: How Far Can Machines Go? Hila Peleg Technion Laziness Laziness Impatience Laziness Impatience Hubris Automatic Programming Clean up my spreadsheet! Right away, boss! Program synthesis

More information

Quick Start Guide. CodeGenerator v1.5.0

Quick Start Guide. CodeGenerator v1.5.0 Contents Revision History... 2 Summary... 3 How It Works... 4 Database Schema... 4 Customization... 4 APIs... 4 Annotations... 4 Attributes... 5 Transformation & Output... 5 Creating a Project... 6 General

More information

Week 8 Lecture: Getting Things Done

Week 8 Lecture: Getting Things Done Week 8 Lecture: Input and Output, and Moving to Getting Things Done Introduction to Programming for GIS & Remote Sensing GEO6938-4172 GEO4938-4166 4166 Where We Are To this point we ve covered: Basics

More information

DiskBoss DATA MANAGEMENT

DiskBoss DATA MANAGEMENT DiskBoss DATA MANAGEMENT File Delete and Data Wiping Version 9.3 May 2018 www.diskboss.com info@flexense.com 1 1 Product Overview DiskBoss is an automated, policy-based data management solution allowing

More information

Creating a Box-and-Whisker Graph in Excel: Step One: Step Two:

Creating a Box-and-Whisker Graph in Excel: Step One: Step Two: Creating a Box-and-Whisker Graph in Excel: It s not as simple as selecting Box and Whisker from the Chart Wizard. But if you ve made a few graphs in Excel before, it s not that complicated to convince

More information

Spotter Documentation Version 0.5, Released 4/12/2010

Spotter Documentation Version 0.5, Released 4/12/2010 Spotter Documentation Version 0.5, Released 4/12/2010 Purpose Spotter is a program for delineating an association signal from a genome wide association study using features such as recombination rates,

More information

Package EMLRT. August 7, 2014

Package EMLRT. August 7, 2014 Package EMLRT August 7, 2014 Type Package Title Association Studies with Imputed SNPs Using Expectation-Maximization-Likelihood-Ratio Test LazyData yes Version 1.0 Date 2014-08-01 Author Maintainer

More information

Download PLINK from

Download PLINK from PLINK tutorial Amended from two tutorials that the PLINK author Shaun Purcell wrote, see http://pngu.mgh.harvard.edu/~purcell/plink/tutorial.shtml and 'Teaching materials and example dataset' at http://pngu.mgh.harvard.edu/~purcell/plink/res.shtml

More information

1. Summary statistics test_gwas. This file contains a set of 50K random SNPs of the Subjective Well-being GWAS of the Netherlands Twin Register

1. Summary statistics test_gwas. This file contains a set of 50K random SNPs of the Subjective Well-being GWAS of the Netherlands Twin Register Quality Control for Genome-Wide Association Studies Bart Baselmans & Meike Bartels Boulder 2017 Setting up files and directories To perform a quality control protocol in a Genome-Wide Association Meta

More information

Sucuri Webinar Q&A HOW TO IDENTIFY AND FIX A HACKED WORDPRESS WEBSITE. Ben Martin - Remediation Team Lead

Sucuri Webinar Q&A HOW TO IDENTIFY AND FIX A HACKED WORDPRESS WEBSITE. Ben Martin - Remediation Team Lead Sucuri Webinar Q&A HOW TO IDENTIFY AND FIX A HACKED WORDPRESS WEBSITE. Ben Martin - Remediation Team Lead 1 Question #1: What is the benefit to spammers for using someone elses UA code and is there a way

More information

haplo.score Score Tests for Association of Traits with Haplotypes when Linkage Phase is Ambiguous

haplo.score Score Tests for Association of Traits with Haplotypes when Linkage Phase is Ambiguous haploscore Score Tests for Association of Traits with Haplotypes when Linkage Phase is Ambiguous Charles M Rowland, David E Tines, and Daniel J Schaid Mayo Clinic Rochester, MN E-mail contact: rowland@mayoedu

More information

SISG/SISMID Module 3

SISG/SISMID Module 3 SISG/SISMID Module 3 Introduction to R Ken Rice Tim Thornton University of Washington Seattle, July 2018 Introduction: Course Aims This is a first course in R. We aim to cover; Reading in, summarizing

More information

ToCatchAThief c ryan campbell & jenn coughlan 7/23/2018

ToCatchAThief c ryan campbell & jenn coughlan 7/23/2018 ToCatchAThief c ryan campbell & jenn coughlan 7/23/2018 Welcome to the To Catch a Thief: With Data! walkthrough! https://bioconductor.org/packages/devel/ bioc/vignettes/snprelate/inst/doc/snprelatetutorial.html

More information

SNP HiTLink Manual. Yoko Fukuda 1, Hiroki Adachi 2, Eiji Nakamura 2, and Shoji Tsuji 1

SNP HiTLink Manual. Yoko Fukuda 1, Hiroki Adachi 2, Eiji Nakamura 2, and Shoji Tsuji 1 SNP HiTLink Manual Yoko Fukuda 1, Hiroki Adachi 2, Eiji Nakamura 2, and Shoji Tsuji 1 1 Department of Neurology, Graduate School of Medicine, the University of Tokyo, Tokyo, Japan 2 Dynacom Co., Ltd, Kanagawa,

More information

Information Technology

Information Technology Information Technology Operations Team Instructional Sharing Protected Data Contents Introduction... 3 Procedure... 3 How to Password Protect a MS Word Document... 3 MS Word 2010/2013/2016... 3 Word 2007...

More information

CompClustTk Manual & Tutorial

CompClustTk Manual & Tutorial CompClustTk Manual & Tutorial Brandon King Copyright c California Institute of Technology Version 0.1.10 May 13, 2004 Contents 1 Introduction 1 1.1 Purpose.............................................

More information

Depending on the computer you find yourself in front of, here s what you ll need to do to open SPSS.

Depending on the computer you find yourself in front of, here s what you ll need to do to open SPSS. 1 SPSS 11.5 for Windows Introductory Assignment Material covered: Opening an existing SPSS data file, creating new data files, generating frequency distributions and descriptive statistics, obtaining printouts

More information

SCRATCH MODULE 3: NUMBER CONVERSIONS

SCRATCH MODULE 3: NUMBER CONVERSIONS SCRATCH MODULE 3: NUMBER CONVERSIONS INTRODUCTION The purpose of this module is to experiment with user interactions, error checking input, and number conversion algorithms in Scratch. We will be exploring

More information

Part 6b: The effect of scale on raster calculations mean local relief and slope

Part 6b: The effect of scale on raster calculations mean local relief and slope Part 6b: The effect of scale on raster calculations mean local relief and slope Due: Be done with this section by class on Monday 10 Oct. Tasks: Calculate slope for three rasters and produce a decent looking

More information

R in Linguistic Analysis. Week 2 Wassink Autumn 2012

R in Linguistic Analysis. Week 2 Wassink Autumn 2012 R in Linguistic Analysis Week 2 Wassink Autumn 2012 Today R fundamentals The anatomy of an R help file but first... How did you go about learning the R functions in the reading? More help learning functions

More information

snpqc an R pipeline for quality control of Illumina SNP data

snpqc an R pipeline for quality control of Illumina SNP data snpqc an R pipeline for quality control of Illumina SNP data 1. In a nutshell snpqc is a series of R scripts to perform quality control analysis on Illumina SNP data. The objective of the program is to

More information

MAGA: Meta-Analysis of Gene-level Associations

MAGA: Meta-Analysis of Gene-level Associations MAGA: Meta-Analysis of Gene-level Associations SYNOPSIS MAGA [--sfile] [--chr] OPTIONS Option Default Description --sfile specification.txt Select a specification file --chr Select a chromosome DESCRIPTION

More information

Techniques for Optimizing Reusable Content in LibGuides

Techniques for Optimizing Reusable Content in LibGuides University of Louisville From the SelectedWorks of Terri Holtze April 21, 2017 Techniques for Optimizing Reusable Content in LibGuides Terri Holtze, University of Louisville Available at: https://works.bepress.com/terri-holtze/4/

More information

Digital Marketing & Sales Training. Part 1: SEO, Local, & AdWords Express Leadgenix & AG 431

Digital Marketing & Sales Training. Part 1: SEO, Local, & AdWords Express Leadgenix & AG 431 Digital Marketing & Sales Training Part 1: SEO, Local, & AdWords Express Leadgenix & AG 431 Introductions Andy Selcho AG Location Owner Dan Posner Partner Relationships Jamie Bates Director of Operations

More information

Our legacy archival system resides in an Access Database lovingly named The Beast. Having the data in a database provides the opportunity and ability

Our legacy archival system resides in an Access Database lovingly named The Beast. Having the data in a database provides the opportunity and ability Our legacy archival system resides in an Access Database lovingly named The Beast. Having the data in a database provides the opportunity and ability to maintain a semblance of structure in the data However,

More information

PLATO User Guide. Current version: PLATO 2.1. Last modified: September Ritchie Lab, Geisinger Health System

PLATO User Guide. Current version: PLATO 2.1. Last modified: September Ritchie Lab, Geisinger Health System PLATO User Guide Current version: PLATO 2.1 Last modified: September 2017 Ritchie Lab, Geisinger Health System Email: software@ritchielab.psu.edu 1 Table of Contents Overview... 3 PLATO Quick Reference...

More information

Ricopili: Introdution. WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015

Ricopili: Introdution. WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015 Ricopili: Introdution WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015 What will we offer? Practical: Sorry, no practical sessions today, please refer to the summer school, organized

More information

Properties of Data. Digging into Data: Jordan Boyd-Graber. University of Maryland. February 11, 2013

Properties of Data. Digging into Data: Jordan Boyd-Graber. University of Maryland. February 11, 2013 Properties of Data Digging into Data: Jordan Boyd-Graber University of Maryland February 11, 2013 Digging into Data: Jordan Boyd-Graber (UMD) Properties of Data February 11, 2013 1 / 43 Roadmap Munging

More information

Graphics Performance Benchmarking Framework ATI. Presented to: Jerry Howard. By: Drew Roberts, Nicholas Tower, Jason Underhill

Graphics Performance Benchmarking Framework ATI. Presented to: Jerry Howard. By: Drew Roberts, Nicholas Tower, Jason Underhill Graphics Performance Benchmarking Framework ATI Presented to: Jerry Howard By: Drew Roberts, Nicholas Tower, Jason Underhill Executive Summary The goal of this project was to create a graphical benchmarking

More information

Systems Software. Recitation 1: Intro & Revision Control. Quite different from 213. Our Philosophy. Partly-free lunch

Systems Software. Recitation 1: Intro & Revision Control. Quite different from 213. Our Philosophy. Partly-free lunch Systems Software Recitation 1: Intro & Revision Control Dave Andersen CMU Computer Science Fall 2006 Low-level (projects in C) Designed to run forever Handle every possible error condition Manage resources

More information

Maximizing Public Data Sources for Sequencing and GWAS

Maximizing Public Data Sources for Sequencing and GWAS Maximizing Public Data Sources for Sequencing and GWAS February 4, 2014 G Bryce Christensen Director of Services Questions during the presentation Use the Questions pane in your GoToWebinar window Agenda

More information

The first thing we ll need is some numbers. I m going to use the set of times and drug concentration levels in a patient s bloodstream given below.

The first thing we ll need is some numbers. I m going to use the set of times and drug concentration levels in a patient s bloodstream given below. Graphing in Excel featuring Excel 2007 1 A spreadsheet can be a powerful tool for analyzing and graphing data, but it works completely differently from the graphing calculator that you re used to. If you

More information

Small example of use of OmicABEL

Small example of use of OmicABEL Small example of use of OmicABEL Yurii Aulchenko for the OmicABEL developers July 1, 2013 Contents 1 Important note on data format for OmicABEL 1 2 Outline of the example 2 3 Prepare the data for analysis

More information

Effective Recombination in Plant Breeding and Linkage Mapping Populations: Testing Models and Mating Schemes

Effective Recombination in Plant Breeding and Linkage Mapping Populations: Testing Models and Mating Schemes Effective Recombination in Plant Breeding and Linkage Mapping Populations: Testing Models and Mating Schemes Raven et al., 1999 Seth C. Murray Assistant Professor of Quantitative Genetics and Maize Breeding

More information

Data to App: Web,Tablet and Smart Phone Duane Griffith, Montana State University,

Data to App: Web,Tablet and Smart Phone Duane Griffith, Montana State University, Data to App: Web,Tablet and Smart Phone Duane Griffith, Montana State University, griffith@montana.edu This tutorial is broken into several steps related mostly to the technology applied at each step.

More information

Creating and Using Genome Assemblies Tutorial

Creating and Using Genome Assemblies Tutorial Creating and Using Genome Assemblies Tutorial Release 8.1 Golden Helix, Inc. March 18, 2014 Contents 1. Create a Genome Assembly for Danio rerio 2 2. Building Annotation Sources 5 A. Creating a Reference

More information

BioBin User Guide Current version: BioBin 2.3

BioBin User Guide Current version: BioBin 2.3 BioBin User Guide Current version: BioBin 2.3 Last modified: April 2017 Ritchie Lab Geisinger Health System URL: http://www.ritchielab.com/software/biobin-download Email: software@ritchielab.psu.edu 1

More information

Tips and Guidance for Analyzing Data. Executive Summary

Tips and Guidance for Analyzing Data. Executive Summary Tips and Guidance for Analyzing Data Executive Summary This document has information and suggestions about three things: 1) how to quickly do a preliminary analysis of time-series data; 2) key things to

More information

How to Set up a Budget Advanced Excel Part B

How to Set up a Budget Advanced Excel Part B How to Set up a Budget Advanced Excel Part B A budget is probably the most important spreadsheet you can create. A good budget will keep you focused on your ultimate financial goal and help you avoid spending

More information

QTX. Tutorial for. by Kim M.Chmielewicz Kenneth F. Manly. Software for genetic mapping of Mendelian markers and quantitative trait loci.

QTX. Tutorial for. by Kim M.Chmielewicz Kenneth F. Manly. Software for genetic mapping of Mendelian markers and quantitative trait loci. Tutorial for QTX by Kim M.Chmielewicz Kenneth F. Manly Software for genetic mapping of Mendelian markers and quantitative trait loci. Available in versions for Mac OS and Microsoft Windows. revised for

More information

Lastly, in case you don t already know this, and don t have Excel on your computers, you can get it for free through IT s website under software.

Lastly, in case you don t already know this, and don t have Excel on your computers, you can get it for free through IT s website under software. Welcome to Basic Excel, presented by STEM Gateway as part of the Essential Academic Skills Enhancement, or EASE, workshop series. Before we begin, I want to make sure we are clear that this is by no means

More information

SPSS TRAINING SPSS VIEWS

SPSS TRAINING SPSS VIEWS SPSS TRAINING SPSS VIEWS Dataset Data file Data View o Full data set, structured same as excel (variable = column name, row = record) Variable View o Provides details for each variable (column in Data

More information

IN-CLASS EXERCISE: INTRODUCTION TO R

IN-CLASS EXERCISE: INTRODUCTION TO R NAVAL POSTGRADUATE SCHOOL IN-CLASS EXERCISE: INTRODUCTION TO R Survey Research Methods Short Course Marine Corps Combat Development Command Quantico, Virginia May 2013 In-class Exercise: Introduction to

More information

Notes on QTL Cartographer

Notes on QTL Cartographer Notes on QTL Cartographer Introduction QTL Cartographer is a suite of programs for mapping quantitative trait loci (QTLs) onto a genetic linkage map. The programs use linear regression, interval mapping

More information

fasta2genotype.py Version 1.10 Written for Python Available on request from the author 2017 Paul Maier

fasta2genotype.py Version 1.10 Written for Python Available on request from the author 2017 Paul Maier 1 fasta2genotype.py Version 1.10 Written for Python 2.7.10 Available on request from the author 2017 Paul Maier This program takes a fasta file listing all sequence haplotypes of all individuals at all

More information

0 Graphical Analysis Use of Excel

0 Graphical Analysis Use of Excel Lab 0 Graphical Analysis Use of Excel What You Need To Know: This lab is to familiarize you with the graphing ability of excels. You will be plotting data set, curve fitting and using error bars on the

More information

PRSice: Polygenic Risk Score software v1.22

PRSice: Polygenic Risk Score software v1.22 PRSice: Polygenic Risk Score software v1.22 Jack Euesden jack.euesden@kcl.ac.uk Cathryn M. Lewis April 30, 2015 Paul F. O Reilly Contents 1 Overview 3 2 R packages required 3 3 Quickstart 3 3.1 Input Data...................................

More information

GWAS Exercises 3 - GWAS with a Quantiative Trait

GWAS Exercises 3 - GWAS with a Quantiative Trait GWAS Exercises 3 - GWAS with a Quantiative Trait Peter Castaldi January 28, 2013 PLINK can also test for genetic associations with a quantitative trait (i.e. a continuous variable). In this exercise, we

More information

Breeding Guide. Customer Services PHENOME-NETWORKS 4Ben Gurion Street, 74032, Nes-Ziona, Israel

Breeding Guide. Customer Services PHENOME-NETWORKS 4Ben Gurion Street, 74032, Nes-Ziona, Israel Breeding Guide Customer Services PHENOME-NETWORKS 4Ben Gurion Street, 74032, Nes-Ziona, Israel www.phenome-netwoks.com Contents PHENOME ONE - INTRODUCTION... 3 THE PHENOME ONE LAYOUT... 4 THE JOBS ICON...

More information

Introduction to Hail. Cotton Seed, Technical Lead Tim Poterba, Software Engineer Hail Team, Neale Lab Broad Institute and MGH

Introduction to Hail. Cotton Seed, Technical Lead Tim Poterba, Software Engineer Hail Team, Neale Lab Broad Institute and MGH Introduction to Hail Cotton Seed, Technical Lead Tim Poterba, Software Engineer Hail Team, Neale Lab Broad Institute and MGH Why Hail? Genetic data is becoming absolutely massive Broad Genomics, by the

More information

SECTION 1: INTRODUCTION. ENGR 112 Introduction to Engineering Computing

SECTION 1: INTRODUCTION. ENGR 112 Introduction to Engineering Computing SECTION 1: INTRODUCTION ENGR 112 Introduction to Engineering Computing 2 Course Overview What is Programming? 3 Programming The implementation of algorithms in a particular computer programming language

More information

Tutorial on gene-c ancestry es-ma-on: How to use LASER. Chaolong Wang Sequence Analysis Workshop June University of Michigan

Tutorial on gene-c ancestry es-ma-on: How to use LASER. Chaolong Wang Sequence Analysis Workshop June University of Michigan Tutorial on gene-c ancestry es-ma-on: How to use LASER Chaolong Wang Sequence Analysis Workshop June 2014 @ University of Michigan LASER: Loca-ng Ancestry from SEquence Reads Main func:ons of the so

More information

Variant calling using SAMtools

Variant calling using SAMtools Variant calling using SAMtools Calling variants - a trivial use of an Interactive Session We are going to conduct the variant calling exercises in an interactive idev session just so you can get a feel

More information

Release Notes and Installation Guide (Unix Version)

Release Notes and Installation Guide (Unix Version) Release Notes and Installation Guide (Unix Version) Release 3.1 December 1997 - ii - Release Notes and Installation Guide Table of Contents 1 Changes Since Last Release... 1 1.1 Changes Since Release 2.2...

More information

Mascot Insight is a new application designed to help you to organise and manage your Mascot search and quantitation results. Mascot Insight provides

Mascot Insight is a new application designed to help you to organise and manage your Mascot search and quantitation results. Mascot Insight provides 1 Mascot Insight is a new application designed to help you to organise and manage your Mascot search and quantitation results. Mascot Insight provides ways to flexibly merge your Mascot search and quantitation

More information

databuild Documentation

databuild Documentation databuild Documentation Release 0.0.10 Flavio Curella May 15, 2015 Contents 1 Contents 3 1.1 Installation................................................ 3 1.2 Quickstart................................................

More information

Introduction to Python Part 2

Introduction to Python Part 2 Introduction to Python Part 2 v0.2 Brian Gregor Research Computing Services Information Services & Technology Tutorial Outline Part 2 Functions Tuples and dictionaries Modules numpy and matplotlib modules

More information

HVAC Designer Application Features

HVAC Designer Application Features HVAC Designer Application Features My name is Adam and in just under six months I have created a project management app using the Bubble platform. I have zero programming training or experience and want

More information

QTL Analysis with QGene Tutorial

QTL Analysis with QGene Tutorial QTL Analysis with QGene Tutorial Phillip McClean 1. Getting the software. The first step is to download and install the QGene software. It can be obtained from the following WWW site: http://qgene.org

More information

JatinSir - Mastering Python

JatinSir - Mastering Python JatinSir - Mastering Python Best Python Training with Real-time Project Duration of the Training: 42-48 hours Who can learn Python? In short anyone. Automation Engineers Data analysts and scientist Quality

More information

Snakemake overview. Thomas Cokelaer. Nov 9th 2017 Snakemake and Sequana overview. Institut Pasteur

Snakemake overview. Thomas Cokelaer. Nov 9th 2017 Snakemake and Sequana overview. Institut Pasteur Snakemake overview Thomas Cokelaer Institut Pasteur Nov 9th 2017 Snakemake and Sequana overview Many bioinformatic pipeline frameworks available A review of bioinformatic pipeline frameworks. Jeremy Leipzig

More information

Practical Unix exercise MBV INFX410

Practical Unix exercise MBV INFX410 Practical Unix exercise MBV INFX410 We will in this exercise work with a practical task that, it turns out, can easily be solved by using basic Unix. Let us pretend that an engineer in your group has spent

More information

ELAI user manual. Yongtao Guan Baylor College of Medicine. Version June Copyright 2. 3 A simple example 2

ELAI user manual. Yongtao Guan Baylor College of Medicine. Version June Copyright 2. 3 A simple example 2 ELAI user manual Yongtao Guan Baylor College of Medicine Version 1.0 25 June 2015 Contents 1 Copyright 2 2 What ELAI Can Do 2 3 A simple example 2 4 Input file formats 3 4.1 Genotype file format....................................

More information