Genome-Wide Association Study Using
|
|
- Edwin Mills
- 6 years ago
- Views:
Transcription
1 has to Department of Epidemiology UT MD Anderson Cancer Center Houston, TX April 2, 2008 Programmers Cross Training
2 Outline has to 1 has 2 to 3 Going object-oriented:
3 Outline has Brief introduction to The GUI interface of The interface of 1 has Brief introduction to The GUI interface of The interface of Why do we need to write a script? Why do we need to write a script? to
4 What is? has Brief introduction to The GUI interface of The interface of Why do we need to write a script? A powerful and flexible system for population-based SNP analysis Supports case/control, quantitative trait loci (QTL) and categorical analysis Has and a interface An expensive genetic analysis software we ve already paid for to
5 GUI has Brief introduction to The GUI interface of The interface of Why do we need to write a script? to
6 interface has Brief introduction to The GUI interface of The interface of Why do we need to write a script? to
7 Advantages of using scripts, now has Brief introduction to 1 Power of Conditions, loops, command line arguments,... Multiple analyses in a script Different analyses using command line arguments Running (multiple jobs) in batch mode The GUI interface of The interface of Why do we need to write a script? to
8 Advantages of using scripts, now has Brief introduction to The GUI interface of The interface of Why do we need to write a script? to 1 Power of 2 Extentions Conditions, loops, command line arguments,... Multiple analyses in a script Different analyses using command line arguments Running (multiple jobs) in batch mode Non- analyses: e.g. statistical analyses and graphics in R Control of output: automatic annotation, use filters to output selected fields Additional functions: mergespreadsheets(), calcindgenotypesex() Code reuse: function library, add menus to
9 But more importantly, for future 3 What if results look suspicious? has Brief introduction to The GUI interface of The interface of Why do we need to write a script? to
10 But more importantly, for future 3 What if results look suspicious? AFAIR, I selected that option. has Brief introduction to The GUI interface of The interface of Why do we need to write a script? to
11 But more importantly, for future has Brief introduction to 3 What if results look suspicious? AFAIR, I selected that option. I do not know, Qing has left. The GUI interface of The interface of Why do we need to write a script? to
12 But more importantly, for future has Brief introduction to 3 What if results look suspicious? AFAIR, I selected that option. I do not know, Qing has left. I can have a look and re-run script. The GUI interface of The interface of Why do we need to write a script? to
13 But more importantly, for future has Brief introduction to 3 What if results look suspicious? AFAIR, I selected that option. I do not know, Qing has left. I can have a look and re-run script. 4 We have some additional data! The GUI interface of The interface of Why do we need to write a script? to
14 But more importantly, for future has Brief introduction to The GUI interface of 3 What if results look suspicious? AFAIR, I selected that option. I do not know, Qing has left. I can have a look and re-run script. 4 We have some additional data! Again? The interface of Why do we need to write a script? to
15 But more importantly, for future has Brief introduction to The GUI interface of The interface of 3 What if results look suspicious? AFAIR, I selected that option. I do not know, Qing has left. I can have a look and re-run script. 4 We have some additional data! Again? I can (modify and) re-run script. Why do we need to write a script? to
16 But more importantly, for future has Brief introduction to The GUI interface of The interface of Why do we need to write a script? 3 What if results look suspicious? AFAIR, I selected that option. I do not know, Qing has left. I can have a look and re-run script. 4 We have some additional data! Again? I can (modify and) re-run script. 5 I ve got a new project. to
17 But more importantly, for future has Brief introduction to The GUI interface of The interface of Why do we need to write a script? to 3 What if results look suspicious? AFAIR, I selected that option. I do not know, Qing has left. I can have a look and re-run script. 4 We have some additional data! Again? I can (modify and) re-run script. 5 I ve got a new project. OK, what is first step?
18 But more importantly, for future has Brief introduction to The GUI interface of The interface of Why do we need to write a script? to 3 What if results look suspicious? AFAIR, I selected that option. I do not know, Qing has left. I can have a look and re-run script. 4 We have some additional data! Again? I can (modify and) re-run script. 5 I ve got a new project. OK, what is first step? We have a script for a similar project, do we?
19 GUI is still nice to have, of course has 1 Debug Brief introduction to The GUI interface of The interface of Why do we need to write a script? to
20 GUI is still nice to have, of course has 1 Debug 2 Quick experimental runs Brief introduction to The GUI interface of The interface of Why do we need to write a script? to
21 GUI is still nice to have, of course has Brief introduction to 1 Debug 2 Quick experimental runs 3 View results/datasets The GUI interface of The interface of Why do we need to write a script? to
22 GUI is still nice to have, of course has Brief introduction to The GUI interface of The interface of 1 Debug 2 Quick experimental runs 3 View results/datasets 4 Quick plot Why do we need to write a script? to
23 Outline has to What do we need to know before 2 to What do we need to know before Ways of in? How to use Python shell in? How to find command references? A tiny example Ways of in? How to use Python shell in? How to find command references? A tiny example
24 Things to know before has 1 What does boss want me to do??? to What do we need to know before Ways of in? How to use Python shell in? How to find command references? A tiny example
25 Things to know before has 1 What does boss want me to do??? 2 Python Python Python to What do we need to know before Ways of in? How to use Python shell in? How to find command references? A tiny example
26 Things to know before has to 1 What does boss want me to do??? 2 Python Python Python 3 integrated functional interface What do we need to know before Ways of in? How to use Python shell in? How to find command references? A tiny example
27 Something about Python 1 Python is a dynamic object-oriented programming language that supports has to What do we need to know before Ways of in? How to use Python shell in? How to find command references? A tiny example
28 Something about Python has 1 Python is a dynamic object-oriented programming language that supports 2 Python is easy to get started (hours - a few days) to What do we need to know before Ways of in? How to use Python shell in? How to find command references? A tiny example
29 Something about Python has 1 Python is a dynamic object-oriented programming language that supports 2 Python is easy to get started (hours - a few days) 3 Python makes you become a good programmer to What do we need to know before Ways of in? How to use Python shell in? How to find command references? A tiny example
30 Something about Python has to What do we need to know before 1 Python is a dynamic object-oriented programming language that supports 2 Python is easy to get started (hours - a few days) 3 Python makes you become a good programmer 4 Python is becoming more and more popular (NASA, Cisco, Google, Golden Helix...) Ways of in? How to use Python shell in? How to find command references? A tiny example
31 Something about Python has to What do we need to know before Ways of in? How to use Python shell in? How to find command references? 1 Python is a dynamic object-oriented programming language that supports 2 Python is easy to get started (hours - a few days) 3 Python makes you become a good programmer 4 Python is becoming more and more popular (NASA, Cisco, Google, Golden Helix...) 5 Resources: Dive into Python Thinking in Python A tiny example
32 Scripting in has In Python shell to What do we need to know before Ways of in? How to use Python shell in? How to find command references? A tiny example
33 Scripting in has In Python shell From drop-down menu to What do we need to know before Ways of in? How to use Python shell in? How to find command references? A tiny example
34 Scripting in has to What do we need to know before Ways of in? In Python shell From drop-down menu From command line -s /path/to/script.py param1 param2 Note that current installation requires full path name to script. How to use Python shell in? How to find command references? A tiny example
35 Use Python shell in has to What do we need to know before The Python shell Acquire current object: obj = ghi.getcurrentobject() Get available methods: dir(obj) Get help: help(obj.associationtests) Ways of in? How to use Python shell in? How to find command references? A tiny example
36 command references has to What do we need to know before Ways of in? Offline Online pdf manual under directory Getting help from /Manual/manual.html How to use Python shell in? How to find command references? A tiny example
37 Open a project and run a study has to What do we need to know before Ways of in? How to use Python shell in? How to find command references? A tiny example Code ghi.openproject( /home/yxu/research/projects/mockdataset /Mock_example/Mock_example.ghp ) obj = ghi.getobject( Mock_geno )[0] obj.hweplot(1) ghi.saveproject() Available at pcprhelix:yxu/research/projects/ MockDataSet/Mock_tiny.py
38 Outline has to Shortcomings of using functions directly 3 Going object-oriented: Shortcomings of using functions directly to An example Your work Additional tools Summary to An example Your work Additional tools Summary
39 Shortcomings of using commands directly has Not effective in some situations (rerun whole analysis after redo/modification of analysis) to Shortcomings of using functions directly to An example Your work Additional tools Summary
40 Shortcomings of using commands directly has to Not effective in some situations (rerun whole analysis after redo/modification of analysis) Interface parameter s redundance Shortcomings of using functions directly to An example Your work Additional tools Summary
41 Shortcomings of using commands directly has to Not effective in some situations (rerun whole analysis after redo/modification of analysis) Interface parameter s redundance No programming pattern Shortcomings of using functions directly to An example Your work Additional tools Summary
42 What is? has 1 What is a Python class A little bit object-oriented Everything in Python is object If you still don t know... (a function library) to Shortcomings of using functions directly to An example Your work Additional tools Summary
43 What is? has to Shortcomings of using functions directly 1 What is a Python class A little bit object-oriented Everything in Python is object If you still don t know... (a function library) 2 What is A wrapper for s project/spreadsheet management A data-processing class that enforces a few strategies to handle large GWAS dataset to An example Your work Additional tools Summary
44 Design idea one has to Automatically load existing project, spreadsheets, results when ir dependencies are unchanged; A signal is used to force rerun of some analyses Shortcomings of using functions directly to An example Your work Additional tools Summary
45 Design idea two Seperate stable large data (genotype) from variable small data description (meta) has to Shortcomings of using functions directly to An example Your work Additional tools Summary
46 Design idea two Seperate stable large data (genotype) from variable small data description (meta) A single genotype spreadsheet without demographic information (apply genetic map if exists) has to Shortcomings of using functions directly to An example Your work Additional tools Summary
47 Design idea two has to Seperate stable large data (genotype) from variable small data description (meta) A single genotype spreadsheet without demographic information (apply genetic map if exists) Two spreadsheets DATA_NAME_ind_info and DATA_NAME_marker_info that keeps meta information for individuals and markers Shortcomings of using functions directly to An example Your work Additional tools Summary
48 Design idea two has to Shortcomings of using functions directly Seperate stable large data (genotype) from variable small data description (meta) A single genotype spreadsheet without demographic information (apply genetic map if exists) Two spreadsheets DATA_NAME_ind_info and DATA_NAME_marker_info that keeps meta information for individuals and markers Meta information can be changed programmatically or using outside programs such as excel to An example Your work Additional tools Summary
49 Design idea two has to Shortcomings of using functions directly Seperate stable large data (genotype) from variable small data description (meta) A single genotype spreadsheet without demographic information (apply genetic map if exists) Two spreadsheets DATA_NAME_ind_info and DATA_NAME_marker_info that keeps meta information for individuals and markers Meta information can be changed programmatically or using outside programs such as excel The Info spreadsheets will be read into Python so that y can be used or changed easily to An example Your work Additional tools Summary
50 Design idea two has to Shortcomings of using functions directly to An example Your work Additional tools Summary Seperate stable large data (genotype) from variable small data description (meta) A single genotype spreadsheet without demographic information (apply genetic map if exists) Two spreadsheets DATA_NAME_ind_info and DATA_NAME_marker_info that keeps meta information for individuals and markers Meta information can be changed programmatically or using outside programs such as excel The Info spreadsheets will be read into Python so that y can be used or changed easily Samples and markers are chosen according to Info spreadsheets, a new genotype spreadsheet, with case-control information is created for each analysis
51 Data structure (Genotype) has to Shortcomings of using functions directly to An example Your work Additional tools Summary
52 Data structure (indinfo) has to Shortcomings of using functions directly to An example Your work Additional tools Summary
53 Data structure (markerinfo) has to Shortcomings of using functions directly to An example Your work Additional tools Summary
54 GWAS member functions has to Shortcomings of using functions directly to An example Your work Additional tools Summary setoption() loadgenotype()/ applygeneticmap() loadindinfo()/ loadmarkerinfo() calcindcallrate()/calcmarkercallrate() (more powerful than menu option) calcindgenotypesex() (our own addition) calchwe() (copied from ) extractgenotype() mergespreadsheets() (our own addition) openorcreatecasecontrolspreadsheet() / getorimportspreadsheet() (time consuming steps...) associationstudy() saveresult() (choose selected columns, add annotation from external sources)
55 MockDataSet has to Shortcomings of using functions directly to An example Your work Additional tools Summary Code: case_control analysis def case_control(prj, excludesexmisspecified=false): case control association analysis indinfo = prj.indinfo markerinfo = prj.markerinfo inds = indinfo.labels() SNPs = markerinfo.labels() cases = [inds[x] for x in range(len(inds)) if indinfo[ aff ][x] == 2 \ and indinfo[ individual call rate ][x] > and not indinfo[ exclude ][x] \ and (not excludesexmisspecified or indinfo[ gen controls = [inds[x] for x in range(len(inds)) if indinfo[ aff ][x] == 1 \ and indinfo[ individual call rate ][x] > and not indinfo[ exclude ][x] \ and (not excludesexmisspecified or indinfo[ gen
56 MockDataSet (cont.) has to Shortcomings of using functions directly to An example Your work Additional tools Summary markers = [SNPs[x] for x in range(len(snps)) if markerinfo[ HWE p-value ][x] > and markerinfo[ marker call rate ][x] > 0.9] if prj.verbose: print With %d cases, %d controls, and %d markers % \ (len(cases), len(controls), len(markers)) data = prj.openorcreatecasecontrolspreadsheet( case-control, cases, controls, markers) if prj.verbose: print Performing case-control association tests return prj.associationstudy(data, 3, 0, bonferroni=1, fdr=1, genocounts=1, allelecounts=1, usepca=1, numcomponents=10
57 MockDataSet (cont.) has to Shortcomings of using functions directly to An example Your work Additional tools Summary Code: define and run project prj = GWAS(projectName= Mock_example, projectpath=os.path.join(phome, MockDataSet ), datapath=os.path.join(phome, MockDataSet, data ), ghi=ghi) # se two options are turned on by default prj.setoption(verbose=true, cautious=true) # load genotype data prj.extractgenotype( MockData, Mock_geno, filename= MockD prj.loadgenotype(name= Mock_geno ) prj.applygeneticmap( HelixResult.csv, C, 0, markerid=1, d # indinfo, markerinfo = prepareindandmarkerinfo(prj) # data analysis function eval( %s(prj) % projname) # save results maffilter = greaterthanfilter( Minor Allele Freq., 0.05) chi2filter = lessthanfilter( Chi-Squared P, 0.001) resultname = %s_result % projname
58 MockDataSet (cont.) has to Shortcomings of using functions directly to Code: save results prj.saveresult(resultname, os.path.join(phome, MockData, %s.csv % resultname), columns = resultcolumns) prj.saveresult(resultname, os.path.join(phome, MockData, %s-chi csv % resultname), columns=resultcolumns, filter=chi2filter) prj.saveresult(resultname, os.path.join(phome, MockData, %s-maf-0.05-chi csv % resultname), columns=resultcolumns, filter=andfilter(maffilter, chi2filter)) An example Your work Additional tools Summary Available at pcprhelix:yxu/research/projects/mockdataset/
59 What need I to do indeed? has to Shortcomings of using functions directly Create a project Prepare datasets: indinfo, markerinfo Select cases/controls/markers Call associationstudy() Save results to An example Your work Additional tools Summary
60 gwasutil.py has to Shortcomings of using functions directly Defined classes of utility functions Write to a log file of screen output Defined filters Prepare info files Data-sources for saving results Plot figures using R to An example Your work Additional tools Summary
61 Summary has to Shortcomings of using functions directly is a good genetic analysis software Python is a great programming language Scripting is not difficult Scripting is very important and valuable to An example Your work Additional tools Summary
62 Thank you! has to Thank you! Shortcomings of using functions directly to An example Your work Additional tools Summary
BICF Nano Course: GWAS GWAS Workflow Development using PLINK. Julia Kozlitina April 28, 2017
BICF Nano Course: GWAS GWAS Workflow Development using PLINK Julia Kozlitina Julia.Kozlitina@UTSouthwestern.edu April 28, 2017 Getting started Open the Terminal (Search -> Applications -> Terminal), and
More informationImporting and Merging Data Tutorial
Importing and Merging Data Tutorial Release 1.0 Golden Helix, Inc. February 17, 2012 Contents 1. Overview 2 2. Import Pedigree Data 4 3. Import Phenotypic Data 6 4. Import Genetic Data 8 5. Import and
More informationCTL mapping in R. Danny Arends, Pjotr Prins, and Ritsert C. Jansen. University of Groningen Groningen Bioinformatics Centre & GCC Revision # 1
CTL mapping in R Danny Arends, Pjotr Prins, and Ritsert C. Jansen University of Groningen Groningen Bioinformatics Centre & GCC Revision # 1 First written: Oct 2011 Last modified: Jan 2018 Abstract: Tutorial
More informationKGG: A systematic biological Knowledge-based mining system for Genomewide Genetic studies (Version 3.5) User Manual. Miao-Xin Li, Jiang Li
KGG: A systematic biological Knowledge-based mining system for Genomewide Genetic studies (Version 3.5) User Manual Miao-Xin Li, Jiang Li Department of Psychiatry Centre for Genomic Sciences Department
More informationcalled Hadoop Distribution file System (HDFS). HDFS is designed to run on clusters of commodity hardware and is capable of handling large files. A fil
Parallel Genome-Wide Analysis With Central And Graphic Processing Units Muhamad Fitra Kacamarga mkacamarga@binus.edu James W. Baurley baurley@binus.edu Bens Pardamean bpardamean@binus.edu Abstract The
More informationFVGWAS- 3.0 Manual. 1. Schematic overview of FVGWAS
FVGWAS- 3.0 Manual Hongtu Zhu @ UNC BIAS Chao Huang @ UNC BIAS Nov 8, 2015 More and more large- scale imaging genetic studies are being widely conducted to collect a rich set of imaging, genetic, and clinical
More information6.034 Design Assignment 2
6.034 Design Assignment 2 April 5, 2005 Weka Script Due: Friday April 8, in recitation Paper Due: Wednesday April 13, in class Oral reports: Friday April 15, by appointment The goal of this assignment
More informationUSER S MANUAL FOR THE AMaCAID PROGRAM
USER S MANUAL FOR THE AMaCAID PROGRAM TABLE OF CONTENTS Introduction How to download and install R Folder Data The three AMaCAID models - Model 1 - Model 2 - Model 3 - Processing times Changing directory
More informationEmile R. Chimusa Division of Human Genetics Department of Pathology University of Cape Town
Advanced Genomic data manipulation and Quality Control with plink Emile R. Chimusa (emile.chimusa@uct.ac.za) Division of Human Genetics Department of Pathology University of Cape Town Outlines: 1.Introduction
More informationStep-by-Step Guide to Advanced Genetic Analysis
Step-by-Step Guide to Advanced Genetic Analysis Page 1 Introduction In the previous document, 1 we covered the standard genetic analyses available in JMP Genomics. Here, we cover the more advanced options
More informationStep-by-Step Guide to Basic Genetic Analysis
Step-by-Step Guide to Basic Genetic Analysis Page 1 Introduction This document shows you how to clean up your genetic data, assess its statistical properties and perform simple analyses such as case-control
More informationConvert Dosages to Genotypes Author: Autumn Laughbaum, Golden Helix, Inc.
Convert Dosages to Genotypes Author: Autumn Laughbaum, Golden Helix, Inc. Overview This script converts allelic dosage values to genotypes based on user-specified thresholds. The dosage data may be in
More informationPackage REGENT. R topics documented: August 19, 2015
Package REGENT August 19, 2015 Title Risk Estimation for Genetic and Environmental Traits Version 1.0.6 Date 2015-08-18 Author Daniel J.M. Crouch, Graham H.M. Goddard & Cathryn M. Lewis Maintainer Daniel
More informationStep-by-Step Guide to Relatedness and Association Mapping Contents
Step-by-Step Guide to Relatedness and Association Mapping Contents OBJECTIVES... 2 INTRODUCTION... 2 RELATEDNESS MEASURES... 2 POPULATION STRUCTURE... 6 Q-K ASSOCIATION ANALYSIS... 10 K MATRIX COMPRESSION...
More informationGenetic Analysis. Page 1
Genetic Analysis Page 1 Genetic Analysis Objectives: 1) Set up Case-Control Association analysis and the Basic Genetics Workflow 2) Use JMP tools to interact with and explore results 3) Learn advanced
More informationThe fgwas software. Version 1.0. Pennsylvannia State University
The fgwas software Version 1.0 Zhong Wang 1 and Jiahan Li 2 1 Department of Public Health Science, 2 Department of Statistics, Pennsylvannia State University 1. Introduction Genome-wide association studies
More informationA whirlwind introduction to using R for your research
A whirlwind introduction to using R for your research Jeremy Chacón 1 Outline 1. Why use R? 2. The R-Studio work environment 3. The mock experimental analysis: 1. Writing and running code 2. Getting data
More informationStatistical Analysis for Genetic Epidemiology (S.A.G.E.) Version 6.4 Graphical User Interface (GUI) Manual
Statistical Analysis for Genetic Epidemiology (S.A.G.E.) Version 6.4 Graphical User Interface (GUI) Manual Department of Epidemiology and Biostatistics Wolstein Research Building 2103 Cornell Rd Case Western
More informationRecalling Genotypes with BEAGLECALL Tutorial
Recalling Genotypes with BEAGLECALL Tutorial Release 8.1.4 Golden Helix, Inc. June 24, 2014 Contents 1. Format and Confirm Data Quality 2 A. Exclude Non-Autosomal Markers......................................
More informationRelease Notes. JMP Genomics. Version 4.0
JMP Genomics Version 4.0 Release Notes Creativity involves breaking out of established patterns in order to look at things in a different way. Edward de Bono JMP. A Business Unit of SAS SAS Campus Drive
More informationPackage GWAF. March 12, 2015
Type Package Package GWAF March 12, 2015 Title Genome-Wide Association/Interaction Analysis and Rare Variant Analysis with Family Data Version 2.2 Date 2015-03-12 Author Ming-Huei Chen
More informationPolymorphism and Variant Analysis Lab
Polymorphism and Variant Analysis Lab Arian Avalos PowerPoint by Casey Hanson Polymorphism and Variant Analysis Matt Hudson 2018 1 Exercise In this exercise, we will do the following:. 1. Gain familiarity
More informationBioinformatics - Homework 1 Q&A style
Bioinformatics - Homework 1 Q&A style Instructions: in this assignment you will test your understanding of basic GWAS concepts and GenABEL functions. The materials needed for the homework (two datasets
More informationPRSice: Polygenic Risk Score software - Vignette
PRSice: Polygenic Risk Score software - Vignette Jack Euesden, Paul O Reilly March 22, 2016 1 The Polygenic Risk Score process PRSice ( precise ) implements a pipeline that has become standard in Polygenic
More informationPackage SimGbyE. July 20, 2009
Package SimGbyE July 20, 2009 Type Package Title Simulated case/control or survival data sets with genetic and environmental interactions. Author Melanie Wilson Maintainer Melanie
More informationSOLOMON: Parentage Analysis 1. Corresponding author: Mark Christie
SOLOMON: Parentage Analysis 1 Corresponding author: Mark Christie christim@science.oregonstate.edu SOLOMON: Parentage Analysis 2 Table of Contents: Installing SOLOMON on Windows/Linux Pg. 3 Installing
More informationThe Imprinting Model
The Imprinting Model Version 1.0 Zhong Wang 1 and Chenguang Wang 2 1 Department of Public Health Science, Pennsylvania State University 2 Office of Surveillance and Biometrics, Center for Devices and Radiological
More informationGenomics. Nolan C. Kane
Genomics Nolan C. Kane Nolan.Kane@Colorado.edu Course info http://nkane.weebly.com/genomics.html Emails let me know if you are not getting them! Email me at nolan.kane@colorado.edu Office hours by appointment
More informationUsing IDLE for
Using IDLE for 15-110 Step 1: Installing Python Download and install Python using the Resources page of the 15-110 website. Be sure to install version 3.3.2 and the correct version depending on whether
More informationGMDR User Manual. GMDR software Beta 0.9. Updated March 2011
GMDR User Manual GMDR software Beta 0.9 Updated March 2011 1 As an open source project, the source code of GMDR is published and made available to the public, enabling anyone to copy, modify and redistribute
More informationThe H3ABioNet GWAS Pipeline
School of Electrical and UNIVERSITY OF THE WITWATERSRAND, JOHANNESBURG Information Engineering The H3ABioNet GWAS Pipeline https://github.com/h3abionet/h3agwas Scott Hazelhurst 1 Introduction Need to build
More informationAnalytical Processing of Data of statistical genetics research in UNIX like Systems
Survival Skills for Analytical Processing of Data of statistical genetics research in UNIX like Systems robert yu :: March 2011 anote UNIX like? Traditional/classical UNIX, e.g. System V (Solaris), BSD
More informationHybridCheck User Manual
HybridCheck User Manual Ben J. Ward February 2015 HybridCheck is a software package to visualise the recombination signal in assembled next generation sequence data, and it can be used to detect recombination,
More informationDiskBoss DATA MANAGEMENT
DiskBoss DATA MANAGEMENT Disk Change Monitor Version 9.3 May 2018 www.diskboss.com info@flexense.com 1 1 Product Overview DiskBoss is an automated, policy-based data management solution allowing one to
More informationA short manual for LFMM (command-line version)
A short manual for LFMM (command-line version) Eric Frichot efrichot@gmail.com April 16, 2013 Please, print this reference manual only if it is necessary. This short manual aims to help users to run LFMM
More informationDiskBoss DATA MANAGEMENT
DiskBoss DATA MANAGEMENT File Integrity Monitor Version 9.3 May 2018 www.diskboss.com info@flexense.com 1 1 Product Overview DiskBoss is an automated, policy-based data management solution allowing one
More informationUAccess ANALYTICS Next Steps: Working with Bins, Groups, and Calculated Items: Combining Data Your Way
UAccess ANALYTICS Next Steps: Working with Bins, Groups, and Calculated Items: Arizona Board of Regents, 2014 THE UNIVERSITY OF ARIZONA created 02.07.2014 v.1.00 For information and permission to use our
More informationAutomatic Programming: How Far Can Machines Go? Hila Peleg Technion
Automatic Programming: How Far Can Machines Go? Hila Peleg Technion Laziness Laziness Impatience Laziness Impatience Hubris Automatic Programming Clean up my spreadsheet! Right away, boss! Program synthesis
More informationQuick Start Guide. CodeGenerator v1.5.0
Contents Revision History... 2 Summary... 3 How It Works... 4 Database Schema... 4 Customization... 4 APIs... 4 Annotations... 4 Attributes... 5 Transformation & Output... 5 Creating a Project... 6 General
More informationWeek 8 Lecture: Getting Things Done
Week 8 Lecture: Input and Output, and Moving to Getting Things Done Introduction to Programming for GIS & Remote Sensing GEO6938-4172 GEO4938-4166 4166 Where We Are To this point we ve covered: Basics
More informationDiskBoss DATA MANAGEMENT
DiskBoss DATA MANAGEMENT File Delete and Data Wiping Version 9.3 May 2018 www.diskboss.com info@flexense.com 1 1 Product Overview DiskBoss is an automated, policy-based data management solution allowing
More informationCreating a Box-and-Whisker Graph in Excel: Step One: Step Two:
Creating a Box-and-Whisker Graph in Excel: It s not as simple as selecting Box and Whisker from the Chart Wizard. But if you ve made a few graphs in Excel before, it s not that complicated to convince
More informationSpotter Documentation Version 0.5, Released 4/12/2010
Spotter Documentation Version 0.5, Released 4/12/2010 Purpose Spotter is a program for delineating an association signal from a genome wide association study using features such as recombination rates,
More informationPackage EMLRT. August 7, 2014
Package EMLRT August 7, 2014 Type Package Title Association Studies with Imputed SNPs Using Expectation-Maximization-Likelihood-Ratio Test LazyData yes Version 1.0 Date 2014-08-01 Author Maintainer
More informationDownload PLINK from
PLINK tutorial Amended from two tutorials that the PLINK author Shaun Purcell wrote, see http://pngu.mgh.harvard.edu/~purcell/plink/tutorial.shtml and 'Teaching materials and example dataset' at http://pngu.mgh.harvard.edu/~purcell/plink/res.shtml
More information1. Summary statistics test_gwas. This file contains a set of 50K random SNPs of the Subjective Well-being GWAS of the Netherlands Twin Register
Quality Control for Genome-Wide Association Studies Bart Baselmans & Meike Bartels Boulder 2017 Setting up files and directories To perform a quality control protocol in a Genome-Wide Association Meta
More informationSucuri Webinar Q&A HOW TO IDENTIFY AND FIX A HACKED WORDPRESS WEBSITE. Ben Martin - Remediation Team Lead
Sucuri Webinar Q&A HOW TO IDENTIFY AND FIX A HACKED WORDPRESS WEBSITE. Ben Martin - Remediation Team Lead 1 Question #1: What is the benefit to spammers for using someone elses UA code and is there a way
More informationhaplo.score Score Tests for Association of Traits with Haplotypes when Linkage Phase is Ambiguous
haploscore Score Tests for Association of Traits with Haplotypes when Linkage Phase is Ambiguous Charles M Rowland, David E Tines, and Daniel J Schaid Mayo Clinic Rochester, MN E-mail contact: rowland@mayoedu
More informationSISG/SISMID Module 3
SISG/SISMID Module 3 Introduction to R Ken Rice Tim Thornton University of Washington Seattle, July 2018 Introduction: Course Aims This is a first course in R. We aim to cover; Reading in, summarizing
More informationToCatchAThief c ryan campbell & jenn coughlan 7/23/2018
ToCatchAThief c ryan campbell & jenn coughlan 7/23/2018 Welcome to the To Catch a Thief: With Data! walkthrough! https://bioconductor.org/packages/devel/ bioc/vignettes/snprelate/inst/doc/snprelatetutorial.html
More informationSNP HiTLink Manual. Yoko Fukuda 1, Hiroki Adachi 2, Eiji Nakamura 2, and Shoji Tsuji 1
SNP HiTLink Manual Yoko Fukuda 1, Hiroki Adachi 2, Eiji Nakamura 2, and Shoji Tsuji 1 1 Department of Neurology, Graduate School of Medicine, the University of Tokyo, Tokyo, Japan 2 Dynacom Co., Ltd, Kanagawa,
More informationInformation Technology
Information Technology Operations Team Instructional Sharing Protected Data Contents Introduction... 3 Procedure... 3 How to Password Protect a MS Word Document... 3 MS Word 2010/2013/2016... 3 Word 2007...
More informationCompClustTk Manual & Tutorial
CompClustTk Manual & Tutorial Brandon King Copyright c California Institute of Technology Version 0.1.10 May 13, 2004 Contents 1 Introduction 1 1.1 Purpose.............................................
More informationDepending on the computer you find yourself in front of, here s what you ll need to do to open SPSS.
1 SPSS 11.5 for Windows Introductory Assignment Material covered: Opening an existing SPSS data file, creating new data files, generating frequency distributions and descriptive statistics, obtaining printouts
More informationSCRATCH MODULE 3: NUMBER CONVERSIONS
SCRATCH MODULE 3: NUMBER CONVERSIONS INTRODUCTION The purpose of this module is to experiment with user interactions, error checking input, and number conversion algorithms in Scratch. We will be exploring
More informationPart 6b: The effect of scale on raster calculations mean local relief and slope
Part 6b: The effect of scale on raster calculations mean local relief and slope Due: Be done with this section by class on Monday 10 Oct. Tasks: Calculate slope for three rasters and produce a decent looking
More informationR in Linguistic Analysis. Week 2 Wassink Autumn 2012
R in Linguistic Analysis Week 2 Wassink Autumn 2012 Today R fundamentals The anatomy of an R help file but first... How did you go about learning the R functions in the reading? More help learning functions
More informationsnpqc an R pipeline for quality control of Illumina SNP data
snpqc an R pipeline for quality control of Illumina SNP data 1. In a nutshell snpqc is a series of R scripts to perform quality control analysis on Illumina SNP data. The objective of the program is to
More informationMAGA: Meta-Analysis of Gene-level Associations
MAGA: Meta-Analysis of Gene-level Associations SYNOPSIS MAGA [--sfile] [--chr] OPTIONS Option Default Description --sfile specification.txt Select a specification file --chr Select a chromosome DESCRIPTION
More informationTechniques for Optimizing Reusable Content in LibGuides
University of Louisville From the SelectedWorks of Terri Holtze April 21, 2017 Techniques for Optimizing Reusable Content in LibGuides Terri Holtze, University of Louisville Available at: https://works.bepress.com/terri-holtze/4/
More informationDigital Marketing & Sales Training. Part 1: SEO, Local, & AdWords Express Leadgenix & AG 431
Digital Marketing & Sales Training Part 1: SEO, Local, & AdWords Express Leadgenix & AG 431 Introductions Andy Selcho AG Location Owner Dan Posner Partner Relationships Jamie Bates Director of Operations
More informationOur legacy archival system resides in an Access Database lovingly named The Beast. Having the data in a database provides the opportunity and ability
Our legacy archival system resides in an Access Database lovingly named The Beast. Having the data in a database provides the opportunity and ability to maintain a semblance of structure in the data However,
More informationPLATO User Guide. Current version: PLATO 2.1. Last modified: September Ritchie Lab, Geisinger Health System
PLATO User Guide Current version: PLATO 2.1 Last modified: September 2017 Ritchie Lab, Geisinger Health System Email: software@ritchielab.psu.edu 1 Table of Contents Overview... 3 PLATO Quick Reference...
More informationRicopili: Introdution. WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015
Ricopili: Introdution WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015 What will we offer? Practical: Sorry, no practical sessions today, please refer to the summer school, organized
More informationProperties of Data. Digging into Data: Jordan Boyd-Graber. University of Maryland. February 11, 2013
Properties of Data Digging into Data: Jordan Boyd-Graber University of Maryland February 11, 2013 Digging into Data: Jordan Boyd-Graber (UMD) Properties of Data February 11, 2013 1 / 43 Roadmap Munging
More informationGraphics Performance Benchmarking Framework ATI. Presented to: Jerry Howard. By: Drew Roberts, Nicholas Tower, Jason Underhill
Graphics Performance Benchmarking Framework ATI Presented to: Jerry Howard By: Drew Roberts, Nicholas Tower, Jason Underhill Executive Summary The goal of this project was to create a graphical benchmarking
More informationSystems Software. Recitation 1: Intro & Revision Control. Quite different from 213. Our Philosophy. Partly-free lunch
Systems Software Recitation 1: Intro & Revision Control Dave Andersen CMU Computer Science Fall 2006 Low-level (projects in C) Designed to run forever Handle every possible error condition Manage resources
More informationMaximizing Public Data Sources for Sequencing and GWAS
Maximizing Public Data Sources for Sequencing and GWAS February 4, 2014 G Bryce Christensen Director of Services Questions during the presentation Use the Questions pane in your GoToWebinar window Agenda
More informationThe first thing we ll need is some numbers. I m going to use the set of times and drug concentration levels in a patient s bloodstream given below.
Graphing in Excel featuring Excel 2007 1 A spreadsheet can be a powerful tool for analyzing and graphing data, but it works completely differently from the graphing calculator that you re used to. If you
More informationSmall example of use of OmicABEL
Small example of use of OmicABEL Yurii Aulchenko for the OmicABEL developers July 1, 2013 Contents 1 Important note on data format for OmicABEL 1 2 Outline of the example 2 3 Prepare the data for analysis
More informationEffective Recombination in Plant Breeding and Linkage Mapping Populations: Testing Models and Mating Schemes
Effective Recombination in Plant Breeding and Linkage Mapping Populations: Testing Models and Mating Schemes Raven et al., 1999 Seth C. Murray Assistant Professor of Quantitative Genetics and Maize Breeding
More informationData to App: Web,Tablet and Smart Phone Duane Griffith, Montana State University,
Data to App: Web,Tablet and Smart Phone Duane Griffith, Montana State University, griffith@montana.edu This tutorial is broken into several steps related mostly to the technology applied at each step.
More informationCreating and Using Genome Assemblies Tutorial
Creating and Using Genome Assemblies Tutorial Release 8.1 Golden Helix, Inc. March 18, 2014 Contents 1. Create a Genome Assembly for Danio rerio 2 2. Building Annotation Sources 5 A. Creating a Reference
More informationBioBin User Guide Current version: BioBin 2.3
BioBin User Guide Current version: BioBin 2.3 Last modified: April 2017 Ritchie Lab Geisinger Health System URL: http://www.ritchielab.com/software/biobin-download Email: software@ritchielab.psu.edu 1
More informationTips and Guidance for Analyzing Data. Executive Summary
Tips and Guidance for Analyzing Data Executive Summary This document has information and suggestions about three things: 1) how to quickly do a preliminary analysis of time-series data; 2) key things to
More informationHow to Set up a Budget Advanced Excel Part B
How to Set up a Budget Advanced Excel Part B A budget is probably the most important spreadsheet you can create. A good budget will keep you focused on your ultimate financial goal and help you avoid spending
More informationQTX. Tutorial for. by Kim M.Chmielewicz Kenneth F. Manly. Software for genetic mapping of Mendelian markers and quantitative trait loci.
Tutorial for QTX by Kim M.Chmielewicz Kenneth F. Manly Software for genetic mapping of Mendelian markers and quantitative trait loci. Available in versions for Mac OS and Microsoft Windows. revised for
More informationLastly, in case you don t already know this, and don t have Excel on your computers, you can get it for free through IT s website under software.
Welcome to Basic Excel, presented by STEM Gateway as part of the Essential Academic Skills Enhancement, or EASE, workshop series. Before we begin, I want to make sure we are clear that this is by no means
More informationSPSS TRAINING SPSS VIEWS
SPSS TRAINING SPSS VIEWS Dataset Data file Data View o Full data set, structured same as excel (variable = column name, row = record) Variable View o Provides details for each variable (column in Data
More informationIN-CLASS EXERCISE: INTRODUCTION TO R
NAVAL POSTGRADUATE SCHOOL IN-CLASS EXERCISE: INTRODUCTION TO R Survey Research Methods Short Course Marine Corps Combat Development Command Quantico, Virginia May 2013 In-class Exercise: Introduction to
More informationNotes on QTL Cartographer
Notes on QTL Cartographer Introduction QTL Cartographer is a suite of programs for mapping quantitative trait loci (QTLs) onto a genetic linkage map. The programs use linear regression, interval mapping
More informationfasta2genotype.py Version 1.10 Written for Python Available on request from the author 2017 Paul Maier
1 fasta2genotype.py Version 1.10 Written for Python 2.7.10 Available on request from the author 2017 Paul Maier This program takes a fasta file listing all sequence haplotypes of all individuals at all
More information0 Graphical Analysis Use of Excel
Lab 0 Graphical Analysis Use of Excel What You Need To Know: This lab is to familiarize you with the graphing ability of excels. You will be plotting data set, curve fitting and using error bars on the
More informationPRSice: Polygenic Risk Score software v1.22
PRSice: Polygenic Risk Score software v1.22 Jack Euesden jack.euesden@kcl.ac.uk Cathryn M. Lewis April 30, 2015 Paul F. O Reilly Contents 1 Overview 3 2 R packages required 3 3 Quickstart 3 3.1 Input Data...................................
More informationGWAS Exercises 3 - GWAS with a Quantiative Trait
GWAS Exercises 3 - GWAS with a Quantiative Trait Peter Castaldi January 28, 2013 PLINK can also test for genetic associations with a quantitative trait (i.e. a continuous variable). In this exercise, we
More informationBreeding Guide. Customer Services PHENOME-NETWORKS 4Ben Gurion Street, 74032, Nes-Ziona, Israel
Breeding Guide Customer Services PHENOME-NETWORKS 4Ben Gurion Street, 74032, Nes-Ziona, Israel www.phenome-netwoks.com Contents PHENOME ONE - INTRODUCTION... 3 THE PHENOME ONE LAYOUT... 4 THE JOBS ICON...
More informationIntroduction to Hail. Cotton Seed, Technical Lead Tim Poterba, Software Engineer Hail Team, Neale Lab Broad Institute and MGH
Introduction to Hail Cotton Seed, Technical Lead Tim Poterba, Software Engineer Hail Team, Neale Lab Broad Institute and MGH Why Hail? Genetic data is becoming absolutely massive Broad Genomics, by the
More informationSECTION 1: INTRODUCTION. ENGR 112 Introduction to Engineering Computing
SECTION 1: INTRODUCTION ENGR 112 Introduction to Engineering Computing 2 Course Overview What is Programming? 3 Programming The implementation of algorithms in a particular computer programming language
More informationTutorial on gene-c ancestry es-ma-on: How to use LASER. Chaolong Wang Sequence Analysis Workshop June University of Michigan
Tutorial on gene-c ancestry es-ma-on: How to use LASER Chaolong Wang Sequence Analysis Workshop June 2014 @ University of Michigan LASER: Loca-ng Ancestry from SEquence Reads Main func:ons of the so
More informationVariant calling using SAMtools
Variant calling using SAMtools Calling variants - a trivial use of an Interactive Session We are going to conduct the variant calling exercises in an interactive idev session just so you can get a feel
More informationRelease Notes and Installation Guide (Unix Version)
Release Notes and Installation Guide (Unix Version) Release 3.1 December 1997 - ii - Release Notes and Installation Guide Table of Contents 1 Changes Since Last Release... 1 1.1 Changes Since Release 2.2...
More informationMascot Insight is a new application designed to help you to organise and manage your Mascot search and quantitation results. Mascot Insight provides
1 Mascot Insight is a new application designed to help you to organise and manage your Mascot search and quantitation results. Mascot Insight provides ways to flexibly merge your Mascot search and quantitation
More informationdatabuild Documentation
databuild Documentation Release 0.0.10 Flavio Curella May 15, 2015 Contents 1 Contents 3 1.1 Installation................................................ 3 1.2 Quickstart................................................
More informationIntroduction to Python Part 2
Introduction to Python Part 2 v0.2 Brian Gregor Research Computing Services Information Services & Technology Tutorial Outline Part 2 Functions Tuples and dictionaries Modules numpy and matplotlib modules
More informationHVAC Designer Application Features
HVAC Designer Application Features My name is Adam and in just under six months I have created a project management app using the Bubble platform. I have zero programming training or experience and want
More informationQTL Analysis with QGene Tutorial
QTL Analysis with QGene Tutorial Phillip McClean 1. Getting the software. The first step is to download and install the QGene software. It can be obtained from the following WWW site: http://qgene.org
More informationJatinSir - Mastering Python
JatinSir - Mastering Python Best Python Training with Real-time Project Duration of the Training: 42-48 hours Who can learn Python? In short anyone. Automation Engineers Data analysts and scientist Quality
More informationSnakemake overview. Thomas Cokelaer. Nov 9th 2017 Snakemake and Sequana overview. Institut Pasteur
Snakemake overview Thomas Cokelaer Institut Pasteur Nov 9th 2017 Snakemake and Sequana overview Many bioinformatic pipeline frameworks available A review of bioinformatic pipeline frameworks. Jeremy Leipzig
More informationPractical Unix exercise MBV INFX410
Practical Unix exercise MBV INFX410 We will in this exercise work with a practical task that, it turns out, can easily be solved by using basic Unix. Let us pretend that an engineer in your group has spent
More informationELAI user manual. Yongtao Guan Baylor College of Medicine. Version June Copyright 2. 3 A simple example 2
ELAI user manual Yongtao Guan Baylor College of Medicine Version 1.0 25 June 2015 Contents 1 Copyright 2 2 What ELAI Can Do 2 3 A simple example 2 4 Input file formats 3 4.1 Genotype file format....................................
More information