DeltaGen: Quick start manual

Size: px
Start display at page:

Download "DeltaGen: Quick start manual"

Transcription

1 1 DeltaGen: Quick start manual Dr. Zulfi Jahufer & Dr. Dongwen Luo CONTENTS Page Main operations tab commands 2 Uploading a data file 3 Matching variable identifiers 4 Data check 5 Univariate analysis 7 Pattern analysis (within univariate model option) 11 Univariate analysis Two trait combination 14 Guide to calculation of generating a GEBV Sample cost 16 Estimation of genetic gain (G) and its simulation 17 Multivariate analysis 19 Using Plot 20 o Biplots (on raw data) 20 o Matrix plots (phenotypic correlation- on raw data) 21 MANOVA (additive variance/co-variance & correlation) 22 Smith-Hazel selection index 24 Pattern analysis for multiple traits 25 Trial design instructions (to be completed) 29 Save session and Quit 30

2 2 Main operations tab commands Main operations tab commands Clicking on any of the commands above will open dropdown menus. Introduction This window is opened when DeltaGen is started. Trial Design This command will open a screen that will provide a range of experimental designs that DeltaGen can generate. A full description of this command will be provided under experimental design, the last section in this manual. Data Input: uploading a data file Clicking on this command will open a dropdown menu with: Input: Clicking on this command will open the Data Information screen shown on the next page.

3 Uploading a data file 3 Shows data files opened Upload enables data files to be uploaded using Browse; Examples* enables practice data sets within DeltaGen to be uploaded; Clipboard enables copied data to be uploaded. * The practice data sets will also provide information on format of data matrices. STEP 2 Click to upload external data files Data file types accepted by DeltaGen! CSV files are preferred save EXCEL data files in csv format for uploading into DeltaGen STEP 1! Missing values in the data matrix should be identified before uploading any data set. The dropdown menu provides 3 options: Empty or * or! Data file names cannot have PPl gaps between words. Follow STEPS 1 & 2 to upload a data set from an external file. Files can be saved in RData or CSV formats

4 Matching variable identifiers 4 After Uploading a data set following steps 1 & 2, the column identifiers of variables in the data matrix have to be matched with those already defined as (Year, Season, Location, Replicates, Row, Column, Sample and Line) in DeltaGen. as shown in STEP 3 Traits STEP 3 - To match a DeltaGen variable with an associated column identifier in the data matrix, click on the relevant dropdown menu and choose the matching column name; e.g. by clicking on the dropdown menu for Location, column Site was selected. Similarly, Rep for Replicates. If a variable is not in the uploaded data, this is left as Null. STEP 4 - Clicking on Run will submit the data for analysis. You are now ready to check or analyse your data.

5 Data check: Graphical or tabular summary of raw data is an optional data quality check before univariate or multivariate analysis. 5 The plot-type dropdown menu provides a range of plots; Histogram, Density, Scatter, Line, Box-plot, to illustrate the data. First click on the X-variable, in this case DM (dry matter). You can arrange the plots by defining the row and column layout; in the example presented using Histograms, the Locations are presented down rows and the dry matter in each of the 3 replicates within each location as columns.

6 6 Data check continued. The heat-map option from the dropdown menu under Pivot Table, illustrates the actual values and spatial distribution of summer dry matter raw data across a field experiment based on a row-column experimental design consisting of 3 replicates. This can also be used to identify data entry errors. Clicking on these headings will show the associated data below. High value Missing data Low value All factors, e.g. Replicates, Column, Row, can be moved across; point on a factor, left click on the mouse, hold down and move. This will result in changing the configuration of the table.

7 7 Univariate analysis: Click on the Models command and select Univariate. The screen shown will open. The Data Information panel provides a summary of the contents of the uploaded data set ready for analysis. The demonstration/practice data set used, consists of 144 lines (half-sib families) of switchgrass (Panicum virgatum L.) evaluated across 2 locations over 2 years using randomized complete block designs with 3 replicates. Data on 4 traits; dry matter yield, Klason lignin (KL), ethanol (ETOH) and high heating value (HHV) are included. Data file name: Case study 3 under Examples. The default settings for the linear mixed effects model are; modelling and half sib family. If simulation of genetic gain is to be conducted, the choice of half sib or full sib family is important. Alternately if the analysis is not for estimation of genetic components of variance or is based on a fixed effects model, continue using the the default option. Simulation must be selected only after conducting the variance component analysis. On opening the univariate analysis screen, the Primary Trait box will be at Null. Clicking on this box will open the dropdown menu with all the traits in the uploaded data matrix; in the example; Yield, KL, ETOH, HHV. Click Run to begin analysis Fixed terms: clicking in the fixed terms box will open a dropdown menu that will enable you to select the appropriate factors in the data set (years, locations or lines in the example). Select Null if no fixed terms are to be included in the model. Random terms: Select the appropriate factors from dropdown menu which opens in this box. Contains theory & references Traits for BLUP or BLUE estimates can be selected from the associated dropdown menus.

8 8 Univariate analysis (continued): The linear model - One trait The demonstration/practice data set used, consists of 125 lines (half-sib families) of perennial ryegrass (Lolium perenne L.) evaluated across 2 locations over 4 years and seasons; spring, summer, autumn and winter, of each year. Experimental design: row column with 4 replicates. Data file name: MultiYearSea-Growth under Examples. Select Primary trait to be analysed, (From our example data set Spring growth has been selected), Select Fixed terms and their interactions, (if an additional term that does not appear in the dropdown menu needs to be added, double click in the fixed terms box, enter the term and click on Add that appears) Select Random terms and interactions, (if an additional term that does not appear in the dropdown menu needs to be added, double click in the random terms box, enter the term and click on Add that appears) Select Heritability if appropriate, Select BLUP if lines are random or BLUE if fixed, Click Run to begin analysis.

9 9 Mixed model analysis Output of spring growth of the 125 half sib families across years and locations. As the lines were considered as random effects in the linear model and the BLUP estimate option was selected, Clicking on the BLUP button has provided this analysis output, BLUP values for spring growth. Results for Fixed effects Half sib families (HS) Variance component Associated ± standard error Heritability was estimated as this option was selected.

10 10 Mixed model analysis of growth of the 125 half sib families across 4 years, 4 seasons and 2 locations. Results for Fixed effects The demonstration/practice data set, Data file name: AcrossYearSeaSite-Growth (1) under Examples. Half sib families (HS)

11 11 Pattern analysis (within Univariate model option) Multi-location (more than 2 locations) The demonstration/practice data set, Data file name: AcrossYearSeaSite-Growth (2) under Examples. The first step in Pattern analysis is to select Line-by-location interaction. BLUP estimates for each individual line within each location (in the example Rua, PN and KERI) will be generated. Click on pattern analysis Click Run.

12 12 Location groups: 1, 2 & 3. Clicking on Pattern Analysis Cluster will result in generating Line groups based on performance across sites and also associated dendrograms of Locations and Lines. Dendrograms of Location and Line grouping Locations: KERI, Rua, PN

13 13 Clicking on Pattern Analysis PCA (Principle Component Analysis) will result in generating a biplot based on PC1 and PC2, the line groups and individual line labels. The directional vectors are the locations. Directional vectors Line clusters

14 14 Univariate analysis (continued): The linear model - Two trait combination ETOH Analysis with the secondary trait included provides an opportunity to simultaneously estimate narrow sense heritability for each trait, and their genetic correlation. These outputs are then automatically integrated into the simulation models for estimation of Correlated Response to Selection of the primary trait based on secondary trait selection. The demonstration/practice data set used, consists of 144 lines (half-sib families) of switchgrass (Panicum virgatum L.) evaluated across 2 locations over 2 years using randomized complete block designs with 3 replicates. Data on 4 traits; dry matter yield, Klason lignin (KL), ethanol (ETOH) and high heating value (HHV) are included. Data file name: CaseStudy3 under Examples. All the initial steps with regards the fixed and random term models for the primary trait are similar to the single trait analysis. The Secondary trait: KL Tick the box for secondary trait and select the trait from the dropdown menu, (From our example data set, trait KL was selected ) Click the MANOVA box and select the terms in the dropdown menu to conduct a variance/covariance analysis, Click Run

15 15 Univariate analysis Output two trait (ETOH/KL) combination Results for Fixed effects from primary trait analysis. Variance components for Random effects from primary trait analysis. Heritability of primary trait. Results from this analysis are similar to those from the single trait analysis, but also provide information on heritability of the secondary trait as well as the genetic correlation between the two traits.

16 16 This is only a guide for calculating the cost of generating a single GEBV referred to as Sample cost in GS Step Cost/sample Genotyping $53 DNA isolation $7 Library generation $9 DNA sequencing $37 SNP genotypes $10 (bioinformatics) Prediction of GEBV's $5 (statistical model) Total $68 Other notes: Assumes GBS as the genotyping method. Sequencing uses an Illumina HiSeq 2500 with version 4 chemistry Cost is for the 96-plex level which will change with the level of multiplexing. The suggested multipliers: (48-ples 2), (192-plex 0.5), (384-plex 0.25) Dodds, K.G.; McEwan, J.C.; Brauning, R.; Anderson, R.M.; van Stijn, T.C.; Kristjánsson, T.; Clarke, S.M. (2015). Construction of relatedness matrices using genotyping-by-sequencing data. BMC Genomics 16: 1047.

17 17 Estimation of Genetic Gain (G) and its simulation After variance component analysis of half sib (HS) or full sib (FS) families, for a given trait (ETOH in our example), click on Simulation. This will open a blank Breeding Strategy and Simulation window shown below: Clicking on the Strategy dropdown window will enable selection of any of the breeding strategies below:

18 18 Estimation of Genetic Gain (G) and its simulation continued These constants cannot be changed HS family based breeding models including Genomic selection (GS). Inputs automatically transferred from linear model analysis All the simulation variables have dropdown menus which provide a range of values to select from. All estimated costs ($) should be entered manually. Gc, gain estimate per cycle Ga, gain estimate per annum % (relative to parental mean) Click Update every time a breeding strategy, simulation variable or cost ($) is changed. This will update G estimates and associated costs. If Full Sib families, HS will change to FS. If two traits like ETOH and KL, primary and secondary, respectively, are analysed, then models for Correlated response to selection will be available.

19 19 Multivariate analysis Plot gives you options to generate a Biplot or a Matrix Plot of phenotypic correlation, based on raw data. To begin: Click on the Model command and select Multivariate. MANOVA (Multivariate analysis of variance) generates a variance and covariance matrix and genotypic or genetic correlation coefficients for the traits chosen from the dropdown menu in the Multiple traits box. Clicking on Selection index activates a window that enables use of the Smith-Hazel index. Used for highlighting groups in the Plot option. Clicking in this box will show you the list of traits, in the uploaded data matrix, to be selected for multivariate analysis based on the three options; Plot, MANOVA and Selection Index. The demonstration/practice data (File name: CaseStudy3 in Examples. You need to first upload this file using: Data Input), consists of 144 lines (half-sib families) of switchgrass (Panicum virgatum L.) evaluated across 2 locations over 2 years using randomized complete block designs at each location containing 3 replicates. Data on 4 traits; biomass dry matter yield (Yld), Klason lignin (KL), ethanol (ETOH) and high heating value (HHV) are included in matrix.

20 20 Using Plot! This Biplot is based on raw data.

21 Phenotypic correlation 21

22 22 MANOVA 2 1 1) Click on MANOVA, 2) Click on Multiple traits box and choose traits, 3) Click on MANOVA terms box and chose the effects for the completely random linear model (keep this model simple by choosing main effects and only their two way interactions), 4) Click Run. Multivariate analysis Output the estimates will be genetic if the raw data are generated from HS or FS families 3 4

23 23 Smith-Hazel selection index ) Click on Selection Index, 2) Click on Multiple traits box and choose traits, 3) Manually enter the Index weightings, 4) Click on LME fixed terms box and select the fixed effects or leave as Null, 5) Click on LME random terms box and select the random effects, 6) Click on MANOVA terms box and chose the effects for the completely random linear model (keep this model simple by choosing main effects and only their two way interactions), 7) Click on the Selection pressure box and choose the intensity of selection, 8) To estimated the genetic gain for each trait under selection (YLD, ETOH, KL) tick G, 9) Click Run

24 Smith-Hazel index - Output window 24 [P] 1 The Smith-Hazel index equation [b] = [P] 1 [A][w] [A] [b] Individual trait BLUP s Smith-Hazel index: the genetic worth (breeding values) of lines. Gc, gain estimate per selection cycle in unites of measurement of each trait.

25 25 Pattern analysis for Multiple Traits Step 1, upload the Line-by-Trait mean matrix into DeltaGen using Data Input. This example is based on the data file MultiTraitMatrix.csv found in examples. Step 2, Click on the Pattern Analysis Step 3, select the variables by clicking on them, and keep the standardized data option on, Step 4, Click on Run. Cluster analysis will produce Line groups and a heat map with Line and Trait dendrograms. The PCA BiPlot option will provide a graphical summary of the Line clusters and trait association (shown by the directional vectors).

26 Pattern analysis output. 26 1, 2 & 3.

27 27 Dendrograms of Trait and Line grouping Lines Traits

28 28 Line cluster groups: Magnification and quality of the contents of the biplot can be adjusted by moving the scale controllers The entries (lines, genotypes..) shown in the biplot can be shown as dots or labels by selecting the option in the dropdown menu. Directional vectors

29 29 Trial design instructions Will be uploaded soon. However, please do try using the design generator.

30 30 Save session and Quit Analysis reports can be saved as PDF, HTML and Word documents. Click on any of the three document format options followed by selecting Download. To Quit DeltaGen, click on Quit App.

Step-by-Step Guide to Advanced Genetic Analysis

Step-by-Step Guide to Advanced Genetic Analysis Step-by-Step Guide to Advanced Genetic Analysis Page 1 Introduction In the previous document, 1 we covered the standard genetic analyses available in JMP Genomics. Here, we cover the more advanced options

More information

Breeding View A visual tool for running analytical pipelines User Guide Darren Murray, Roger Payne & Zhengzheng Zhang VSN International Ltd

Breeding View A visual tool for running analytical pipelines User Guide Darren Murray, Roger Payne & Zhengzheng Zhang VSN International Ltd Breeding View A visual tool for running analytical pipelines User Guide Darren Murray, Roger Payne & Zhengzheng Zhang VSN International Ltd January 2015 1. Introduction The Breeding View is a visual tool

More information

Practical OmicsFusion

Practical OmicsFusion Practical OmicsFusion Introduction In this practical, we will analyse data, from an experiment which aim was to identify the most important metabolites that are related to potato flesh colour, from an

More information

Breeding Guide. Customer Services PHENOME-NETWORKS 4Ben Gurion Street, 74032, Nes-Ziona, Israel

Breeding Guide. Customer Services PHENOME-NETWORKS 4Ben Gurion Street, 74032, Nes-Ziona, Israel Breeding Guide Customer Services PHENOME-NETWORKS 4Ben Gurion Street, 74032, Nes-Ziona, Israel www.phenome-netwoks.com Contents PHENOME ONE - INTRODUCTION... 3 THE PHENOME ONE LAYOUT... 4 THE JOBS ICON...

More information

Step-by-Step Guide to Relatedness and Association Mapping Contents

Step-by-Step Guide to Relatedness and Association Mapping Contents Step-by-Step Guide to Relatedness and Association Mapping Contents OBJECTIVES... 2 INTRODUCTION... 2 RELATEDNESS MEASURES... 2 POPULATION STRUCTURE... 6 Q-K ASSOCIATION ANALYSIS... 10 K MATRIX COMPRESSION...

More information

To finish the current project and start a new project. File Open a text data

To finish the current project and start a new project. File Open a text data GGEbiplot version 5 In addition to being the most complete, most powerful, and most user-friendly software package for biplot analysis, GGEbiplot also has powerful components for on-the-fly data manipulation,

More information

This tutorial will guide a curator/user to create the files to upload phenotype experiment annotation and data onto T3.

This tutorial will guide a curator/user to create the files to upload phenotype experiment annotation and data onto T3. This tutorial will guide a curator/user to create the files to upload phenotype experiment annotation and data onto T3. Updated December 2012. Any feedback is welcome! Sections 1. Download the Template

More information

Fast, Easy, and Publication-Quality Ecological Analyses with PC-ORD

Fast, Easy, and Publication-Quality Ecological Analyses with PC-ORD Emerging Technologies Fast, Easy, and Publication-Quality Ecological Analyses with PC-ORD JeriLynn E. Peck School of Forest Resources, Pennsylvania State University, University Park, Pennsylvania 16802

More information

STAT 311 (3 CREDITS) VARIANCE AND REGRESSION ANALYSIS ELECTIVE: ALL STUDENTS. CONTENT Introduction to Computer application of variance and regression

STAT 311 (3 CREDITS) VARIANCE AND REGRESSION ANALYSIS ELECTIVE: ALL STUDENTS. CONTENT Introduction to Computer application of variance and regression STAT 311 (3 CREDITS) VARIANCE AND REGRESSION ANALYSIS ELECTIVE: ALL STUDENTS. CONTENT Introduction to Computer application of variance and regression analysis. Analysis of Variance: one way classification,

More information

Estimating Variance Components in MMAP

Estimating Variance Components in MMAP Last update: 6/1/2014 Estimating Variance Components in MMAP MMAP implements routines to estimate variance components within the mixed model. These estimates can be used for likelihood ratio tests to compare

More information

Quick Start Guide Jacob Stolk PhD Simone Stolk MPH November 2018

Quick Start Guide Jacob Stolk PhD Simone Stolk MPH November 2018 Quick Start Guide Jacob Stolk PhD Simone Stolk MPH November 2018 Contents Introduction... 1 Start DIONE... 2 Load Data... 3 Missing Values... 5 Explore Data... 6 One Variable... 6 Two Variables... 7 All

More information

Supporting Figures Figure S1. FIGURE S2.

Supporting Figures Figure S1. FIGURE S2. Supporting Figures Figure S1. QTL scans for the 12 environments of the yield data for pepper simulated from the physiological genotype-to-phenotype model with seven physiological parameters (Rodrigues

More information

CTL mapping in R. Danny Arends, Pjotr Prins, and Ritsert C. Jansen. University of Groningen Groningen Bioinformatics Centre & GCC Revision # 1

CTL mapping in R. Danny Arends, Pjotr Prins, and Ritsert C. Jansen. University of Groningen Groningen Bioinformatics Centre & GCC Revision # 1 CTL mapping in R Danny Arends, Pjotr Prins, and Ritsert C. Jansen University of Groningen Groningen Bioinformatics Centre & GCC Revision # 1 First written: Oct 2011 Last modified: Jan 2018 Abstract: Tutorial

More information

2016 Stat-Ease, Inc. & CAMO Software

2016 Stat-Ease, Inc. & CAMO Software Multivariate Analysis and Design of Experiments in practice using The Unscrambler X Frank Westad CAMO Software fw@camo.com Pat Whitcomb Stat-Ease pat@statease.com Agenda Goal: Part 1: Part 2: Show how

More information

CHAPTER 1 GETTING STARTED

CHAPTER 1 GETTING STARTED CHAPTER 1 GETTING STARTED Configuration Requirements This design of experiment software package is written for the Windows 2000, XP and Vista environment. The following system requirements are necessary

More information

Package GWAF. March 12, 2015

Package GWAF. March 12, 2015 Type Package Package GWAF March 12, 2015 Title Genome-Wide Association/Interaction Analysis and Rare Variant Analysis with Family Data Version 2.2 Date 2015-03-12 Author Ming-Huei Chen

More information

Package MultiMeta. February 19, 2015

Package MultiMeta. February 19, 2015 Type Package Package MultiMeta February 19, 2015 Title Meta-analysis of Multivariate Genome Wide Association Studies Version 0.1 Date 2014-08-21 Author Dragana Vuckovic Maintainer Dragana Vuckovic

More information

ASReml: AN OVERVIEW. Rajender Parsad 1, Jose Crossa 2 and Juan Burgueno 2

ASReml: AN OVERVIEW. Rajender Parsad 1, Jose Crossa 2 and Juan Burgueno 2 ASReml: AN OVERVIEW Rajender Parsad 1, Jose Crossa 2 and Juan Burgueno 2 1 I.A.S.R.I., Library Avenue, New Delhi - 110 012, India 2 Biometrics and statistics Unit, CIMMYT, Mexico ASReml is a statistical

More information

Platinum Planner. School Admin Manual. ã Platinum Educational Group Phone

Platinum Planner. School Admin Manual. ã Platinum Educational Group Phone Platinum Planner School Admin Manual ã Platinum Educational Group Phone 616.818.7877 PLATINUM EDUCATIONAL GROUP Platinum Planner User Guide Platinum Planner is an online web application tool for Clinical

More information

FlowJo Software Lecture Outline:

FlowJo Software Lecture Outline: FlowJo Software Lecture Outline: Workspace Basics: 3 major components 1) The Ribbons (toolbar) The availability of buttons here can be customized. *One of the best assets of FlowJo is the help feature*

More information

Statistical Good Practice Guidelines. 1. Introduction. Contents. SSC home Using Excel for Statistics - Tips and Warnings

Statistical Good Practice Guidelines. 1. Introduction. Contents. SSC home Using Excel for Statistics - Tips and Warnings Statistical Good Practice Guidelines SSC home Using Excel for Statistics - Tips and Warnings On-line version 2 - March 2001 This is one in a series of guides for research and support staff involved in

More information

Microsoft Excel Pivot Tables & Pivot Table Charts

Microsoft Excel Pivot Tables & Pivot Table Charts Microsoft Excel 2013 Pivot Tables & Pivot Table Charts A pivot table report allows you to analyze and summarize a million rows of data in Excel 2013 without entering a single formula. Pivot Tables let

More information

Multiple Sequence Alignment

Multiple Sequence Alignment Introduction to Bioinformatics online course: IBT Multiple Sequence Alignment Lec3: Navigation in Cursor mode By Ahmed Mansour Alzohairy Professor (Full) at Department of Genetics, Zagazig University,

More information

Release Notes. JMP Genomics. Version 4.0

Release Notes. JMP Genomics. Version 4.0 JMP Genomics Version 4.0 Release Notes Creativity involves breaking out of established patterns in order to look at things in a different way. Edward de Bono JMP. A Business Unit of SAS SAS Campus Drive

More information

Single Subject Demo Data Instructions 1) click "New" and answer "No" to the "spatially preprocess" question.

Single Subject Demo Data Instructions 1) click New and answer No to the spatially preprocess question. (1) conn - Functional connectivity toolbox v1.0 Single Subject Demo Data Instructions 1) click "New" and answer "No" to the "spatially preprocess" question. 2) in "Basic" enter "1" subject, "6" seconds

More information

bimm vignette Matti Pirinen & Christian Benner University of Helsinki November 15, 2016

bimm vignette Matti Pirinen & Christian Benner University of Helsinki November 15, 2016 bimm vignette Matti Pirinen & Christian Benner University of Helsinki November 15, 2016 1 Introduction bimm is a software package to efficiently estimate variance parameters of a bivariate lineax mixed

More information

Progenesis CoMet User Guide

Progenesis CoMet User Guide Progenesis CoMet User Guide Analysis workflow guidelines for version 2.0 Contents Introduction... 3 How to use this document... 3 How can I analyse my own runs using CoMet?... 3 LC-MS Data used in this

More information

DesignDirector Version 1.0(E)

DesignDirector Version 1.0(E) Statistical Design Support System DesignDirector Version 1.0(E) User s Guide NHK Spring Co.,Ltd. Copyright NHK Spring Co.,Ltd. 1999 All Rights Reserved. Copyright DesignDirector is registered trademarks

More information

Veco User Guides. Grids, Views, and Grid Reports

Veco User Guides. Grids, Views, and Grid Reports Veco User Guides Grids, Views, and Grid Reports Introduction A Grid is defined as being a list of data records presented to the user. A grid is shown generally when an option is selected from the Tree

More information

Performing whole genome SNP analysis with mapping performed locally

Performing whole genome SNP analysis with mapping performed locally BioNumerics Tutorial: Performing whole genome SNP analysis with mapping performed locally 1 Introduction 1.1 An introduction to whole genome SNP analysis A Single Nucleotide Polymorphism (SNP) is a variation

More information

Differential Expression Analysis at PATRIC

Differential Expression Analysis at PATRIC Differential Expression Analysis at PATRIC The following step- by- step workflow is intended to help users learn how to upload their differential gene expression data to their private workspace using Expression

More information

Using Charts in a Presentation 6

Using Charts in a Presentation 6 Using Charts in a Presentation 6 LESSON SKILL MATRIX Skill Exam Objective Objective Number Building Charts Create a chart. Import a chart. Modifying the Chart Type and Data Change the Chart Type. 3.2.3

More information

Order Attributes. Magento Extension User Guide. Official extension page: Order Attributes

Order Attributes. Magento Extension User Guide. Official extension page: Order Attributes Order Attributes Magento Extension User Guide Official extension page: Order Attributes User Guide: Order Attributes Page 1 Table of contents: 1. Extension Settings...... 3 2. Creation of order attributes......

More information

Introduction to Modeling with Simile

Introduction to Modeling with Simile Introduction to Modeling with Simile This is an introduction to using Simile for building and analysing a model. Simile is a visual modeling tool. It allows you to draw a conceptual model diagram, and

More information

miscript mirna PCR Array Data Analysis v1.1 revision date November 2014

miscript mirna PCR Array Data Analysis v1.1 revision date November 2014 miscript mirna PCR Array Data Analysis v1.1 revision date November 2014 Overview The miscript mirna PCR Array Data Analysis Quick Reference Card contains instructions for analyzing the data returned from

More information

Calculating a PCA and a MDS on a fingerprint data set

Calculating a PCA and a MDS on a fingerprint data set BioNumerics Tutorial: Calculating a PCA and a MDS on a fingerprint data set 1 Aim Principal Components Analysis (PCA) and Multi Dimensional Scaling (MDS) are two alternative grouping techniques that can

More information

8. MINITAB COMMANDS WEEK-BY-WEEK

8. MINITAB COMMANDS WEEK-BY-WEEK 8. MINITAB COMMANDS WEEK-BY-WEEK In this section of the Study Guide, we give brief information about the Minitab commands that are needed to apply the statistical methods in each week s study. They are

More information

Nature Methods: doi: /nmeth Supplementary Figure 1

Nature Methods: doi: /nmeth Supplementary Figure 1 Supplementary Figure 1 Schematic representation of the Workflow window in Perseus All data matrices uploaded in the running session of Perseus and all processing steps are displayed in the order of execution.

More information

SOLOMON: Parentage Analysis 1. Corresponding author: Mark Christie

SOLOMON: Parentage Analysis 1. Corresponding author: Mark Christie SOLOMON: Parentage Analysis 1 Corresponding author: Mark Christie christim@science.oregonstate.edu SOLOMON: Parentage Analysis 2 Table of Contents: Installing SOLOMON on Windows/Linux Pg. 3 Installing

More information

You will be re-directed to the following result page.

You will be re-directed to the following result page. ENCODE Element Browser Goal: to navigate the candidate DNA elements predicted by the ENCODE consortium, including gene expression, DNase I hypersensitive sites, TF binding sites, and candidate enhancers/promoters.

More information

What is KNIME? workflows nodes standard data mining, data analysis data manipulation

What is KNIME? workflows nodes standard data mining, data analysis data manipulation KNIME TUTORIAL What is KNIME? KNIME = Konstanz Information Miner Developed at University of Konstanz in Germany Desktop version available free of charge (Open Source) Modular platform for building and

More information

Importing your Exeter NGS data into Galaxy:

Importing your Exeter NGS data into Galaxy: Importing your Exeter NGS data into Galaxy: The aim of this tutorial is to show you how to import your raw Illumina FASTQ files and/or assemblies and remapping files into Galaxy. As of 1 st July 2011 Illumina

More information

ARM : Features Added After Version Gylling Data Management, Inc. * = key features. October 2015

ARM : Features Added After Version Gylling Data Management, Inc. * = key features. October 2015 ARM 2015.6: Features Added After Version 2015.1 Gylling Data Management, Inc. October 2015 * = key features 1 Simpler Assessment Data Column Properties Removed less useful values, reduced height on screen

More information

DI TRANSFORM. The regressive analyses. identify relationships

DI TRANSFORM. The regressive analyses. identify relationships July 2, 2015 DI TRANSFORM MVstats TM Algorithm Overview Summary The DI Transform Multivariate Statistics (MVstats TM ) package includes five algorithm options that operate on most types of geologic, geophysical,

More information

Excel Tips and FAQs - MS 2010

Excel Tips and FAQs - MS 2010 BIOL 211D Excel Tips and FAQs - MS 2010 Remember to save frequently! Part I. Managing and Summarizing Data NOTE IN EXCEL 2010, THERE ARE A NUMBER OF WAYS TO DO THE CORRECT THING! FAQ1: How do I sort my

More information

Data Visualisation with Google Fusion Tables

Data Visualisation with Google Fusion Tables Data Visualisation with Google Fusion Tables Workshop Exercises Dr Luc Small 12 April 2017 1.5 Data Visualisation with Google Fusion Tables Page 1 of 33 1 Introduction Google Fusion Tables is shaping up

More information

JMP Book Descriptions

JMP Book Descriptions JMP Book Descriptions The collection of JMP documentation is available in the JMP Help > Books menu. This document describes each title to help you decide which book to explore. Each book title is linked

More information

Package OmicKriging. August 29, 2016

Package OmicKriging. August 29, 2016 Type Package Title Poly-Omic Prediction of Complex TRaits Version 1.4.0 Date 2016-03-03 Package OmicKriging August 29, 2016 Author Hae Kyung Im, Heather E. Wheeler, Keston Aquino Michaels, Vassily Trubetskoy

More information

Metabolomic Data Analysis with MetaboAnalyst

Metabolomic Data Analysis with MetaboAnalyst Metabolomic Data Analysis with MetaboAnalyst User ID: guest6522519400069885256 April 14, 2009 1 Data Processing and Normalization 1.1 Reading and Processing the Raw Data MetaboAnalyst accepts a variety

More information

Data and Function Plotting with MATLAB (Linux-10)

Data and Function Plotting with MATLAB (Linux-10) Data and Function Plotting with MATLAB (Linux-10) This tutorial describes the use of MATLAB for general plotting of experimental data and equations and for special plots like histograms. (Astronomers -

More information

Creating a custom mappings similarity matrix

Creating a custom mappings similarity matrix BioNumerics Tutorial: Creating a custom mappings similarity matrix 1 Aim In BioNumerics, character values can be mapped to categorical names according to predefined criteria (see tutorial Importing non-numerical

More information

TexRAD Research Version Client User Guide Version 3.9

TexRAD Research Version Client User Guide Version 3.9 Imaging tools for medical decision makers Cambridge Computed Imaging Ltd Grange Park Broadway Bourn Cambridge CB23 2TA UK TexRAD Research Version Client User Guide Version 3.9 Release date 23/05/2016 Number

More information

Machine Learning for Pre-emptive Identification of Performance Problems in UNIX Servers Helen Cunningham

Machine Learning for Pre-emptive Identification of Performance Problems in UNIX Servers Helen Cunningham Final Report for cs229: Machine Learning for Pre-emptive Identification of Performance Problems in UNIX Servers Helen Cunningham Abstract. The goal of this work is to use machine learning to understand

More information

Der Nutzen von teilwiederholten Versuchen bei Sonnenblumen

Der Nutzen von teilwiederholten Versuchen bei Sonnenblumen Sommertagung AG Landwirtschaftliches Versuchswesen 29. 30. Juni 2017 Die Universität Hohenheim Der Nutzen von teilwiederholten Versuchen bei Sonnenblumen 29.06.2017 Christian Sponagel, Hans-Peter Piepho,

More information

Asreml-R: an R package for mixed models using residual maximum likelihood

Asreml-R: an R package for mixed models using residual maximum likelihood Asreml-R: an R package for mixed models using residual maximum likelihood David Butler 1 Brian Cullis 2 Arthur Gilmour 3 1 Queensland Department of Primary Industries Toowoomba 2 NSW Department of Primary

More information

Genetic Programming. Charles Chilaka. Department of Computational Science Memorial University of Newfoundland

Genetic Programming. Charles Chilaka. Department of Computational Science Memorial University of Newfoundland Genetic Programming Charles Chilaka Department of Computational Science Memorial University of Newfoundland Class Project for Bio 4241 March 27, 2014 Charles Chilaka (MUN) Genetic algorithms and programming

More information

Manual code: MSU_pigs.R

Manual code: MSU_pigs.R Manual code: MSU_pigs.R Authors: Jose Luis Gualdrón Duarte 1 and Juan Pedro Steibel,3 1 Departamento de Producción Animal, Facultad de Agronomía, UBA-CONICET, Buenos Aires, ARG Department of Animal Science,

More information

SMRT Link Release Notes (v6.0.0)

SMRT Link Release Notes (v6.0.0) SMRT Link Release Notes (v6.0.0) SMRT Link Server Installation SMRT Link server software is supported on English-language CentOS 6.x; 7.x and Ubuntu 14.04; 16.04 64-bit Linux distributions. (This also

More information

Session One: MINITAB Basics

Session One: MINITAB Basics 8 Session One: MINITAB Basics, 8-2 Start MINITAB, 8-3 Open a Worksheet, 8-3 Enter Data from the Keyboard, 8-4 Enter Patterned Data, 8-4 Save Your Project, 8-5 Compute Descriptive Statistics, 8-6 Perform

More information

GBS Bioinformatics Pipeline(s) Overview

GBS Bioinformatics Pipeline(s) Overview GBS Bioinformatics Pipeline(s) Overview Getting from sequence files to genotypes. Pipeline Coding: Ed Buckler Jeff Glaubitz James Harriman Presentation: Terry Casstevens With supporting information from

More information

Performing Comparisons in BioNumerics. Erik W. Coleman May 2009

Performing Comparisons in BioNumerics. Erik W. Coleman May 2009 Performing Comparisons in BioNumerics Erik W. Coleman May 2009 Overview Create a Comparison and Perform a Cluster Analysis Cluster Analysis Parameters Position Tolerance and Optimization Dice Coefficient

More information

How to Make Graphs in EXCEL

How to Make Graphs in EXCEL How to Make Graphs in EXCEL The following instructions are how you can make the graphs that you need to have in your project.the graphs in the project cannot be hand-written, but you do not have to use

More information

BICF Nano Course: GWAS GWAS Workflow Development using PLINK. Julia Kozlitina April 28, 2017

BICF Nano Course: GWAS GWAS Workflow Development using PLINK. Julia Kozlitina April 28, 2017 BICF Nano Course: GWAS GWAS Workflow Development using PLINK Julia Kozlitina Julia.Kozlitina@UTSouthwestern.edu April 28, 2017 Getting started Open the Terminal (Search -> Applications -> Terminal), and

More information

NUMERICAL COMPUTING For Finance Using Excel. Sorting and Displaying Data

NUMERICAL COMPUTING For Finance Using Excel. Sorting and Displaying Data NUMERICAL COMPUTING For Finance Using Excel Sorting and Displaying Data Outline 1 Sorting data Excel Sort tool (sort data in ascending or descending order) Simple filter (by ROW, COLUMN, apply a custom

More information

BRAT Manual V1.1. May 6th, Christian Göschl & Wolfgang Busch. for correspondence:

BRAT Manual V1.1. May 6th, Christian Göschl & Wolfgang Busch.  for correspondence: BRAT Manual V1.1 May 6th, 2014 Christian Göschl & Wolfgang Busch Email for correspondence: wolfgang.busch@gmi.oeaw.ac.at Gregor Mendel Institute Dr. Bohr-Gasse 3 1030 Vienna Austria 1 2 Brat Segmentation

More information

Progenesis CoMet User Guide

Progenesis CoMet User Guide Progenesis CoMet User Guide Analysis workflow guidelines for version 1.0 Contents Introduction... 3 How to use this document... 3 How can I analyse my own runs using CoMet?... 3 LC-MS Data used in this

More information

Genetic Analysis. Page 1

Genetic Analysis. Page 1 Genetic Analysis Page 1 Genetic Analysis Objectives: 1) Set up Case-Control Association analysis and the Basic Genetics Workflow 2) Use JMP tools to interact with and explore results 3) Learn advanced

More information

GRADE CENTRE BEST PRACTICE FOR A4L

GRADE CENTRE BEST PRACTICE FOR A4L GRADE CENTRE BEST PRACTICE FOR A4L Overview A large number of reports use information from the Grade Centre to draw correlations between activity and student success (see appendix). This document serves

More information

CLC Server. End User USER MANUAL

CLC Server. End User USER MANUAL CLC Server End User USER MANUAL Manual for CLC Server 10.0.1 Windows, macos and Linux March 8, 2018 This software is for research purposes only. QIAGEN Aarhus Silkeborgvej 2 Prismet DK-8000 Aarhus C Denmark

More information

MACAU User Manual. Xiang Zhou. March 15, 2017

MACAU User Manual. Xiang Zhou. March 15, 2017 MACAU User Manual Xiang Zhou March 15, 2017 Contents 1 Introduction 2 1.1 What is MACAU...................................... 2 1.2 How to Cite MACAU................................... 2 1.3 The Model.........................................

More information

Step-by-Step Guide to Basic Genetic Analysis

Step-by-Step Guide to Basic Genetic Analysis Step-by-Step Guide to Basic Genetic Analysis Page 1 Introduction This document shows you how to clean up your genetic data, assess its statistical properties and perform simple analyses such as case-control

More information

Microsoft Excel Using Excel in the Science Classroom

Microsoft Excel Using Excel in the Science Classroom Microsoft Excel Using Excel in the Science Classroom OBJECTIVE Students will take data and use an Excel spreadsheet to manipulate the information. This will include creating graphs, manipulating data,

More information

CLAREMONT MCKENNA COLLEGE. Fletcher Jones Student Peer to Peer Technology Training Program. Basic Statistics using Stata

CLAREMONT MCKENNA COLLEGE. Fletcher Jones Student Peer to Peer Technology Training Program. Basic Statistics using Stata CLAREMONT MCKENNA COLLEGE Fletcher Jones Student Peer to Peer Technology Training Program Basic Statistics using Stata An Introduction to Stata A Comparison of Statistical Packages... 3 Opening Stata...

More information

Click on "+" button Select your VCF data files (see #Input Formats->1 above) Remove file from files list:

Click on + button Select your VCF data files (see #Input Formats->1 above) Remove file from files list: CircosVCF: CircosVCF is a web based visualization tool of genome-wide variant data described in VCF files using circos plots. The provided visualization capabilities, gives a broad overview of the genomic

More information

FVGWAS- 3.0 Manual. 1. Schematic overview of FVGWAS

FVGWAS- 3.0 Manual. 1. Schematic overview of FVGWAS FVGWAS- 3.0 Manual Hongtu Zhu @ UNC BIAS Chao Huang @ UNC BIAS Nov 8, 2015 More and more large- scale imaging genetic studies are being widely conducted to collect a rich set of imaging, genetic, and clinical

More information

renumf90 Introduction to 5/27/2016 UGA 05/2016 Data File for blupf90 family: a) Only numbers Integer or real

renumf90 Introduction to 5/27/2016 UGA 05/2016 Data File for blupf90 family: a) Only numbers Integer or real Introduction to renumf90 UGA 05/2016 Data File for blupf90 family: a) Only numbers Integer or real a) All effects need to be renumbered from 1 consecutively Controlled by the same parameter file! 1 RENUMF90

More information

Analyzing Genomic Data with NOJAH

Analyzing Genomic Data with NOJAH Analyzing Genomic Data with NOJAH TAB A) GENOME WIDE ANALYSIS Step 1: Select the example dataset or upload your own. Two example datasets are available. Genome-Wide TCGA-BRCA Expression datasets and CoMMpass

More information

CS 229: Machine Learning Final Report Identifying Driving Behavior from Data

CS 229: Machine Learning Final Report Identifying Driving Behavior from Data CS 9: Machine Learning Final Report Identifying Driving Behavior from Data Robert F. Karol Project Suggester: Danny Goodman from MetroMile December 3th 3 Problem Description For my project, I am looking

More information

Microsoft Excel Pivot Tables & Pivot Table Charts

Microsoft Excel Pivot Tables & Pivot Table Charts Microsoft Excel 2007 Pivot Tables & Pivot Table Charts A pivot table report allows you to analyze and summarize a million rows of data in Excel 2007 without entering a single formula. Pivot Tables let

More information

Graphical Analysis of Data using Microsoft Excel [2016 Version]

Graphical Analysis of Data using Microsoft Excel [2016 Version] Graphical Analysis of Data using Microsoft Excel [2016 Version] Introduction In several upcoming labs, a primary goal will be to determine the mathematical relationship between two variable physical parameters.

More information

Excel 2016: Part 2 Functions/Formulas/Charts

Excel 2016: Part 2 Functions/Formulas/Charts Excel 2016: Part 2 Functions/Formulas/Charts Updated: March 2018 Copy cost: $1.30 Getting Started This class requires a basic understanding of Microsoft Excel skills. Please take our introductory class,

More information

Sage: Symmetry and Asymmetry in Geometric Data Version 1.21 (compiled 03/1114)

Sage: Symmetry and Asymmetry in Geometric Data Version 1.21 (compiled 03/1114) Sage: Symmetry and Asymmetry in Geometric Data Version 1.21 (compiled 03/1114) Eladio Marquez 2012-2014 Mammals Division University of Michigan Museum of Zoology http://www-personal.umich.edu/~emarquez/morph/

More information

CircosVCF workshop, TAU, 9/11/2017

CircosVCF workshop, TAU, 9/11/2017 CircosVCF exercise In this exercise, we will create and design circos plots using CircosVCF. We will use vcf files of a published case "X-linked elliptocytosis with impaired growth is related to mutated

More information

GenomeStudio Software Release Notes

GenomeStudio Software Release Notes GenomeStudio Software 2009.2 Release Notes 1. GenomeStudio Software 2009.2 Framework... 1 2. Illumina Genome Viewer v1.5...2 3. Genotyping Module v1.5... 4 4. Gene Expression Module v1.5... 6 5. Methylation

More information

SPSS INSTRUCTION CHAPTER 9

SPSS INSTRUCTION CHAPTER 9 SPSS INSTRUCTION CHAPTER 9 Chapter 9 does no more than introduce the repeated-measures ANOVA, the MANOVA, and the ANCOVA, and discriminant analysis. But, you can likely envision how complicated it can

More information

PATRISTICv1.0. Patristic runs on Windows, Mac and Linux in a graphic mode. Make sure you have a version a Java.

PATRISTICv1.0. Patristic runs on Windows, Mac and Linux in a graphic mode. Make sure you have a version a Java. PATRISTICv1.0 Introduction: Patristic is a Java program that uses as input different tree files and computes their patristic distances. Patristic allows saving and editing in different ways those distances.

More information

All Applications Release Bulletin January 2010

All Applications Release Bulletin January 2010 All Applications Release Bulletin January 2010 In this bulletin... Online Enrollment: HTML Forms for Contracts 2 System Administration: MBP Online User Accounts 11 About Release 91_6 This release includes

More information

Data Science. Data Analyst. Data Scientist. Data Architect

Data Science. Data Analyst. Data Scientist. Data Architect Data Science Data Analyst Data Analysis in Excel Programming in R Introduction to Python/SQL/Tableau Data Visualization in R / Tableau Exploratory Data Analysis Data Scientist Inferential Statistics &

More information

Maximizing Public Data Sources for Sequencing and GWAS

Maximizing Public Data Sources for Sequencing and GWAS Maximizing Public Data Sources for Sequencing and GWAS February 4, 2014 G Bryce Christensen Director of Services Questions during the presentation Use the Questions pane in your GoToWebinar window Agenda

More information

Automated Bioinformatics Analysis System on Chip ABASOC. version 1.1

Automated Bioinformatics Analysis System on Chip ABASOC. version 1.1 Automated Bioinformatics Analysis System on Chip ABASOC version 1.1 Phillip Winston Miller, Priyam Patel, Daniel L. Johnson, PhD. University of Tennessee Health Science Center Office of Research Molecular

More information

EcoStudy Software User Guide

EcoStudy Software User Guide EcoStudy Software User Guide FOR RESEARCH USE ONLY What is EcoStudy? 3 Setting Up a Study 4 Specifying Analysis Settings for your Study 6 Reviewing the Data in your Study 8 Exporting Study Data to a Report

More information

Release Notes for April StatCrunch Updates

Release Notes for April StatCrunch Updates Release Notes for April 2018 - StatCrunch Updates Major additions Introducing accessibility features that support full keyboard functionality including new keyboard shortcuts. [Go to page 2] Measures to

More information

Excel 2013 Intermediate

Excel 2013 Intermediate Excel 2013 Intermediate Quick Access Toolbar... 1 Customizing Excel... 2 Keyboard Shortcuts... 2 Navigating the Spreadsheet... 2 Status Bar... 3 Worksheets... 3 Group Column/Row Adjusments... 4 Hiding

More information

Fathom Dynamic Data TM Version 2 Specifications

Fathom Dynamic Data TM Version 2 Specifications Data Sources Fathom Dynamic Data TM Version 2 Specifications Use data from one of the many sample documents that come with Fathom. Enter your own data by typing into a case table. Paste data from other

More information

Activity Graphical Analysis with Excel and Logger Pro

Activity Graphical Analysis with Excel and Logger Pro Activity Graphical Analysis with Excel and Logger Pro Purpose Vernier s Logger Pro is a graphical analysis software that will allow you to collect, graph and manipulate data. Microsoft s Excel is a spreadsheet

More information

Package PedCNV. February 19, 2015

Package PedCNV. February 19, 2015 Type Package Package PedCNV February 19, 2015 Title An implementation for association analysis with CNV data. Version 0.1 Date 2013-08-03 Author, Sungho Won and Weicheng Zhu Maintainer

More information

How to use Excel Spreadsheets for Graphing

How to use Excel Spreadsheets for Graphing How to use Excel Spreadsheets for Graphing 1. Click on the Excel Program on the Desktop 2. You will notice that a screen similar to the above screen comes up. A spreadsheet is divided into Columns (A,

More information

Feature Selection. CE-725: Statistical Pattern Recognition Sharif University of Technology Spring Soleymani

Feature Selection. CE-725: Statistical Pattern Recognition Sharif University of Technology Spring Soleymani Feature Selection CE-725: Statistical Pattern Recognition Sharif University of Technology Spring 2013 Soleymani Outline Dimensionality reduction Feature selection vs. feature extraction Filter univariate

More information

Technology Assignment: Scatter Plots

Technology Assignment: Scatter Plots The goal of this assignment is to create a scatter plot of a set of data. You could do this with any two columns of data, but for demonstration purposes we ll work with the data in the table below. You

More information

Excel Spreadsheets and Graphs

Excel Spreadsheets and Graphs Excel Spreadsheets and Graphs Spreadsheets are useful for making tables and graphs and for doing repeated calculations on a set of data. A blank spreadsheet consists of a number of cells (just blank spaces

More information