Getting started: Analysis of Microbial Communities
|
|
- Mervyn Howard
- 5 years ago
- Views:
Transcription
1 Getting started: Analysis of Microbial Communities June 12, 2015 CLC bio, a QIAGEN Company Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: Fax: support@clcbio.com
2 Contents Getting started: Analysis of Microbial Communities Prerequisites Results Additional statistical analyses
3 CONTENTS 3 Getting started: Analysis of Microbial Communities Mr. X is a suspect in a murder. The body was found on site 1, but Mr. X claims he was never there because he spent his entire week-end on site 2 and 3. Investigators have found a pair of rubber boots and a pair of hiking shoes at Mr. X s house. Both were dirty with soil on the soles. They took 3 samples of soil from each pair of shoes, and 2 samples of soil from each site: the crime scene (site 1) but also the 2 sites Mr. X claimed he was at (site 2 and 3). This tutorial will guide you through the different tools you can use to transform NGS data into forensic evidence. Indeed, each soil sample is characterized by a specific microbial community. To identify species present in soil samples, DNA is extracted from its microbial community, a region of the 16S gene is PCR amplified, and the resulting amplicon is sequenced using an NGS machine. The Microbial Genomics Module will then assign taxonomy to the reads by clustering them with representative sequences of pseudo-species called Operational Taxonomical Units (OTUs), and tally the occurrences of each OTU. Secondary analyses will further describe microbial communities by estimating alpha and beta diversities in the context of sample metadata. Prerequisites For this tutorial, you must be working with CLC Genomics Workbench 7.5 or higher, or Biomedical Genomics Workbench 2.1 or higher, and you must have installed the Microbial Genomics Module. You will be using a data set containing the sequences and metadata from a round robin trial of several soil types generated in a mock crime investigation as part of the MiSAFE project ( DNA was extracted, and a region of the 16S gene was PCR amplified using standard primers. The resulting amplicon was sequenced on an Illumina MiSeq machine (300 cycles, forward and reverse). The data set includes the following files: Sequence data: 12 data sets (two each for soil from locations 1, 2 and 3, and three each for soil on the suspects rubber boots GT-A and hiking shoes GT-B). The data was generated from the same MiSeq run and is composed of demultiplexed.fastq files. The original files have been down sampled to only contain 1/10th of the reads. While this improves the processing speed of the different analyses performed by the workbench, it will also impact negatively the statistical analysis. Metadata: the spreadsheet MetadataRoundRobin.csv contains metadata information. In this case we have only information about the origin of the samples. Sequencing primer sequences: 16s_primers_round_robin.clc for the 16S primers. Note that several databases are available for download in the Microbial Genomics Module. Database: 16S_97_otus_GG.clc.n Now that the prerequisites have been described, its time to start with the analyses of your samples. 1. Download the sample data from our website: MicrobialAnalysisData.zip. P. 3
4 CONTENTS 4 2. Start the workbench and go to File Import ( ) Illumina ( ) to import the 24 sequence files (ending with "fastq.gz"). Ensure that the import type under Options is set to Paired reads and that the radio button for Paired-end (forward-reverse) is selected. Minimum distance must be set to 200 and Maximum distance to 550. Click on the button labeled Next and select the location where you want to store the imported sequences. We recommend that you create a new folder called Illumina reads for example. You can check that you have now 12 files labeled as "paired". 3. Import the database sequence data by drag-and-drop the 16S_97_otus_GG.clc database and the 16s_primers_round_robin.clc primer sequences into your destination folder in your workbench. Now that all our data has been imported, you can start the workflows from the Microbial Genomics Module. The Data QC and OTU clustering workflow consists of 5 tools being executed sequentially (see a display of the workflow in figure 1). The input necessary to run the workflow are the reads you want to cluster. You can also specify a list of the primers that were used to sequence these reads. 1. Open the workflow Toolbox Workflows Data QC and OTU clustering. 2. Select the 12 sequence files form your folder called Illumina reads. 3. In the Optional Merge Paired Reads window, change the parameters to Mismatch cost 1, Minimum score 40, Gap cost 4 and Maximum unaligned end mismatches to 5 and click Next. 4. In the Trim Sequences window, select the list of primers sequences 16s_primers_round_robin.clc. Leave the other parameters as default and click on the button labeled Next. 5. In the Fixed Length Trimming window, leave the option as default as we want the length of trimming to be automatically detected by the software. 6. In the OTUclustering window, choose from the drop-down menu Reference based OTU clustering and select the file called 16S_97_otus_GG. Click on the button labeled Next. 7. Save your workflow outputs in a new folder called Data QC and OTU clustering. Click on the button labeled Finish. You can follow the progress of the workflow in the Processes bar below the toolbox. When the workflow is done, you will have 3 output files in your folder (figure 2). The file OTU(Table) is the one you will use as input for the Estimate Alpha and Beta Diversities workflow, but as these secondary analyses require metadata you will first use the Add Metadata to OTU Abundance Table tool. You can then run the Estimate Alpha and Beta Diversities workflow which consists of 5 tools and your newly generated OTU(Table) as input file (figure 3). 1. Select Toolbox OTUclustering ( ) Add Metadata to OTU Abundance Table ( ) and choose the OTU(Table) as input. P. 4
5 CONTENTS 5 Figure 1: Layout of the Data QC and OTU clustering workflow. Figure 2: Outputs of the Data QC and OTU clustering workflow. 2. Select the file describing the metadata on your local computer: MetatdataRoundRobin.csv and click on the button labeled Next. 3. Save your result in the Data QC and OTU clustering folder. It is a table that will overwrite the previous OTU(Table) file. The tables are similar to each other, but you have now the option to Aggregate samples based on the headers of the columns of your metadata file. Note: if you had previously open the OTU(Table), close it and reopen it to be able to have the aggregate option on the right side panel of the workbench. 4. Open Toolbox Workflows Estimate Alpha and Beta Diversities and select the OTU (Table). Click on the button labeled Next. P. 5
6 CONTENTS 6 Figure 3: Layout of the Alpha and Beta Diversities workflow. 5. In the Alpha analysis window, deselect everything but Number of OTUs. 6. In the Beta analysis window select D_0.5 UniFrac but deselect Bray-Curtis and Jaccard measures, unweighted and weighted UniFrac. 7. Save the data in a new folder called Estimate Alpha and Beta Diversities. Running this workflow will give at least 4 outputs (figure 4): an alignment of the OTUs, a phylogenetic tree of the OTUs, a diversity report for the alpha diversity and a PCoA for the beta diversity. Figure 4: Outputs of the Alpha and Beta Diversities workflow. Results The primary output of your analysis is an OTU abundance table annotated with the metadata. In this investigation, the metadata defines the origin of the different soil samples and allows the aggregation of the results to improve visualization of the results. In addition, the module offers several ways to look at your newly generated OTU clusters: the table itself, but also Stacked Bar Charts and Stacked Area Charts ( ) as well as Zoomable Sunbursts ( ). P. 6
7 CONTENTS 7 Open the OTU(Table) file from your Data QC and OTU clustering folder and click on the Stacked Bar Chart icon in the lower part of the workbench. In the right side panel, choose to aggregate samples by Type (figure 5). We observe a striking similarity between the GT-A profile found on the suspects rubber boots and the profile of the soil from Site 1, indicating that Mr. X was most likely lying when he pretended he had never been on Site 1. Figure 5: Aggregate samples based on metadata information. Now open the results of the alpha diversity analysis, called OTU(Table)(Filtered)(Alpha diversity): the plot contains the rarefaction results of the specified alpha diversity measure while each line corresponds to one sample. The coloring scheme can be set by double clicking on a graph and changing the color for lines one sample at a time. Note: you need to set different shades of a particular color to the different samples as the samples cannot have the exact same color. In the following graph (figure 6) we have chosen shades of blue for GT-A and shades of green for GT-B. The lines do not plateau, indicating that we would need more samples to reach a definite conclusion, but GT-A samples seem to have similar measures of alpha diversity as the sites 1, 2, and 3 while GT-B samples are clearly apart. We suspect that Mr.X was not wearing his hiking boots on any of the sites sampled. Figure 6: Results of the alpha diversity analysis measured using Number of OTUs as parameters. Finally, beta diversity estimates differences in species diversity between samples. The beta diversity analysis tool performs a Principal Component Analysis PCoA ( ) on the UniFrac P. 7
8 CONTENTS 8 distances (figure 7). Figure 7: Result of the beta diversity analysis. In a PCoA of the beta diversities, the soil samples cluster according to their origin. In this case, selecting PCoA as X axis and PCoA1 as Y axis allows for a better visualization of Site 1 and GT-A being clustered together, confirming a similarity between the 2 soils and thus confirming our suspicion that Mr. X was on site 1 with his rubber boots. Additional statistical analyses As a tool for assessing similarity between samples, a heat map and dendrogram can be helpful. In order to perform additional statistical tests, you can use the Convert to Experiment tool to assign categories to the samples from your OTU abundance table. 1. Select Toolbox OTUclustering ( ) Convert to Experiment ( ) 2. Choose OTU(Table) from the Data QC and OTU clustering folder as input, and which Metadata group to consider as factors (here Type) before saving the resulting experiment in a new folder you can call Statistics. The file is called OTU(Table)(experiment). 3. Select Toolbox Transcriptomics Analysis ( ) in CLC Genomics Workbench and Expression Analysis ( ) in Biomedical Genomics Workbench Quality control ( ) Hierarchical Clustering of Samples ( ) 4. Choose the OTU(Table)(experiment) as input. Click on the button labeled Next. 5. Select Euclidian distance and Average linkage and click on the button labeled Next. 6. Save your result in the Statistics folder. It will overwrite the OTU(Table)(experiment) file. If the OTU(Table)(experiment) table was previously opened in your workbench, close it and open it again. The Experiment table now displays a button at the bottom with a heat map icon. Selecting this view will display the heat map (figure 8). We can see that GT-A is again nested together with Site 1, confirming once more that the soil found on the rubber boots is extremely similar to the one sampled from site 1. Finally, you can assess the robustness of your results by running a PERMANOVA analysis on your samples. PERMANOVA can be used to measure effect size and significance of beta diversity. P. 8
9 CONTENTS 9 Figure 8: Heat map of the Hierarchical Clustering of Samples analysis. 1. Select Toolbox OTUclustering ( ) PERMANOVA ( ). 2. Choose OTU(Table) from the Data QC and OTU clustering folder as input and select Type as Metadata group. 3. Deselect Bray-Curtis and Jaccard measures and specify the phylogenetic tree (OTU(Table)(Filtered)align from the Estimate Alpha and Beta Diversities folder. Select D_0.5 UniFrac, deselect weighted and unweighted UniFrac, and leave the number of permutations to 99,999. Click on the button labeled Next. 4. Save the report in the Statistics folder. The result of the PERMANOVA analysis is a table (figure 9). The PERMANOVA confirms that the clusters are significant (p=0,00008), but with only two to three replicates for each sample or group, the clustering is not significant on pair-wise comparisons of the Types. The investigators will need more samples - and in particular from the rubber boots solesand site 1 - to transform this analysis into actual evidence! P. 9
10 CONTENTS 10 Figure 9: Result of the PERMANOVA analysis. P. 10
OTU Clustering Using Workflows
OTU Clustering Using Workflows June 28, 2018 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com ts-bioinformatics@qiagen.com
More informationTutorial. OTU Clustering Step by Step. Sample to Insight. March 2, 2017
OTU Clustering Step by Step March 2, 2017 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com AdvancedGenomicsSupport@qiagen.com
More informationTutorial. OTU Clustering Step by Step. Sample to Insight. June 28, 2018
OTU Clustering Step by Step June 28, 2018 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com ts-bioinformatics@qiagen.com
More informationCLC Microbial Genomics Module USER MANUAL
CLC Microbial Genomics Module USER MANUAL User manual for CLC Microbial Genomics Module 1.1 Windows, Mac OS X and Linux October 12, 2015 This software is for research purposes only. CLC bio, a QIAGEN Company
More informationFusion Detection Using QIAseq RNAscan Panels
Fusion Detection Using QIAseq RNAscan Panels June 11, 2018 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com ts-bioinformatics@qiagen.com
More informationExpression Analysis with the Advanced RNA-Seq Plugin
Expression Analysis with the Advanced RNA-Seq Plugin May 24, 2016 Sample to Insight CLC bio, a QIAGEN Company Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.clcbio.com support-clcbio@qiagen.com
More informationTutorial. Aligning contigs manually using the Genome Finishing. Sample to Insight. February 6, 2019
Aligning contigs manually using the Genome Finishing Module February 6, 2019 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com
More informationModification of an Existing Workflow
Modification of an Existing Workflow April 3, 2019 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com ts-bioinformatics@qiagen.com
More informationTutorial. Find Very Low Frequency Variants With QIAGEN GeneRead Panels. Sample to Insight. November 21, 2017
Find Very Low Frequency Variants With QIAGEN GeneRead Panels November 21, 2017 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com
More informationQIAseq Targeted RNAscan Panel Analysis Plugin USER MANUAL
QIAseq Targeted RNAscan Panel Analysis Plugin USER MANUAL User manual for QIAseq Targeted RNAscan Panel Analysis 0.5.2 beta 1 Windows, Mac OS X and Linux February 5, 2018 This software is for research
More informationSmall RNA Analysis using Illumina Data
Small RNA Analysis using Illumina Data September 7, 2016 Sample to Insight CLC bio, a QIAGEN Company Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.clcbio.com support-clcbio@qiagen.com
More informationTutorial: De Novo Assembly of Paired Data
: De Novo Assembly of Paired Data September 20, 2013 CLC bio Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 Fax: +45 86 20 12 22 www.clcbio.com support@clcbio.com : De Novo Assembly
More informationTutorial. Small RNA Analysis using Illumina Data. Sample to Insight. October 5, 2016
Small RNA Analysis using Illumina Data October 5, 2016 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com AdvancedGenomicsSupport@qiagen.com
More informationTutorial. RNA-Seq Analysis of Breast Cancer Data. Sample to Insight. November 21, 2017
RNA-Seq Analysis of Breast Cancer Data November 21, 2017 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com AdvancedGenomicsSupport@qiagen.com
More informationTutorial. Typing and Epidemiological Clustering of Common Pathogens (beta) Sample to Insight. November 21, 2017
Typing and Epidemiological Clustering of Common Pathogens (beta) November 21, 2017 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com
More informationTutorial. Identification of Variants in a Tumor Sample. Sample to Insight. November 21, 2017
Identification of Variants in a Tumor Sample November 21, 2017 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com AdvancedGenomicsSupport@qiagen.com
More informationTutorial. De Novo Assembly of Paired Data. Sample to Insight. November 21, 2017
De Novo Assembly of Paired Data November 21, 2017 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com AdvancedGenomicsSupport@qiagen.com
More informationQIIME and the art of fungal community analysis. Greg Caporaso
QIIME and the art of fungal community analysis Greg Caporaso Sequencing output (454, Illumina, Sanger) fastq, fasta, qual, or sff/trace files Metadata mapping file www.qiime.org Pre-processing e.g., remove
More informationTutorial. Identification of Variants Using GATK. Sample to Insight. November 21, 2017
Identification of Variants Using GATK November 21, 2017 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com AdvancedGenomicsSupport@qiagen.com
More informationTutorial. Variant Detection. Sample to Insight. November 21, 2017
Resequencing: Variant Detection November 21, 2017 Map Reads to Reference and Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com
More informationQIAseq DNA V3 Panel Analysis Plugin USER MANUAL
QIAseq DNA V3 Panel Analysis Plugin USER MANUAL User manual for QIAseq DNA V3 Panel Analysis 1.0.1 Windows, Mac OS X and Linux January 25, 2018 This software is for research purposes only. QIAGEN Aarhus
More informationCLC Server. End User USER MANUAL
CLC Server End User USER MANUAL Manual for CLC Server 10.0.1 Windows, macos and Linux March 8, 2018 This software is for research purposes only. QIAGEN Aarhus Silkeborgvej 2 Prismet DK-8000 Aarhus C Denmark
More informationTutorial. Getting Started. Sample to Insight. November 28, 2018
Getting Started November 28, 2018 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com ts-bioinformatics@qiagen.com CONTENTS
More informationTutorial. Comparative Analysis of Three Bovine Genomes. Sample to Insight. November 21, 2017
Comparative Analysis of Three Bovine Genomes November 21, 2017 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com AdvancedGenomicsSupport@qiagen.com
More informationTutorial: Resequencing Analysis using Tracks
: Resequencing Analysis using Tracks September 20, 2013 CLC bio Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 Fax: +45 86 20 12 22 www.clcbio.com support@clcbio.com : Resequencing
More informationTutorial. Batching of Multi-Input Workflows. Sample to Insight. November 21, 2017
Batching of Multi-Input Workflows November 21, 2017 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com AdvancedGenomicsSupport@qiagen.com
More informationTutorial. Phylogenetic Trees and Metadata. Sample to Insight. November 21, 2017
Phylogenetic Trees and Metadata November 21, 2017 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com AdvancedGenomicsSupport@qiagen.com
More informationTutorial: RNA-Seq analysis part I: Getting started
: RNA-Seq analysis part I: Getting started August 9, 2012 CLC bio Finlandsgade 10-12 8200 Aarhus N Denmark Telephone: +45 70 22 55 09 Fax: +45 70 22 55 19 www.clcbio.com support@clcbio.com : RNA-Seq analysis
More informationCLC Sequence Viewer USER MANUAL
CLC Sequence Viewer USER MANUAL Manual for CLC Sequence Viewer 8.0.0 Windows, macos and Linux June 1, 2018 This software is for research purposes only. QIAGEN Aarhus Silkeborgvej 2 Prismet DK-8000 Aarhus
More informationTutorial. Identification of somatic variants in a matched tumor-normal pair. Sample to Insight. November 21, 2017
Identification of somatic variants in a matched tumor-normal pair November 21, 2017 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.qiagenbioinformatics.com
More informationCustomizable information fields (or entries) linked to each database level may be replicated and summarized to upstream and downstream levels.
Manage. Analyze. Discover. NEW FEATURES BioNumerics Seven comes with several fundamental improvements and a plethora of new analysis possibilities with a strong focus on user friendliness. Among the most
More informationBiomedical Genomics Workbench APPLICATION BASED MANUAL
Biomedical Genomics Workbench APPLICATION BASED MANUAL Manual for Biomedical Genomics Workbench 4.0 Windows, Mac OS X and Linux January 23, 2017 This software is for research purposes only. QIAGEN Aarhus
More informationResequencing Analysis. (Pseudomonas aeruginosa MAPO1 ) Sample to Insight
Resequencing Analysis (Pseudomonas aeruginosa MAPO1 ) 1 Workflow Import NGS raw data Trim reads Import Reference Sequence Reference Mapping QC on reads Variant detection Case Study Pseudomonas aeruginosa
More informationTECH NOTE Improving the Sensitivity of Ultra Low Input mrna Seq
TECH NOTE Improving the Sensitivity of Ultra Low Input mrna Seq SMART Seq v4 Ultra Low Input RNA Kit for Sequencing Powered by SMART and LNA technologies: Locked nucleic acid technology significantly improves
More informationGenboree Microbiome Toolset - Tutorial. Create Sample Meta Data. Previous Tutorials. September_2011_GMT-Tutorial_Single-Samples
Genboree Microbiome Toolset - Tutorial Previous Tutorials September_2011_GMT-Tutorial_Single-Samples We will be going through a tutorial on the Genboree Microbiome Toolset with publicly available data:
More informationPerforming whole genome SNP analysis with mapping performed locally
BioNumerics Tutorial: Performing whole genome SNP analysis with mapping performed locally 1 Introduction 1.1 An introduction to whole genome SNP analysis A Single Nucleotide Polymorphism (SNP) is a variation
More informationBaseSpace - MiSeq Reporter Software v2.4 Release Notes
Page 1 of 5 BaseSpace - MiSeq Reporter Software v2.4 Release Notes For MiSeq Systems Connected to BaseSpace June 2, 2014 Revision Date Description of Change A May 22, 2014 Initial Version Revision History
More informationPerforming a resequencing assembly
BioNumerics Tutorial: Performing a resequencing assembly 1 Aim In this tutorial, we will discuss the different options to obtain statistics about the sequence read set data and assess the quality, and
More information1 Abstract. 2 Introduction. 3 Requirements. 4 Procedure. Qiime Community Profiling University of Colorado at Boulder
1 Abstract 2 Introduction This SOP describes QIIME (Quantitative Insights Into Microbial Ecology) for community profiling using the Human Microbiome Project 16S data. The process takes users from their
More informationCommunity analysis of 16S rrna amplicon sequencing data with Chipster. Eija Korpelainen CSC IT Center for Science, Finland
Community analysis of 16S rrna amplicon sequencing data with Chipster Eija Korpelainen CSC IT Center for Science, Finland chipster@csc.fi What will I learn? How to operate the Chipster software Community
More informationData analysis of 16S rrna amplicons Computational Metagenomics Workshop University of Mauritius
Data analysis of 16S rrna amplicons Computational Metagenomics Workshop University of Mauritius Practical December 2014 Exercise options 1) We will be going through a 16S pipeline using QIIME and 454 data
More informationBioinformatics explained: BLAST. March 8, 2007
Bioinformatics Explained Bioinformatics explained: BLAST March 8, 2007 CLC bio Gustav Wieds Vej 10 8000 Aarhus C Denmark Telephone: +45 70 22 55 09 Fax: +45 70 22 55 19 www.clcbio.com info@clcbio.com Bioinformatics
More informationTutorial for Windows and Macintosh. Trimming Sequence Gene Codes Corporation
Tutorial for Windows and Macintosh Trimming Sequence 2007 Gene Codes Corporation Gene Codes Corporation 775 Technology Drive, Ann Arbor, MI 48108 USA 1.800.497.4939 (USA) +1.734.769.7249 (elsewhere) +1.734.769.7074
More informationTour Guide for Windows and Macintosh
Tour Guide for Windows and Macintosh 2011 Gene Codes Corporation Gene Codes Corporation 775 Technology Drive, Suite 100A, Ann Arbor, MI 48108 USA phone 1.800.497.4939 or 1.734.769.7249 (fax) 1.734.769.7074
More informationAdvanced RNA-Seq 1.5. User manual for. Windows, Mac OS X and Linux. November 2, 2016 This software is for research purposes only.
User manual for Advanced RNA-Seq 1.5 Windows, Mac OS X and Linux November 2, 2016 This software is for research purposes only. QIAGEN Aarhus Silkeborgvej 2 Prismet DK-8000 Aarhus C Denmark Contents 1 Introduction
More informationTutorial: RNA-Seq Analysis Part II (Tracks): Non-Specific Matches, Mapping Modes and Expression measures
: RNA-Seq Analysis Part II (Tracks): Non-Specific Matches, Mapping Modes and February 24, 2014 Sample to Insight : RNA-Seq Analysis Part II (Tracks): Non-Specific Matches, Mapping Modes and : RNA-Seq Analysis
More informationJames Robert White, Cesar Arze, Malcolm Matalka, the CloVR team, Owen White, Samuel V. Angiuoli & W. Florian Fricke
CloVR-16S: Phylogenetic microbial community composition analysis based on 16S ribosomal RNA amplicon sequencing standard operating procedure, version 1.1 James Robert White, Cesar Arze, Malcolm Matalka,
More informationTUTORIAL: Generating diagnostic primers using the Uniqprimer Galaxy Workflow
TUTORIAL: Generating diagnostic primers using the Uniqprimer Galaxy Workflow In this tutorial, we will generate primers expected to amplify a product from Dickeya dadantii but not from other strains of
More informationOrdination (Guerrero Negro)
Ordination (Guerrero Negro) Back to Table of Contents All of the code in this page is meant to be run in R unless otherwise specified. Load data and calculate distance metrics. For more explanations of
More informationCLC Sequence Viewer 6.5 Windows, Mac OS X and Linux
CLC Sequence Viewer Manual for CLC Sequence Viewer 6.5 Windows, Mac OS X and Linux January 26, 2011 This software is for research purposes only. CLC bio Finlandsgade 10-12 DK-8200 Aarhus N Denmark Contents
More informationBioinformatics explained: Smith-Waterman
Bioinformatics Explained Bioinformatics explained: Smith-Waterman May 1, 2007 CLC bio Gustav Wieds Vej 10 8000 Aarhus C Denmark Telephone: +45 70 22 55 09 Fax: +45 70 22 55 19 www.clcbio.com info@clcbio.com
More informationIntroduction to taxonomic analysis of amplicon and shotgun data using QIIME
Practical 08 Introduction to taxonomic analysis Introduction to taxonomic analysis of amplicon and shotgun data using QIIME September 2014 Author Peter Sterk Oxford e-research Centre, University of Oxford,
More informationVector Xpression 3. Speed Tutorial: III. Creating a Script for Automating Normalization of Data
Vector Xpression 3 Speed Tutorial: III. Creating a Script for Automating Normalization of Data Table of Contents Table of Contents...1 Important: Please Read...1 Opening Data in Raw Data Viewer...2 Creating
More informationAnalyzing ChIP- Seq Data in Galaxy
Analyzing ChIP- Seq Data in Galaxy Lauren Mills RISS ABSTRACT Step- by- step guide to basic ChIP- Seq analysis using the Galaxy platform. Table of Contents Introduction... 3 Links to helpful information...
More informationManaging big biological sequence data with Biostrings and DECIPHER. Erik Wright University of Wisconsin-Madison
Managing big biological sequence data with Biostrings and DECIPHER Erik Wright University of Wisconsin-Madison What you should learn How to use the Biostrings and DECIPHER packages Creating a database
More informationImporting and processing a DGGE gel image
BioNumerics Tutorial: Importing and processing a DGGE gel image 1 Aim Comprehensive tools for the processing of electrophoresis fingerprints, both from slab gels and capillary sequencers are incorporated
More informationCLC Genomics Server. Administrator Manual
CLC Genomics Server Administrator Manual Administrator Manual for CLC Genomics Server 5.5 Windows, Mac OS X and Linux October 31, 2013 This software is for research purposes only. CLC bio Silkeborgvej
More informationGegenees genome format...7. Gegenees comparisons...8 Creating a fragmented all-all comparison...9 The alignment The analysis...
User Manual: Gegenees V 1.1.0 What is Gegenees?...1 Version system:...2 What's new...2 Installation:...2 Perspectives...4 The workspace...4 The local database...6 Populate the local database...7 Gegenees
More informationmealybugs Documentation
mealybugs Documentation Release 1.0 Thierry Gosselin June 09, 2014 Contents 1 Computer hardware requirements 3 2 Getting prepared with files 5 3 Start Mothur 7 4 Reducing sequencing and PCR errors 9 5
More informationClick Trust to launch TableView.
Visualizing Expression data using the Co-expression Tool Web service and TableView Introduction. TableView was written by James (Jim) E. Johnson and colleagues at the University of Minnesota Center for
More informationTutorial for Windows and Macintosh Assembly Strategies
Tutorial for Windows and Macintosh Assembly Strategies 2017 Gene Codes Corporation Gene Codes Corporation 525 Avis Drive, Ann Arbor, MI 48108 USA 1.800.497.4939 (USA) +1.734.769.7249 (elsewhere) +1.734.769.7074
More informationCopyright 2014 Regents of the University of Minnesota
Quality Control of Illumina Data using Galaxy Contents September 16, 2014 1 Introduction 2 1.1 What is Galaxy?..................................... 2 1.2 Galaxy at MSI......................................
More informationAdditional Alignments Plugin USER MANUAL
Additional Alignments Plugin USER MANUAL User manual for Additional Alignments Plugin 1.8 Windows, Mac OS X and Linux November 7, 2017 This software is for research purposes only. QIAGEN Aarhus Silkeborgvej
More informationOmixon PreciseAlign CLC Genomics Workbench plug-in
Omixon PreciseAlign CLC Genomics Workbench plug-in User Manual User manual for Omixon PreciseAlign plug-in CLC Genomics Workbench plug-in (all platforms) CLC Genomics Server plug-in (all platforms) January
More informationTutorial for Windows and Macintosh SNP Hunting
Tutorial for Windows and Macintosh SNP Hunting 2010 Gene Codes Corporation Gene Codes Corporation 775 Technology Drive, Ann Arbor, MI 48108 USA 1.800.497.4939 (USA) +1.734.769.7249 (elsewhere) +1.734.769.7074
More informationMiSeq Reporter TruSight Tumor 15 Workflow Guide
MiSeq Reporter TruSight Tumor 15 Workflow Guide For Research Use Only. Not for use in diagnostic procedures. Introduction 3 TruSight Tumor 15 Workflow Overview 4 Reports 8 Analysis Output Files 9 Manifest
More informationITMO Ecole de Bioinformatique Hands-on session: smallrna-seq N. Servant 21 rd November 2013
ITMO Ecole de Bioinformatique Hands-on session: smallrna-seq N. Servant 21 rd November 2013 1. Data and objectives We will use the data from GEO (GSE35368, Toedling, Servant et al. 2011). Two samples were
More informationAgilent Genomic Workbench Lite Edition 6.5
Agilent Genomic Workbench Lite Edition 6.5 SureSelect Quality Analyzer User Guide For Research Use Only. Not for use in diagnostic procedures. Agilent Technologies Notices Agilent Technologies, Inc. 2010
More informationThe software comes with 2 installers: (1) SureCall installer (2) GenAligners (contains BWA, BWA- MEM).
Release Notes Agilent SureCall 4.0 Product Number G4980AA SureCall Client 6-month named license supports installation of one client and server (to host the SureCall database) on one machine. For additional
More informationDifferential Expression Analysis at PATRIC
Differential Expression Analysis at PATRIC The following step- by- step workflow is intended to help users learn how to upload their differential gene expression data to their private workspace using Expression
More informationmiscript mirna PCR Array Data Analysis v1.1 revision date November 2014
miscript mirna PCR Array Data Analysis v1.1 revision date November 2014 Overview The miscript mirna PCR Array Data Analysis Quick Reference Card contains instructions for analyzing the data returned from
More informationCopyright 2014 Regents of the University of Minnesota
Quality Control of Illumina Data using Galaxy August 18, 2014 Contents 1 Introduction 2 1.1 What is Galaxy?..................................... 2 1.2 Galaxy at MSI......................................
More informationVisual Analytics Tools for the Global Change Assessment Model. Michael Steptoe, Ross Maciejewski, & Robert Link Arizona State University
Visual Analytics Tools for the Global Change Assessment Model Michael Steptoe, Ross Maciejewski, & Robert Link Arizona State University GCAM Simulation When exploring the impact of various conditions or
More informationStep-by-Step Guide to Relatedness and Association Mapping Contents
Step-by-Step Guide to Relatedness and Association Mapping Contents OBJECTIVES... 2 INTRODUCTION... 2 RELATEDNESS MEASURES... 2 POPULATION STRUCTURE... 6 Q-K ASSOCIATION ANALYSIS... 10 K MATRIX COMPRESSION...
More informationNext generation Confirmation (NGC) module
QUICK REFERENCE Next generation Confirmation (NGC) module Catalog Number A28221 Pub. No. MAN0015891 Rev. A.0 Product description The Applied Biosystems Next generation Confirmation (NGC) module analyzes
More informationTutorial for Windows and Macintosh SNP Hunting
Tutorial for Windows and Macintosh SNP Hunting 2017 Gene Codes Corporation Gene Codes Corporation 525 Avis Drive, Ann Arbor, MI 48108 USA 1.800.497.4939 (USA) +1.734.769.7249 (elsewhere) +1.734.769.7074
More informationMetAmp: a tool for Meta-Amplicon analysis User Manual
November 12, 2014 MetAmp: a tool for Meta-Amplicon analysis User Manual Ilya Y. Zhbannikov 1, Janet E. Williams 1, James A. Foster 1,2,3 3 Institute for Bioinformatics and Evolutionary Studies, University
More informationData Visualisation with Google Fusion Tables
Data Visualisation with Google Fusion Tables Workshop Exercises Dr Luc Small 12 April 2017 1.5 Data Visualisation with Google Fusion Tables Page 1 of 33 1 Introduction Google Fusion Tables is shaping up
More informationOmega: an Overlap-graph de novo Assembler for Metagenomics
Omega: an Overlap-graph de novo Assembler for Metagenomics B a h l e l H a i d e r, Ta e - H y u k A h n, B r i a n B u s h n e l l, J u a n j u a n C h a i, A l e x C o p e l a n d, C h o n g l e Pa n
More informationRelease Notes. Version Gene Codes Corporation
Version 4.10.1 Release Notes 2010 Gene Codes Corporation Gene Codes Corporation 775 Technology Drive, Ann Arbor, MI 48108 USA 1.800.497.4939 (USA) +1.734.769.7249 (elsewhere) +1.734.769.7074 (fax) www.genecodes.com
More informationCLC Phylogeny Module User manual
CLC Phylogeny Module User manual User manual for Phylogeny Module 1.0 Windows, Mac OS X and Linux September 13, 2013 This software is for research purposes only. CLC bio Silkeborgvej 2 Prismet DK-8000
More informationCloVR-ITS: Automated ITS amplicon sequence analysis pipeline for the characterization of fungal communities standard operating procedure, version 1.
CloVR-ITS: Automated ITS amplicon sequence analysis pipeline for the characterization of fungal communities standard operating procedure, version 1.0 James Robert White, the CloVR team, Owen White, Samuel
More informationTruSight HLA Assign 2.1 RUO Software Guide
TruSight HLA Assign 2.1 RUO Software Guide For Research Use Only. Not for use in diagnostic procedures. Introduction 3 Computing Requirements and Compatibility 4 Installation 5 Getting Started 6 Navigating
More informationAnalyzing Genomic Data with NOJAH
Analyzing Genomic Data with NOJAH TAB A) GENOME WIDE ANALYSIS Step 1: Select the example dataset or upload your own. Two example datasets are available. Genome-Wide TCGA-BRCA Expression datasets and CoMMpass
More informationMolecular Identifier (MID) Analysis for TAM-ChIP Paired-End Sequencing
Molecular Identifier (MID) Analysis for TAM-ChIP Paired-End Sequencing Catalog Nos.: 53126 & 53127 Name: TAM-ChIP antibody conjugate Description Active Motif s TAM-ChIP technology combines antibody directed
More informationProjection with Public Data (PPD)
Projection with Public Data (PPD) Goal To compare users 16S rrna data with published datasets by processing and normalization them together, and projecting into 3D PCoA plot for visual comparative analysis
More informationNGS Data Visualization and Exploration Using IGV
1 What is Galaxy Galaxy for Bioinformaticians Galaxy for Experimental Biologists Using Galaxy for NGS Analysis NGS Data Visualization and Exploration Using IGV 2 What is Galaxy Galaxy for Bioinformaticians
More informationCLC Server Command Line Tools USER MANUAL
CLC Server Command Line Tools USER MANUAL Manual for CLC Server Command Line Tools 2.2 Windows, Mac OS X and Linux August 29, 2014 This software is for research purposes only. CLC bio, a QIAGEN Company
More informationObjective 1: To simulate the rolling of a die 100 times and to build a probability distribution.
Minitab Lab #2 Math 120 Nguyen 1 of 6 Objectives: 1) Simulate games of chance that have equally likely outcomes 2) Construct a binomial probability distribution and sketch a probability histogram 3) Calculate
More informationHow Do I Choose Which Type of Graph to Use?
How Do I Choose Which Type of Graph to Use? When to Use...... a Line graph. Line graphs are used to track changes over short and long periods of time. When smaller changes exist, line graphs are better
More informationNGS NEXT GENERATION SEQUENCING
NGS NEXT GENERATION SEQUENCING Paestum (Sa) 15-16 -17 maggio 2014 Relatore Dr Cataldo Senatore Dr.ssa Emilia Vaccaro Sanger Sequencing Reactions For given template DNA, it s like PCR except: Uses only
More informationStats fest Multivariate analysis. Multivariate analyses. Aims. Multivariate analyses. Objects. Variables
Stats fest 7 Multivariate analysis murray.logan@sci.monash.edu.au Multivariate analyses ims Data reduction Reduce large numbers of variables into a smaller number that adequately summarize the patterns
More informationmtdna Variant Processor v1.0 BaseSpace App Guide
mtdna Variant Processor v1.0 BaseSpace App Guide For Research Use Only. Not for use in diagnostic procedures. Introduction 3 Workflow Diagram 4 Workflow 5 Log In to BaseSpace 6 Set Analysis Parameters
More informationTask 2 Guidance (P2, P3, P4, M1, M2)
Task 2 Guidance (P2, P3, P4, M1, M2) P2 Make sure that your spreadsheet model meets the complex criteria and exhibits some aspects of complexity such as multiple worksheets (with links), complex formulae
More informationSupplementary Figure 1. Fast read-mapping algorithm of BrowserGenome.
Supplementary Figure 1 Fast read-mapping algorithm of BrowserGenome. (a) Indexing strategy: The genome sequence of interest is divided into non-overlapping 12-mers. A Hook table is generated that contains
More informationContents. About this Book...1 Audience... 1 Prerequisites... 1 Conventions... 2
Contents About this Book...1 Audience... 1 Prerequisites... 1 Conventions... 2 1 About SAS Sentiment Analysis Workbench...3 1.1 What Is SAS Sentiment Analysis Workbench?... 3 1.2 Benefits of Using SAS
More informationm6aviewer Version Documentation
m6aviewer Version 1.6.0 Documentation Contents 1. About 2. Requirements 3. Launching m6aviewer 4. Running Time Estimates 5. Basic Peak Calling 6. Running Modes 7. Multiple Samples/Sample Replicates 8.
More informationCS313 Exercise 4 Cover Page Fall 2017
CS313 Exercise 4 Cover Page Fall 2017 Due by the start of class on Thursday, October 12, 2017. Name(s): In the TIME column, please estimate the time you spent on the parts of this exercise. Please try
More informationEcological cluster analysis with Past
Ecological cluster analysis with Past Øyvind Hammer, Natural History Museum, University of Oslo, 2011-06-26 Introduction Cluster analysis is used to identify groups of items, e.g. samples with counts or
More informationDeployment Manual CLC WORKBENCHES
Deployment Manual CLC WORKBENCHES Manual for CLC Workbenches: deployment and technical information Windows, Mac OS X and Linux June 15, 2017 This software is for research purposes only. QIAGEN Aarhus Silkeborgvej
More information