Guide for the EFI-Database (EFI-DB)

Size: px
Start display at page:

Download "Guide for the EFI-Database (EFI-DB)"

Transcription

1 Guide for the EFI-Database (EFI-DB) Use this guide to become familiar with the information available in the EFI experimental database, the EFI-DB. Helpful annotations are in yellow. 10/2011

2 About the EFI-DB 1 What is the EFI-DB? The EFI-DB serves as the EFI s public database of experimental data. The database stores details of all cloning, purification, and structure determination experiments, as well as the results of in vivo and in vitro analyses as they become publicly available. The information is presented in a dynamic, interactive format to allow one to quickly browse through all experimental data. The EFI-DB database contains a subset of the information stored in its companion database, the LabDB LIMS (Laboratory Information Management System). LabDB is used internally by the EFI staff to record vast amounts of data describing the experiments conducted in the center; in many cases LabDB interfaces directly with the equipment used by EFI. What is the EFI-DB used for? To locate EFI targets and track their experimental status. The experimental ( wet ) data for each target in EFI is also linked to a corresponding page in the Structure-Function Linkage Database (SFLD) with the bioinformatics ( dry ) data for that target. EFI-DB also provides data to other public databases, such as TargetDB, PepCDB, and the Protein Data Bank. How can I use the EFI-DB? - Browse and search the list of EFI targets - Display experimental track of any EFI target - Check if a protein of interest has a homologue among the EFI targets - View X-ray crystallography structures determined and annotated by EFI members - Download and view interactive structure descriptions, using ICM viewer technology from Molsoft LLC - See overall progress and statistics of the EFI project Who maintains EFI-DB? EFI-DB and LabDB are developed and maintained on the behalf of the Enzyme Function Initiative by the group of Prof. Wladek Minor at the University of Virginia. Contact: labdb@enzymefunction.org

3 Getting Started Web Page: EFI-DB Home 2 Website links are in red. EFI-DB menu EFI main website menu About News Use Guides Contact

4 Search EFI Targets Web Page: Homologue Search 3 Select data set, E value and provide sequence

5 Browse EFI Targets Web Page: Targets 4 Select menu items to reformat table below to display specific superfamily targets. Select to reformat table below: - Filter Targets by organism, species, keyword, stage, or lab - Jump to Target via EFI-DB, GI number, or locus tag - Filter Columns to display EFI-ID, superfamily, GI, locus tag, organism, gene name, description, length, homologue PDB, homologue PDB % identity, stage

6 Browse EFI Targets, cont. Web Page: Targets 5 Click EFI-ID to view individual target page Click GI to go to NCBI protein page Click organism to go to NCBI taxonomy page Click SFLD to view informatics page EFI Superfamily NCBI Description Selection Most Advanced Rationale Experimental Stage

7 Download EFI Data Web Page: Targets 6 Click to download EFI data for the targets represented in the table below.

8 EFI Target Information Web Page: Specific Target View (example) 7 Experimental tree where each box represents an experiment. Dark boxes lead to the most advanced stage. Click SFLD to view Informatics page General target information

9 EFI Target Information, cont. Web Page: Specific Target View (example) 8

10 EFI Structures Web Page: Structures 9 Click title to view individual structure page 9 Structures Click to go to external databases

11 EFI Structures, cont. Web Page: Specific Deposit View (example) 10 Click to go to view experimental tree 9 Structures crystallization details

12 EFI Structures, cont. Web Page: Electronic Structure Description 11 Click to view interactive structure visualizations (requires Molsoft download) 9 Structures Interactive poses

13 Progress Web Page: Progress 12 Select to reformat table and chart below by superfamily, organism, or species

14 Statistics Web Page: Statistics 13 Select to reformat table and chart below by superfamily, organism, or species Select menu items to view superfamilyspecific and structure pipeline statistics

15 Data Sharing Plan 14 1) Gathering and grouping of sequence similarity networks and other approaches for phylogenetics by the Superfamily/Genome Core Deposited in the SFLD database and will be available immediately (prior to publication) to the scientific community. 2) Purified proteins produced by the Protein Core The identities and progress of proteins in the pipeline will be available immediately (prior to publication) to the scientific community via the TargetDB and PepCDB databases. The purified proteins will be available only to the members of the EFI; the clones for the proteins will be available to the scientific community via the PSI-MR. 3) Three dimensional structures of target proteins by x ray crystallography in the Structure Core The structures determined by the structure core will be deposited immediately (prior to publication) in the PDB. This plan parallels that used by the NIGMS PSI-2 large scale centers for structure deposition. 4) Three dimensional structures of target proteins by homology modeling, in silico docking poses of virtual libraries of metabolites and high energy intermediates, and the resulting rank-ordered hit lists of predicted substrates by the Computation Core. The homology models and hit lists will be immediately available to all members of the EFI, but will not be made available to the scientific community until functional assignments and/or improvements in computational algorithms based on these results have been published. The computation core will provide web-based portals to enable the community to use the software for homology modeling, docking, and related tasks. All of the underlying software is freely available to non-profit institutions, and new software developed by the EFI will be open-source. 5) Focused libraries of ligands and substrates by the Bridging Projects The identities of these libraries will be provided on the EFI website; samples will be made available on request as quantities permit. The procedures for their syntheses will be provided via the EFI website; clones for any EFI-specific proteins required for library synthesis will be available from the PSI MR. 6) Results of library screening and detailed kinetic analyses by the Bridging Projects The results of focus library screening and detailed kinetic analyses will be immediately available to all members of the EFI, but will not be made available to the scientific community until functional assignments and/or improvements in computational algorithms based on these results have been published. 7) Results of phenotypic and metabolomic analyses by the Microbiology Core The results of phenotypic and metabolomic analyses will be immediately available to all members of the EFI, but will not be made available to the scientific community until functional assignments and/or improvements in computational algorithms based on these results have been published.

Viewing Molecular Structures

Viewing Molecular Structures Viewing Molecular Structures Proteins fulfill a wide range of biological functions which depend upon their three dimensional structures. Therefore, deciphering the structure of proteins has been the quest

More information

Tutorial. Docking School SAnDReS Tutorial Cyclin-Dependent Kinases with K i Information (Introduction)

Tutorial. Docking School SAnDReS Tutorial Cyclin-Dependent Kinases with K i Information (Introduction) Tutorial Docking School SAnDReS Tutorial Cyclin-Dependent Kinases with K i Information (Introduction) Prof. Dr. Walter Filgueira de Azevedo Jr. Laboratory of Computational Systems Biology azevedolab.net

More information

mpmorfsdb: A database of Molecular Recognition Features (MoRFs) in membrane proteins. Introduction

mpmorfsdb: A database of Molecular Recognition Features (MoRFs) in membrane proteins. Introduction mpmorfsdb: A database of Molecular Recognition Features (MoRFs) in membrane proteins. Introduction Molecular Recognition Features (MoRFs) are short, intrinsically disordered regions in proteins that undergo

More information

Tutorial: Using the SFLD and Cytoscape to Make Hypotheses About Enzyme Function for an Isoprenoid Synthase Superfamily Sequence

Tutorial: Using the SFLD and Cytoscape to Make Hypotheses About Enzyme Function for an Isoprenoid Synthase Superfamily Sequence Tutorial: Using the SFLD and Cytoscape to Make Hypotheses About Enzyme Function for an Isoprenoid Synthase Superfamily Sequence Requirements: 1. A web browser 2. The cytoscape program (available for download

More information

Structural Bioinformatics

Structural Bioinformatics Structural Bioinformatics Elucidation of the 3D structures of biomolecules. Analysis and comparison of biomolecular structures. Prediction of biomolecular recognition. Handles three-dimensional (3-D) structures.

More information

Enabling Open Science: Data Discoverability, Access and Use. Jo McEntyre Head of Literature Services

Enabling Open Science: Data Discoverability, Access and Use. Jo McEntyre Head of Literature Services Enabling Open Science: Data Discoverability, Access and Use Jo McEntyre Head of Literature Services www.ebi.ac.uk About EMBL-EBI Part of the European Molecular Biology Laboratory International, non-profit

More information

biochem480 Autumn 2016 Bioinformatics Report pdf document with the title bioinfof16lastname_initial.pdf

biochem480 Autumn 2016 Bioinformatics Report pdf document with the title bioinfof16lastname_initial.pdf biochem480 Autumn 2016 Bioinformatics Report These are the instructions of how to complete your bioinformatics project Your final report, which is to be emailed to jcorkill@ewu.edu before 3pm on Friday

More information

Computer Lab, Session 1

Computer Lab, Session 1 Computer Lab, Session 1 1 Log in Please log in with username VSDD0xy where xy is your computer number ranging from 01, 02,, 20 2 Settings Open terminal In home directory (initial directory): cp /export/home/vsdd/vsdd001/.bashrc.

More information

Geneious 5.6 Quickstart Manual. Biomatters Ltd

Geneious 5.6 Quickstart Manual. Biomatters Ltd Geneious 5.6 Quickstart Manual Biomatters Ltd October 15, 2012 2 Introduction This quickstart manual will guide you through the features of Geneious 5.6 s interface and help you orient yourself. You should

More information

PSI Data Management Workshop July 10-11, 2003

PSI Data Management Workshop July 10-11, 2003 Data Management Workshop, July 10-11, 2003 PSI Data Management Workshop July 10-11, 2003 The National Institute of General Medical Sciences organized the first workshop on data management for the Protein

More information

The PDB and experimental data

The PDB and experimental data The PDB and experimental data John Westbrook Rutgers, The State University of New Jersey www.wwpdb.org Workshop on Metadata for raw data from X-ray diffraction and other structural techniques Overview

More information

User guide for GEM-TREND

User guide for GEM-TREND User guide for GEM-TREND 1. Requirements for Using GEM-TREND GEM-TREND is implemented as a java applet which can be run in most common browsers and has been test with Internet Explorer 7.0, Internet Explorer

More information

Integrating Data with Publications: Greater Interactivity and Challenges for Long-Term Preservation of the Scientific Record

Integrating Data with Publications: Greater Interactivity and Challenges for Long-Term Preservation of the Scientific Record Integrating Data with Publications: Greater Interactivity and Challenges for Long-Term Preservation of the Scientific Record Brian McMahon International Union of Crystallography 5 Abbey Square Chester

More information

How to use KAIKObase Version 3.1.0

How to use KAIKObase Version 3.1.0 How to use KAIKObase Version 3.1.0 Version3.1.0 29/Nov/2010 http://sgp2010.dna.affrc.go.jp/kaikobase/ Copyright National Institute of Agrobiological Sciences. All rights reserved. Outline 1. System overview

More information

biochem480 Autumn 2017 Introduction to Bioinformatics (Final version) Edit out any instructional details in orange italics. As always, style counts!

biochem480 Autumn 2017 Introduction to Bioinformatics (Final version) Edit out any instructional details in orange italics. As always, style counts! biochem480 Autumn 2017 Introduction to Bioinformatics (Final version) These are the instructions of how to complete your FINAL bioinformatics project Your report, as a Word or pdf is to be handed in on

More information

2. Take a few minutes to look around the site. The goal is to familiarize yourself with a few key components of the NCBI.

2. Take a few minutes to look around the site. The goal is to familiarize yourself with a few key components of the NCBI. 2 Navigating the NCBI Instructions Aim: To become familiar with the resources available at the National Center for Bioinformatics (NCBI) and the search engine Entrez. Instructions: Write the answers to

More information

BLAST Exercise 2: Using mrna and EST Evidence in Annotation Adapted by W. Leung and SCR Elgin from Annotation Using mrna and ESTs by Dr. J.

BLAST Exercise 2: Using mrna and EST Evidence in Annotation Adapted by W. Leung and SCR Elgin from Annotation Using mrna and ESTs by Dr. J. BLAST Exercise 2: Using mrna and EST Evidence in Annotation Adapted by W. Leung and SCR Elgin from Annotation Using mrna and ESTs by Dr. J. Buhler Prerequisites: BLAST Exercise: Detecting and Interpreting

More information

INTRODUCTION TO BIOINFORMATICS

INTRODUCTION TO BIOINFORMATICS Molecular Biology-2019 1 INTRODUCTION TO BIOINFORMATICS In this section, we want to provide a simple introduction to using the web site of the National Center for Biotechnology Information NCBI) to obtain

More information

Protein Data Bank: An open access resource enabling basic and applied research and education in biology and medicine

Protein Data Bank: An open access resource enabling basic and applied research and education in biology and medicine Protein Data Bank: An open access resource enabling basic and applied research and education in biology and medicine John Westbrook, Ph.D. RCSB PDB Data & Software Architect Lead Overview A bit of background

More information

Genome Browsers - The UCSC Genome Browser

Genome Browsers - The UCSC Genome Browser Genome Browsers - The UCSC Genome Browser Background The UCSC Genome Browser is a well-curated site that provides users with a view of gene or sequence information in genomic context for a specific species,

More information

User Manual. Ver. 3.0 March 19, 2012

User Manual. Ver. 3.0 March 19, 2012 User Manual Ver. 3.0 March 19, 2012 Table of Contents 1. Introduction... 2 1.1 Rationale... 2 1.2 Software Work-Flow... 3 1.3 New in GenomeGems 3.0... 4 2. Software Description... 5 2.1 Key Features...

More information

Extra-Homework Problem Set

Extra-Homework Problem Set Extra-Homework Problem Set => Will not be graded, but might be a good idea for self-study => Solutions are posted at the end of the problem set Your adviser asks you to find out about a so far unpublished

More information

Blast2GO User Manual. Blast2GO Ortholog Group Annotation May, BioBam Bioinformatics S.L. Valencia, Spain

Blast2GO User Manual. Blast2GO Ortholog Group Annotation May, BioBam Bioinformatics S.L. Valencia, Spain Blast2GO User Manual Blast2GO Ortholog Group Annotation May, 2016 BioBam Bioinformatics S.L. Valencia, Spain Contents 1 Clusters of Orthologs 2 2 Orthologous Group Annotation Tool 2 3 Statistics for NOG

More information

The beginning of this guide offers a brief introduction to the Protein Data Bank, where users can download structure files.

The beginning of this guide offers a brief introduction to the Protein Data Bank, where users can download structure files. Structure Viewers Take a Class This guide supports the Galter Library class called Structure Viewers. See our Classes schedule for the next available offering. If this class is not on our upcoming schedule,

More information

ClinVar. Jennifer Lee, PhD, NCBI/NLM/NIH ClinVar

ClinVar. Jennifer Lee, PhD, NCBI/NLM/NIH ClinVar ClinVar What is ClinVar ClinVar is a freely available, central archive for associating observed variation with supporting clinical and experimental evidence for a wide range of disorders. The database

More information

Homology Modeling FABP

Homology Modeling FABP Homology Modeling FABP Homology modeling is a technique used to approximate the 3D structure of a protein when no experimentally determined structure exists. It operates under the principle that protein

More information

Deliverable D5.5. D5.5 VRE-integrated PDBe Search and Query API. World-wide E-infrastructure for structural biology. Grant agreement no.

Deliverable D5.5. D5.5 VRE-integrated PDBe Search and Query API. World-wide E-infrastructure for structural biology. Grant agreement no. Deliverable D5.5 Project Title: World-wide E-infrastructure for structural biology Project Acronym: West-Life Grant agreement no.: 675858 Deliverable title: D5.5 VRE-integrated PDBe Search and Query API

More information

MetScape User Manual

MetScape User Manual MetScape 2.3.2 User Manual A Plugin for Cytoscape National Center for Integrative Biomedical Informatics July 2012 2011 University of Michigan This work is supported by the National Center for Integrative

More information

Complex Query Formulation Over Diverse Information Sources Using an Ontology

Complex Query Formulation Over Diverse Information Sources Using an Ontology Complex Query Formulation Over Diverse Information Sources Using an Ontology Robert Stevens, Carole Goble, Norman Paton, Sean Bechhofer, Gary Ng, Patricia Baker and Andy Brass Department of Computer Science,

More information

Using Protein Data Bank and Astex Viewer to Study Protein Structure

Using Protein Data Bank and Astex Viewer to Study Protein Structure Helsinki University of Technology S-114.500 The Basics of Cell Bio Systems 28 February 2005 Using Protein Data Bank and Astex Viewer to Study Protein Structure Teppo Valtonen ASN 50768A Contents 1.Introduction...3

More information

CONTENTS 1. Contents

CONTENTS 1. Contents BIANA Tutorial CONTENTS 1 Contents 1 Getting Started 6 1.1 Starting BIANA......................... 6 1.2 Creating a new BIANA Database................ 8 1.3 Parsing External Databases...................

More information

Software review. Biomolecular Interaction Network Database

Software review. Biomolecular Interaction Network Database Biomolecular Interaction Network Database Keywords: protein interactions, visualisation, biology data integration, web access Abstract This software review looks at the utility of the Biomolecular Interaction

More information

Retrieving factual data and documents using IMGT-ML in the IMGT information system

Retrieving factual data and documents using IMGT-ML in the IMGT information system Retrieving factual data and documents using IMGT-ML in the IMGT information system Authors : Chaume D. *, Combres K. *, Giudicelli V. *, Lefranc M.-P. * * Laboratoire d'immunogénétique Moléculaire, LIGM,

More information

HymenopteraMine Documentation

HymenopteraMine Documentation HymenopteraMine Documentation Release 1.0 Aditi Tayal, Deepak Unni, Colin Diesh, Chris Elsik, Darren Hagen Apr 06, 2017 Contents 1 Welcome to HymenopteraMine 3 1.1 Overview of HymenopteraMine.....................................

More information

Version 1.0 November2016 Hermes V1.8.2

Version 1.0 November2016 Hermes V1.8.2 Hermes in a Nutshell Version 1.0 November2016 Hermes V1.8.2 Table of Contents Hermes in a Nutshell... 1 Introduction... 2 Example 1. Visualizing and Editing the MLL1 fusion protein... 3 Setting Your Display...

More information

What do I do if my blast searches seem to have all the top hits from the same genus or species?

What do I do if my blast searches seem to have all the top hits from the same genus or species? What do I do if my blast searches seem to have all the top hits from the same genus or species? If the bacterial species you are using to annotate is clinically significant or of great research interest,

More information

Tutorial 1: Exploring the UCSC Genome Browser

Tutorial 1: Exploring the UCSC Genome Browser Last updated: May 12, 2011 Tutorial 1: Exploring the UCSC Genome Browser Open the homepage of the UCSC Genome Browser at: http://genome.ucsc.edu/ In the blue bar at the top, click on the Genomes link.

More information

Two Examples of Datanomic. David Du Digital Technology Center Intelligent Storage Consortium University of Minnesota

Two Examples of Datanomic. David Du Digital Technology Center Intelligent Storage Consortium University of Minnesota Two Examples of Datanomic David Du Digital Technology Center Intelligent Storage Consortium University of Minnesota Datanomic Computing (Autonomic Storage) System behavior driven by characteristics of

More information

Topics of the talk. Biodatabases. Data types. Some sequence terminology...

Topics of the talk. Biodatabases. Data types. Some sequence terminology... Topics of the talk Biodatabases Jarno Tuimala / Eija Korpelainen CSC What data are stored in biological databases? What constitutes a good database? Nucleic acid sequence databases Amino acid sequence

More information

Re-dock of Roscovitine Against Human Cyclin-Dependent Kinase 2 with Molegro Virtual Docker

Re-dock of Roscovitine Against Human Cyclin-Dependent Kinase 2 with Molegro Virtual Docker Tutorial Re-dock of Roscovitine Against Human Cyclin-Dependent Kinase 2 with Molegro Virtual Docker Prof. Dr. Walter Filgueira de Azevedo Jr. walter@azevedolab.net azevedolab.net 1 Introduction In this

More information

Depositing small-angle scattering data and models to the Small-Angle Scattering Biological Data Bank (SASBDB).

Depositing small-angle scattering data and models to the Small-Angle Scattering Biological Data Bank (SASBDB). Depositing small-angle scattering data and models to the Small-Angle Scattering Biological Data Bank (SASBDB). Introduction. The following guide provides a basic outline of the minimum requirements necessary

More information

Introduction to Bioinformatics Online Course: IBT

Introduction to Bioinformatics Online Course: IBT Introduction to Bioinformatics Online Course: IBT Multiple Sequence Alignment Building Multiple Sequence Alignment Lec2 Choosing the Right Sequences Choosing the Right Sequences Before you build your alignment,

More information

Wilson Leung 01/03/2018 An Introduction to NCBI BLAST. Prerequisites: Detecting and Interpreting Genetic Homology: Lecture Notes on Alignment

Wilson Leung 01/03/2018 An Introduction to NCBI BLAST. Prerequisites: Detecting and Interpreting Genetic Homology: Lecture Notes on Alignment An Introduction to NCBI BLAST Prerequisites: Detecting and Interpreting Genetic Homology: Lecture Notes on Alignment Resources: The BLAST web server is available at https://blast.ncbi.nlm.nih.gov/blast.cgi

More information

KaPPA-View 4. Manual for Beginners. ver Kazusa DNA Research Institute. The Kazusa Plant Pathway Viewer, Version 4.0

KaPPA-View 4. Manual for Beginners. ver Kazusa DNA Research Institute. The Kazusa Plant Pathway Viewer, Version 4.0 KaPPA-View 4 The Kazusa Plant Pathway Viewer, Version 4.0 Manual for Beginners ver. 1.2 Kazusa DNA Research Institute Table of Contents Table of Contents 1. Introduction... 1 1-1. Overview of KaPPA-View4...

More information

Petroleum User Group Meeting, April 2006 Houston, TX. Leveraging Semantic Technology for Improved Enterprise Search and Knowledge Discovery

Petroleum User Group Meeting, April 2006 Houston, TX. Leveraging Semantic Technology for Improved Enterprise Search and Knowledge Discovery Petroleum User Group Meeting, April 2006 Houston, TX Leveraging Semantic Technology for Improved Enterprise Search and Knowledge Discovery Petroleum User Group Meeting, April 2006 Houston, TX OR GIS as

More information

INTRODUCTION TO BIOINFORMATICS

INTRODUCTION TO BIOINFORMATICS Molecular Biology-2017 1 INTRODUCTION TO BIOINFORMATICS In this section, we want to provide a simple introduction to using the web site of the National Center for Biotechnology Information NCBI) to obtain

More information

Bioinformatics explained: BLAST. March 8, 2007

Bioinformatics explained: BLAST. March 8, 2007 Bioinformatics Explained Bioinformatics explained: BLAST March 8, 2007 CLC bio Gustav Wieds Vej 10 8000 Aarhus C Denmark Telephone: +45 70 22 55 09 Fax: +45 70 22 55 19 www.clcbio.com info@clcbio.com Bioinformatics

More information

RDF friendly Chemical Taxonomies for Semantic Web (Using ORACLE/MySQL

RDF friendly Chemical Taxonomies for Semantic Web (Using ORACLE/MySQL RDF friendly Chemical Taxonomies for Semantic Web (Using ORACLE/MySQL MySQL) Downloads T.N.Bhat Bhat*, J. Barkley NIST, Gaithersburg USA bhat@nist.gov Query 3-D data Query 2-D data Prasanna MD, Vondrasek

More information

Lezione 13. Bioinformatica. Mauro Ceccanti e Alberto Paoluzzi

Lezione 13. Bioinformatica. Mauro Ceccanti e Alberto Paoluzzi Lezione 13 Bioinformatica Mauro Ceccanti e Alberto Paoluzzi Dip. Informatica e Automazione Università Roma Tre Dip. Medicina Clinica Università La Sapienza Lecture 13: Alignment of sequences Sequence alignment

More information

CAP BIOINFORMATICS Su-Shing Chen CISE. 8/19/2005 Su-Shing Chen, CISE 1

CAP BIOINFORMATICS Su-Shing Chen CISE. 8/19/2005 Su-Shing Chen, CISE 1 CAP 5510-2 BIOINFORMATICS Su-Shing Chen CISE 8/19/2005 Su-Shing Chen, CISE 1 Building Local Genomic Databases Genomic research integrates sequence data with gene function knowledge. Gene ontology to represent

More information

Virginia Bioinformatics Institute. Extension / Research IT Network DIVERSE API

Virginia Bioinformatics Institute. Extension / Research IT Network DIVERSE API Homeland Security Workshop on Information Analysis, Synthesis, and Cybersecurity Florida State University and Oak Ridge National Laboratories July 18 th, 2003 Tallahassee, FL Presentation contacts: Peter

More information

Protein Data Bank Japan

Protein Data Bank Japan Protein Data Bank Japan http://www.pdbj.org/ PDBj Today gene information for many species is just at the point of being revealed. To make use of this information, it is necessary to look at the proteins

More information

9/29/13. Outline Data mining tasks. Clustering algorithms. Applications of clustering in biology

9/29/13. Outline Data mining tasks. Clustering algorithms. Applications of clustering in biology 9/9/ I9 Introduction to Bioinformatics, Clustering algorithms Yuzhen Ye (yye@indiana.edu) School of Informatics & Computing, IUB Outline Data mining tasks Predictive tasks vs descriptive tasks Example

More information

The data explosion. and the need to manage diverse data sources in scientific research. Simon Coles

The data explosion. and the need to manage diverse data sources in scientific research. Simon Coles The data explosion and the need to manage diverse data sources in scientific research Simon Coles (s.j.coles@soton.ac.uk) Director, UK National Crystallography Service Why manage? Volume Day to day coping

More information

BHSAI Biotechnology HPC Software Applications Institute

BHSAI Biotechnology HPC Software Applications Institute BHSAI Biotechnology HPC Software Applications Institute QuartetS-DB An Orthology Database for Species User s Guide May 0 The QuartetS database (QuartetS-DB) contains orthology predictions for species (

More information

ESG: Extended Similarity Group Job Submission

ESG: Extended Similarity Group Job Submission ESG: Extended Similarity Group Job Submission Cite: Meghana Chitale, Troy Hawkins, Changsoon Park, & Daisuke Kihara ESG: Extended similarity group method for automated protein function prediction, Bioinformatics,

More information

Discovery Net : A UK e-science Pilot Project for Grid-based Knowledge Discovery Services. Patrick Wendel Imperial College, London

Discovery Net : A UK e-science Pilot Project for Grid-based Knowledge Discovery Services. Patrick Wendel Imperial College, London Discovery Net : A UK e-science Pilot Project for Grid-based Knowledge Discovery Services Patrick Wendel Imperial College, London Data Mining and Exploration Middleware for Distributed and Grid Computing,

More information

MetaStorm: User Manual

MetaStorm: User Manual MetaStorm: User Manual User Account: First, either log in as a guest or login to your user account. If you login as a guest, you can visualize public MetaStorm projects, but can not run any analysis. To

More information

Mismatch String Kernels for SVM Protein Classification

Mismatch String Kernels for SVM Protein Classification Mismatch String Kernels for SVM Protein Classification by C. Leslie, E. Eskin, J. Weston, W.S. Noble Athina Spiliopoulou Morfoula Fragopoulou Ioannis Konstas Outline Definitions & Background Proteins Remote

More information

Introduction to Hermes

Introduction to Hermes Introduction to Hermes Version 2.0 November 2017 Hermes v1.9 Table of Contents Introduction... 2 Visualising and Editing the MLL1 fusion protein... 2 Opening Files in Hermes... 3 Setting Style Preferences...

More information

Human Disease Models Tutorial

Human Disease Models Tutorial Mouse Genome Informatics www.informatics.jax.org The fundamental mission of the Mouse Genome Informatics resource is to facilitate the use of mouse as a model system for understanding human biology and

More information

E. coli functional genotyping: predicting phenotypic traits from whole genome sequences

E. coli functional genotyping: predicting phenotypic traits from whole genome sequences BioNumerics Tutorial: E. coli functional genotyping: predicting phenotypic traits from whole genome sequences 1 Aim In this tutorial we will screen genome sequences of Escherichia coli samples for phenotypic

More information

Molecular docking tutorial

Molecular docking tutorial Molecular docking tutorial Sulfonamide-type D-Glu inhibitor docked into the MurD active site using ArgusLab In this tutorial [1] you will learn how to prepare and run molecular docking calculations using

More information

2) NCBI BLAST tutorial This is a users guide written by the education department at NCBI.

2) NCBI BLAST tutorial   This is a users guide written by the education department at NCBI. Web resources -- Tour. page 1 of 8 This is a guided tour. Any homework is separate. In fact, this exercise is used for multiple classes and is publicly available to everyone. The entire tour will take

More information

MDA Blast2GO Exercises

MDA Blast2GO Exercises MDA 2011 - Blast2GO Exercises Ana Conesa and Stefan Götz March 2011 Bioinformatics and Genomics Department Prince Felipe Research Center Valencia, Spain Contents 1 Annotate 10 sequences with Blast2GO 2

More information

MetaPhyler Usage Manual

MetaPhyler Usage Manual MetaPhyler Usage Manual Bo Liu boliu@umiacs.umd.edu March 13, 2012 Contents 1 What is MetaPhyler 1 2 Installation 1 3 Quick Start 2 3.1 Taxonomic profiling for metagenomic sequences.............. 2 3.2

More information

PDB-Metrics: a Web tool for exploring the PDB contents

PDB-Metrics: a Web tool for exploring the PDB contents PDB-Metrics: a Web tool for exploring the PDB contents 333 PDB-Metrics: a Web tool for exploring the PDB contents Renato Fileto, Paula R. Kuser, Michel E.B. Yamagishi, André A. Ribeiro, Thiago G. Quinalia,

More information

Introduction to the Protein Data Bank Master Chimie Info Roland Stote Page #

Introduction to the Protein Data Bank Master Chimie Info Roland Stote Page # Introduction to the Protein Data Bank Master Chimie Info - 2009 Roland Stote The purpose of the Protein Data Bank is to collect and organize 3D structures of proteins, nucleic acids, protein-nucleic acid

More information

BIOINFORMATICS A PRACTICAL GUIDE TO THE ANALYSIS OF GENES AND PROTEINS

BIOINFORMATICS A PRACTICAL GUIDE TO THE ANALYSIS OF GENES AND PROTEINS BIOINFORMATICS A PRACTICAL GUIDE TO THE ANALYSIS OF GENES AND PROTEINS EDITED BY Genome Technology Branch National Human Genome Research Institute National Institutes of Health Bethesda, Maryland B. F.

More information

CACAO Training. Jim Hu and Suzi Aleksander Spring 2016

CACAO Training. Jim Hu and Suzi Aleksander Spring 2016 CACAO Training Jim Hu and Suzi Aleksander Spring 2016 1 What is CACAO? Community Assessment of Community Annotation with Ontologies (CACAO) Annotation of gene function Competition Within a class Between

More information

Bioinformatics Database Worksheet

Bioinformatics Database Worksheet Bioinformatics Database Worksheet (based on http://www.usm.maine.edu/~rhodes/goodies/matics.html) Where are the opsin genes in the human genome? Point your browser to the NCBI Map Viewer at http://www.ncbi.nlm.nih.gov/mapview/.

More information

Build Scientific Computing Infrastructure with Rebar3 and Docker. Eric Sage

Build Scientific Computing Infrastructure with Rebar3 and Docker. Eric Sage Build Scientific Computing Infrastructure with Rebar3 and Docker Eric Sage A scientific telecommunications network Hello, I d like an automated gene ontology please! Agenda - An example biological service

More information

Bioinformatics Hubs on the Web

Bioinformatics Hubs on the Web Bioinformatics Hubs on the Web Take a class The Galter Library teaches a related class called Bioinformatics Hubs on the Web. See our Classes schedule for the next available offering. If this class is

More information

Automatic annotation in UniProtKB using UniRule, and Complete Proteomes. Wei Mun Chan

Automatic annotation in UniProtKB using UniRule, and Complete Proteomes. Wei Mun Chan Automatic annotation in UniProtKB using UniRule, and Complete Proteomes Wei Mun Chan Talk outline Introduction to UniProt UniProtKB annotation and propagation Data increase and the need for Automatic Annotation

More information

Product Application Focus LabVelocity : Online Tools for Life Science Products, Protocols, Technical Information, MEDLINE

Product Application Focus LabVelocity : Online Tools for Life Science Products, Protocols, Technical Information, MEDLINE Product Application Focus LabVelocity : Online Tools for Life Science Products, Protocols, Technical Information, MEDLINE Searches, and Laboratory Calculations BioTechniques 30:1310-1315 (June 2001) David

More information

Mapping Sequence Conservation onto Structures with Chimera

Mapping Sequence Conservation onto Structures with Chimera This page: www.rbvi.ucsf.edu/chimera/data/tutorials/systems/outline.html Chimera in BP205A BP205A syllabus Mapping Sequence Conservation onto Structures with Chimera Case 1: You already have a structure

More information

BLAST, Profile, and PSI-BLAST

BLAST, Profile, and PSI-BLAST BLAST, Profile, and PSI-BLAST Jianlin Cheng, PhD School of Electrical Engineering and Computer Science University of Central Florida 26 Free for academic use Copyright @ Jianlin Cheng & original sources

More information

Life Sciences Oracle Based Solutions. June 2004

Life Sciences Oracle Based Solutions. June 2004 Life Sciences Oracle Based Solutions June 2004 Overview of Accelrys Leading supplier of computation tools to the life science and informatics research community: Bioinformatics Cheminformatics Modeling/Simulation

More information

A First Introduction to Scientific Visualization Geoffrey Gray

A First Introduction to Scientific Visualization Geoffrey Gray Visual Molecular Dynamics A First Introduction to Scientific Visualization Geoffrey Gray VMD on CIRCE: On the lower bottom left of your screen, click on the window start-up menu. In the search box type

More information

CLC Sequence Viewer 6.5 Windows, Mac OS X and Linux

CLC Sequence Viewer 6.5 Windows, Mac OS X and Linux CLC Sequence Viewer Manual for CLC Sequence Viewer 6.5 Windows, Mac OS X and Linux January 26, 2011 This software is for research purposes only. CLC bio Finlandsgade 10-12 DK-8200 Aarhus N Denmark Contents

More information

A Protocol for Maintaining Multidatabase Referential Integrity. Articial Intelligence Center. SRI International, EJ229

A Protocol for Maintaining Multidatabase Referential Integrity. Articial Intelligence Center. SRI International, EJ229 A Protocol for Maintaining Multidatabase Referential Integrity Peter D. Karp Articial Intelligence Center SRI International, EJ229 333 Ravenswood Ave. Menlo Park, CA 94025 voice: 415-859-6375 fax: 415-859-3735

More information

Machine Learning Techniques for Bacteria Classification

Machine Learning Techniques for Bacteria Classification Machine Learning Techniques for Bacteria Classification Massimo La Rosa Riccardo Rizzo Alfonso M. Urso S. Gaglio ICAR-CNR University of Palermo Workshop on Hardware Architectures Beyond 2020: Challenges

More information

Tutorial. Docking School SAnDReS Tutorial Cyclin-Dependent Kinases with K i Information (Scoring Function Analysis)

Tutorial. Docking School SAnDReS Tutorial Cyclin-Dependent Kinases with K i Information (Scoring Function Analysis) Tutorial Docking School SAnDReS Tutorial Cyclin-Dependent Kinases with K i Information (Scoring Function Analysis) Prof. Dr. Walter Filgueira de Azevedo Jr. Laboratory of Computational Systems Biology

More information

Docking Study with HyperChem Release Notes

Docking Study with HyperChem Release Notes Docking Study with HyperChem Release Notes This document lists additional information about Docking Study with HyperChem family, Essential, Premium Essential, Professional, Advanced, Ultimat, and Cluster.

More information

Applying Parallel Computing to Quickly Find the Solution for Marginal-Quality Data

Applying Parallel Computing to Quickly Find the Solution for Marginal-Quality Data Applying Parallel Computing to Quickly Find the Solution for Marginal-Quality Data Zheng-Qing Fu SERCAT, APS, Argonne National Lab., Argonne, IL 60439 University Of Georgia, Athens, GA 30602 NECAT Workshop,

More information

Finding data. HMMER Answer key

Finding data. HMMER Answer key Finding data HMMER Answer key HMMER input is prepared using VectorBase ClustalW, which runs a Java application for the graphical representation of the results. If you get an error message that blocks this

More information

PBSI-EHR Off the Charts!

PBSI-EHR Off the Charts! Stage 2 Meaningful Use Measure #27 & 28 Timely Access A & B OBJECTIVE: MEASURE A: Provide patients the ability to view online, download and transmit their health information within four business days of

More information

Integration in the 21 st -Century Enterprise. Thomas Blackadar American Chemical Society Meeting New York, September 10, 2003

Integration in the 21 st -Century Enterprise. Thomas Blackadar American Chemical Society Meeting New York, September 10, 2003 Integration in the 21 st -Century Enterprise Thomas Blackadar American Chemical Society Meeting New York, September 10, 2003 The Integration Bill of Rights Integrate = to form, coordinate, or blend into

More information

LinkDB: A Database of Cross Links between Molecular Biology Databases

LinkDB: A Database of Cross Links between Molecular Biology Databases LinkDB: A Database of Cross Links between Molecular Biology Databases Susumu Goto, Yutaka Akiyama, Minoru Kanehisa Institute for Chemical Research, Kyoto University Introduction We have developed a molecular

More information

Genome Browsers Guide

Genome Browsers Guide Genome Browsers Guide Take a Class This guide supports the Galter Library class called Genome Browsers. See our Classes schedule for the next available offering. If this class is not on our upcoming schedule,

More information

Blast2GO PRO Plugin for Geneious User Manual

Blast2GO PRO Plugin for Geneious User Manual Blast2GO PRO Plugin for Geneious User Manual Geneious 8.0 Version 1.0 October 2015 BioBam Bioinformatics S.L. Valencia, Spain Contents Introduction 2 1.1 Blast2GO methodology................................

More information

Bioqueries: A Social Community Sharing Experiences while Querying Biological Linked Data (

Bioqueries: A Social Community Sharing Experiences while Querying Biological Linked Data ( Bioqueries: A Social Community Sharing Experiences while Querying Biological Linked Data (http://bioqueries.uma.es) María Jesús García-Godoy, Ismael Navas-Delgado, José Francisco Aldana Montes Computing

More information

User Guide for DNAFORM Clone Search Engine

User Guide for DNAFORM Clone Search Engine User Guide for DNAFORM Clone Search Engine Document Version: 3.0 Dated from: 1 October 2010 The document is the property of K.K. DNAFORM and may not be disclosed, distributed, or replicated without the

More information

Requirements for data catalogues within facilities

Requirements for data catalogues within facilities Requirements for data catalogues within facilities Milan Prica 1, George Kourousias 1, Alistair Mills 2, Brian Matthews 2 1 Sincrotrone Trieste S.C.p.A, Trieste, Italy 2 Scientific Computing Department,

More information

Querying Multiple Bioinformatics Information Sources: Can Semantic Web Research Help?

Querying Multiple Bioinformatics Information Sources: Can Semantic Web Research Help? Querying Multiple Bioinformatics Information Sources: Can Semantic Web Research Help? David Buttler, Matthew Coleman 1, Terence Critchlow 1, Renato Fileto, Wei Han, Ling Liu, Calton Pu, Daniel Rocco, Li

More information

CSE182 Class project: An EST database of H. medicinalis

CSE182 Class project: An EST database of H. medicinalis CSE182 Class project: An EST database of H. medicinalis October 15, 2006 1 Introduction to Hirudo Hirudo medicinalis (medicinal leech is organism with historical medical as well contemporary relvance as

More information

AMNH Gerstner Scholars in Bioinformatics & Computational Biology Application Instructions

AMNH Gerstner Scholars in Bioinformatics & Computational Biology Application Instructions PURPOSE AMNH Gerstner Scholars in Bioinformatics & Computational Biology Application Instructions The seeks highly qualified applicants for its Gerstner postdoctoral fellowship program in Bioinformatics

More information

How Can a Warehouse Manage Your Wheat-(Data)?

How Can a Warehouse Manage Your Wheat-(Data)? International Wheat Innovation Workshop 16 th - 17 th November 2015 Clermont-Ferrand, France How Can a Warehouse Manage Your Wheat-(Data)? Uwe Scholz IPK Gatersleben IPK - Leibniz Institute of Plant Genetics

More information

Wilson Leung 05/27/2008 A Simple Introduction to NCBI BLAST

Wilson Leung 05/27/2008 A Simple Introduction to NCBI BLAST A Simple Introduction to NCBI BLAST Prerequisites: Detecting and Interpreting Genetic Homology: Lecture Notes on Alignment Resources: The BLAST web server is available at http://www.ncbi.nih.gov/blast/

More information

Lecture 5 Advanced BLAST

Lecture 5 Advanced BLAST Introduction to Bioinformatics for Medical Research Gideon Greenspan gdg@cs.technion.ac.il Lecture 5 Advanced BLAST BLAST Recap Sequence Alignment Complexity and indexing BLASTN and BLASTP Basic parameters

More information