Bio wikis. Paolo Romano Bioinformatics, National Cancer Research Institute, Genova

Size: px
Start display at page:

Download "Bio wikis. Paolo Romano Bioinformatics, National Cancer Research Institute, Genova"

Transcription

1 Bio wikis Paolo Romano Bioinformatics, National Cancer Research Institute, Genova

2 Outline o Wiki systems: aims and technologies o Working with wikis: practical issues for setting up and contributing o Specific aims and issues of wikis for biology o Examples of wikis for biology and specific features and objectives 2

3 Aims of Wiki systems for biology Specific aims of wikis for biology: o Collaborative development and sharing of documentation o Collaborative annotation of database contents (external to the database) o Collaborative update of database contents 3

4 Aims: development of documentation o One of the basic aims of wiki systems o It allows to build collaboratively and share: o o o o o procedures, data, experiences, news, various information o Motivation: high quality expertise and interests on special topics are distributed o Objective: high quality Summa biologica on a given topic, equivalent to a sectorial encyclopedia 4

5 Aims: annotation of database contents o Motivation: an extended and accurate curation of databases is extremely difficult o Objective: allow users of resources to contribute their expertise, experiences, observations and results of experiments o The contents of the database are therefore collaboratively annotated o Users can control this extended curation and correct possible errors o Databases are left unchanged 5

6 Aims: updating database contents o Motivation: Databases are hard to maintain and curate accurately o Objectives: update databases on the basis of users contributions o Problems: o how reliable are users contributions? o how to capture annotations provided as a text (Wiki site) o how to trasfer this information into the database? o Procedures should be implemented providing: o o o o a way to assess users contributiosn a set of Wiki pages created from the database a tool to extract data from annotations a tool for adding this data to the database 6

7 Aims: a genome example o GenBank can be thought of as an electronic library o Submitters own records contents o GenBank cannot undergo extended annotation, reannotation, error removal unless owners request/agree o As a consequence, Genbank is not up-to-date o A wiki built by a community of experts could serve this aim: o re-annotation of records (sequence, function, ), o links to documental information and sites, o up-to-date data Salzberg SL: Genome re-annotation: a wwiki solution? Genome Biology 2007, 8:102 7

8 Issues with Wiki systems for biology o Authoritativeness of contributions and of sites: how to assess quality? o Acknowledgement of users as a way to stimulate contributions: how to stimulate quality additions? o Authorships management and reward: how to keep information on authors and assign these contributions a scientific production value o Special features for contents: how to manage the many, different data types? 8

9 Issues: authoritativeness o Quality of contributions is essential o Contributions by end users is usually considered not adequate o Achieve a quality of contributions comparable with professional annotation at service centers (EBI, ) o Example of Wikipedia success may help (?) o Possible solutions: peer-evaluation of contributions, identification of users 9

10 Issues: acknowledgement o It is needed to stimulate good contributions o How to attract best experts in the field? o Which kind of reward can be assigned to best contributors? o Identification and citation: is this enough? o Benefits (subscriptions to services, journals, )? 10

11 RNBIO participation Italian Network of Oncology Bioinformatics web site Based on Plone + Zwiki 22 members can create and edit pages A small fraction did it, even for the simplest tasks Contents News Events Public docs Newsletters Private docs Overall Pages Users

12 Issues: authorships management o Authorships assignment is a form of acknowledgement o Contributes to define reliability of information, while providing authors due credits o Enables peer-review: authors can rate each other an automatic reputation system can be implemented o Allows a knowledge base to evolve into a rigourous scientific tool via continual revision and peer-review o How to combine collaborative (and altruistic) features of wiki systems with authorships? Hoffmann R: A wiki for the life sciences where authorship matters, Nature Genetics

13 Issues: features for different contents o Textual information is only a small part of biological data o Biological data types are numerous and heterogeneous, depending on the domain o How to cope with the different data types? o images, o plots, o diagrams o Adaptation of wiki systems is needed 13

14 Many bio-wikis available Many sites already available: o Brede Wiki Results for neuroimaging studies. o Ecoliwiki - Comprehensive information on Escherichia coli o GenWiki Genealogy resource for German-speaking people o GONUTS GO Normal Usage Tracking System: a wikibased GO term browser. Allows community curation of GO annotations for any gene. o MetaBase Biological databases o Metagenes Metagenomic DNA sequences o PDBWiki Macromolecular structures From Gene Wiki portal 14

15 Many bio-wikis available (cont d) Many sites already available: o Proteopedia Annotation of protein structures and other biomolecules o SNPedia Genetic polymorphisms o SubtiWiki Genes of Bacillus subtilis o TOPSAN Annotation of protein structures o WikiGenes Gene wiki with authorships attribution o WikiPathways Curation of biological pathways o WikiProteins Protein wiki for structured data o ZebrafishGenomeWiki Community annotation of the Zebrafish genome From Gene Wiki portal 15

16 Introduction to bio-wikis A quick introduction to: o Gene Wiki o WikiGenes o WikiPathways o WikiProteins See tomorrow Alex Bateman s keynote RNA WikiProject: Community annotation of RNA families 16

17 Gene Wiki: references o Gene Wiki iki o Huss JW III, Orozco C, Goodale J, Wu C, Batalov S, et al. (2008) A Gene Wiki for Community Annotation of Gene Function. PLoS Biol 6(7): e

18 Gene Wiki: features and contents o Contributions to Wikipedia as a specialized sub-section o Aims: provide a quality article for every notable human gene in the most used on-line encyclopedia o Some figures: o pages, visited millions of times o 86% of pages show up on the first page of Google (search by gene symbol) o 15,255 edits by 3,590 unique users in 2008 o Average increase of 236 kb of text per month (27 research letters in Nature) 18

19 19

20 References 20

21 References 21

22 References 22

23 References 23

24 WikiGenes: References o WikiGenes o Hoffmann R: A wiki for the life sciences where authorsh 24

25 WikiGenes: features o Implements strict management of contributions authorship o Even small changes (single words) are assigned to an author o A web exist for each user, listing expertise, publications, and contributions o Authors can thus be evaluated and rated by peers o Includes a friendly editor, allowing for additions of specialized links 25

26 WikiGenes: editor setup o Chemical: allows annotation of terms as chemical compounds. Facilitates internal navigation and link to NCBI PubChem. o Gene: allows annotation of terms as genes. Enables users to internal navigation and link to NCBI Gene, Uniprot, SNPedia, o MeSH: alows annotation of terms as MeSH terms. Facilitates unambiguous internal navigation. o Reference: allows annotations of papers. Facilitates citation of articles listed in Pubmed. o Link: allows annotation of links to other pages. Enables linking of terms with articles in WikiGenes. o External link: allows annotation of terms with URLs. Enables linking to external web sites. 26

27 27

28 References 28

29 References 29

30 References 30

31 References 31

32 References 32

33 References 33

34 34

35 35

36 References 36

37 References 37

38 WikiGenes: the movie Enjoy a movie now 38

39 ArrayWiki: References o ArrayWiki o Stokes TH, Torrance JT, Li H, Wang MD: ArrayWiki : an enabling technology for sharing public m 39

40 ArrayWiki: features An "intelligent" microarray repository that enables update of meta-data with the raw array data, and provides standardized archiving protocols o provides a user-friendly knowledge management interface (MediaWiki) o provides a user-curation capability through the familiar Wiki interface o provides text-based searches across experiment meta-data and exposes data to search engine crawlers o includes automated quality control processes (cacorrect) and novel visualization methods (BioPNG, Gel Plots), which provide extra information about data quality 40

41 ArrayWiki: statistics Unites meta-data from multiple sources, with following features: o provides a user-friendly knowledge management interface (MediaWiki) o provides a user-curation capability through the familiar Wiki interface o provides text-based searches across experiment meta-data and exposes data to search engine crawlers o includes automated quality control processes (cacorrect) and novel visualization methods (BioPNG, Gel Plots), which provide extra information about data quality 41

42 ArrayWiki: statistics Unites meta-data from multiple sources, with following features: 42

43 WikiPathways: References o WikiPathways o Pico AR, Kelder T, van Iersel MP, Hanspers K, Conklin BR, et al. (2008) WikiPathways: Pathway Editing for the People. PLoS Biol 6(7): e bio

44 WikiPathways: features o A new model for pathway databases that enhances and complements ongoing efforts (KEGG, Reactome, Pathway Commons) o Peer review, editorial curation, and maintenance assigned to the community o Based on MediaWiki o Additionally: o a graphical pathway editing tool o integrated databases (major gene, protein, and small-molecule systems) 44

45 WikiPathways: interacting with pathways o One page per pathway, including: diagram, description, references, download options, version history, components o Pathways can be edited (embedded pathway editor) o History of changes and list of component are included with links to external resources o Users can monitor and undo changes o Pathways can be searched by name, included genes and proteins, text in descriptions and comments. o Pathways can be browsed by species names and by categories (ontology-based). o Pathways can be downloaded in many formats, including GPML 45

46 46

47 47

48 48

49 49

50 50

51 51

52 52

53 WikiProteins: references o WikiProteins o Mons B, Ashburner M, Chichester C, et al., Calling on a million minds for comm WikiProteins, Genome Biology 2008, 9:R

54 WikiProteins: the concept space o Based on millions of biological concepts derived from UMLS, UniProtKB, IntAct, GO o Implements the original knowlet technology to store biological concepts and their relationships in the concept space o Proper filters allow to show sub-sections (semantic groups) as well as different types of relationships among concepts (strong only, all) o The concept space can be converted into RDF/OWL and searched by SPARQL 54

55 WikiProteins: wiki system o Unique wiki page per concept with data derived from the concept space o Concepts are identified on-the-fly and highlighted in the text o Navigation is allowed through internal (concepts-based) and external links o Registered users are allowed to edit wiki pages o Changes to data are evaluated and can be incorporated into databases 55

56 56

57 57

58 58

59 59

60 60

61 61

62 62

63 63

64 64

65 65

Brede Wiki: Neuroscience data structured in a wiki

Brede Wiki: Neuroscience data structured in a wiki Brede Wiki: Neuroscience data structured in a wiki Finn Årup Nielsen Center for Integrated Molecular Brain Imaging, Copenhagen, Denmark; DTU Informatics, Technical University of Denmark, Lyngby, Denmark;

More information

Brede Wiki: Neuroscience data structured in a wiki

Brede Wiki: Neuroscience data structured in a wiki Brede Wiki: Neuroscience data structured in a wiki Finn Årup Nielsen Center for Integrated Molecular Brain Imaging, Copenhagen, Denmark; DTU Informatics, Technical University of Denmark, Lyngby, Denmark;

More information

How to store and visualize RNA-seq data

How to store and visualize RNA-seq data How to store and visualize RNA-seq data Gabriella Rustici Functional Genomics Group gabry@ebi.ac.uk EBI is an Outstation of the European Molecular Biology Laboratory. Talk summary How do we archive RNA-seq

More information

Integrated Access to Biological Data. A use case

Integrated Access to Biological Data. A use case Integrated Access to Biological Data. A use case Marta González Fundación ROBOTIKER, Parque Tecnológico Edif 202 48970 Zamudio, Vizcaya Spain marta@robotiker.es Abstract. This use case reflects the research

More information

WikiPathways Tutorial

WikiPathways Tutorial WikiPathways Tutorial Mining biological pathways and more Thomas Kelder www.wikipathways.org Wiki for biological pathways Free and open pathway resource Share, curate and discuss! Topics How to Find and

More information

Welcome - webinar instructions

Welcome - webinar instructions Welcome - webinar instructions GoToTraining works best in Chrome or IE avoid Firefox due to audio issues with Macs To access the full features of GoToTraining, use the desktop version by clicking switch

More information

Topics of the talk. Biodatabases. Data types. Some sequence terminology...

Topics of the talk. Biodatabases. Data types. Some sequence terminology... Topics of the talk Biodatabases Jarno Tuimala / Eija Korpelainen CSC What data are stored in biological databases? What constitutes a good database? Nucleic acid sequence databases Amino acid sequence

More information

NCBI News, November 2009

NCBI News, November 2009 Peter Cooper, Ph.D. NCBI cooper@ncbi.nlm.nh.gov Dawn Lipshultz, M.S. NCBI lipshult@ncbi.nlm.nih.gov Featured Resource: New Discovery-oriented PubMed and NCBI Homepage The NCBI Site Guide A new and improved

More information

Literature Databases

Literature Databases Literature Databases Introduction to Bioinformatics Dortmund, 16.-20.07.2007 Lectures: Sven Rahmann Exercises: Udo Feldkamp, Michael Wurst 1 Overview 1. Databases 2. Publications in Science 3. PubMed and

More information

EBI patent related services

EBI patent related services EBI patent related services 4 th Annual Forum for SMEs October 18-19 th 2010 Jennifer McDowall Senior Scientist, EMBL-EBI EBI is an Outstation of the European Molecular Biology Laboratory. Overview Patent

More information

2. Take a few minutes to look around the site. The goal is to familiarize yourself with a few key components of the NCBI.

2. Take a few minutes to look around the site. The goal is to familiarize yourself with a few key components of the NCBI. 2 Navigating the NCBI Instructions Aim: To become familiar with the resources available at the National Center for Bioinformatics (NCBI) and the search engine Entrez. Instructions: Write the answers to

More information

Customisable Curation Workflows in Argo

Customisable Curation Workflows in Argo Customisable Curation Workflows in Argo Rafal Rak*, Riza Batista-Navarro, Andrew Rowley, Jacob Carter and Sophia Ananiadou National Centre for Text Mining, University of Manchester, UK *Corresponding author:

More information

The ELIXIR of Linked Data

The ELIXIR of Linked Data The ELIXIR of Linked Data Professor Carole Goble (UK node) Barend Mons (NL node), Helen Parkinson (EMBL-EBI node) The Interoperability Services Backbone Team European Life Sciences Infrastructure for Biological

More information

Information Retrieval, Information Extraction, and Text Mining Applications for Biology. Slides by Suleyman Cetintas & Luo Si

Information Retrieval, Information Extraction, and Text Mining Applications for Biology. Slides by Suleyman Cetintas & Luo Si Information Retrieval, Information Extraction, and Text Mining Applications for Biology Slides by Suleyman Cetintas & Luo Si 1 Outline Introduction Overview of Literature Data Sources PubMed, HighWire

More information

Bioinformatics Hubs on the Web

Bioinformatics Hubs on the Web Bioinformatics Hubs on the Web Take a class The Galter Library teaches a related class called Bioinformatics Hubs on the Web. See our Classes schedule for the next available offering. If this class is

More information

Applied Bioinformatics

Applied Bioinformatics Applied Bioinformatics Course Overview & Introduction to Linux Bing Zhang Department of Biomedical Informatics Vanderbilt University bing.zhang@vanderbilt.edu What is bioinformatics Bio Bioinformatics

More information

Software review. Biomolecular Interaction Network Database

Software review. Biomolecular Interaction Network Database Biomolecular Interaction Network Database Keywords: protein interactions, visualisation, biology data integration, web access Abstract This software review looks at the utility of the Biomolecular Interaction

More information

Enabling Open Science: Data Discoverability, Access and Use. Jo McEntyre Head of Literature Services

Enabling Open Science: Data Discoverability, Access and Use. Jo McEntyre Head of Literature Services Enabling Open Science: Data Discoverability, Access and Use Jo McEntyre Head of Literature Services www.ebi.ac.uk About EMBL-EBI Part of the European Molecular Biology Laboratory International, non-profit

More information

An Online Ontology: WiktionaryZ

An Online Ontology: WiktionaryZ KR-MED 2006 "Biomedical Ontology in Action" November 8, 2006, Baltimore, Maryland, USA An Online Ontology: WiktionaryZ Erik M. van Mulligen, Ph.D. 1,2, Erik Möller, Peter-Jan Roes 3, Marc Weeber, Ph.D.

More information

EBP. Accessing the Biomedical Literature for the Best Evidence

EBP. Accessing the Biomedical Literature for the Best Evidence Accessing the Biomedical Literature for the Best Evidence Structuring the search for information and evidence Basic search resources Starting the search EBP Lab / Practice: Simple searches Using PubMed

More information

Portals and workflows: Taverna Workbench. Paolo Romano National Cancer Research Institute, Genova

Portals and workflows: Taverna Workbench. Paolo Romano National Cancer Research Institute, Genova Portals and workflows: Taverna Workbench Paolo Romano National Cancer Research Institute, Genova (paolo.romano@istge.it) 1 Summary Information and data integration in biology Web Services and workflow

More information

Applied Bioinformatics

Applied Bioinformatics Applied Bioinformatics Course Overview & Introduction to Linux Bing Zhang Department of Biomedical Informatics Vanderbilt University bing.zhang@vanderbilt.edu What is bioinformatics Bio Bioinformatics

More information

INTRODUCTION TO BIOINFORMATICS

INTRODUCTION TO BIOINFORMATICS Molecular Biology-2019 1 INTRODUCTION TO BIOINFORMATICS In this section, we want to provide a simple introduction to using the web site of the National Center for Biotechnology Information NCBI) to obtain

More information

EBI services. Jennifer McDowall EMBL-EBI

EBI services. Jennifer McDowall EMBL-EBI EBI services Jennifer McDowall EMBL-EBI The SLING project is funded by the European Commission within Research Infrastructures of the FP7 Capacities Specific Programme, grant agreement number 226073 (Integrating

More information

Exploring the Generation and Integration of Publishable Scientific Facts Using the Concept of Nano-publications

Exploring the Generation and Integration of Publishable Scientific Facts Using the Concept of Nano-publications Exploring the Generation and Integration of Publishable Scientific Facts Using the Concept of Nano-publications Amanda Clare 1,3, Samuel Croset 2,3 (croset@ebi.ac.uk), Christoph Grabmueller 2,3, Senay

More information

The CALBC RDF Triple store: retrieval over large literature content

The CALBC RDF Triple store: retrieval over large literature content The CALBC RDF Triple store: retrieval over large literature content Samuel Croset, Christoph Grabmüller, Chen Li, Silverstras Kavaliauskas, Dietrich Rebholz-Schuhmann croset@ebi.ac.uk 10 th December 2010,

More information

Min Wang. April, 2003

Min Wang. April, 2003 Development of a co-regulated gene expression analysis tool (CREAT) By Min Wang April, 2003 Project Documentation Description of CREAT CREAT (coordinated regulatory element analysis tool) are developed

More information

Publish & Manage Journal Websites Rapidly

Publish & Manage Journal Websites Rapidly Publish & Manage Journal Websites Rapidly Software Highlights Mobile Friendly e-journals Automated Issue Publishing & archiving Mobile Friendly Websites Our latest product ubijournal is designed for mobile

More information

Introducing the Springer Nature Data Support Services

Introducing the Springer Nature Data Support Services Introducing the Springer Nature Data Support Services 1 What motivates researchers to share data? 97% - to accelerate research and its applications 1 96% - increased visibility and discovery of their research

More information

Information Resources in Molecular Biology Marcela Davila-Lopez How many and where

Information Resources in Molecular Biology Marcela Davila-Lopez How many and where Information Resources in Molecular Biology Marcela Davila-Lopez (marcela.davila@medkem.gu.se) How many and where Data growth DB: What and Why A Database is a shared collection of logically related data,

More information

When we search a nucleic acid databases, there is no need for you to carry out your own six frame translation. Mascot always performs a 6 frame

When we search a nucleic acid databases, there is no need for you to carry out your own six frame translation. Mascot always performs a 6 frame 1 When we search a nucleic acid databases, there is no need for you to carry out your own six frame translation. Mascot always performs a 6 frame translation on the fly. That is, 3 reading frames from

More information

CACAO Training. Jim Hu and Suzi Aleksander Spring 2016

CACAO Training. Jim Hu and Suzi Aleksander Spring 2016 CACAO Training Jim Hu and Suzi Aleksander Spring 2016 1 What is CACAO? Community Assessment of Community Annotation with Ontologies (CACAO) Annotation of gene function Competition Within a class Between

More information

Languages and tools for building and using ontologies. Simon Jupp, James Malone

Languages and tools for building and using ontologies. Simon Jupp, James Malone An overview of ontology technology Languages and tools for building and using ontologies Simon Jupp, James Malone jupp@ebi.ac.uk, malone@ebi.ac.uk Outline Languages OWL and OBO classes, individuals, relations,

More information

RLIMS-P Website Help Document

RLIMS-P Website Help Document RLIMS-P Website Help Document Table of Contents Introduction... 1 RLIMS-P architecture... 2 RLIMS-P interface... 2 Login...2 Input page...3 Results Page...4 Text Evidence/Curation Page...9 URL: http://annotation.dbi.udel.edu/text_mining/rlimsp2/

More information

Alternative Tools for Mining The Biomedical Literature

Alternative Tools for Mining The Biomedical Literature Yale University From the SelectedWorks of Rolando Garcia-Milian May 14, 2014 Alternative Tools for Mining The Biomedical Literature Rolando Garcia-Milian, Yale University Available at: https://works.bepress.com/rolando_garciamilian/1/

More information

Biostatistics and Bioinformatics Molecular Sequence Databases

Biostatistics and Bioinformatics Molecular Sequence Databases . 1 Description of Module Subject Name Paper Name Module Name/Title 13 03 Dr. Vijaya Khader Dr. MC Varadaraj 2 1. Objectives: In the present module, the students will learn about 1. Encoding linear sequences

More information

2) NCBI BLAST tutorial This is a users guide written by the education department at NCBI.

2) NCBI BLAST tutorial   This is a users guide written by the education department at NCBI. Web resources -- Tour. page 1 of 8 This is a guided tour. Any homework is separate. In fact, this exercise is used for multiple classes and is publicly available to everyone. The entire tour will take

More information

Genomic pathways database and biological data management

Genomic pathways database and biological data management SHORT COMMUNICATION Genomic pathways database and biological data management Z. M. Ozsoyoglu*,, G. Ozsoyoglu*, and J. Nadeau*, *Center for Computational Genomics, Case Western Reserve University (CWRU),

More information

3DProIN: Protein-Protein Interaction Networks and Structure Visualization

3DProIN: Protein-Protein Interaction Networks and Structure Visualization Columbia International Publishing American Journal of Bioinformatics and Computational Biology doi:10.7726/ajbcb.2014.1003 Research Article 3DProIN: Protein-Protein Interaction Networks and Structure Visualization

More information

SEEK User Manual. Introduction

SEEK User Manual. Introduction SEEK User Manual Introduction SEEK is a computational gene co-expression search engine. It utilizes a vast human gene expression compendium to deliver fast, integrative, cross-platform co-expression analyses.

More information

Facilitating Semantic Alignment of EBI Resources

Facilitating Semantic Alignment of EBI Resources Facilitating Semantic Alignment of EBI Resources 17 th March, 2017 Tony Burdett Technical Co-ordinator Samples, Phenotypes and Ontologies Team www.ebi.ac.uk What is EMBL-EBI? Europe s home for biological

More information

Automation of bioinformatics processes through workflow management systems

Automation of bioinformatics processes through workflow management systems Automation of bioinformatics processes through workflow management systems Paolo Romano Bioinformatics National Cancer Research Institute of Genoa, Italy paolo.romano@istge.it Summary Information and data

More information

What is Internet COMPUTER NETWORKS AND NETWORK-BASED BIOINFORMATICS RESOURCES

What is Internet COMPUTER NETWORKS AND NETWORK-BASED BIOINFORMATICS RESOURCES What is Internet COMPUTER NETWORKS AND NETWORK-BASED BIOINFORMATICS RESOURCES Global Internet DNS Internet IP Internet Domain Name System Domain Name System The Domain Name System (DNS) is a hierarchical,

More information

mgu74a.db November 2, 2013 Map Manufacturer identifiers to Accession Numbers

mgu74a.db November 2, 2013 Map Manufacturer identifiers to Accession Numbers mgu74a.db November 2, 2013 mgu74aaccnum Map Manufacturer identifiers to Accession Numbers mgu74aaccnum is an R object that contains mappings between a manufacturer s identifiers and manufacturers accessions.

More information

Bioqueries: A Social Community Sharing Experiences while Querying Biological Linked Data (

Bioqueries: A Social Community Sharing Experiences while Querying Biological Linked Data ( Bioqueries: A Social Community Sharing Experiences while Querying Biological Linked Data (http://bioqueries.uma.es) María Jesús García-Godoy, Ismael Navas-Delgado, José Francisco Aldana Montes Computing

More information

Nancy Baker 1, Thomas Knudsen 2, Antony Williams 2

Nancy Baker 1, Thomas Knudsen 2, Antony Williams 2 SOFTWARE TOOL ARTICLE Abstract Sifter: a comprehensive front-end system to PubMed [version 1; referees: 2 approved] Nancy Baker 1, Thomas Knudsen 2, Antony Williams 2 1Leidos, Research Triangle Park, NC,

More information

Measuring inter-annotator agreement in GO annotations

Measuring inter-annotator agreement in GO annotations Measuring inter-annotator agreement in GO annotations Camon EB, Barrell DG, Dimmer EC, Lee V, Magrane M, Maslen J, Binns ns D, Apweiler R. An evaluation of GO annotation retrieval for BioCreAtIvE and GOA.

More information

ClinVar. Jennifer Lee, PhD, NCBI/NLM/NIH ClinVar

ClinVar. Jennifer Lee, PhD, NCBI/NLM/NIH ClinVar ClinVar What is ClinVar ClinVar is a freely available, central archive for associating observed variation with supporting clinical and experimental evidence for a wide range of disorders. The database

More information

Exploring and Exploiting the Biological Maze. Presented By Vidyadhari Edupuganti Advisor Dr. Zoe Lacroix

Exploring and Exploiting the Biological Maze. Presented By Vidyadhari Edupuganti Advisor Dr. Zoe Lacroix Exploring and Exploiting the Biological Maze Presented By Vidyadhari Edupuganti Advisor Dr. Zoe Lacroix Motivation An abundance of biological data sources contain data about scientific entities, such as

More information

INTRODUCTION TO BIOINFORMATICS

INTRODUCTION TO BIOINFORMATICS Molecular Biology-2017 1 INTRODUCTION TO BIOINFORMATICS In this section, we want to provide a simple introduction to using the web site of the National Center for Biotechnology Information NCBI) to obtain

More information

Topincs Wiki. A Topic Maps Powered Wiki. Robert Cerny

Topincs Wiki. A Topic Maps Powered Wiki. Robert Cerny Topincs Wiki A Topic Maps Powered Wiki Robert Cerny An der Embsmühle 25, D-65817 Eppstein, Germany robert@cerny-online.com http://www.cerny-online.com Abstract. Topincs provides a RESTful web service interface

More information

BIOAUTOMATION, 2009, 13 (3),

BIOAUTOMATION, 2009, 13 (3), Topics of Bioengineering in Wikipedia Vassia Atanassova Centre of Biomedical Engineering Bulgarian Academy of Sciences Acad. G. Bonchev Str., bl. 105, Sofia 1113, Bulgaria E-mail: vassia.atanassova@gmail.com

More information

warwick.ac.uk/lib-publications

warwick.ac.uk/lib-publications Original citation: Zhao, Lei, Lim Choi Keung, Sarah Niukyun and Arvanitis, Theodoros N. (2016) A BioPortalbased terminology service for health data interoperability. In: Unifying the Applications and Foundations

More information

Maximizing the Value of STM Content through Semantic Enrichment. Frank Stumpf December 1, 2009

Maximizing the Value of STM Content through Semantic Enrichment. Frank Stumpf December 1, 2009 Maximizing the Value of STM Content through Semantic Enrichment Frank Stumpf December 1, 2009 What is Semantics and Semantic Processing? Content Knowledge Framework Technology Framework Search Text Images

More information

ScienceDirect Hungary Library Information Tour

ScienceDirect Hungary Library Information Tour ScienceDirect Hungary Library Information Tour 27-28 May 2013 Silvie Niedworok Product Sales Manager Elsevier B.V. s.niedworok@elsevier.com ScienceDirect ScienceDirect is Elsevier s extensive and unique

More information

Data Mining Technologies for Bioinformatics Sequences

Data Mining Technologies for Bioinformatics Sequences Data Mining Technologies for Bioinformatics Sequences Deepak Garg Computer Science and Engineering Department Thapar Institute of Engineering & Tecnology, Patiala Abstract Main tool used for sequence alignment

More information

Arabidopsis Protein Protein Interaction Analysis Pipeline

Arabidopsis Protein Protein Interaction Analysis Pipeline ANAP User Guide The ANAP tool has many useful network biology functions that are demonstrated in this user guide. 1. The ANAP tool http://gmdd.shgmo.org/computational-biology/anap/anap_v1.0/ The ANAP tool

More information

Deliverable D4.3 Release of pilot version of data warehouse

Deliverable D4.3 Release of pilot version of data warehouse Deliverable D4.3 Release of pilot version of data warehouse Date: 10.05.17 HORIZON 2020 - INFRADEV Implementation and operation of cross-cutting services and solutions for clusters of ESFRI Grant Agreement

More information

Discovery Net : A UK e-science Pilot Project for Grid-based Knowledge Discovery Services. Patrick Wendel Imperial College, London

Discovery Net : A UK e-science Pilot Project for Grid-based Knowledge Discovery Services. Patrick Wendel Imperial College, London Discovery Net : A UK e-science Pilot Project for Grid-based Knowledge Discovery Services Patrick Wendel Imperial College, London Data Mining and Exploration Middleware for Distributed and Grid Computing,

More information

Creating and Using Genome Assemblies Tutorial

Creating and Using Genome Assemblies Tutorial Creating and Using Genome Assemblies Tutorial Release 8.1 Golden Helix, Inc. March 18, 2014 Contents 1. Create a Genome Assembly for Danio rerio 2 2. Building Annotation Sources 5 A. Creating a Reference

More information

mogene20sttranscriptcluster.db

mogene20sttranscriptcluster.db mogene20sttranscriptcluster.db November 17, 2017 mogene20sttranscriptclusteraccnum Map Manufacturer identifiers to Accession Numbers mogene20sttranscriptclusteraccnum is an R object that contains mappings

More information

SciVerse ScienceDirect. User Guide. October SciVerse ScienceDirect. Open to accelerate science

SciVerse ScienceDirect. User Guide. October SciVerse ScienceDirect. Open to accelerate science SciVerse ScienceDirect User Guide October 2010 SciVerse ScienceDirect Open to accelerate science Welcome to SciVerse ScienceDirect: How to get the most from your subscription SciVerse ScienceDirect is

More information

About the Edinburgh Pathway Editor:

About the Edinburgh Pathway Editor: About the Edinburgh Pathway Editor: EPE is a visual editor designed for annotation, visualisation and presentation of wide variety of biological networks, including metabolic, genetic and signal transduction

More information

The LAILAPS Search Engine - A Feature Model for Relevance Ranking in Life Science Databases

The LAILAPS Search Engine - A Feature Model for Relevance Ranking in Life Science Databases International Symposium on Integrative Bioinformatics 2010 The LAILAPS Search Engine - A Feature Model for Relevance Ranking in Life Science Databases M Lange, K Spies, C Colmsee, S Flemming, M Klapperstück,

More information

Extracting reproducible simulation studies from model repositories using the CombineArchive Toolkit

Extracting reproducible simulation studies from model repositories using the CombineArchive Toolkit Extracting reproducible simulation studies from model repositories using the CombineArchive Toolkit Martin Scharm, Dagmar Waltemath Department of Systems Biology and Bioinformatics University of Rostock

More information

HsAgilentDesign db

HsAgilentDesign db HsAgilentDesign026652.db January 16, 2019 HsAgilentDesign026652ACCNUM Map Manufacturer identifiers to Accession Numbers HsAgilentDesign026652ACCNUM is an R object that contains mappings between a manufacturer

More information

The Role of Repositories and Journals in the Astronomy Research Lifecycle

The Role of Repositories and Journals in the Astronomy Research Lifecycle The Role of Repositories and Journals in the Astronomy Research Lifecycle Alberto Accomazzi NASA Astrophysics Data System Smithsonian Astrophysical Observatory http://ads.harvard.edu Astroinformatics 2010,

More information

Update: MIRIAM Registry and SBO

Update: MIRIAM Registry and SBO Update: MIRIAM Registry and SBO Nick Juty, EMBL-EBI 3rd Sept, 2011 Overview MIRIAM Registry MIRIAM Guidelines.. MIRIAM Registry content URIs (URN form), example Summary/current developments SBO Purpose

More information

BovineMine Documentation

BovineMine Documentation BovineMine Documentation Release 1.0 Deepak Unni, Aditi Tayal, Colin Diesh, Christine Elsik, Darren Hag Oct 06, 2017 Contents 1 Tutorial 3 1.1 Overview.................................................

More information

Structural Bioinformatics

Structural Bioinformatics Structural Bioinformatics Elucidation of the 3D structures of biomolecules. Analysis and comparison of biomolecular structures. Prediction of biomolecular recognition. Handles three-dimensional (3-D) structures.

More information

ProSystem fx Site Builder. enewsletters

ProSystem fx Site Builder. enewsletters ProSystem fx Site Builder enewsletters December 2011 Copyright 2010-2011, CCH INCORPORATED. A Wolters Kluwer business. All Rights Reserved. Material in this publication may not be reproduced or transmitted,

More information

Your Open Science and Research Publishing Platform. 1st SciShops Summer School

Your Open Science and Research Publishing Platform. 1st SciShops Summer School Your Open Science and Research Publishing Platform 1st SciShops Summer School to researchers? to Open Science? Personal / project / community profile Thematic / personal / project repositories Enriched

More information

> Semantic Web Use Cases and Case Studies

> Semantic Web Use Cases and Case Studies > Semantic Web Use Cases and Case Studies Case Study: The Semantic Web for the Agricultural Domain, Semantic Navigation of Food, Nutrition and Agriculture Journal Gauri Salokhe, Margherita Sini, and Johannes

More information

Elsevier publishing partner for the journals in Thailand

Elsevier publishing partner for the journals in Thailand Elsevier publishing partner for the journals in Thailand Monique Lamine, Senior Business Development Manager Innovation & Product Development, Elsevier journals Date: 31 th August 2012 Overview Publishing

More information

SELF-SERVICE SEMANTIC DATA FEDERATION

SELF-SERVICE SEMANTIC DATA FEDERATION SELF-SERVICE SEMANTIC DATA FEDERATION WE LL MAKE YOU A DATA SCIENTIST Contact: IPSNP Computing Inc. Chris Baker, CEO Chris.Baker@ipsnp.com (506) 721 8241 BIG VISION: SELF-SERVICE DATA FEDERATION Biomedical

More information

ArrayExpress and Expression Atlas: Mining Functional Genomics data

ArrayExpress and Expression Atlas: Mining Functional Genomics data and Expression Atlas: Mining Functional Genomics data Gabriella Rustici, PhD Functional Genomics Team EBI-EMBL gabry@ebi.ac.uk What is functional genomics (FG)? The aim of FG is to understand the function

More information

A Framework for BioCuration (part II)

A Framework for BioCuration (part II) A Framework for BioCuration (part II) Text Mining for the BioCuration Workflow Workshop, 3rd International Biocuration Conference Friday, April 17, 2009 (Berlin) Martin Krallinger Spanish National Cancer

More information

An overview of Graph Categories and Graph Primitives

An overview of Graph Categories and Graph Primitives An overview of Graph Categories and Graph Primitives Dino Ienco (dino.ienco@irstea.fr) https://sites.google.com/site/dinoienco/ Topics I m interested in: Graph Database and Graph Data Mining Social Network

More information

Supplementary Note 1: Considerations About Data Integration

Supplementary Note 1: Considerations About Data Integration Supplementary Note 1: Considerations About Data Integration Considerations about curated data integration and inferred data integration mentha integrates high confidence interaction information curated

More information

The genexplain platform. Workshop SW2: Pathway Analysis in Transcriptomics, Proteomics and Metabolomics

The genexplain platform. Workshop SW2: Pathway Analysis in Transcriptomics, Proteomics and Metabolomics The genexplain platform Workshop SW2: Pathway Analysis in Transcriptomics, Proteomics and Metabolomics Saturday, March 17, 2012 2 genexplain GmbH Am Exer 10b D-38302 Wolfenbüttel Germany E-mail: olga.kel-margoulis@genexplain.com,

More information

Web-based tools for Bioinformatics; A (free) introduction to (freely available) NCBI, MUSC and World-wide Bioinformatics Resources.

Web-based tools for Bioinformatics; A (free) introduction to (freely available) NCBI, MUSC and World-wide Bioinformatics Resources. 1 of 12 9/10/2003 11:15 AM Web-based tools for Bioinformatics; A (free) introduction to (freely available) NCBI, MUSC and World-wide Bioinformatics Resources. When and Where---Wednesdays at 1pm Room 438

More information

BioMinT: Biological Text Mining EU FP5 Quality of Life Project

BioMinT: Biological Text Mining EU FP5 Quality of Life Project BioMinT: Biological Text Mining EU FP5 Quality of Life Project Dr. Dipl.-Ing. Österreichisches Forschungsinstitut für Artificial Intelligence Motivation Economic and business pressures are forcing drug

More information

Semantic Knowledge Discovery OntoChem IT Solutions

Semantic Knowledge Discovery OntoChem IT Solutions Semantic Knowledge Discovery OntoChem IT Solutions OntoChem IT Solutions GmbH Blücherstr. 24 06120 Halle (Saale) Germany Tel. +49 345 4780472 Fax: +49 345 4780471 mail: info(at)ontochem.com Get the Gold!

More information

Collaborative Ontology Development on the (Semantic) Web

Collaborative Ontology Development on the (Semantic) Web Collaborative Ontology Development on the (Semantic) Web Natalya F. Noy and Tania Tudorache Stanford Center for Biomedical Informatics Research Stanford University Stanford, CA 94305 {noy,tudorache}@stanford.edu

More information

Editing Pathway/Genome Databases

Editing Pathway/Genome Databases Editing Pathway/Genome Databases By Ron Caspi ron.caspi@sri.com This presentation can be found at http://bioinformatics.ai.sri.com/ptools/tutorial/sessions/ curation/curation of genes, enzymes and Pathways/

More information

A Semantic Model for Federated Queries Over a Normalized Corpus

A Semantic Model for Federated Queries Over a Normalized Corpus A Semantic Model for Federated Queries Over a Normalized Corpus Samuel Croset, Christoph Grabmüller, Dietrich Rebholz-Schuhmann 17 th March 2010, Hinxton EBI is an Outstation of the European Molecular

More information

Enhanced retrieval using semantic technologies:

Enhanced retrieval using semantic technologies: Enhanced retrieval using semantic technologies: Ontology based retrieval as a new search paradigm? - Considerations based on new projects at the Bavarian State Library Dr. Berthold Gillitzer 28. Mai 2008

More information

This document contains information about the annotation workflow for the Full BioCreative interactive task.

This document contains information about the annotation workflow for the Full BioCreative interactive task. BioCreative IV-User Interactive Task RLIMS-P Annotation Task This document contains information about the annotation workflow for the Full BioCreative interactive task. Annotation Workflow using RLIMS-P

More information

National Centre for Text Mining NaCTeM. e-science and data mining workshop

National Centre for Text Mining NaCTeM. e-science and data mining workshop National Centre for Text Mining NaCTeM e-science and data mining workshop John Keane Co-Director, NaCTeM john.keane@manchester.ac.uk School of Informatics, University of Manchester What is text mining?

More information

Deliverable D5.5. D5.5 VRE-integrated PDBe Search and Query API. World-wide E-infrastructure for structural biology. Grant agreement no.

Deliverable D5.5. D5.5 VRE-integrated PDBe Search and Query API. World-wide E-infrastructure for structural biology. Grant agreement no. Deliverable D5.5 Project Title: World-wide E-infrastructure for structural biology Project Acronym: West-Life Grant agreement no.: 675858 Deliverable title: D5.5 VRE-integrated PDBe Search and Query API

More information

Semantic Annotation and Linking of Medical Educational Resources

Semantic Annotation and Linking of Medical Educational Resources 5 th European IFMBE MBEC, Budapest, September 14-18, 2011 Semantic Annotation and Linking of Medical Educational Resources N. Dovrolis 1, T. Stefanut 2, S. Dietze 3, H.Q. Yu 3, C. Valentine 3 & E. Kaldoudi

More information

Data Curation Profile Human Genomics

Data Curation Profile Human Genomics Data Curation Profile Human Genomics Profile Author Profile Author Institution Name Contact J. Carlson N. Brown Purdue University J. Carlson, jrcarlso@purdue.edu Date of Creation October 27, 2009 Date

More information

The Text Analytics Challenge BioCreative V - Extraction of causal network information in BEL

The Text Analytics Challenge BioCreative V - Extraction of causal network information in BEL The Text Analytics Challenge BioCreative V - Extraction of causal network information in BEL http://tinyurl.com/beltask Fabio Rinaldi Outline Biomedical text mining, motivation Competitive evaluations:

More information

Advances in Data Integration & Representation in Systems Biology

Advances in Data Integration & Representation in Systems Biology Advances in Data Integration & Representation in Systems Biology Susie Stephens Principal Product Manager, Life Sciences Oracle susie.stephens@oracle.com Outline Systems Biology Data Requirements Semantic

More information

Mendeley Help Guide. What is Mendeley? Mendeley is freemium software which is available

Mendeley Help Guide. What is Mendeley? Mendeley is freemium software which is available Mendeley Help Guide What is Mendeley? Mendeley is freemium software which is available Getting Started across a number of different platforms. You can run The first thing you ll need to do is to Mendeley

More information

Supporting Bioinformatic Experiments with A Service Query Engine

Supporting Bioinformatic Experiments with A Service Query Engine Supporting Bioinformatic Experiments with A Service Query Engine Xuan Zhou Shiping Chen Athman Bouguettaya Kai Xu CSIRO ICT Centre, Australia {xuan.zhou,shiping.chen,athman.bouguettaya,kai.xu}@csiro.au

More information

hgu133plus2.db December 11, 2017

hgu133plus2.db December 11, 2017 hgu133plus2.db December 11, 2017 hgu133plus2accnum Map Manufacturer identifiers to Accession Numbers hgu133plus2accnum is an R object that contains mappings between a manufacturer s identifiers and manufacturers

More information

Crossing the Archival Borders

Crossing the Archival Borders IST-Africa 2008 Conference Proceedings Paul Cunningham and Miriam Cunningham (Eds) IIMC International Information Management Corporation, 2008 ISBN: 978-1-905824-07-6 Crossing the Archival Borders Fredrik

More information

An Approach for Discovering and Exploring Semantic Relationships between Genes

An Approach for Discovering and Exploring Semantic Relationships between Genes An Approach for Discovering and Exploring Semantic Relationships between Genes Nicoletta Dessì, Matteo Pani, Barbara Pes, and Diego Reforgiato Recupero Università degli Studi di Cagliari, Dipartimento

More information

CACAO: literature-based functional annotation as an intercollegiate competition

CACAO: literature-based functional annotation as an intercollegiate competition CACAO: literature-based functional annotation as an intercollegiate competition ASM/JGI Functional Genomics Workshop Hiram College July 2011 Jim Hu PortEco project Texas A&M Univ. jimhu@tamu.edu ecoliwiki@gmail.com

More information