NCI Thesaurus, managing towards an ontology

Similar documents
Challenges in Deploying and Managing Large Terminologies: NCI Thesaurus

The National Cancer Institute's Thésaurus and Ontology

Collaborative Ontology Development on the (Semantic) Web

WHO ICD11 Wiki LexWiki, Semantic MediaWiki and the International Classification of Diseases

Semantic MediaWiki A Tool for Collaborative Vocabulary Development Harold Solbrig Division of Biomedical Informatics Mayo Clinic

Tania Tudorache Stanford University. - Ontolog forum invited talk04. October 2007

Acquiring Experience with Ontology and Vocabularies

Community-based ontology development, alignment, and evaluation. Natasha Noy Stanford Center for Biomedical Informatics Research Stanford University

Utilizing NCBO Tools to Develop & Use an ECG Ontology

SEMANTIC SUPPORT FOR MEDICAL IMAGE SEARCH AND RETRIEVAL

Languages and tools for building and using ontologies. Simon Jupp, James Malone

warwick.ac.uk/lib-publications

Interoperability and Semantics in Use- Application of UML, XMI and MDA to Precision Medicine and Cancer Research

WebProtégé. Protégé going Web. Tania Tudorache, Jennifer Vendetti, Natasha Noy. Stanford Center for Biomedical Informatics

Collaborative & WebProtégé

Open Ontology Repository Initiative

Ontrez Project Report National Center for Biomedical Ontology November, 2007

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E

A method for recommending ontology alignment strategies

LexBIG/EVS API Overview

Prototyping a Biomedical Ontology Recommender Service

A Semantic Web-Based Approach for Harvesting Multilingual Textual. definitions from Wikipedia to support ICD-11 revision

A Framework for Ontology Evolution in Collaborative Environments

Comparing SNOMED CT and the NCI Thesaurus through Semantic Web Technologies

Taking a view on bio-ontologies. Simon Jupp Functional Genomics Production Team ICBO, 2012 Graz, Austria

A System for Ontology-Based Annotation of Biomedical Data

Knowledge Representations. How else can we represent knowledge in addition to formal logic?

Ontology Summit2007 Survey Response Analysis. Ken Baclawski Northeastern University

Ontology-based annotation of multiscale imaging data: Utilizing and building the Neuroscience Information Framework. Maryann E.

Using Ontologies for Data and Semantic Integration

Customisable Curation Workflows in Argo

Disease Information and Semantic Web

HL7 s Common Terminology Services Standard (CTS)

The Neuroscience Information Framework Practical experiences in using and building community ontologies

TIM: A Semantic Web Application for the Specification of Metadata Items in Clinical Research

SEEK User Manual. Introduction

Taxonomy Tools: Collaboration, Creation & Integration. Dow Jones & Company

Enabling complex queries to drug information sources through functional composition

Ontologies Growing Up: Tools for Ontology Management. Natasha Noy Stanford University

Semantic Representation and Querying of cabig Data Services

Balanced Large Scale Knowledge Matching Using LSH Forest

Protégé Plug-in Library: A Task-Oriented Tour

Linking Knowledge Organization Systems via Wikidata

Semantic Technologies and CDISC Standards. Frederik Malfait, Information Architect, IMOS Consulting Scott Bahlavooni, Independent

SELF-SERVICE SEMANTIC DATA FEDERATION

RxMix Enabling complex queries to drug information sources through functional composition

Introduction to ontologies

Analyzing user interactions with biomedical ontologies: A visual perspective

Medical Imaging on the Semantic Web: Annotation and Image Markup

ONTOLOGIES FOR BIOMEDICINE HOW TO MAKE

Terminology Harmonization

Core Technology Development Team Meeting

About the Edinburgh Pathway Editor:

SOA In Health Care Usage of Standard based Ontology Pavithra Kenjige PK Technologies July 12th, 2010

Interoperability of Protégé 2.0 beta and OilEd 3.5 in the Domain Knowledge of Osteoporosis

A Grid Middleware for Ontology Access

Semantic Web. Dr. Philip Cannata 1

A Knowledge-Based System for the Specification of Variables in Clinical Trials

LexGrid Philosophy, Model and Interfaces Harold R Solbrig Division of Biomedical Statistics and Informatics Mayo Clinic

A Protégé Ontology as The Core Component of a BioSense Message Analysis Framework

Ontology for Exploring Knowledge in C++ Language

Data Modeling and Harmonization with OWL: Opportunities and Lessons Learned

Reducing Consumer Uncertainty

Auditing Redundant Import in Reuse of a Top Level Ontology for the Drug Discovery Investigations Ontology (DDI)

Making Easy Things Easy and Lowering the Barrier to Entry in cabig with Semantic Web Technologies

ICT for Health Care and Life Sciences

Terminologies, Knowledge Organization Systems, Ontologies

Case Studies on Ontology Reuse

Learning from the Masters: Understanding Ontologies found on the Web

Semantic Knowledge Discovery OntoChem IT Solutions

Building Modular Ontologies and Specifying Ontology Joining, Binding, Localizing and Programming Interfaces in Ontologies Implemented in OWL

Protégé: Past, Present, and Future. Ray Fergerson Stanford

SKOS. COMP62342 Sean Bechhofer

UMLS-Query: A Perl Module for Querying the UMLS

Smart Open Services for European Patients. Work Package 3.5 Semantic Services Definition Appendix E - Ontology Specifications

Creating a Corporate Taxonomy. Internet Librarian November 2001 Betsy Farr Cogliano

MeSH: A Thesaurus for PubMed

Ontology Research Group Overview

Ontologies SKOS. COMP62342 Sean Bechhofer

Computing the Changes Between Ontologies

A methodology for biomedical ontology reuse

A Flexible Partitioning Tool for Large Ontologies

Mouse BIRN Data Integration. Maryann Martone Mouse All Hands Meeting

Course Microsoft Dynamics 365 Customization and Configuration with Visual Development (CRM)

Building the NNEW Weather Ontology

BIO-ONTOLOGIES: A KNOWLEDGE REPRESENTATION RESOURCE IN BIOINFORMATICS

Unstructured Text in Big Data The Elephant in the Room

Mark Musen and the NCBO Team

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.

Ontologies and The Earth System Grid

Apelon DTS on FHIR. John Gresh, Director of Product Development

NCBO Technology: Powering semantically aware applications

FIBO Metadata in Ontology Mapping

Ontology-Assisted Analysis of Web Queries to Determine the Knowledge Radiologists Seek

Standardization of (Imaging) Data Formats

Utilizing Semantic Web Technologies in Healthcare

The Model-Driven Semantic Web Emerging Standards & Technologies

Ontologies Guidelines for Best Practice

Ontology Engineering. CSE 595 Semantic Web Instructor: Dr. Paul Fodor Stony Brook University

Daniel L. Rubin and Kaustubh Supekar, Stanford University

Transcription:

NCI Thesaurus, managing towards an ontology CENDI/NKOS Workshop October 22, 2009 Gilberto Fragoso Outline Background on EVS The NCI Thesaurus BiomedGT Editing Plug-in for Protege Semantic Media Wiki supports collaborative development 1

NCI Enterprise Vocabulary Services Goal Integration by Meaning EVS provides services and resources that assists to: Integrate different conceptual frameworks for clinical, basic and translational research, Create terminological and taxonomic conventions across systems Controlled Terminology Products NCI Thesaurus an ontology-like cancer-centric controlled terminology NCI Metathesaurus maps biomedical vocabularies BiomedGT (Biomedical Grid Terminology - new) External vocabularies maintained and served: MedDRA, HL7, NDF-RT, LOINC, GO, Zebrafish, RadLex, etc. Further info, see: https://wiki.nci.nih.gov/display/evs/evs+wiki Products: NCI Thesaurus Reference Terminology for NCI, cabig, Partners Underpins cacore, cagrid semantics A Recommended Federal Standard Terminology for Anatomy Public domain, open content license 80,000 Concepts hierarchically organized into domains Broad coverage of the cancer research and clinical domain including prevention and treatment trials Neoplastic and other Diseases Findings and Abnormalities Anatomy, Tissues, Subcellular Structures Agents, Drugs, Chemicals Genes, Gene Products, Biological Processes Animal Models Mouse, other Research techniques and management, apparatus, clinical and lab, radiology, imagery 2

Products: NCI Thesaurus (2) Description-logic based Concept History Published Monthly Accessible via API, web browsers, downloadable files Transition to OWL begun in 03 Issues and Challenges Content many domains, users and uses Support collecting input from end users and collaborators Tracking Provenance Use of Properties standard terminology for properties (e.g. preferred_term vs term vs?) QA and Editing Consistency New use cases Query and reasoning against instance data on the Grid Federation of ontologies and subontologies 3

NCI Thesaurus Evaluation 2006-2007 Goals 1. Review and report of OBO criteria and relevant ISO standards for semantic quality and federation of terminologies, semantic quality and consistency. 2. Review content and structure for compliance 3. Document examples of how compliance would be achieved Among the Recommendations (Genesis of BiomedGT) 1) Unravel the vocabulary Partition into: Words and their definitions (Lexicon / Dictionary) Categorization and navigational nodes (Thesauri) Ontology 2) Identify external resources and: Include ability to reference general upper level ontologies Named relationships with other external resources 3) Enable collaboration: create environment where SME s can collaborate and discuss 4

Unraveling the Vocabulary owl:thing in BGT BFO BioTop BGT Ontology Nodes BGT Thesaurus Nodes BGT Word Nodes Reuse of Resources owl:thing 1) External Namespaces, Reference (GO, ChEBI, JAX) BFO BioTop BGT Thesaurus Nodes 2) External Namespaces, Modeled (CTCAE, NPO?) BGT Ontology Nodes 3) Internal Namespaces, Collaboratively Modeled (NPO?) 5

BGT Initial Development NCIt content used to seed BGT Categorization of all BGT concepts as O, N, C Eliminate NCIt-specific branches and concepts Partition concepts BFO, Thesaurus, and Words (ongoing) Import external vocabularies (ongoing) Refactor annotation properties (ongoing) name and format Refactor object properties (standby) Concept Removal and Federation Branch Removed NCI Thesaurus Entity Chemotherapy Regimen or Agent Combination Experimental Organism Anatomical Concept Initial Federation GO chebi Sequence types and features ontology Removal after Refactoring Biological Process Biochemical Pathway Organism Experimental Organism Disease Medical Device Component or Accessory Medical Product Usage and Evaluation Reportable Event UML Entity Business Rules Clinical or Research Facility Funding Information and Media Academic Degree NCBI Taxon Pathway Ontology Protein Modification PATO NCI Thesaurus Disease or Disorder Branch BFO Disease or Disorder Molecular_Abnormality Chemicals and Drugs 6

Partitioning of BGT Concepts Editing Tool Requirements Shared Data and Distributed Editing Reasoning GUI for Subject Matter Experts search and reporting facilities Editing Consistency basic content preferred and alternative terms, definition Complex Operations merge, split, retirement tied to history tracking Rule Enforcement, Edit Checks semantic type, no duplicate restrictions Support for Workflow, Editing Roles (manager, editor) 7

NCIEditTab NCIEditTab editing definitions and terms 8

NCIEditTab relations subtab NCIEditTab class expressions, editing 9

NCIEditTab other properties NCIEditTab tree panel in copy, split, and merge 10

NCIEditTab retirement NCIEditTab reporting 11

Lucene Query Tab Explanation Tab 12

Download site http://gforge.nci.nih.gov/frs/?group_id=174 SMW-based collaborative development 13

SMW- Protege Roundtrip Acknowledgements EVS Team Protégé/ NCI Protégé/SMW programmers Stanford BMIR staff Dionne Associates Clark & Parsia Mayo Clinic Production and QA staff Steve Hunter (Ekagra) M. Storey s group (UVic) Tracy Safran, Rob Wynne, John Park Editing Laura Roth Lori Whiteman Liz Hahn-Dantona and others (Lockheed Martin) NCI staff Frank Hartel Gilberto Fragoso Sherri de Coronado Margaret Haber Larry Wright 14