WheatIS: Progress report

Similar documents
WheatIS EWG 2014 action plan

Seminar Plant Genomics rd-4th-5th April 2012 Pont Royal en Provence

Feed the Future Innovation Lab for Peanut (Peanut Innovation Lab) Data Management Plan Version:

Esther Dzale Yeumo, Michael Alaux, Elizabeth Arnaud, Sophie Aubin, Ute Baumann, Patrice Buche, Laurel Cooper, Hanna Ćwiek-Kupczyńska,

Genomics. Nolan C. Kane

Analyzing Variant Call results using EuPathDB Galaxy, Part II

The ELIXIR of Linked Data

Topics of the talk. Biodatabases. Data types. Some sequence terminology...

The European Variation Archive

Core Technology Development Team Meeting

Reducing Consumer Uncertainty

Core Technology Development Team Meeting

Click on "+" button Select your VCF data files (see #Input Formats->1 above) Remove file from files list:

Features & Functionality

Human Disease Models Tutorial

Executive Committee Meeting

Creating and Using Genome Assemblies Tutorial

Rice Data Interoperability Working Group Updates. Pierre Larmande, IRD Shaik Meera, IIRR Ramil Mauleon, IRRI

LEMONS Database Generator GUI

Welcome to the Quantitative Trait Loci (QTL) Tutorial

Update: MIRIAM Registry and SBO

Core Technology Development Team Meeting

How to store and visualize RNA-seq data

Executive Committee Meeting

Core Technology Development Team Meeting

Introduction

The CQUIN Learning Network Annual Meeting

Major Topics. Prototyping and Rapid Application Development

CROP WILD RELATIVES DATABASE. National Bureau of Plant Genetic Resources (Indian Council of Agricultural Research) Tutorial

Genomic Analysis with Genome Browsers.

Getting Started. April Strand Life Sciences, Inc All rights reserved.

Breeding Guide. Customer Services PHENOME-NETWORKS 4Ben Gurion Street, 74032, Nes-Ziona, Israel

Enabling Open Science: Data Discoverability, Access and Use. Jo McEntyre Head of Literature Services

Reducing Consumer Uncertainty Towards a Vocabulary for User-centric Geospatial Metadata

Metadata Workshop 3 March 2006 Part 1

The TDWG Ontology and the Darwin Core Namespace Policy. Steve Baskauf TDWG VoMaG and RDF Task Groups

Indiana University Research Technology and the Research Data Alliance

Performing whole genome SNP analysis with mapping performed locally

Minimum Information for Reporting Immunogenomic NGS Genotyping (MIRING)

Importing and Merging Data Tutorial

VectorBase Hands-on Workshop August 16, 2015 Intercontinental Hotel Cali, Colombia. VectorBase PopBio: Advanced Search

I. Background and objectives

Release Notes. JMP Genomics. Version 4.0

NeAT Business Plan Component Data Integration and Annotation Services in Biodiversity (DIAS-B) 1. Service Description

Executive Committee Meeting

MIRING: Minimum Information for Reporting Immunogenomic NGS Genotyping. Data Standards Hackathon for NGS HACKATHON 1.0 Bethesda, MD September

How Can a Warehouse Manage Your Wheat-(Data)?

idigbio TCN IT Needs Assessment Results José Fortes, Shari Ellis, Andréa Matsunaga

Capturing ICAN end-user requirements for NETMAR

Axiom Analysis Suite Release Notes (For research use only. Not for use in diagnostic procedures.)

The Final Updates. Philippe Rocca-Serra Alejandra Gonzalez-Beltran, Susanna-Assunta Sansone, Oxford e-research Centre, University of Oxford, UK

What s New in DPLA Technology? Mark A. Matienzo Director of Technology, DPLA DPLAFest Indianapolis, IN April 17, 2015

ELIXIR Human Data Use Case

Data Standards and Protocols for Biological Collections Data

NCBI News, November 2009

User guide for GEM-TREND

Information Resources in Molecular Biology Marcela Davila-Lopez How many and where

Step-by-Step Guide to Advanced Genetic Analysis

Step-by-Step Guide to Basic Genetic Analysis

Update on the TDL Metadata Working Group s activities for

Update on Dataverse Dryad-Dataverse Community Meeting. Mercè Crosas, Elizabeth Quigley & Eleni Castro. Data Science > IQSS > Harvard University

Portals and workflows: Taverna Workbench. Paolo Romano National Cancer Research Institute, Genova

Working with Islandora

PhD: a web database application for phenotype data management

ClinVar. Jennifer Lee, PhD, NCBI/NLM/NIH ClinVar

User Manual. Ver. 3.0 March 19, 2012

Genetic type 1 Error Calculator (GEC)

Breeding Management System

Data Curation Profile Human Genomics

Genetic Analysis. Page 1

Minimal Metadata Standards and MIIDI Reports

CDIS Biomedical Data Commons

Interoperability in Science Data: Stories from the Trenches

NGS Data Analysis. Roberto Preste

Building a missing item in INSPIRE: The Re3gistry

Agenda. Clarification of issues Quarter definition Steering and Executive Committee composition Dissemination and community outreach activities

2. Take a few minutes to look around the site. The goal is to familiarize yourself with a few key components of the NCBI.

A common metadata approach to support egovernment interoperability

CTSA Program Common Metric for Informatics Solutions

Data publication and discovery with Globus

A Semantic Web for Bioinformatics: Goals, Tools, Systems, Applications

EUDAT. Towards a pan-european Collaborative Data Infrastructure

Cheshire 3 Framework White Paper: Implementing Support for Digital Repositories in a Data Grid Environment

Increasing research efficiency. Lucia Franco 15 April 2010

SEEK User Manual. Introduction

JMP Genomics. Release Notes. Version 6.0

Resilient Linked Data. Dave Reynolds, Epimorphics

Development of interactive statistical modules and workflows for exploration of diverse sets of crop phenotyping data

GEP Project Management System: TSS Project Submission

A Framework for BioCuration (part II)

Prototyping a Biomedical Ontology Recommender Service

Gramene: A Resource for Comparative Grass Genomics

Using The Arabidopsis Information Resource (TAIR) to Find Information About Arabidopsis Genes

Core Technology Development Team Meeting

AgroPortal : a proposition for ontology-based services in the agronomic domain

Step-by-Step Guide to Relatedness and Association Mapping Contents

CTS2 Implementation Phast (So Far)

Web service platform to provide access to maize diversity data

Flexible Design for Simple Digital Library Tools and Services

DataDryad.org and the interoperability continuum.

Transcription:

WheatIS: Progress report WheatIS Annual meeting, San Diego, 9 January 2015

WheatIS data submission

DSpace Beta-version to test: http://urgi.versailles.inra.fr/xmlui/ At the moment, available submission formats: Genotyping Phenotyping SNP discovery An account has been send to each EWG members: You can test -> the data will be deleted Please give feedback: michael.alaux@versailles.inra.fr

DSpace To do: Developments based on feedbacks Add new formats

DSpace

DSpace

DSpace Phenotyping data submission demo by Thomas: http://urgi.versailles.inra.fr/xmlui/

WheatIS distributed search tool

SolR search Beta-version to test: https://urgi.versailles.inra.fr/wheatis Rely on the transplant model Data types queried at the moment: Sequences Markers Accession Experiment QTL Genetic maps

SolR search Nodes queried at the moment: URGI GnpIS EBI Ensembl Plants IPK CR-EST GEBIS MetaCrop

SolR search To do: Add more nodes and data types Please give feedback: michael.alaux@versailles.inra.fr

SolR search

SolR search Filter by data type Filter by wheat species Filter by database «nodes»

SolR search Demo: https://urgi.versailles.inra.fr/wheatis

Report of the RDA wheat group

RDA group main objectives Focus: Polymorphisms (SNPs) Genome annotations Phenotypes Genetic & Physical Maps Germplasm expression data Differences in representation formats of each data type Many data sources with their own data structure Differences in the interpretation of the meaning of the data Standards Translations Interoperability framework From Esther Dzalé Kaboré

Workflow Survey/Interviews Identify metadata, data formats and vocabularies used within/by the Wheat research community Workshops Identify/agree on the use of common metadata, data formats and vocabularies Assess then improve the level of accessibility and interoperability of data formats and vocabularies Collect interoperability use cases Implementation Interactive cookbook: recommendations + guidelines Hub of linked vocabularies Prototype - Assess the gain of interoperability based on collected use cases From Esther Dzalé Kaboré

Where we are Survey/Interviews A survey launched in April 2014 Answers from more than 200 respondents Workshops 1-2 October 2014 List of recommended data formats and vocabularies for each data type List of follow up actions for each data type (standardization e.g for traits, minimal set of metadata e.g for SNP files provenance or for markers, QTL and map handling, check for existing mapping tools, etc..) List of interoperability use cases From Esther Dzalé Kaboré

Recommendations Standards summary Follow up actions SNPs Use of VCF data format Look at a metadata set to contextualize the provenance of SNP files genome annotations Germplasms Use of GFF3 data format Use of ontologies to fill «Attributes» column (a list of feature attributes in the format tag=value), column 9 Use of MPCD and Darwin Core Germplasm formats Provide description guidelines for filling in content for column 9 Check how to integrate with tool specific formats (Grin Global, Genesys) Provide a table like human readable format for DWC Germplasm Gene expression Follow existing format standards laid out by repositories (NCBI GEO, EBI Array Express Check for mapping and conversion tools Physical maps Same as for genome annotations Same as for genome annotations Genetic maps Data formats depend on tools that are used, rather concentrate on metadata harmonization Look at a minimal metadata set to handle markers, QTL, maps Obtain details for linking requirements Phenotypes Use of isa-tab data format Standardize the traits metadata Improve the From reference Esther to Dzalé ontologies Kaboré used Michael for traits Alaux

Next steps In progress: A first version of the «cookbook» will be available on a website: http://ist.blogs.inra.fr/wdi/ Work with experts to meet the identified needs Metadata harmonization, minimal metadata sets Mapping among metadata, formats and ontologies Survey on wheat ontologies https://docs.google.com/forms/d/1yozhjrwvinqsvx0e2983c9shnd8e0skhxekj_zk 5T00/viewform?pli=1&edit_requested=true Two workshops in 2015 Refine the cookbook Collect more interoperability use cases From Esther Dzalé Kaboré

Acknowledgments

Questions DSpace test: http://urgi.versailles.inra.fr/xmlui SolR test: https://urgi.versailles.inra.fr/wheatis Contact me at michael.alaux@versailles.inra.fr