Steering Committee Meeting

Size: px
Start display at page:

Download "Steering Committee Meeting"

Transcription

1 Steering Committee Meeting To hear the meeting, you must call in Toll-free phone number: Access Code: For international call in numbers, please visit:

2 Agenda Pilot Project Data Recommendation using Machine Learning and Crowdsourcing biocaddie All-Hands Meeting BD2K All-Hands Meeting Core Development Team Updates and Demo

3 Pilot Project 2.1 Data Recommendation using Machine Learning and Crowdsourcing Xiaoqian Jiang, PhD

4 Motivation v Datasets w Health-science datasets are being created and collected at an increasing speed v Data repositories w NIH organized biomedical datasets into 50+ repositories (e.g., GEO, dbgap, GenBank) v Retrieval w Researchers often need lots of background knowledge to find datasets of interest v We need the PubMed for integrated data indexing and searching

5 Goal v Develop biomedical data recommendation system to meet researcher s need Network of Authors Network of Papers Datasets used citations paper. references

6 Our outputs v v v v Two accepted paper (KDD, TBC) w w Oh J, Han W-S, Yu H, Jiang X, Fast and Robust Parallel SGD Matrix Factorization, ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 15), Sydney, Australia, 2015 Iranmehr A, Wang H, Chen H, Jiang X, DataRank: A Framework for Ranking Biomedical Datasets, Translational Bioinformatics Conference (TBC 15), Tokyo, Japan, 2015 One submission (AMIA CRI) w Garcia-Ulloa D, Xiong L, Luo Y, Ohno-Machado L, Jiang X, Recommendation system for biomedical datasets using Bayesian networks and tensor factorization, AMIA CRI (submitted), 2015 A demo webservice w Extraction of paper-data bipartite graphs

7 Outline v Datarank: Personalized dataset recommendation service v Alternative methods v Integration progress

8 DataRank related work v User-based recommendation, like Netflix w A user likes certain genre of movies v Item-based recommendation, like Amazon w Similar items are frequently purchased together v DataRank? w Researchers can switch topics over time w Researchers might be interested in different datasets under different context

9 DataRank MetaData Enrichment 26,685 articles 35,497 datasets 401,153 mentions

10 DataRank: Feature Representation v v v A MeSH is a tag from a controlled vocabulary that was assigned to all articles in the corpus. w MeSH (Medical Subject Headings) is the NLM controlled vocabulary thesaurus used for indexing articles for PubMed. Each document can be represented via (binary) BoW of MeSH. Document features are aggregated to dataset features using "OR". Query = {genetics, Female} DNA, Feature = [0,..., 0, 1, 0,..., 1, 0,..., 1, 0,..., 0] T Sparse Feature = {1023:1, 9861:1, 22397:1}

11 DataRank: Modeling v We take a Bayesian approach to model the posterior probability over datasets given a query and support re-ranking

12 DataRank system

13 DataRank: Evaluation w Regret: w Average disagreement:

14 Outline v Datarank: Personalized dataset recommendation service v Alternative methods v Integration progress

15 Alternative methods v Can be represented with a sparse tensor Authors Datasets Mesh Terms

16 Alternative methods v The question is, given an author and mesh terms, what is the best dataset to recommend? These are not independent

17 Alternative methods Bayesian Network Tensor factorization db 1 db 2 db nd a 1,m 1 p 11 p 12 p 1nd a 1,m 2 p 21 p 22 p 2nd a na,m nm p q1 p q2 p qnd

18 Alternative methods Based on 2,398 JAMIA ar.cles ci.ng 2,735 GEO datasets, which involve 16,363 different authors and 5,345 MeSH terms

19 Alternative methods Based on 2,398 JAMIA ar.cles ci.ng 2,735 GEO datasets, which involve 16,363 different authors and 5,345 MeSH terms

20 Outline v Datarank: Personalized dataset recommendation service v Alternative methods v Integration progress

21 Integration progress v Attended weekly biocaddie core development meetings v Presented detailed DataRank algorithm to the team v Uploaded DataRank code to the biocaddie github v Shared with the core team the paper citation data

22 Thank you! Questions?

23 biocaddie All-Hands Meeting v Date: Friday, December 4, 2015 v Location: Telemedicine Building, UCSD v Draft agenda (Google Drive > All-Hands Meeting 2015) w Events: Steering Committee meeting DDI prototype overview and demo Working groups and supplement presentations v Core Development Team meetings w Date: Saturday, December 5 w CEDAR Discuss collaboration with biocaddie

24 biocaddie All-Hands Meeting Time Title Presenter(s) 8:30 9:30 AM Steering Committee Meeting Lucila Ohno-Machado 8:30 9:30 AM Breakfast and Registration 9:30 AM Welcome and Introduction to biocaddie Lucila Ohno-Machado 9:50 AM DDI Prototype Hua Xu/Jeffrey Grethe 10:50 AM Break 11:00 AM DDI Prototype/Integration with Pilot Projects Hua Xu 11:30 AM Hands-on DDI Demo All 12:00 PM Lunch 1:00 PM WG2 - Data Identifiers Recommendation Jeffrey Grethe 1:15 PM WG3 - Descriptive Metadata for Datasets Susanna Sansone 1:45 PM WG4 - Use Cases and Testing Benchmarks Hua Xu 2:00 PM WG5 - Dataset Citation Metrics Jeffrey Grethe/Susanna Sansone 2:15 PM WG6 - Criteria for Being Included in the DDI Lucila Ohno-Machado/Ian Fore 2:30 PM WG7 - Accessibility Metadata for Datasets George Alter 2:45 PM WG8 - Ranking Search Results Hua Xu 3:00 PM Break 3:10 PM Supplement: Count Everything David Haussler/Ida Sim/Isaac Kohane 3:30 PM Supplement: OmicsDI Eric Deutch/Henning Hermjakob/Peipei Ping/Susanna Sansone 3:50 PM Supplement: F11 Data Citation Pilot Maryann Martone/Tim Clark 4:10 PM Closing Session Lucila Ohno-Machado

25 BD2K - All-Hands Meeting v Date: November 12-13, 2015 v Location: Bethesda, MD v Agenda - w Prototype Demo? w Posters w Other topics

26 Core Dev. Team - Progress Task ElasticSearch Endpoint Developers have access to API keys ElasticSearch EndPoint Documentation Available at GitHub wiki ElasticSearch Endpoint Enhancements Additional storage for end-point Need to add additional clean-up routines (e.g. logs) Data Ingestion Pipeline Begun discussions with ICPSR Reconciliation of indices with WG3 metadata model Data Set Ingestion PDB/GEO: stable LINCS: In contact about APIs BioProject, ArrayExpress, GEMMA, dbgap: Currently in-process ICPSR: Will be receiving sample data files Due Operational Deployed Ongoing Ongoing Ongoing Ongoing

27 Pilot Project Integration PP Presented to CDT / / / /01 As Integrated Specialized advanced search for GWAS datasets Ranking function based on citation metrics for GEO series data a) isee similarity metric in ElasticSearch b) DELVE implementation as exploratory search and visualization option. (i) for PDB (ii) for gene expression data Ranking function based on citation metrics (dataset mentions) for PDB data Completed On 09/22 Ongoing (10/20) 9/01 Ongoing (11/10) Ongoing (10/27)

28 Core Dev. Team - Progress Task UI Development Preliminary UI (v0.1) demo UI (v0.2) deployed (new design) Pop-up dialog box for search result on homepage Global statistics Track user s search history (recent activity) Similar datasets function (PDB data only) Related publications from PubMed (dataset title) Link to collect user s feedback Error prevention Improvements/additional functions based on feedback Ontology web services: 1) Imported terminologies (6) and validate them 2) Created UI-Browser for TS 3) Integration to Scigraph API 4) Create auto complete feature 5) Integrate terminology server Due 09/01 10/09 Ongoing 08/31 09/30 Ongoing Ongoing Ongoing

29 Core Dev. Team Prototype DEMO

30 Announcements v To access private repositories in the biocaddie GitHub account, send GitHub username to Anupama.E.Gururaj@uth.tmc.edu v Stephanie (sthagstrom@ucsd.edu) to be added to working group mailing lists v Next Steering Committee meeting w November 5, 2015 at 9:00 am PST

Executive Committee Meeting

Executive Committee Meeting Executive Committee Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Executive Committee Meeting

Executive Committee Meeting Executive Committee Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Agenda. Clarification of issues Quarter definition Steering and Executive Committee composition Dissemination and community outreach activities

Agenda. Clarification of issues Quarter definition Steering and Executive Committee composition Dissemination and community outreach activities Agenda Clarification of issues Quarter definition Steering and Executive Committee composition Dissemination and community outreach activities Progress and updates Y1Q3 and plans for Y1Q4 Plan for the

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Executive Committee Meeting

Executive Committee Meeting Executive Committee Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Steering Committee Meeting

Steering Committee Meeting Steering Committee Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Minutes. Date: Location: UCSD BRF2 5A03. Attendees Present

Minutes. Date: Location: UCSD BRF2 5A03. Attendees Present Executive Committee Meeting Location: UCSD BRF2 5A03 Date: 8-16-16 Start time: 10:00 am PDT End time: 11:30 am PDT Meeting Objective Attendees Present Minute Taker Executive Committee Meeting UCSD: Lucila

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Executive Committee Meeting

Executive Committee Meeting Executive Committee Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Steering Committee Meeting

Steering Committee Meeting Steering Committee Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Metadata Ingestion and Processinng

Metadata Ingestion and Processinng biomedical and healthcare Data Discovery Index Ecosystem Ingestion and Processinng Jeffrey S. Grethe, Ph.D. 2017 BioCADDIE All Hands Meeting prototype Ingestion Indexing Repositories Ingestion ElasticSearch

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Steering Committee Meeting

Steering Committee Meeting Steering Committee Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please isit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

The Final Updates. Philippe Rocca-Serra Alejandra Gonzalez-Beltran, Susanna-Assunta Sansone, Oxford e-research Centre, University of Oxford, UK

The Final Updates. Philippe Rocca-Serra Alejandra Gonzalez-Beltran, Susanna-Assunta Sansone, Oxford e-research Centre, University of Oxford, UK The Final Updates Supported by the NIH grant 1U24 AI117966-01 to UCSD PI, Co-Investigators at: Philippe Rocca-Serra Alejandra Gonzalez-Beltran, Susanna-Assunta Sansone, Oxford e-research Centre, University

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

eveloping DataMed the current status

eveloping DataMed the current status eeloping DataMed the current status Hua Xu Core Deelopment Team (CDT) biocaddie AHM 2017 8/8/17 Supported by the NIH grant 1U24 AI117966-01 to the Uniersity of California, San Diego 1 Outline CDT Roles

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Susanna-Assunta Sansone, PhD. Metadata WG3 chair.

Susanna-Assunta Sansone, PhD. Metadata WG3 chair. Susanna-Assunta Sansone, PhD Metadata WG3 chair 3-workgroup@biocaddie.org WG3 Metadata v v Full description: goals, synergies, phases, members & files Joint effort with BD2K Center for Expanded Data Annotation

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

NCBI News, November 2009

NCBI News, November 2009 Peter Cooper, Ph.D. NCBI cooper@ncbi.nlm.nh.gov Dawn Lipshultz, M.S. NCBI lipshult@ncbi.nlm.nih.gov Featured Resource: New Discovery-oriented PubMed and NCBI Homepage The NCBI Site Guide A new and improved

More information

A Data Citation Roadmap for Scholarly Data Repositories

A Data Citation Roadmap for Scholarly Data Repositories A Data Citation Roadmap for Scholarly Data Repositories Tim Clark (Harvard Medical School & Massachusetts General Hospital) Martin Fenner (DataCite) Mercè Crosas (Institute for Quantiative Social Science,

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

MeSH: A Thesaurus for PubMed

MeSH: A Thesaurus for PubMed Resources and tools for bibliographic research MeSH: A Thesaurus for PubMed October 24, 2012 What is MeSH? Who uses MeSH? Why use MeSH? Searching by using the MeSH Database What is MeSH? Acronym for Medical

More information

Master Project. Various Aspects of Recommender Systems. Prof. Dr. Georg Lausen Dr. Michael Färber Anas Alzoghbi Victor Anthony Arrascue Ayala

Master Project. Various Aspects of Recommender Systems. Prof. Dr. Georg Lausen Dr. Michael Färber Anas Alzoghbi Victor Anthony Arrascue Ayala Master Project Various Aspects of Recommender Systems May 2nd, 2017 Master project SS17 Albert-Ludwigs-Universität Freiburg Prof. Dr. Georg Lausen Dr. Michael Färber Anas Alzoghbi Victor Anthony Arrascue

More information

Metadata Discovery and Integration to Support Repurposing of Heterogeneous Data using the OpenFurther Platform

Metadata Discovery and Integration to Support Repurposing of Heterogeneous Data using the OpenFurther Platform Metadata Discovery and Integration to Support Repurposing of Heterogeneous Data using the OpenFurther Platform biocaddie All Hands Meeting September 11 th, 2016 Ram Gouripeddi & Julio Facelli Department

More information

Harmonizing biocaddie Metadata Schemas for Indexing Clinical Research Datasets Using Semantic Web Technologies

Harmonizing biocaddie Metadata Schemas for Indexing Clinical Research Datasets Using Semantic Web Technologies Harmonizing biocaddie Metadata Schemas for Indexing Clinical Research Datasets Using Semantic Web Technologies Harold R. Solbrig 1, Guoqian Jiang 1 1 Mayo Clinic College of Medicine, Rochester, MN [solbrig.harold,

More information

MeSH : A Thesaurus for PubMed

MeSH : A Thesaurus for PubMed Scuola di dottorato di ricerca in Scienze Molecolari Resources and tools for bibliographic research MeSH : A Thesaurus for PubMed What is MeSH? Who uses MeSH? Why use MeSH? Searching by using the MeSH

More information

Literature Databases

Literature Databases Literature Databases Introduction to Bioinformatics Dortmund, 16.-20.07.2007 Lectures: Sven Rahmann Exercises: Udo Feldkamp, Michael Wurst 1 Overview 1. Databases 2. Publications in Science 3. PubMed and

More information

Introduction of Kun-Yen Medical Library Database Search: September 18, 20, 26, :10-13:20

Introduction of Kun-Yen Medical Library Database Search: September 18, 20, 26, :10-13:20 Introduction of Kun-Yen Medical Library Database Search: & September 18, 20, 26, 2018 12:10-13:20 Welcome!! Your very first course at Kun-Yen Medical Library. I am Jayden Chou Reference Librarian You can

More information

ModelChain: Decentralized Privacy-Preserving Healthcare Predictive Modeling Framework on Private Blockchain Networks

ModelChain: Decentralized Privacy-Preserving Healthcare Predictive Modeling Framework on Private Blockchain Networks ModelChain: Decentralized Privacy-Preserving Healthcare Predictive Modeling Framework on Private Blockchain Networks Tsung-Ting Kuo, Chun-Nan Hsu, and Lucila Ohno-Machado pscanner Face-to-Face Meeting

More information

The NIH Big Data to Knowledge Initiative: Raising the Prominence of Data

The NIH Big Data to Knowledge Initiative: Raising the Prominence of Data The NIH Big Data to Knowledge Initiative: Raising the Prominence of Data Michael F. Huerta, Ph.D. Associate Director, National Library of Medicine Director, Office of Health Information Programs Development

More information

Update on Dataverse Dryad-Dataverse Community Meeting. Mercè Crosas, Elizabeth Quigley & Eleni Castro. Data Science > IQSS > Harvard University

Update on Dataverse Dryad-Dataverse Community Meeting. Mercè Crosas, Elizabeth Quigley & Eleni Castro. Data Science > IQSS > Harvard University Update on Dataverse Image credit: David Bygott (CC-BY-NC-SA) 2014 Dryad-Dataverse Community Meeting Mercè Crosas, Elizabeth Quigley & Eleni Castro Data Science > IQSS > Harvard University Introduction

More information

Statistical and Synthetic Data Sharing with Differential Privacy

Statistical and Synthetic Data Sharing with Differential Privacy pscanner and idash Data Sharing Symposium UCSD, Sept 30 Oct 2, 2015 Statistical and Synthetic Data Sharing with Differential Privacy Li Xiong Department of Mathematics and Computer Science Department of

More information

Enriching Knowledge Domain Visualizations: Analysis of a Record Linkage and Information Fusion Approach to Citation Data

Enriching Knowledge Domain Visualizations: Analysis of a Record Linkage and Information Fusion Approach to Citation Data Enriching Knowledge Domain Visualizations: Analysis of a Record Linkage and Information Fusion Approach to Citation Data Marie B. Synnestvedt, MSEd 1, 2 1 Drexel University College of Information Science

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting Agenda v Updates regarding last meeting action items v Presentation by Ergin about Ontology Services v Brief updates from others Supported by the NIH grant 1U24

More information

Next Generation Library Catalogs: opportunities. September 26, 2008

Next Generation Library Catalogs: opportunities. September 26, 2008 Next Generation Library Catalogs: Local developments and research opportunities Derek e Rodriguez, TRLN September 26, 2008 Overview Introduction to TRLN Scope and goals of the TRLN Endeca Project Project

More information

The Download: Community Tech Talks Episode 7. September 14, 2017

The Download: Community Tech Talks Episode 7. September 14, 2017 The Download: Community Tech Talks Episode 7 September 14, 2017 Welcome! Please share: Let others know you are here with Ask questions! We will answer as many questions as we can following each speaker.

More information

PDS 2010 System Design Report

PDS 2010 System Design Report PDS 2010 System Design Report MC Face-to-Face St. Louis, MO August 16-17, 2010 Topics Overall Progress Test Collection Ingestion Build 1 System Deliverables Component Progress - Registry, Harvest, Security

More information

Markus Kaindl Senior Manager Semantic Data Business Owner SN SciGraph

Markus Kaindl Senior Manager Semantic Data Business Owner SN SciGraph Analytics Building business tools for the scholarly publishing domain using LOD and the ELK stack SEMANTiCS Vienna 2018 Markus Kaindl Senior Manager Semantic Data Business Owner SN SciGraph 1 Agenda (25

More information

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper. Semantic Web Company PoolParty - Server PoolParty - Technical White Paper http://www.poolparty.biz Table of Contents Introduction... 3 PoolParty Technical Overview... 3 PoolParty Components Overview...

More information

Phenotype Discovery in NHLBI Genomic Studies

Phenotype Discovery in NHLBI Genomic Studies Phenotype Discovery in NHLBI Genomic Studies Final Report Hyeoneui Kim, RN, PhD Son Doan, PhD Ko-Wei Lin, DVM, PhD Michael Conway, PhD Alexander Hsieh Asher Garland Seena Farzaneh Neda Alipanah Stephanie

More information

Webinar Annotate data in the EUDAT CDI

Webinar Annotate data in the EUDAT CDI Webinar Annotate data in the EUDAT CDI Yann Le Franc - e-science Data Factory, Paris, France March 16, 2017 This work is licensed under the Creative Commons CC-BY 4.0 licence. Attribution: Y. Le Franc

More information

warwick.ac.uk/lib-publications

warwick.ac.uk/lib-publications Original citation: Zhao, Lei, Lim Choi Keung, Sarah Niukyun and Arvanitis, Theodoros N. (2016) A BioPortalbased terminology service for health data interoperability. In: Unifying the Applications and Foundations

More information

January 16, Re: Request for Comment: Data Access and Data Sharing Policy. Dear Dr. Selby:

January 16, Re: Request for Comment: Data Access and Data Sharing Policy. Dear Dr. Selby: Dr. Joe V. Selby, MD, MPH Executive Director Patient-Centered Outcomes Research Institute 1828 L Street, NW, Suite 900 Washington, DC 20036 Submitted electronically at: http://www.pcori.org/webform/data-access-and-data-sharing-policypublic-comment

More information

Welcome to Docker Birthday # Docker Birthday events (list available at Docker.Party) RSVPs 600 mentors Big thanks to our global partners:

Welcome to Docker Birthday # Docker Birthday events (list available at Docker.Party) RSVPs 600 mentors Big thanks to our global partners: Docker Birthday #3 Welcome to Docker Birthday #3 2 120 Docker Birthday events (list available at Docker.Party) 7000+ RSVPs 600 mentors Big thanks to our global partners: Travel Planet 24 e-food.gr The

More information

TSRI, 400-S PubMed / MyNCBI

TSRI, 400-S PubMed / MyNCBI TSRI, 400-S helplib@scripps.edu 858-784-8705 PubMed / MyNCBI My NCBI is a free service available in PubMed (and all other NCBI databases) that allows you to save searches, set up email alerts for search

More information

MeSH : A Thesaurus for PubMed

MeSH : A Thesaurus for PubMed Resources and tools for bibliographic research MeSH : A Thesaurus for PubMed What is MeSH? Who uses MeSH? Why use MeSH? Searching by using the MeSH Database What is MeSH? http://www.ncbi.nlm.nih.gov/mesh

More information

Document Retrieval using Predication Similarity

Document Retrieval using Predication Similarity Document Retrieval using Predication Similarity Kalpa Gunaratna 1 Kno.e.sis Center, Wright State University, Dayton, OH 45435 USA kalpa@knoesis.org Abstract. Document retrieval has been an important research

More information

TSRI, 400-S PubMed / MyNCBI

TSRI, 400-S PubMed / MyNCBI TSRI, 400-S helplib@scripps.edu 858-784-8705 PubMed / MyNCBI My NCBI is a free service available in PubMed (and all other NCBI databases) that allows you to save searches, set up email alerts for search

More information

How to Guide. For Personal Users

How to Guide. For Personal Users How to Guide For Personal Users March 2016 Contents Introduction... 2 Features and Functions:... 2 Accessing UICollaboratory... 3 Home Page... 3 Homepage Key Features... 3 Collaboration Map... 4 Search

More information

RightFind XML for Mining. Quick Start Guide

RightFind XML for Mining. Quick Start Guide RightFind XML for Mining Quick Start Guide CONTENTS Contact RightFind XML for Mining Support... 3 Access RightFind XML for Mining... 3 Create a Project... 3 Define a Corpus........................................................................................................

More information

The 2018 (14th) International Conference on Data Science (ICDATA)

The 2018 (14th) International Conference on Data Science (ICDATA) CALL FOR PAPERS LATE BREAKING PAPERS, POSITION PAPERS, ABSTRACTS, POSTERS Paper Submission Deadline: May 20, 2018 The 2018 (14th) International Conference on Data Science (ICDATA) (former International

More information

Research Faculty Summit Systems Fueling future disruptions

Research Faculty Summit Systems Fueling future disruptions Research Faculty Summit 2018 Systems Fueling future disruptions Continuous Delivery for Bing UX Chap Alex Engineering Manager, Microsoft Core Bing-wide Principles Live-site quality is paramount Constant

More information

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal Heinrich Widmann, DKRZ DI4R 2016, Krakow, 28 September 2016 www.eudat.eu EUDAT receives funding from the European Union's Horizon 2020

More information

Embracing Semantic Technology for Better Metadata Authoring in Biomedicine

Embracing Semantic Technology for Better Metadata Authoring in Biomedicine Embracing Semantic Technology for Better Metadata Authoring in Biomedicine Attila L. Egyedi, Martin J. O Connor, Marcos Martínez-Romero, Debra Willrett, Josef Hardi, John Graybeal, and Mark A. Musen Stanford

More information

Technical Steering Committee Update

Technical Steering Committee Update Technical Steering Committee Update Jeff Brown, TSC Chair IBM, Emerging Product Development, IBM Academy of Technology member Revolutionizing the Datacenter Topics TSC and Workgroup Accomplishments OpenPOWER

More information

CTSA Program Common Metric for Informatics Solutions

CTSA Program Common Metric for Informatics Solutions CTSA Program Common Metric for Informatics Solutions KRISTI HOLMES, PHD DIRECTOR OF EVALUATION, NUCATS DIRECTOR, GALTER HEALTH SCIENCES LIBRARY & LEARNING CENTER NORTHWESTERN UNIVERSITY CTSA PROGRAM STEERING

More information

Exploring and Exploiting the Biological Maze. Presented By Vidyadhari Edupuganti Advisor Dr. Zoe Lacroix

Exploring and Exploiting the Biological Maze. Presented By Vidyadhari Edupuganti Advisor Dr. Zoe Lacroix Exploring and Exploiting the Biological Maze Presented By Vidyadhari Edupuganti Advisor Dr. Zoe Lacroix Motivation An abundance of biological data sources contain data about scientific entities, such as

More information

Lecture 27: Learning from relational data

Lecture 27: Learning from relational data Lecture 27: Learning from relational data STATS 202: Data mining and analysis December 2, 2017 1 / 12 Announcements Kaggle deadline is this Thursday (Dec 7) at 4pm. If you haven t already, make a submission

More information

Genescene: Biomedical Text and Data Mining

Genescene: Biomedical Text and Data Mining Claremont Colleges Scholarship @ Claremont CGU Faculty Publications and Research CGU Faculty Scholarship 5-1-2003 Genescene: Biomedical Text and Data Mining Gondy Leroy Claremont Graduate University Hsinchun

More information

How to Guide. For Personal Users

How to Guide. For Personal Users How to Guide For Personal Users February 2016 Contents Introduction... 2 Features and functions:... 2 Accessing UICollaboratory... 3 Home Page... 3 Homepage Key Features... 3 Collaboration Map... 4 Search

More information

Contextual Search using Cognitive Discovery Capabilities

Contextual Search using Cognitive Discovery Capabilities Contextual Search using Cognitive Discovery Capabilities In this exercise, you will work with a sample application that uses the Watson Discovery service API s for cognitive search use cases. Discovery

More information

Sharing Biomedical Research Data. Michael F. Huerta, Ph.D. Associate Director National Library of Medicine, NIH

Sharing Biomedical Research Data. Michael F. Huerta, Ph.D. Associate Director National Library of Medicine, NIH Sharing Biomedical Research Data Michael F. Huerta, Ph.D. Associate Director National Library of Medicine, NIH Sharing Research Data Benefits of sharing data include: Transparency Reanalysis Integration

More information

Tania Tudorache Stanford University. - Ontolog forum invited talk04. October 2007

Tania Tudorache Stanford University. - Ontolog forum invited talk04. October 2007 Collaborative Ontology Development in Protégé Tania Tudorache Stanford University - Ontolog forum invited talk04. October 2007 Outline Introduction and Background Tools for collaborative knowledge development

More information

SciVerse ScienceDirect. User Guide. October SciVerse ScienceDirect. Open to accelerate science

SciVerse ScienceDirect. User Guide. October SciVerse ScienceDirect. Open to accelerate science SciVerse ScienceDirect User Guide October 2010 SciVerse ScienceDirect Open to accelerate science Welcome to SciVerse ScienceDirect: How to get the most from your subscription SciVerse ScienceDirect is

More information

Respondent Help Guide

Respondent Help Guide Respondent Help Guide https://nqphnetender.openwindowscontracts.com Welcome to the Northern Queensland Primary Health Network etendering Portal. Using NQPHN s etendering Portal is free for all users. There

More information

Open web-based annotation: Creating a lightweight, portable knowledge layer over biomedicine

Open web-based annotation: Creating a lightweight, portable knowledge layer over biomedicine Open web-based annotation: Creating a lightweight, portable knowledge layer over biomedicine Maryann E. Martone, Director of Biosciences BioCADDIE webinar, Thurs Dec 10, 2015 Our Mission Build, deploy

More information

Some useful resources. Data-mining

Some useful resources. Data-mining Some useful resources Data-mining Data Mining? Yeah, I could use a nap What We ll Discuss Why search Who should do it Sources National Library of Medicine Includes the NLM Gateway, PubMed, ClinicalTrials.gov,

More information

LIBER Webinar: A Data Citation Roadmap for Scholarly Data Repositories

LIBER Webinar: A Data Citation Roadmap for Scholarly Data Repositories LIBER Webinar: A Data Citation Roadmap for Scholarly Data Repositories Martin Fenner (DataCite) Mercè Crosas (Institute for Quantiative Social Science, Harvard University) May 15, 2017 2014 Joint Declaration

More information

CS / Cloud Computing. Recitation 11 November 5 th and Nov 8 th, 2013

CS / Cloud Computing. Recitation 11 November 5 th and Nov 8 th, 2013 CS15-319 / 15-619 Cloud Computing Recitation 11 November 5 th and Nov 8 th, 2013 Announcements Encounter a general bug: Post on Piazza Encounter a grading bug: Post Privately on Piazza Don t ask if my

More information

Lecture October. 1 Examples of machine learning problems and basic terminology

Lecture October. 1 Examples of machine learning problems and basic terminology MLISP: Machine Learning in Signal Processing WS 2018/2019 Lecture 1 17. October Prof. Veniamin Morgenshtern Scribe: Eric Sperschneider Agenda: 1. Organizational: webpage, time, review sessions, literature,

More information

The Smithsonian/NASA Astrophysics Data System

The Smithsonian/NASA Astrophysics Data System The Smithsonian/NASA Astrophysics Data System Status Report Alberto Accomazzi Michael Kurtz Harvard-Smithsonian Center for Astrophysics http://ads.harvard.edu The ADS Project Established in 1989 (before

More information

EDIN Frequently Asked Questions

EDIN Frequently Asked Questions EDIN Frequently Asked Questions January 2017 Author: M Diaz Copyright 2017 IHS Markit Ltd Login / Access 3 User Stats 4 Software Requirements 5 Data Integrity 6 Browsers 7 Settings 8 Data Content 9 Map

More information

Web of Science. Platform Release Nina Chang Product Release Date: December 10, 2017 EXTERNAL RELEASE DOCUMENTATION

Web of Science. Platform Release Nina Chang Product Release Date: December 10, 2017 EXTERNAL RELEASE DOCUMENTATION Web of Science EXTERNAL RELEASE DOCUMENTATION Platform Release 5.27 Nina Chang Product Release Date: December 10, 2017 Document Version: 1.0 Date of issue: December 7, 2017 RELEASE OVERVIEW The following

More information

Reproducible Workflows Biomedical Research. P Berlin, Germany

Reproducible Workflows Biomedical Research. P Berlin, Germany Reproducible Workflows Biomedical Research P11 2018 Berlin, Germany Contributors Leslie McIntosh Research Data Alliance, U.S., Executive Director Oya Beyan Aachen University, Germany Anthony Juehne RDA,

More information

Homework: Building an Apache-Solr based Search Engine for DARPA XDATA Employment Data Due: November 10 th, 12pm PT

Homework: Building an Apache-Solr based Search Engine for DARPA XDATA Employment Data Due: November 10 th, 12pm PT Homework: Building an Apache-Solr based Search Engine for DARPA XDATA Employment Data Due: November 10 th, 12pm PT 1. Overview This assignment picks up where the last one left off. You will take your JSON

More information

Software Quality Assurance Plan

Software Quality Assurance Plan Software Quality Assurance Plan For KDD-Research Entity Search Tool (KREST) Version 1.2 Submitted in partial fulfillment of the Masters of Software Engineering degree. Eric Davis CIS 895 MSE Project Department

More information

CSC 261/461 Database Systems Lecture 26. Spring 2017 MW 3:25 pm 4:40 pm January 18 May 3 Dewey 1101

CSC 261/461 Database Systems Lecture 26. Spring 2017 MW 3:25 pm 4:40 pm January 18 May 3 Dewey 1101 CSC 261/461 Database Systems Lecture 26 Spring 2017 MW 3:25 pm 4:40 pm January 18 May 3 Dewey 1101 Announcements Poster Presentation on May 03 (During our usual lecture time) Mandatory for all Graduate

More information

Report on CalConnect Conference XXXIX, June 14-16, 2017

Report on CalConnect Conference XXXIX, June 14-16, 2017 CALCONNECT DOCUMENT CD 1703 Type: Report Title: Report on CalConnect Conference XXXIX Version: 1.0 Date: 2017-07-18 Status: Published Source: N/A Report on CalConnect Conference XXXIX, June 14-16, 2017

More information

MyTennis Start Up Manual

MyTennis Start Up Manual MyTennis Start Up Manual (Version 2 11 November 2015) The following instructional fact sheets aim to provide MyTennis Administrators with support and assistance to start to effectively use the new MyTennis

More information

Rave Study Design and Build Essentials Training Agenda

Rave Study Design and Build Essentials Training Agenda Training Agenda Approximate Duration: 2 ½-day sessions Rave EDC Essentials 1 Time Topic Session 1 30 min Welcome & Introductions 30 min imedidata Setting up an imedidata account Navigating imedidata as

More information

Knowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A.

Knowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. Knowledge Retrieval Franz J. Kurfess Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. 1 Acknowledgements This lecture series has been sponsored by the European

More information

Notifications 4 How to set notifications for a particular item 4. s 4 How to one or more members of the committee 4

Notifications 4 How to set notifications for a particular item 4.  s 4 How to  one or more members of the committee 4 See below for a list of step-by-step guides on how to complete some common tasks on Standards Hub Committee Workspace. About Committee Workspace and Livelink Livelink is a web based collaboration and document

More information

5/13/2009. Introduction. Introduction. Introduction. Introduction. Introduction

5/13/2009. Introduction. Introduction. Introduction. Introduction. Introduction Applying Collaborative Filtering Techniques to Movie Search for Better Ranking and Browsing Seung-Taek Park and David M. Pennock (ACM SIGKDD 2007) Two types of technologies are widely used to overcome

More information

UC San Diego UC San Diego Electronic Theses and Dissertations

UC San Diego UC San Diego Electronic Theses and Dissertations UC San Diego UC San Diego Electronic Theses and Dissertations Title Information Retrieval in Biomedical Research: From Articles to Datasets Permalink https://escholarship.org/uc/item/660390nr Author Wei,

More information

LIVE SCOUTING S ADVENTURE 2017 NATIONAL JAMBOREE JULY 19-28

LIVE SCOUTING S ADVENTURE 2017 NATIONAL JAMBOREE JULY 19-28 LIVE SCOUTING S ADVENTURE 2017 NATIONAL JAMBOREE JULY 19-28 A recording of today s webinar along with the slide presentation and a written Q&A will be added to the Webinar section of the website within

More information

The Materials Data Facility

The Materials Data Facility The Materials Data Facility Ben Blaiszik (blaiszik@uchicago.edu), Kyle Chard (chard@uchicago.edu) Ian Foster (foster@uchicago.edu) materialsdatafacility.org What is MDF? We aim to make it simple for materials

More information