Core Technology Development Team Meeting

Size: px
Start display at page:

Download "Core Technology Development Team Meeting"

Transcription

1 Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: Access Code: For international call in numbers, please visit:

2 Agenda v Harvester PP presentation: Chris Mungall, Lawrence Berkeley National Laboratory: Distributed Data Discovery using GitHub, YAML and Markdown v Updates of last meeting action items v Brief updates from All Supported by the NIH grant 1U24 AI to the University of California, San Diego 2

3 Updates Action Items v Visit datamed.biocaddie.org and provide feedback for the new version, v0.2 via the prototype_issues repository in GitHub v Diagram showing biocaddie with Aztec and Repositories. CEDAR to be added to the diagram and shared with the group Jeff/Ian v Abstract submission deadline extended to 10/23, will send draft tomorrow to the group - Anu Supported by the NIH grant 1U24 AI to the University of California, San Diego 3

4 BioCADDIE Harvester Project: Distributed Data Discovery using GitHub, YAML and Markdown Chris Mungall Lawrence Berkeley Na6onal Laboratory

5 Well it could be worse, at least I m not doing metadata

6 How can we make it easier to a>ach Even fun? Idea: metadata? Target tech- savvy subset of data providers Provide guidelines and tooling that mesh with with their work habits If it works, extend to a wider variety of providers

7 Outline Background Observa6ons on metadata, code and datasets Preliminary related work: OBO Foundry Ontology Registry Global Bio6c Interac6ons (GloBI) Approach Vision Milestones (Now March 2016)

8 Coders love providing metadata Can we get dataset providers to share this love?

9

10 (1) Experience with the Open Biomedical Ontologies (OBO) Registry Background: High quality ontology metadata for ~100 ontologies Curated by OBO team (unfunded) Problem Metadata format lacked richness and extensibility Metadata could not be directly edited by ontology providers

11 (1) Experience with the Open Biomedical Ontologies (OBO) Registry Solu6on h>p://obofoundry.github.io/ Manage metadata files on GitHub Use YAML embedded in Markdown Anyone can submit a Pull Request Changes instantly visible using GitHub pages

12 Pull Requests allow for transparent distributed asynchronous edi6ng of content

13

14 (2) Experience with GloBI h>p://globalbio6cinterac6ons.org/ A search tool for finding species interac6on data What fungi infect bats in SW USA? Indexes 12m species- interac6ons from 15k studies Sources: inaturalist, Dryad, specialist databases, random websites, ad- hoc formats V1: Adding a new datasource required changing the centralized data inges6on code Problem: How to decentralize dataset contribu6ons

15 (2) Experience with GloBI Solu6on Simple metadata file specifica6on globi.json file Goes in top level of a github repository JSON(LD) Data can be in same repo, or external (e.g. dryad) Typically CSV GloBI central database periodically harvests data

16 Approach for biocaddie Harvester project Define format for a simple file that can be included in the root project directory YAML/Markdown Can be placed In the same directory tree as the data Externally, with a pointer to external data (e.g. figshare, dryad) Leverage GitHub social coding ecosystem for Distributed edi6ng of metadata files Cloud- based valida6on (Travis- CI) Pilot project Select exis6ng datasets and apply Demonstrate ability to search

17 researcher producing data (large dataset) deposits checks creates Travis-CI validation/ conversion SPARQL queries semweb user DataSet C DataSet B Cloud Storage / NIH Commons / Git LFS etc describes YAML Repo B YAML DataSet A GitHub / GitLab etc Repo A conversion JSON-LD / RDF HCLS compliant Linked Data Cloud content --- title: skin microbe analysis author: orcid: name: Carrie github: carriemicrobe types: - microbiome - computational datasets: - id: site: axilla - id: site: oral --- my notes here... Example YAML embedded in Markdown BioCaddie Harvester generates Data Index Search Service BioCADDIE creates + deposits finds via biocaddie searches researcher producing data (small dataset)

18 Current Progress and Project Current loca6on: Repository h>ps://github.com/cmungall/biocaddie- gym [Ques6on for biocaddie CDT: Should this move to a biocaddie github org?] Milestones: h>ps://github.com/cmungall/biocaddie- gym/ milestones Await project ini6a6on

19 Milestone 1: YAML Schema Goal: Define the format of a biocaddie.yml file that people can add to the top level of a directory Either the same directory that contains project data Or a pointer to it Why YAML? Simple, easy to author Extensible and nestable Can be embedded inside markdown files Superset of JSON (so can use JSON- LD) YAML- LD Automa6c equivalence to JSON- LD and to RDF Coordinate this with HCLS h>p:// dataset h>ps://github.com/cmungall/biocaddie- gym/milestones/yaml%20schema --- title: skin microbe analysis author: orcid: name: Carrie github: carriemicrobe types: - microbiome - computational datasets: - loc: ds1.tsv site: axilla - loc: ds2.tsv site: oral --- my notes here...

20 researcher producing data (large dataset) creates + deposits deposits creates researcher producing data (small dataset) DataSet C DataSet B Cloud Storage / NIH Commons / Git LFS etc describes YAML Repo B YAML DataSet A GitHub / GitLab etc Repo A content --- title: skin microbe analysis author: orcid: name: Carrie github: carriemicrobe types: - microbiome - computational datasets: - loc: ds1.tsv site: axilla - loc: ds2.tsv site: oral --- my notes here...

21 Milestone 2: biocaddie yaml validator Goal: Ensure biocaddie.yml files are syntac6cally and structurally valid Consistent with W3C HCLS Dataset descrip6on profile Prevent accidental typos that could hamper indexing efforts Approach: Write a simple python validator Integrate this with the GitHub travis con6nuous integra6on system h>ps://github.com/cmungall/biocaddie- gym/milestones/validator

22 researcher producing data (large dataset) deposits checks creates Travis-CI validation/ conversion DataSet C DataSet B Cloud Storage / NIH Commons / Git LFS etc describes YAML Repo B YAML DataSet A GitHub / GitLab etc Repo A

23

24 Goal: Milestone 3: Ultra- lite publishing framework Allow metadata providers to see the authored metadata in a less geeky format Approach: Create HTML templates for rendering dataset descrip6on YMLs using GitHub Pages / Jekyll framework h>ps://github.com/cmungall/biocaddie- gym/milestones/publishing

25 Milestone 4: Searchable index Goal: Demonstrate findability and searchability Approach: Query over git hos6ng sites for any repos that have biocaddie.{yml,md} file Index [Q: How involved should the biocaddie CDT be involved here?] h>ps://github.com/cmungall/biocaddie- gym/milestones/searchable%20index

26 Milestone 5: Pilot project Pilot project using exis6ng published datasets already in GitHub Datasets TBD Open to sugges6ons from biocaddie groups? Possibili6es: Exis6ng datasets indexed by h>p://monarchini6a6ve.org Exis6ng datasets on GitHub Evaluate h>ps://github.com/cmungall/biocaddie- gym/milestones/pilot

27 What next? Will this scale? Goals of pilot: Demonstrate feasibility with tech- savvy users Next steps, making it more accessible: [to be aligned with overall biocaddie goals] Provide authoring tools Integrate with dataset repositories (figshare, dryad) Work with git{hub,lab} to make more accessible?

28 Ongoing work Task Status 1 Metadata Ingestion Import repositories PDB,GEO 2. LINCS 3. BioProject, ArrayExpress, GEMMA, dbgap 4. ICPSR Stable API details Ongoing Sample files 1.2 Metadata mapping Ongoing 1.3 Metadata management Ongoing 1.4 Indexing Ongoing 2 Terminology server 2.1 Develop terminology server 1) Imported terminologies (6) and validated them 2) Created UI-Browser for TS 3) Integration to Scigraph API 4) Create auto complete feature 09/01 10/09 Ongoing 10/ Integrate terminology server Ongoing Supported by the NIH grant 1U24 AI to the University of California, San Diego 28

29 Pilot project integration (Task 3) PP Presented to CDT / / / /01 As Integrated Specialized advanced search for GWAS datasets Ranking function based on citation metrics for GEO series data a) isee similarity metric in ElasticSearch b) DELVE implementation as exploratory search and visualization option. (i) for PDB (ii) for gene expression data Ranking function based on citation metrics (dataset mentions) for PDB data Completed On 09/22 Ongoing (10/22) 9/01 Ongoing (11/10) Ongoing (10/27) Supported by the NIH grant 1U24 AI to the University of California, San Diego 29

30 Ongoing work Task Status 4 Interface Design 4.1 Global statistics Implemented 4.2 Design interface Ongoing 4.3 Implement new design Ongoing 4.4 Breadcrumb for website navigation Not started 4.5 Display most Accessed Datasets Not Started 5 Personalized search 5.1 Search history Implemented 5.2 Save search results Not Started 5.3 Share search results Not Started 5.4 User account - - Discussion Not Started 6 Searching/Ranking algorithms 6.1 Similar datasets Implemented 6.2 Data repositories search Ongoing 6.3 Boolean/advanced search Not Started 6.4 Refine search results based on user s selection Not Started Supported by the NIH grant 1U24 AI to the University of California, San Diego 30

31 Ongoing work Task Status 7 Display of results 7.1 Sort datasets Ongoing 7.2 Browsing (grouping facets/metadata) Not started 7.3 Accessibility information Not started 8 Link to external resources 8.1 Pubmed Ongoing 8.2 Grants Not Started 9 Feedback 9.1 GitHub Implemented 9.2 Feedback form Not Started 10 Documentation 10.1 Source code Not Started 10.2 Tutorials Not Started 11 Usability studies 11.1 UI Analysis Completed 11.2 User studies Not Started 12 Data Duplication issue Supported by the NIH grant 1U24 AI to the University of California, San Diego 31

32 Other issues v Please deposit codes in GitHub. Please contact me at Anupama.E.Gururaj@uth.tmc.edu if you need access v Any other issues? v Thank You

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Steering Committee Meeting

Steering Committee Meeting Steering Committee Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Executive Committee Meeting

Executive Committee Meeting Executive Committee Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Executive Committee Meeting

Executive Committee Meeting Executive Committee Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Metadata Ingestion and Processinng

Metadata Ingestion and Processinng biomedical and healthcare Data Discovery Index Ecosystem Ingestion and Processinng Jeffrey S. Grethe, Ph.D. 2017 BioCADDIE All Hands Meeting prototype Ingestion Indexing Repositories Ingestion ElasticSearch

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Steering Committee Meeting

Steering Committee Meeting Steering Committee Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Executive Committee Meeting

Executive Committee Meeting Executive Committee Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Agenda. Clarification of issues Quarter definition Steering and Executive Committee composition Dissemination and community outreach activities

Agenda. Clarification of issues Quarter definition Steering and Executive Committee composition Dissemination and community outreach activities Agenda Clarification of issues Quarter definition Steering and Executive Committee composition Dissemination and community outreach activities Progress and updates Y1Q3 and plans for Y1Q4 Plan for the

More information

Steering Committee Meeting

Steering Committee Meeting Steering Committee Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please isit: https://www.readytalk.com/account-administration/international-numbers

More information

The Final Updates. Philippe Rocca-Serra Alejandra Gonzalez-Beltran, Susanna-Assunta Sansone, Oxford e-research Centre, University of Oxford, UK

The Final Updates. Philippe Rocca-Serra Alejandra Gonzalez-Beltran, Susanna-Assunta Sansone, Oxford e-research Centre, University of Oxford, UK The Final Updates Supported by the NIH grant 1U24 AI117966-01 to UCSD PI, Co-Investigators at: Philippe Rocca-Serra Alejandra Gonzalez-Beltran, Susanna-Assunta Sansone, Oxford e-research Centre, University

More information

Minutes. Date: Location: UCSD BRF2 5A03. Attendees Present

Minutes. Date: Location: UCSD BRF2 5A03. Attendees Present Executive Committee Meeting Location: UCSD BRF2 5A03 Date: 8-16-16 Start time: 10:00 am PDT End time: 11:30 am PDT Meeting Objective Attendees Present Minute Taker Executive Committee Meeting UCSD: Lucila

More information

Steering Committee Meeting

Steering Committee Meeting Steering Committee Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Executive Committee Meeting

Executive Committee Meeting Executive Committee Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

The Materials Data Facility

The Materials Data Facility The Materials Data Facility Ben Blaiszik (blaiszik@uchicago.edu), Kyle Chard (chard@uchicago.edu) Ian Foster (foster@uchicago.edu) materialsdatafacility.org What is MDF? We aim to make it simple for materials

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers

More information

Core Technology Development Team Meeting

Core Technology Development Team Meeting Core Technology Development Team Meeting Agenda v Updates regarding last meeting action items v Presentation by Ergin about Ontology Services v Brief updates from others Supported by the NIH grant 1U24

More information

Data publication and discovery with Globus

Data publication and discovery with Globus Data publication and discovery with Globus Questions and comments to outreach@globus.org The Globus data publication and discovery services make it easy for institutions and projects to establish collections,

More information

Harmonizing biocaddie Metadata Schemas for Indexing Clinical Research Datasets Using Semantic Web Technologies

Harmonizing biocaddie Metadata Schemas for Indexing Clinical Research Datasets Using Semantic Web Technologies Harmonizing biocaddie Metadata Schemas for Indexing Clinical Research Datasets Using Semantic Web Technologies Harold R. Solbrig 1, Guoqian Jiang 1 1 Mayo Clinic College of Medicine, Rochester, MN [solbrig.harold,

More information

Introducing the Springer Nature Data Support Services

Introducing the Springer Nature Data Support Services Introducing the Springer Nature Data Support Services 1 What motivates researchers to share data? 97% - to accelerate research and its applications 1 96% - increased visibility and discovery of their research

More information

Markus Kaindl Senior Manager Semantic Data Business Owner SN SciGraph

Markus Kaindl Senior Manager Semantic Data Business Owner SN SciGraph Analytics Building business tools for the scholarly publishing domain using LOD and the ELK stack SEMANTiCS Vienna 2018 Markus Kaindl Senior Manager Semantic Data Business Owner SN SciGraph 1 Agenda (25

More information

Susanna-Assunta Sansone, PhD. Metadata WG3 chair.

Susanna-Assunta Sansone, PhD. Metadata WG3 chair. Susanna-Assunta Sansone, PhD Metadata WG3 chair 3-workgroup@biocaddie.org WG3 Metadata v v Full description: goals, synergies, phases, members & files Joint effort with BD2K Center for Expanded Data Annotation

More information

OpenAIRE. Fostering the social and technical links that enable Open Science in Europe and beyond

OpenAIRE. Fostering the social and technical links that enable Open Science in Europe and beyond Alessia Bardi and Paolo Manghi, Institute of Information Science and Technologies CNR Katerina Iatropoulou, ATHENA, Iryna Kuchma and Gwen Franck, EIFL Pedro Príncipe, University of Minho OpenAIRE Fostering

More information

Minimal Metadata Standards and MIIDI Reports

Minimal Metadata Standards and MIIDI Reports Dryad-UK Workshop Wolfson College, Oxford 12 September 2011 Minimal Metadata Standards and MIIDI Reports David Shotton, Silvio Peroni and Tanya Gray Image BioInformatics Research Group Department of Zoology

More information

Azure DevOps. Randy Pagels Intelligent Cloud Technical Specialist Great Lakes Region

Azure DevOps. Randy Pagels Intelligent Cloud Technical Specialist Great Lakes Region Azure DevOps Randy Pagels Intelligent Cloud Technical Specialist Great Lakes Region What is DevOps? People. Process. Products. Build & Test Deploy DevOps is the union of people, process, and products to

More information

Crowdsourcing Codebook Enhancements A DDI-based Approach

Crowdsourcing Codebook Enhancements A DDI-based Approach Crowdsourcing Codebook Enhancements A DDI-based Approach FCSM, December 2 nd 2015 Lars Vilhuber (Cornell University) Benjamin Perry (Cornell University) Venkata Kambhampaty (Cornell University) Kyle Brumsted

More information

Linking data and publications the past, present, and future. Dr. Hylke Koers, Head of Content Innovation, Elsevier

Linking data and publications the past, present, and future. Dr. Hylke Koers, Head of Content Innovation, Elsevier Linking data and publications the past, present, and future Dr. Hylke Koers, Head of Content Innovation, Elsevier BioCADDIE webinar January 8, 2015 Ease of access Open Access 2 The issue: data is important,

More information

January 16, Re: Request for Comment: Data Access and Data Sharing Policy. Dear Dr. Selby:

January 16, Re: Request for Comment: Data Access and Data Sharing Policy. Dear Dr. Selby: Dr. Joe V. Selby, MD, MPH Executive Director Patient-Centered Outcomes Research Institute 1828 L Street, NW, Suite 900 Washington, DC 20036 Submitted electronically at: http://www.pcori.org/webform/data-access-and-data-sharing-policypublic-comment

More information

SATURN Update. DAML PI Meeting Dr. A. Joseph Rockmore 25 May 2004

SATURN Update. DAML PI Meeting Dr. A. Joseph Rockmore 25 May 2004 SATURN Update DAML PI Meeting Dr. A. Joseph Rockmore 25 May 2004 SATURN: Needs and Challenges [1 of 2]! SATURN = semantic access to time-ordered url s and related information! Objective: easier and more

More information

FROM VSTS TO AZURE DEVOPS

FROM VSTS TO AZURE DEVOPS #DOH18 FROM VSTS TO AZURE DEVOPS People. Process. Products. Gaetano Paternò @tanopaterno info@gaetanopaterno.it 2 VSTS #DOH18 3 Azure DevOps Azure Boards (ex Work) Deliver value to your users faster using

More information

RENKU - Reproduce, Reuse, Recycle Research. Rok Roškar and the SDSC Renku team

RENKU - Reproduce, Reuse, Recycle Research. Rok Roškar and the SDSC Renku team RENKU - Reproduce, Reuse, Recycle Research Rok Roškar and the SDSC Renku team Renku-Reana workshop @ CERN 26.06.2018 Goals of Renku 1. Provide the means to create reproducible data science 2. Facilitate

More information

Making the most of metadata with Metadata 2020

Making the most of metadata with Metadata 2020 Making the most of metadata with Metadata 2020 Patricia Feeney, Crossref and Metadata2020 CSE Annual Meeting April 2018 What is Metadata 2020? Metadata 2020 is a collaboration that advocates richer, connected,

More information

Git and GitHub. Dan Wysocki. February 12, Dan Wysocki Git and GitHub February 12, / 48

Git and GitHub. Dan Wysocki. February 12, Dan Wysocki Git and GitHub February 12, / 48 Git and GitHub Dan Wysocki February 12, 2015 Dan Wysocki Git and GitHub February 12, 2015 1 / 48 1 Version Control 2 Git 3 GitHub 4 Walkthrough Dan Wysocki Git and GitHub February 12, 2015 2 / 48 Version

More information

Ag Data Commons: Harnessing the Power of Digital Agriculture Cynthia Parr USDA ARS National Agricultural Library

Ag Data Commons: Harnessing the Power of Digital Agriculture Cynthia Parr USDA ARS National Agricultural Library Ag Data Commons: Harnessing the Power of Digital Agriculture Cynthia Parr USDA ARS National Agricultural Library Live poll at: https://pollev.com/ cyndyparr196 Problems with Public Ag Data Government Website

More information

DataSTORRE Deposit Guide

DataSTORRE Deposit Guide DataSTORRE Deposit Guide Introduction DataStorre is an online digital repository of multi-disciplinary research datasets produced at the University of Stirling. University of Stirling researchers who have

More information

Call for Participation in AIP-6

Call for Participation in AIP-6 Call for Participation in AIP-6 GEOSS Architecture Implementation Pilot (AIP) Issue Date of CFP: 9 February 2013 Due Date for CFP Responses: 15 March 2013 Introduction GEOSS Architecture Implementation

More information

B2SAFE metadata management

B2SAFE metadata management B2SAFE metadata management version 1.2 by Claudio Cacciari, Robert Verkerk, Adil Hasan, Elena Erastova Introduction The B2SAFE service provides a set of functions for long term bit stream data preservation:

More information

Demos: DMP Assistant and Dataverse

Demos: DMP Assistant and Dataverse Demos: DMP Assistant and Dataverse Alexandra Cooper, Data Services Coordinator, Queen s University Meghan Goodchild, RDM Systems Librarian, Queen s University/Scholars Portal Overview of session Research

More information

SharePoint 2013 End User Level II

SharePoint 2013 End User Level II Course 55052A: SharePoint 2013 End User Level II Course Details Course Outline Module 1: Overview A simple introduction module. Understand your course, classroom, classmates, facility and instructor. Module

More information

Powering Linked Open Data Applications

Powering Linked Open Data Applications Powering Linked Open Data Applications With Fedora and Islandora CLAW David Wilcox, DuraSpace dwilcox@duraspace.org About DuraSpace DuraSpace is not for profit organization that provides leadership and

More information

Data Exchange and Conversion Utilities and Tools (DExT)

Data Exchange and Conversion Utilities and Tools (DExT) Data Exchange and Conversion Utilities and Tools (DExT) Louise Corti, Angad Bhat, Herve L Hours UK Data Archive CAQDAS Conference, April 2007 An exchange format for qualitative data Data exchange models

More information

BPMN Processes for machine-actionable DMPs

BPMN Processes for machine-actionable DMPs BPMN Processes for machine-actionable DMPs Simon Oblasser & Tomasz Miksa Contents Start DMP... 2 Specify Size and Type... 3 Get Cost and Storage... 4 Storage Configuration and Cost Estimation... 4 Storage

More information

NCBI News, November 2009

NCBI News, November 2009 Peter Cooper, Ph.D. NCBI cooper@ncbi.nlm.nh.gov Dawn Lipshultz, M.S. NCBI lipshult@ncbi.nlm.nih.gov Featured Resource: New Discovery-oriented PubMed and NCBI Homepage The NCBI Site Guide A new and improved

More information

warwick.ac.uk/lib-publications

warwick.ac.uk/lib-publications Original citation: Zhao, Lei, Lim Choi Keung, Sarah Niukyun and Arvanitis, Theodoros N. (2016) A BioPortalbased terminology service for health data interoperability. In: Unifying the Applications and Foundations

More information

Semantic Web for Earth and Environmental Terminology (SWEET) Status, Future Development and Community Building

Semantic Web for Earth and Environmental Terminology (SWEET) Status, Future Development and Community Building Semantic Web for Earth and Environmental Terminology (SWEET) 2018 Status, Future Development and Community Building 2 Agenda and Purpose Current status of SWEET e.g. What has the community been doing?

More information

IAALD/2013 World Congress. VIVO Workshop. Brian J. Lowe Jon Corson-Rikert

IAALD/2013 World Congress. VIVO Workshop. Brian J. Lowe Jon Corson-Rikert IAALD/2013 World Congress VIVO Workshop Brian J. Lowe Jon Corson-Rikert July 23, 2013 Workshop options Presentation what is VIVO, the technologies it uses, and how its used at Cornell and elsewhere Self-paced

More information

Getting started with GitHub

Getting started with GitHub Getting started with GitHub A beginner s guide. (There s no code in this slide deck!) Presented by Quinn Supplee https://github.com/quinns What is GitHub? GitHub is a code hosting platform for version

More information

SeedMe Building Blocks

SeedMe Building Blocks Building data sharing cloud on Drupal 8 SeedMe Building Blocks Amit Chourasia & David Nadeau San Diego Supercomputer Center, UC San Diego Bay Area Drupal Camp Berkeley, CA Oct 20, 2017 Project website:

More information

PDS 2010 System Design Report

PDS 2010 System Design Report PDS 2010 System Design Report MC Face-to-Face St. Louis, MO August 16-17, 2010 Topics Overall Progress Test Collection Ingestion Build 1 System Deliverables Component Progress - Registry, Harvest, Security

More information

The Avalon Media System

The Avalon Media System The Avalon Media System A Next- Genera8on Solu8on for Media Management and Access Jon Dunn Mark Notess IU Digital Library Brown Bag Series 27 March 2013 About Us Jon Dunn Interim Assistant Dean for Library

More information

Taking a view on bio-ontologies. Simon Jupp Functional Genomics Production Team ICBO, 2012 Graz, Austria

Taking a view on bio-ontologies. Simon Jupp Functional Genomics Production Team ICBO, 2012 Graz, Austria Taking a view on bio-ontologies Simon Jupp Functional Genomics Production Team ICBO, 2012 Graz, Austria Who we are European Bioinformatics Institute one of world s largest bio data and service providers

More information

eveloping DataMed the current status

eveloping DataMed the current status eeloping DataMed the current status Hua Xu Core Deelopment Team (CDT) biocaddie AHM 2017 8/8/17 Supported by the NIH grant 1U24 AI117966-01 to the Uniersity of California, San Diego 1 Outline CDT Roles

More information

Embracing Semantic Technology for Better Metadata Authoring in Biomedicine

Embracing Semantic Technology for Better Metadata Authoring in Biomedicine Embracing Semantic Technology for Better Metadata Authoring in Biomedicine Attila L. Egyedi, Martin J. O Connor, Marcos Martínez-Romero, Debra Willrett, Josef Hardi, John Graybeal, and Mark A. Musen Stanford

More information

Sharing Archival Metadata MODULE 20. Aaron Rubinstein

Sharing Archival Metadata MODULE 20. Aaron Rubinstein Sharing Archival Metadata 297 MODULE 20 SHARING ARCHivaL METADATA Aaron Rubinstein 348 Putting Descriptive Standards to Work The Digital Public Library of America s Application Programming Interface and

More information

Oracle. Engagement Cloud Using Service Request Management. Release 12

Oracle. Engagement Cloud Using Service Request Management. Release 12 Oracle Engagement Cloud Release 12 Oracle Engagement Cloud Part Number E73284-05 Copyright 2011-2017, Oracle and/or its affiliates. All rights reserved. Author: Joseph Kolb This software and related documentation

More information

Jisc Research Data Discovery Service Project Workshop Christopher Brown

Jisc Research Data Discovery Service Project Workshop Christopher Brown 18 Feb 2016 Jisc Research Data Discovery Service Project Workshop Christopher Brown Agenda» 10:30 10:40 Welcome and Introduction - Catherine Grout» 10:40 10:45 Project status and introduction to workshop/exercise

More information

Project Tutorial. CS 143 Communication Network. Ruijia Sun Zilong Chen. Based on version 2015, authored by Ritvik Mishra

Project Tutorial. CS 143 Communication Network. Ruijia Sun Zilong Chen. Based on version 2015, authored by Ritvik Mishra Project Tutorial CS 143 Communication Network Ruijia Sun Zilong Chen Based on version 2015, authored by Ritvik Mishra Outline Project Overview Requirements Network Components Design Remarks Tools Schedule

More information

Tania Tudorache Stanford University. - Ontolog forum invited talk04. October 2007

Tania Tudorache Stanford University. - Ontolog forum invited talk04. October 2007 Collaborative Ontology Development in Protégé Tania Tudorache Stanford University - Ontolog forum invited talk04. October 2007 Outline Introduction and Background Tools for collaborative knowledge development

More information

DaMaRO (Data Management Roll-out at Oxford): DataBank & DataFinder

DaMaRO (Data Management Roll-out at Oxford): DataBank & DataFinder DaMaRO (Data Management Roll-out at Oxford): DataBank & DataFinder Thursday 25 th October, 2012 James A J Wilson James.wilson@oucs.ox.ac.uk Damaro DataBank Oxford s in-development data archive Intended

More information

Update on Dataverse Dryad-Dataverse Community Meeting. Mercè Crosas, Elizabeth Quigley & Eleni Castro. Data Science > IQSS > Harvard University

Update on Dataverse Dryad-Dataverse Community Meeting. Mercè Crosas, Elizabeth Quigley & Eleni Castro. Data Science > IQSS > Harvard University Update on Dataverse Image credit: David Bygott (CC-BY-NC-SA) 2014 Dryad-Dataverse Community Meeting Mercè Crosas, Elizabeth Quigley & Eleni Castro Data Science > IQSS > Harvard University Introduction

More information

Metadata Discovery and Integration to Support Repurposing of Heterogeneous Data using the OpenFurther Platform

Metadata Discovery and Integration to Support Repurposing of Heterogeneous Data using the OpenFurther Platform Metadata Discovery and Integration to Support Repurposing of Heterogeneous Data using the OpenFurther Platform biocaddie All Hands Meeting September 11 th, 2016 Ram Gouripeddi & Julio Facelli Department

More information

Big Data, Big Compute, Big Interac3on Machines for Future Biology. Rick Stevens. Argonne Na3onal Laboratory The University of Chicago

Big Data, Big Compute, Big Interac3on Machines for Future Biology. Rick Stevens. Argonne Na3onal Laboratory The University of Chicago Assembly Annota3on Modeling Design Big Data, Big Compute, Big Interac3on Machines for Future Biology Rick Stevens stevens@anl.gov Argonne Na3onal Laboratory The University of Chicago There are no solved

More information

Con$nuous Integra$on Development Environment. Kovács Gábor

Con$nuous Integra$on Development Environment. Kovács Gábor Con$nuous Integra$on Development Environment Kovács Gábor kovacsg@tmit.bme.hu Before we start anything Select a language Set up conven$ons Select development tools Set up development environment Set up

More information

Data Management Plans

Data Management Plans Data Management Plans JENNIFER L. THOEGERSEN, DATA CURATION LIBRARIAN NURAMP WORKSHOP SERIES MARCH 17, 2016 Jenny Thoegersen, Data Curation Librarian University of Nebraska-Lincoln Libraries jthoegersen2@unl.edu

More information

A Data Citation Roadmap for Scholarly Data Repositories

A Data Citation Roadmap for Scholarly Data Repositories A Data Citation Roadmap for Scholarly Data Repositories Tim Clark (Harvard Medical School & Massachusetts General Hospital) Martin Fenner (DataCite) Mercè Crosas (Institute for Quantiative Social Science,

More information

Webinar Annotate data in the EUDAT CDI

Webinar Annotate data in the EUDAT CDI Webinar Annotate data in the EUDAT CDI Yann Le Franc - e-science Data Factory, Paris, France March 16, 2017 This work is licensed under the Creative Commons CC-BY 4.0 licence. Attribution: Y. Le Franc

More information

ONTOLOGY LIBRARIES: A STUDY FROM ONTOFIER AND ONTOLOGIST PERSPECTIVES

ONTOLOGY LIBRARIES: A STUDY FROM ONTOFIER AND ONTOLOGIST PERSPECTIVES ONTOLOGY LIBRARIES: A STUDY FROM ONTOFIER AND ONTOLOGIST PERSPECTIVES Debashis Naskar 1 and Biswanath Dutta 2 DSIC, Universitat Politècnica de València 1 DRTC, Indian Statistical Institute 2 OUTLINE Introduction

More information

SharePoint 2013 End User Level II

SharePoint 2013 End User Level II SharePoint 2013 End User Level II Course 55052A; 3 Days, Instructor-led Course Description This 3-day course explores several advanced topics of working with SharePoint 2013 sites. Topics include SharePoint

More information

Building on Existing Communities: the Virtual Astronomical Observatory (and NIST)

Building on Existing Communities: the Virtual Astronomical Observatory (and NIST) Building on Existing Communities: the Virtual Astronomical Observatory (and NIST) Robert Hanisch Space Telescope Science Institute Director, Virtual Astronomical Observatory Data in astronomy 2 ~70 major

More information

Getting Started Guide

Getting Started Guide Getting Started Guide for education accounts Setup Manual Edition 7 Last updated: September 15th, 2016 Note: Click on File and select Make a copy to save this to your Google Drive, or select Print, to

More information

Agile Methodologies via Kanban and GitHub

Agile Methodologies via Kanban and GitHub Agile Methodologies via Kanban and GitHub Presented to IT4Innovations 2017 Participants Rinku Gupta Argonne National Laboratory VŠB - Technical University Ostrava Date : Oct 5th, 2017 Slide Courtesy: Mike

More information

Institutional Repository using DSpace. Yatrik Patel Scientist D (CS)

Institutional Repository using DSpace. Yatrik Patel Scientist D (CS) Institutional Repository using DSpace Yatrik Patel Scientist D (CS) yatrik@inflibnet.ac.in What is Institutional Repository? Institutional repositories [are]... digital collections capturing and preserving

More information

UC San Diego Mobile App: The Next Generation

UC San Diego Mobile App: The Next Generation UC San Diego Mobile App: The Next Generation Submitter: Alex Wu, UC San Diego, Information Technology Services, Web Technology Services Manager, a6wu@ucsd.edu Team Members: Brett Pollak, UC San Diego,

More information

Step 1 - Learning & Discovery

Step 1 - Learning & Discovery OUR PROCESS Our goal as a integrated marketing agency is to provide clients with the most creative and effective solutions for their business. Reaching that goal begins with our Learning and Discovery

More information

What is git? Distributed Version Control System (VCS); Created by Linus Torvalds, to help with Linux development;

What is git? Distributed Version Control System (VCS); Created by Linus Torvalds, to help with Linux development; What is git? Distributed Version Control System (VCS); Created by Linus Torvalds, to help with Linux development; Why should I use a VCS? Repositories Types of repositories: Private - only you and the

More information

FINANCIAL REGULATORY REPORTING ACROSS AN EVOLVING SCHEMA

FINANCIAL REGULATORY REPORTING ACROSS AN EVOLVING SCHEMA FINANCIAL REGULATORY REPORTING ACROSS AN EVOLVING SCHEMA MODELDR & MARKLOGIC - DATA POINT MODELING MARKLOGIC WHITE PAPER JUNE 2015 CHRIS ATKINSON Contents Regulatory Satisfaction is Increasingly Difficult

More information

PHP Composer 9 Benefits of Using a Binary Repository Manager

PHP Composer 9 Benefits of Using a Binary Repository Manager PHP Composer 9 Benefits of Using a Binary Repository Manager White Paper Copyright 2017 JFrog Ltd. March 2017 www.jfrog.com Executive Summary PHP development has become one of the most popular platforms

More information

Toward a Knowledge-Based Solution for Information Discovery in Complex and Dynamic Domains

Toward a Knowledge-Based Solution for Information Discovery in Complex and Dynamic Domains Toward a Knowledge-Based Solution for Information Discovery in Complex and Dynamic Domains Eloise Currie and Mary Parmelee SAS Institute, Cary NC About SAS: The Power to Know SAS: The Market Leader in

More information

Linked Data: Fast, low cost semantic interoperability for health care?

Linked Data: Fast, low cost semantic interoperability for health care? Linked Data: Fast, low cost semantic interoperability for health care? About the presentation Part I: Motivation Why we need semantic operability in health care Why enhancing existing systems to increase

More information

ISMTE Best Practices Around Data for Journals, and How to Follow Them" Brooks Hanson Director, Publications, AGU

ISMTE Best Practices Around Data for Journals, and How to Follow Them Brooks Hanson Director, Publications, AGU ISMTE Best Practices Around Data for Journals, and How to Follow Them" Brooks Hanson Director, Publications, AGU bhanson@agu.org 1 Recent Alignment by Publishers, Repositories, and Funders Around Data

More information

Interoperability in Science Data: Stories from the Trenches

Interoperability in Science Data: Stories from the Trenches Interoperability in Science Data: Stories from the Trenches Karen Stocks University of California San Diego Open Data for Open Science Data Interoperability Microsoft escience Workshop 2012 Interoperability

More information

CODE AND DATA MANAGEMENT. Toni Rosati Lynn Yarmey

CODE AND DATA MANAGEMENT. Toni Rosati Lynn Yarmey CODE AND DATA MANAGEMENT Toni Rosati Lynn Yarmey Data Management is Important! Because Reproducibility is the foundation of science Journals are starting to require data deposit You want to get credit

More information

W3C CASE STUDY. Teamwork on Open Standards Development Speeds Industry Adoption

W3C CASE STUDY. Teamwork on Open Standards Development Speeds Industry Adoption January 2017 W3C CASE STUDY Teamwork on Open Standards Development Speeds Industry Adoption Like driving a long stretch of open road alone, standards development work can be a lonely endeavor. But with

More information

Reproducible Workflows Biomedical Research. P Berlin, Germany

Reproducible Workflows Biomedical Research. P Berlin, Germany Reproducible Workflows Biomedical Research P11 2018 Berlin, Germany Contributors Leslie McIntosh Research Data Alliance, U.S., Executive Director Oya Beyan Aachen University, Germany Anthony Juehne RDA,

More information