Core Technology Development Team Meeting
|
|
- Barnard Rose
- 5 years ago
- Views:
Transcription
1 Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: Access Code: For international call in numbers, please visit:
2 Agenda v Harvester PP presentation: Chris Mungall, Lawrence Berkeley National Laboratory: Distributed Data Discovery using GitHub, YAML and Markdown v Updates of last meeting action items v Brief updates from All Supported by the NIH grant 1U24 AI to the University of California, San Diego 2
3 Updates Action Items v Visit datamed.biocaddie.org and provide feedback for the new version, v0.2 via the prototype_issues repository in GitHub v Diagram showing biocaddie with Aztec and Repositories. CEDAR to be added to the diagram and shared with the group Jeff/Ian v Abstract submission deadline extended to 10/23, will send draft tomorrow to the group - Anu Supported by the NIH grant 1U24 AI to the University of California, San Diego 3
4 BioCADDIE Harvester Project: Distributed Data Discovery using GitHub, YAML and Markdown Chris Mungall Lawrence Berkeley Na6onal Laboratory
5 Well it could be worse, at least I m not doing metadata
6 How can we make it easier to a>ach Even fun? Idea: metadata? Target tech- savvy subset of data providers Provide guidelines and tooling that mesh with with their work habits If it works, extend to a wider variety of providers
7 Outline Background Observa6ons on metadata, code and datasets Preliminary related work: OBO Foundry Ontology Registry Global Bio6c Interac6ons (GloBI) Approach Vision Milestones (Now March 2016)
8 Coders love providing metadata Can we get dataset providers to share this love?
9
10 (1) Experience with the Open Biomedical Ontologies (OBO) Registry Background: High quality ontology metadata for ~100 ontologies Curated by OBO team (unfunded) Problem Metadata format lacked richness and extensibility Metadata could not be directly edited by ontology providers
11 (1) Experience with the Open Biomedical Ontologies (OBO) Registry Solu6on h>p://obofoundry.github.io/ Manage metadata files on GitHub Use YAML embedded in Markdown Anyone can submit a Pull Request Changes instantly visible using GitHub pages
12 Pull Requests allow for transparent distributed asynchronous edi6ng of content
13
14 (2) Experience with GloBI h>p://globalbio6cinterac6ons.org/ A search tool for finding species interac6on data What fungi infect bats in SW USA? Indexes 12m species- interac6ons from 15k studies Sources: inaturalist, Dryad, specialist databases, random websites, ad- hoc formats V1: Adding a new datasource required changing the centralized data inges6on code Problem: How to decentralize dataset contribu6ons
15 (2) Experience with GloBI Solu6on Simple metadata file specifica6on globi.json file Goes in top level of a github repository JSON(LD) Data can be in same repo, or external (e.g. dryad) Typically CSV GloBI central database periodically harvests data
16 Approach for biocaddie Harvester project Define format for a simple file that can be included in the root project directory YAML/Markdown Can be placed In the same directory tree as the data Externally, with a pointer to external data (e.g. figshare, dryad) Leverage GitHub social coding ecosystem for Distributed edi6ng of metadata files Cloud- based valida6on (Travis- CI) Pilot project Select exis6ng datasets and apply Demonstrate ability to search
17 researcher producing data (large dataset) deposits checks creates Travis-CI validation/ conversion SPARQL queries semweb user DataSet C DataSet B Cloud Storage / NIH Commons / Git LFS etc describes YAML Repo B YAML DataSet A GitHub / GitLab etc Repo A conversion JSON-LD / RDF HCLS compliant Linked Data Cloud content --- title: skin microbe analysis author: orcid: name: Carrie github: carriemicrobe types: - microbiome - computational datasets: - id: site: axilla - id: site: oral --- my notes here... Example YAML embedded in Markdown BioCaddie Harvester generates Data Index Search Service BioCADDIE creates + deposits finds via biocaddie searches researcher producing data (small dataset)
18 Current Progress and Project Current loca6on: Repository h>ps://github.com/cmungall/biocaddie- gym [Ques6on for biocaddie CDT: Should this move to a biocaddie github org?] Milestones: h>ps://github.com/cmungall/biocaddie- gym/ milestones Await project ini6a6on
19 Milestone 1: YAML Schema Goal: Define the format of a biocaddie.yml file that people can add to the top level of a directory Either the same directory that contains project data Or a pointer to it Why YAML? Simple, easy to author Extensible and nestable Can be embedded inside markdown files Superset of JSON (so can use JSON- LD) YAML- LD Automa6c equivalence to JSON- LD and to RDF Coordinate this with HCLS h>p:// dataset h>ps://github.com/cmungall/biocaddie- gym/milestones/yaml%20schema --- title: skin microbe analysis author: orcid: name: Carrie github: carriemicrobe types: - microbiome - computational datasets: - loc: ds1.tsv site: axilla - loc: ds2.tsv site: oral --- my notes here...
20 researcher producing data (large dataset) creates + deposits deposits creates researcher producing data (small dataset) DataSet C DataSet B Cloud Storage / NIH Commons / Git LFS etc describes YAML Repo B YAML DataSet A GitHub / GitLab etc Repo A content --- title: skin microbe analysis author: orcid: name: Carrie github: carriemicrobe types: - microbiome - computational datasets: - loc: ds1.tsv site: axilla - loc: ds2.tsv site: oral --- my notes here...
21 Milestone 2: biocaddie yaml validator Goal: Ensure biocaddie.yml files are syntac6cally and structurally valid Consistent with W3C HCLS Dataset descrip6on profile Prevent accidental typos that could hamper indexing efforts Approach: Write a simple python validator Integrate this with the GitHub travis con6nuous integra6on system h>ps://github.com/cmungall/biocaddie- gym/milestones/validator
22 researcher producing data (large dataset) deposits checks creates Travis-CI validation/ conversion DataSet C DataSet B Cloud Storage / NIH Commons / Git LFS etc describes YAML Repo B YAML DataSet A GitHub / GitLab etc Repo A
23
24 Goal: Milestone 3: Ultra- lite publishing framework Allow metadata providers to see the authored metadata in a less geeky format Approach: Create HTML templates for rendering dataset descrip6on YMLs using GitHub Pages / Jekyll framework h>ps://github.com/cmungall/biocaddie- gym/milestones/publishing
25 Milestone 4: Searchable index Goal: Demonstrate findability and searchability Approach: Query over git hos6ng sites for any repos that have biocaddie.{yml,md} file Index [Q: How involved should the biocaddie CDT be involved here?] h>ps://github.com/cmungall/biocaddie- gym/milestones/searchable%20index
26 Milestone 5: Pilot project Pilot project using exis6ng published datasets already in GitHub Datasets TBD Open to sugges6ons from biocaddie groups? Possibili6es: Exis6ng datasets indexed by h>p://monarchini6a6ve.org Exis6ng datasets on GitHub Evaluate h>ps://github.com/cmungall/biocaddie- gym/milestones/pilot
27 What next? Will this scale? Goals of pilot: Demonstrate feasibility with tech- savvy users Next steps, making it more accessible: [to be aligned with overall biocaddie goals] Provide authoring tools Integrate with dataset repositories (figshare, dryad) Work with git{hub,lab} to make more accessible?
28 Ongoing work Task Status 1 Metadata Ingestion Import repositories PDB,GEO 2. LINCS 3. BioProject, ArrayExpress, GEMMA, dbgap 4. ICPSR Stable API details Ongoing Sample files 1.2 Metadata mapping Ongoing 1.3 Metadata management Ongoing 1.4 Indexing Ongoing 2 Terminology server 2.1 Develop terminology server 1) Imported terminologies (6) and validated them 2) Created UI-Browser for TS 3) Integration to Scigraph API 4) Create auto complete feature 09/01 10/09 Ongoing 10/ Integrate terminology server Ongoing Supported by the NIH grant 1U24 AI to the University of California, San Diego 28
29 Pilot project integration (Task 3) PP Presented to CDT / / / /01 As Integrated Specialized advanced search for GWAS datasets Ranking function based on citation metrics for GEO series data a) isee similarity metric in ElasticSearch b) DELVE implementation as exploratory search and visualization option. (i) for PDB (ii) for gene expression data Ranking function based on citation metrics (dataset mentions) for PDB data Completed On 09/22 Ongoing (10/22) 9/01 Ongoing (11/10) Ongoing (10/27) Supported by the NIH grant 1U24 AI to the University of California, San Diego 29
30 Ongoing work Task Status 4 Interface Design 4.1 Global statistics Implemented 4.2 Design interface Ongoing 4.3 Implement new design Ongoing 4.4 Breadcrumb for website navigation Not started 4.5 Display most Accessed Datasets Not Started 5 Personalized search 5.1 Search history Implemented 5.2 Save search results Not Started 5.3 Share search results Not Started 5.4 User account - - Discussion Not Started 6 Searching/Ranking algorithms 6.1 Similar datasets Implemented 6.2 Data repositories search Ongoing 6.3 Boolean/advanced search Not Started 6.4 Refine search results based on user s selection Not Started Supported by the NIH grant 1U24 AI to the University of California, San Diego 30
31 Ongoing work Task Status 7 Display of results 7.1 Sort datasets Ongoing 7.2 Browsing (grouping facets/metadata) Not started 7.3 Accessibility information Not started 8 Link to external resources 8.1 Pubmed Ongoing 8.2 Grants Not Started 9 Feedback 9.1 GitHub Implemented 9.2 Feedback form Not Started 10 Documentation 10.1 Source code Not Started 10.2 Tutorials Not Started 11 Usability studies 11.1 UI Analysis Completed 11.2 User studies Not Started 12 Data Duplication issue Supported by the NIH grant 1U24 AI to the University of California, San Diego 31
32 Other issues v Please deposit codes in GitHub. Please contact me at Anupama.E.Gururaj@uth.tmc.edu if you need access v Any other issues? v Thank You
Core Technology Development Team Meeting
Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationCore Technology Development Team Meeting
Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationCore Technology Development Team Meeting
Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationCore Technology Development Team Meeting
Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationCore Technology Development Team Meeting
Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationCore Technology Development Team Meeting
Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationCore Technology Development Team Meeting
Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationCore Technology Development Team Meeting
Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationSteering Committee Meeting
Steering Committee Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationCore Technology Development Team Meeting
Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationExecutive Committee Meeting
Executive Committee Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationCore Technology Development Team Meeting
Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationCore Technology Development Team Meeting
Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationCore Technology Development Team Meeting
Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationExecutive Committee Meeting
Executive Committee Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationCore Technology Development Team Meeting
Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationCore Technology Development Team Meeting
Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationMetadata Ingestion and Processinng
biomedical and healthcare Data Discovery Index Ecosystem Ingestion and Processinng Jeffrey S. Grethe, Ph.D. 2017 BioCADDIE All Hands Meeting prototype Ingestion Indexing Repositories Ingestion ElasticSearch
More informationCore Technology Development Team Meeting
Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationSteering Committee Meeting
Steering Committee Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationExecutive Committee Meeting
Executive Committee Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationCore Technology Development Team Meeting
Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationCore Technology Development Team Meeting
Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationCore Technology Development Team Meeting
Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationAgenda. Clarification of issues Quarter definition Steering and Executive Committee composition Dissemination and community outreach activities
Agenda Clarification of issues Quarter definition Steering and Executive Committee composition Dissemination and community outreach activities Progress and updates Y1Q3 and plans for Y1Q4 Plan for the
More informationSteering Committee Meeting
Steering Committee Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please isit: https://www.readytalk.com/account-administration/international-numbers
More informationThe Final Updates. Philippe Rocca-Serra Alejandra Gonzalez-Beltran, Susanna-Assunta Sansone, Oxford e-research Centre, University of Oxford, UK
The Final Updates Supported by the NIH grant 1U24 AI117966-01 to UCSD PI, Co-Investigators at: Philippe Rocca-Serra Alejandra Gonzalez-Beltran, Susanna-Assunta Sansone, Oxford e-research Centre, University
More informationMinutes. Date: Location: UCSD BRF2 5A03. Attendees Present
Executive Committee Meeting Location: UCSD BRF2 5A03 Date: 8-16-16 Start time: 10:00 am PDT End time: 11:30 am PDT Meeting Objective Attendees Present Minute Taker Executive Committee Meeting UCSD: Lucila
More informationSteering Committee Meeting
Steering Committee Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationExecutive Committee Meeting
Executive Committee Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationThe Materials Data Facility
The Materials Data Facility Ben Blaiszik (blaiszik@uchicago.edu), Kyle Chard (chard@uchicago.edu) Ian Foster (foster@uchicago.edu) materialsdatafacility.org What is MDF? We aim to make it simple for materials
More informationCore Technology Development Team Meeting
Core Technology Development Team Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationCore Technology Development Team Meeting
Core Technology Development Team Meeting Agenda v Updates regarding last meeting action items v Presentation by Ergin about Ontology Services v Brief updates from others Supported by the NIH grant 1U24
More informationData publication and discovery with Globus
Data publication and discovery with Globus Questions and comments to outreach@globus.org The Globus data publication and discovery services make it easy for institutions and projects to establish collections,
More informationHarmonizing biocaddie Metadata Schemas for Indexing Clinical Research Datasets Using Semantic Web Technologies
Harmonizing biocaddie Metadata Schemas for Indexing Clinical Research Datasets Using Semantic Web Technologies Harold R. Solbrig 1, Guoqian Jiang 1 1 Mayo Clinic College of Medicine, Rochester, MN [solbrig.harold,
More informationIntroducing the Springer Nature Data Support Services
Introducing the Springer Nature Data Support Services 1 What motivates researchers to share data? 97% - to accelerate research and its applications 1 96% - increased visibility and discovery of their research
More informationMarkus Kaindl Senior Manager Semantic Data Business Owner SN SciGraph
Analytics Building business tools for the scholarly publishing domain using LOD and the ELK stack SEMANTiCS Vienna 2018 Markus Kaindl Senior Manager Semantic Data Business Owner SN SciGraph 1 Agenda (25
More informationSusanna-Assunta Sansone, PhD. Metadata WG3 chair.
Susanna-Assunta Sansone, PhD Metadata WG3 chair 3-workgroup@biocaddie.org WG3 Metadata v v Full description: goals, synergies, phases, members & files Joint effort with BD2K Center for Expanded Data Annotation
More informationOpenAIRE. Fostering the social and technical links that enable Open Science in Europe and beyond
Alessia Bardi and Paolo Manghi, Institute of Information Science and Technologies CNR Katerina Iatropoulou, ATHENA, Iryna Kuchma and Gwen Franck, EIFL Pedro Príncipe, University of Minho OpenAIRE Fostering
More informationMinimal Metadata Standards and MIIDI Reports
Dryad-UK Workshop Wolfson College, Oxford 12 September 2011 Minimal Metadata Standards and MIIDI Reports David Shotton, Silvio Peroni and Tanya Gray Image BioInformatics Research Group Department of Zoology
More informationAzure DevOps. Randy Pagels Intelligent Cloud Technical Specialist Great Lakes Region
Azure DevOps Randy Pagels Intelligent Cloud Technical Specialist Great Lakes Region What is DevOps? People. Process. Products. Build & Test Deploy DevOps is the union of people, process, and products to
More informationCrowdsourcing Codebook Enhancements A DDI-based Approach
Crowdsourcing Codebook Enhancements A DDI-based Approach FCSM, December 2 nd 2015 Lars Vilhuber (Cornell University) Benjamin Perry (Cornell University) Venkata Kambhampaty (Cornell University) Kyle Brumsted
More informationLinking data and publications the past, present, and future. Dr. Hylke Koers, Head of Content Innovation, Elsevier
Linking data and publications the past, present, and future Dr. Hylke Koers, Head of Content Innovation, Elsevier BioCADDIE webinar January 8, 2015 Ease of access Open Access 2 The issue: data is important,
More informationJanuary 16, Re: Request for Comment: Data Access and Data Sharing Policy. Dear Dr. Selby:
Dr. Joe V. Selby, MD, MPH Executive Director Patient-Centered Outcomes Research Institute 1828 L Street, NW, Suite 900 Washington, DC 20036 Submitted electronically at: http://www.pcori.org/webform/data-access-and-data-sharing-policypublic-comment
More informationSATURN Update. DAML PI Meeting Dr. A. Joseph Rockmore 25 May 2004
SATURN Update DAML PI Meeting Dr. A. Joseph Rockmore 25 May 2004 SATURN: Needs and Challenges [1 of 2]! SATURN = semantic access to time-ordered url s and related information! Objective: easier and more
More informationFROM VSTS TO AZURE DEVOPS
#DOH18 FROM VSTS TO AZURE DEVOPS People. Process. Products. Gaetano Paternò @tanopaterno info@gaetanopaterno.it 2 VSTS #DOH18 3 Azure DevOps Azure Boards (ex Work) Deliver value to your users faster using
More informationRENKU - Reproduce, Reuse, Recycle Research. Rok Roškar and the SDSC Renku team
RENKU - Reproduce, Reuse, Recycle Research Rok Roškar and the SDSC Renku team Renku-Reana workshop @ CERN 26.06.2018 Goals of Renku 1. Provide the means to create reproducible data science 2. Facilitate
More informationMaking the most of metadata with Metadata 2020
Making the most of metadata with Metadata 2020 Patricia Feeney, Crossref and Metadata2020 CSE Annual Meeting April 2018 What is Metadata 2020? Metadata 2020 is a collaboration that advocates richer, connected,
More informationGit and GitHub. Dan Wysocki. February 12, Dan Wysocki Git and GitHub February 12, / 48
Git and GitHub Dan Wysocki February 12, 2015 Dan Wysocki Git and GitHub February 12, 2015 1 / 48 1 Version Control 2 Git 3 GitHub 4 Walkthrough Dan Wysocki Git and GitHub February 12, 2015 2 / 48 Version
More informationAg Data Commons: Harnessing the Power of Digital Agriculture Cynthia Parr USDA ARS National Agricultural Library
Ag Data Commons: Harnessing the Power of Digital Agriculture Cynthia Parr USDA ARS National Agricultural Library Live poll at: https://pollev.com/ cyndyparr196 Problems with Public Ag Data Government Website
More informationDataSTORRE Deposit Guide
DataSTORRE Deposit Guide Introduction DataStorre is an online digital repository of multi-disciplinary research datasets produced at the University of Stirling. University of Stirling researchers who have
More informationCall for Participation in AIP-6
Call for Participation in AIP-6 GEOSS Architecture Implementation Pilot (AIP) Issue Date of CFP: 9 February 2013 Due Date for CFP Responses: 15 March 2013 Introduction GEOSS Architecture Implementation
More informationB2SAFE metadata management
B2SAFE metadata management version 1.2 by Claudio Cacciari, Robert Verkerk, Adil Hasan, Elena Erastova Introduction The B2SAFE service provides a set of functions for long term bit stream data preservation:
More informationDemos: DMP Assistant and Dataverse
Demos: DMP Assistant and Dataverse Alexandra Cooper, Data Services Coordinator, Queen s University Meghan Goodchild, RDM Systems Librarian, Queen s University/Scholars Portal Overview of session Research
More informationSharePoint 2013 End User Level II
Course 55052A: SharePoint 2013 End User Level II Course Details Course Outline Module 1: Overview A simple introduction module. Understand your course, classroom, classmates, facility and instructor. Module
More informationPowering Linked Open Data Applications
Powering Linked Open Data Applications With Fedora and Islandora CLAW David Wilcox, DuraSpace dwilcox@duraspace.org About DuraSpace DuraSpace is not for profit organization that provides leadership and
More informationData Exchange and Conversion Utilities and Tools (DExT)
Data Exchange and Conversion Utilities and Tools (DExT) Louise Corti, Angad Bhat, Herve L Hours UK Data Archive CAQDAS Conference, April 2007 An exchange format for qualitative data Data exchange models
More informationBPMN Processes for machine-actionable DMPs
BPMN Processes for machine-actionable DMPs Simon Oblasser & Tomasz Miksa Contents Start DMP... 2 Specify Size and Type... 3 Get Cost and Storage... 4 Storage Configuration and Cost Estimation... 4 Storage
More informationNCBI News, November 2009
Peter Cooper, Ph.D. NCBI cooper@ncbi.nlm.nh.gov Dawn Lipshultz, M.S. NCBI lipshult@ncbi.nlm.nih.gov Featured Resource: New Discovery-oriented PubMed and NCBI Homepage The NCBI Site Guide A new and improved
More informationwarwick.ac.uk/lib-publications
Original citation: Zhao, Lei, Lim Choi Keung, Sarah Niukyun and Arvanitis, Theodoros N. (2016) A BioPortalbased terminology service for health data interoperability. In: Unifying the Applications and Foundations
More informationSemantic Web for Earth and Environmental Terminology (SWEET) Status, Future Development and Community Building
Semantic Web for Earth and Environmental Terminology (SWEET) 2018 Status, Future Development and Community Building 2 Agenda and Purpose Current status of SWEET e.g. What has the community been doing?
More informationIAALD/2013 World Congress. VIVO Workshop. Brian J. Lowe Jon Corson-Rikert
IAALD/2013 World Congress VIVO Workshop Brian J. Lowe Jon Corson-Rikert July 23, 2013 Workshop options Presentation what is VIVO, the technologies it uses, and how its used at Cornell and elsewhere Self-paced
More informationGetting started with GitHub
Getting started with GitHub A beginner s guide. (There s no code in this slide deck!) Presented by Quinn Supplee https://github.com/quinns What is GitHub? GitHub is a code hosting platform for version
More informationSeedMe Building Blocks
Building data sharing cloud on Drupal 8 SeedMe Building Blocks Amit Chourasia & David Nadeau San Diego Supercomputer Center, UC San Diego Bay Area Drupal Camp Berkeley, CA Oct 20, 2017 Project website:
More informationPDS 2010 System Design Report
PDS 2010 System Design Report MC Face-to-Face St. Louis, MO August 16-17, 2010 Topics Overall Progress Test Collection Ingestion Build 1 System Deliverables Component Progress - Registry, Harvest, Security
More informationThe Avalon Media System
The Avalon Media System A Next- Genera8on Solu8on for Media Management and Access Jon Dunn Mark Notess IU Digital Library Brown Bag Series 27 March 2013 About Us Jon Dunn Interim Assistant Dean for Library
More informationTaking a view on bio-ontologies. Simon Jupp Functional Genomics Production Team ICBO, 2012 Graz, Austria
Taking a view on bio-ontologies Simon Jupp Functional Genomics Production Team ICBO, 2012 Graz, Austria Who we are European Bioinformatics Institute one of world s largest bio data and service providers
More informationeveloping DataMed the current status
eeloping DataMed the current status Hua Xu Core Deelopment Team (CDT) biocaddie AHM 2017 8/8/17 Supported by the NIH grant 1U24 AI117966-01 to the Uniersity of California, San Diego 1 Outline CDT Roles
More informationEmbracing Semantic Technology for Better Metadata Authoring in Biomedicine
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine Attila L. Egyedi, Martin J. O Connor, Marcos Martínez-Romero, Debra Willrett, Josef Hardi, John Graybeal, and Mark A. Musen Stanford
More informationSharing Archival Metadata MODULE 20. Aaron Rubinstein
Sharing Archival Metadata 297 MODULE 20 SHARING ARCHivaL METADATA Aaron Rubinstein 348 Putting Descriptive Standards to Work The Digital Public Library of America s Application Programming Interface and
More informationOracle. Engagement Cloud Using Service Request Management. Release 12
Oracle Engagement Cloud Release 12 Oracle Engagement Cloud Part Number E73284-05 Copyright 2011-2017, Oracle and/or its affiliates. All rights reserved. Author: Joseph Kolb This software and related documentation
More informationJisc Research Data Discovery Service Project Workshop Christopher Brown
18 Feb 2016 Jisc Research Data Discovery Service Project Workshop Christopher Brown Agenda» 10:30 10:40 Welcome and Introduction - Catherine Grout» 10:40 10:45 Project status and introduction to workshop/exercise
More informationProject Tutorial. CS 143 Communication Network. Ruijia Sun Zilong Chen. Based on version 2015, authored by Ritvik Mishra
Project Tutorial CS 143 Communication Network Ruijia Sun Zilong Chen Based on version 2015, authored by Ritvik Mishra Outline Project Overview Requirements Network Components Design Remarks Tools Schedule
More informationTania Tudorache Stanford University. - Ontolog forum invited talk04. October 2007
Collaborative Ontology Development in Protégé Tania Tudorache Stanford University - Ontolog forum invited talk04. October 2007 Outline Introduction and Background Tools for collaborative knowledge development
More informationDaMaRO (Data Management Roll-out at Oxford): DataBank & DataFinder
DaMaRO (Data Management Roll-out at Oxford): DataBank & DataFinder Thursday 25 th October, 2012 James A J Wilson James.wilson@oucs.ox.ac.uk Damaro DataBank Oxford s in-development data archive Intended
More informationUpdate on Dataverse Dryad-Dataverse Community Meeting. Mercè Crosas, Elizabeth Quigley & Eleni Castro. Data Science > IQSS > Harvard University
Update on Dataverse Image credit: David Bygott (CC-BY-NC-SA) 2014 Dryad-Dataverse Community Meeting Mercè Crosas, Elizabeth Quigley & Eleni Castro Data Science > IQSS > Harvard University Introduction
More informationMetadata Discovery and Integration to Support Repurposing of Heterogeneous Data using the OpenFurther Platform
Metadata Discovery and Integration to Support Repurposing of Heterogeneous Data using the OpenFurther Platform biocaddie All Hands Meeting September 11 th, 2016 Ram Gouripeddi & Julio Facelli Department
More informationBig Data, Big Compute, Big Interac3on Machines for Future Biology. Rick Stevens. Argonne Na3onal Laboratory The University of Chicago
Assembly Annota3on Modeling Design Big Data, Big Compute, Big Interac3on Machines for Future Biology Rick Stevens stevens@anl.gov Argonne Na3onal Laboratory The University of Chicago There are no solved
More informationCon$nuous Integra$on Development Environment. Kovács Gábor
Con$nuous Integra$on Development Environment Kovács Gábor kovacsg@tmit.bme.hu Before we start anything Select a language Set up conven$ons Select development tools Set up development environment Set up
More informationData Management Plans
Data Management Plans JENNIFER L. THOEGERSEN, DATA CURATION LIBRARIAN NURAMP WORKSHOP SERIES MARCH 17, 2016 Jenny Thoegersen, Data Curation Librarian University of Nebraska-Lincoln Libraries jthoegersen2@unl.edu
More informationA Data Citation Roadmap for Scholarly Data Repositories
A Data Citation Roadmap for Scholarly Data Repositories Tim Clark (Harvard Medical School & Massachusetts General Hospital) Martin Fenner (DataCite) Mercè Crosas (Institute for Quantiative Social Science,
More informationWebinar Annotate data in the EUDAT CDI
Webinar Annotate data in the EUDAT CDI Yann Le Franc - e-science Data Factory, Paris, France March 16, 2017 This work is licensed under the Creative Commons CC-BY 4.0 licence. Attribution: Y. Le Franc
More informationONTOLOGY LIBRARIES: A STUDY FROM ONTOFIER AND ONTOLOGIST PERSPECTIVES
ONTOLOGY LIBRARIES: A STUDY FROM ONTOFIER AND ONTOLOGIST PERSPECTIVES Debashis Naskar 1 and Biswanath Dutta 2 DSIC, Universitat Politècnica de València 1 DRTC, Indian Statistical Institute 2 OUTLINE Introduction
More informationSharePoint 2013 End User Level II
SharePoint 2013 End User Level II Course 55052A; 3 Days, Instructor-led Course Description This 3-day course explores several advanced topics of working with SharePoint 2013 sites. Topics include SharePoint
More informationBuilding on Existing Communities: the Virtual Astronomical Observatory (and NIST)
Building on Existing Communities: the Virtual Astronomical Observatory (and NIST) Robert Hanisch Space Telescope Science Institute Director, Virtual Astronomical Observatory Data in astronomy 2 ~70 major
More informationGetting Started Guide
Getting Started Guide for education accounts Setup Manual Edition 7 Last updated: September 15th, 2016 Note: Click on File and select Make a copy to save this to your Google Drive, or select Print, to
More informationAgile Methodologies via Kanban and GitHub
Agile Methodologies via Kanban and GitHub Presented to IT4Innovations 2017 Participants Rinku Gupta Argonne National Laboratory VŠB - Technical University Ostrava Date : Oct 5th, 2017 Slide Courtesy: Mike
More informationInstitutional Repository using DSpace. Yatrik Patel Scientist D (CS)
Institutional Repository using DSpace Yatrik Patel Scientist D (CS) yatrik@inflibnet.ac.in What is Institutional Repository? Institutional repositories [are]... digital collections capturing and preserving
More informationUC San Diego Mobile App: The Next Generation
UC San Diego Mobile App: The Next Generation Submitter: Alex Wu, UC San Diego, Information Technology Services, Web Technology Services Manager, a6wu@ucsd.edu Team Members: Brett Pollak, UC San Diego,
More informationStep 1 - Learning & Discovery
OUR PROCESS Our goal as a integrated marketing agency is to provide clients with the most creative and effective solutions for their business. Reaching that goal begins with our Learning and Discovery
More informationWhat is git? Distributed Version Control System (VCS); Created by Linus Torvalds, to help with Linux development;
What is git? Distributed Version Control System (VCS); Created by Linus Torvalds, to help with Linux development; Why should I use a VCS? Repositories Types of repositories: Private - only you and the
More informationFINANCIAL REGULATORY REPORTING ACROSS AN EVOLVING SCHEMA
FINANCIAL REGULATORY REPORTING ACROSS AN EVOLVING SCHEMA MODELDR & MARKLOGIC - DATA POINT MODELING MARKLOGIC WHITE PAPER JUNE 2015 CHRIS ATKINSON Contents Regulatory Satisfaction is Increasingly Difficult
More informationPHP Composer 9 Benefits of Using a Binary Repository Manager
PHP Composer 9 Benefits of Using a Binary Repository Manager White Paper Copyright 2017 JFrog Ltd. March 2017 www.jfrog.com Executive Summary PHP development has become one of the most popular platforms
More informationToward a Knowledge-Based Solution for Information Discovery in Complex and Dynamic Domains
Toward a Knowledge-Based Solution for Information Discovery in Complex and Dynamic Domains Eloise Currie and Mary Parmelee SAS Institute, Cary NC About SAS: The Power to Know SAS: The Market Leader in
More informationLinked Data: Fast, low cost semantic interoperability for health care?
Linked Data: Fast, low cost semantic interoperability for health care? About the presentation Part I: Motivation Why we need semantic operability in health care Why enhancing existing systems to increase
More informationISMTE Best Practices Around Data for Journals, and How to Follow Them" Brooks Hanson Director, Publications, AGU
ISMTE Best Practices Around Data for Journals, and How to Follow Them" Brooks Hanson Director, Publications, AGU bhanson@agu.org 1 Recent Alignment by Publishers, Repositories, and Funders Around Data
More informationInteroperability in Science Data: Stories from the Trenches
Interoperability in Science Data: Stories from the Trenches Karen Stocks University of California San Diego Open Data for Open Science Data Interoperability Microsoft escience Workshop 2012 Interoperability
More informationCODE AND DATA MANAGEMENT. Toni Rosati Lynn Yarmey
CODE AND DATA MANAGEMENT Toni Rosati Lynn Yarmey Data Management is Important! Because Reproducibility is the foundation of science Journals are starting to require data deposit You want to get credit
More informationW3C CASE STUDY. Teamwork on Open Standards Development Speeds Industry Adoption
January 2017 W3C CASE STUDY Teamwork on Open Standards Development Speeds Industry Adoption Like driving a long stretch of open road alone, standards development work can be a lonely endeavor. But with
More informationReproducible Workflows Biomedical Research. P Berlin, Germany
Reproducible Workflows Biomedical Research P11 2018 Berlin, Germany Contributors Leslie McIntosh Research Data Alliance, U.S., Executive Director Oya Beyan Aachen University, Germany Anthony Juehne RDA,
More information