Projects Tools BLAH proposal Conclusion. OntoGene/BioMeXT
|
|
- Marcus Lester
- 5 years ago
- Views:
Transcription
1 OntoGene/BioMeXT
2 The Bio Term Hub and OGER Lenz Furrer, Nico Colic, Fabio Rinaldi University of Zurich and Swiss Institute of Bioinformatics January 10, 2018
3 Outline Projects Tools BLAH proposal Conclusion
4 Topic Projects Tools BLAH proposal Conclusion
5 VetMine
6 VetMine
7 VetMine
8 VetMine
9 VetMine
10 PsyMine
11 Goal: discover causal interactions
12 From disorders to etiological factors
13 Creation of a reference corpus
14 An application: temporal trends
15 Author name disambiguation
16 SwissMADE: The challenge of clinical text SwissMADE (Monitoring of Adverse Drug Event) older patients (aged 65 years) antithrombotic drugs using structured and unstructured parts of the EHRs involves five hospitals [
17 MedMon Mining the web and social networks for mentions of Adverse Drug Reactions Collaboration with a major Pharma Company and another Swiss University I urgently need a junior PostDoc! PhD in Computational Linguistics, Computer Science or a related field Good programming skills, and proven expertise in Python Experience with Information Extraction and Text Mining Experience with machine learning approaches
18 Assisted curation The OntoGene/BioMeXT group has been active in assisted curation since 2010 with the SASEBio project (Semi-Automated Semantic Enrichment of the Biomedical Literature). Since 2013 we are collaborating with the RegulonDB database in a project aimed at testing and gradually introduce assisted curation techniques in their curation pipeline. RegulonDB is a database of the regulatory network of Escherichia coli K-12.
19 Example We additionally found that expression of the mntp gene is upregulated by manganese through MntR. Given: MntR [+] mntp To identify: condition [manganese]
20
21 OxyR experiment TOPIC: oxidative stress by OxyR CORPUS: 46 papers, curated in RegDB METHODS: automated annotations of entities via OntoGene, selection of sentences via ODIN filters, manual validation RESULTS: 100% of RIs retrieved, including TF, EFFECT and their TG Identified the growth conditions for 15 of the 20 Ris of OxyR checking only a limited set of sentences (about 10% of the article is read) [Gama-Castro et al., 2014]
22
23 Topic Projects Tools BLAH proposal Conclusion
24 Bio Term Hub
25
26 Bio Term Hub
27 BTH: Term Statistics
28 BTH: Term confusion matrix
29 OGER
30 OGER: annotation service The OntoGene s Biomedical Entity Recogniser (OGER) RESTful web service, using BTH terminologies Allows annotation of a collection of documents. Evaluated in the Bio Text Mining services challenge BioCreative/TIPS best results according to several of the evaluation metrics.
31 OGER: annotation service Annotates input text with entities from the BTH Except EntrezGene Can be used as a web demo (for annotation of single articles) or as a web service (batch). Input: PubMed, PubMed Central, Free Text Formats: text(i), BioC (I/O), pxml (I), tsv (O), brat (O), odin-xml (O) Note: user-provided terminologies can be used, but this is not yet supported by the interface and web service.
32 BioCreative V.5 / TIPS
33 OGER in TIPS
34
35 Term ambiguity
36 Term ambiguity
37 Term ambiguity
38
39 CRAFT corpus
40
41 Results [Anna Jancso, Lenz Furrer, Fabio Rinaldi, in preparation]
42 Previous history... [2006] BioCreative II: PPI (3rd), IMT (best) [2009] BioCreative II.5 PPI (best results); BioNLP [2010] BioCreative III: ACT, IMT, IAT [2011] CALBC (large scale entity extraction), BioNLP [2012] CTD task at BioCreative 2012 [2013] BioCreative IV: BioC, CTD, IAT
43 Topic Projects Tools BLAH proposal Conclusion
44 Use BTH/OGER through web API integration of BTH in another dictionary-based annotation platform usage of OGER web services Suggestion: integration with PubDictionaries/PubAnnotations
45 BTH: Rest API The Bio Term Hub can currently be accessed publicly through a web interface or (if installed locally) used through a command-line interface. To ease integration into automatic workflows, a REST API should be added.
46 BTH: JSON output The Bio Term Hub currently produces plain-text output (a TSV table). dense (as compared to e.g. XML), and straight-forward to parse in text-based processing environments for some applications JSON might be more suitable. We propose to evaluate different possible JSON representations and implement the best one. License: BSD 2-clause
47 OGER: BioC/JSON OGER supports BioC XML as both input and output. recently a JSON version of the BioC format has been defined. We propose to add support for this new format. While a possible approach would be to use the converter provided by the NCBI, it is preferable to use a solution with less overhead with respect to speed and memory consumption.
48 OGER: Format options in the web UI only a fraction of the API s options is exposed in the web interface only allows specifying input documents through an ID or by typing/pasting plain text into a text box. output is always an embedded HTML fragment with the annotations highlighted in color, which cannot easily be downloaded. We propose to extend the availables choices to the full range of input and output formats.
49 Topic Projects Tools BLAH proposal Conclusion
50 Conclusions Bio Term Hub: a one-stop site for obtaining up-to-date biomedical terminological resources OGER: an efficient text annotation tool using the BTH terminologies Provides spans and IDs (NER and CR) OGER-CR shown to be state-of-the-art But: disambiguation not yet included in web services
51 Thank you / どうもありがとうございます
Customisable Curation Workflows in Argo
Customisable Curation Workflows in Argo Rafal Rak*, Riza Batista-Navarro, Andrew Rowley, Jacob Carter and Sophia Ananiadou National Centre for Text Mining, University of Manchester, UK *Corresponding author:
More informationThe Text Analytics Challenge BioCreative V - Extraction of causal network information in BEL
The Text Analytics Challenge BioCreative V - Extraction of causal network information in BEL http://tinyurl.com/beltask Fabio Rinaldi Outline Biomedical text mining, motivation Competitive evaluations:
More informationImproving Interoperability of Text Mining Tools with BioC
Improving Interoperability of Text Mining Tools with BioC Ritu Khare, Chih-Hsuan Wei, Yuqing Mao, Robert Leaman, Zhiyong Lu * National Center for Biotechnology Information, 8600 Rockville Pike, Bethesda,
More informationOverview of BioCreative VI Precision Medicine Track
Overview of BioCreative VI Precision Medicine Track Mining scientific literature for protein interactions affected by mutations Organizers: Rezarta Islamaj Dogan (NCBI) Andrew Chatr-aryamontri (BioGrid)
More informationThe user interactive task (IAT) in BioCreative Challenges BioCreative Workshop on Text Mining Applications April 7, 2014
The user interactive task (IAT) in BioCreative Challenges BioCreative Workshop on Text Mining Applications April 7, 2014 N., PhD Research Associate Professor Protein Information Resource CBCB, University
More informationBioC: a minimalist approach to interoperability for biomedical text processing. Don Comeau
BioC: a minimalist approach to interoperability for biomedical text processing Don Comeau Outline Background and origin of BioC What is BioC? Available Tools and Corpora 2 BioCreative Critical Assessment
More informationA Framework for BioCuration (part II)
A Framework for BioCuration (part II) Text Mining for the BioCuration Workflow Workshop, 3rd International Biocuration Conference Friday, April 17, 2009 (Berlin) Martin Krallinger Spanish National Cancer
More informationA curation pipeline and web-services for PDF documents
A curation pipeline and web-services for PDF documents André Santos 1, Sérgio Matos 1, David Campos 2 and José Luís Oliveira 1 1 DETI/IEETA, University of Aveiro, 3810-193 Aveiro, Portugal {aleixomatos,andre.jeronimo,jlo}@ua.pt
More information@Note2 tutorial. Hugo Costa Ruben Rodrigues Miguel Rocha
@Note2 tutorial Hugo Costa (hcosta@silicolife.com) Ruben Rodrigues (pg25227@alunos.uminho.pt) Miguel Rocha (mrocha@di.uminho.pt) 23-01-2018 The document presents a typical workflow using @Note2 platform
More informationNatural Language Processing Pipelines to Annotate BioC Collections with an Application to the NCBI Disease Corpus
Natural Language Processing Pipelines to Annotate BioC Collections with an Application to the NCBI Disease Corpus Donald C. Comeau *, Haibin Liu, Rezarta Islamaj Doğan and W. John Wilbur National Center
More informationMaximizing the Value of STM Content through Semantic Enrichment. Frank Stumpf December 1, 2009
Maximizing the Value of STM Content through Semantic Enrichment Frank Stumpf December 1, 2009 What is Semantics and Semantic Processing? Content Knowledge Framework Technology Framework Search Text Images
More informationArchitecting C++ apps
Architecting C++ apps with a multi-device application platform John JT Thomas Director of Product Management jt@embarcadero.com @FireMonkeyPM blogs.embarcadero.com/jtembarcadero/ What is a multi-device
More informationOriginal article Using the OntoGene pipeline for the triage task of BioCreative 2012
Original article Using the OntoGene pipeline for the triage task of BioCreative 2012 Fabio Rinaldi 1, *, Simon Clematide 1, Simon Hafner 1, Gerold Schneider 1, Gintare_ Grigonyte_ 1, Martin Romacker 2
More informationPowering Knowledge Discovery. Insights from big data with Linguamatics I2E
Powering Knowledge Discovery Insights from big data with Linguamatics I2E Gain actionable insights from unstructured data The world now generates an overwhelming amount of data, most of it written in natural
More informationA few contributions of the SIFR project
A few contributions of the SIFR project Semantic Indexing of French biomedical Resources Data seminar- December 10th 2015 LIRMM University of Montpellier Clement Jonquet, Mathieu Roche, Sandra Bringay
More informationPMC text mining subset in BioC: 2.3 million full text articles and growing
PMC text mining subset in BioC: 2.3 million full text articles and growing Donald C. Comeau, Chih-Hsuan Wei, Rezarta Islamaj Doğan and Zhiyong Lu National Center for Biotechnology Information, U.S. Library
More informationText mining tools for semantically enriching the scientific literature
Text mining tools for semantically enriching the scientific literature Sophia Ananiadou Director National Centre for Text Mining School of Computer Science University of Manchester Need for enriching the
More informationSemantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.
Semantic Web Company PoolParty - Server PoolParty - Technical White Paper http://www.poolparty.biz Table of Contents Introduction... 3 PoolParty Technical Overview... 3 PoolParty Components Overview...
More informationThe CALBC RDF Triple store: retrieval over large literature content
The CALBC RDF Triple store: retrieval over large literature content Samuel Croset, Christoph Grabmüller, Chen Li, Silverstras Kavaliauskas, Dietrich Rebholz-Schuhmann croset@ebi.ac.uk 10 th December 2010,
More informationEFFICIENT AUTOMATED PROCESSING OF BIOMEDICAL LITERATURE
EFFICIENT AUTOMATED PROCESSING OF BIOMEDICAL LITERATURE NICO COLIC 1. Introduction The rate at which biomedical research papers are published is ever increasing. Because of this, professionals rely on
More informationInformation Retrieval, Information Extraction, and Text Mining Applications for Biology. Slides by Suleyman Cetintas & Luo Si
Information Retrieval, Information Extraction, and Text Mining Applications for Biology Slides by Suleyman Cetintas & Luo Si 1 Outline Introduction Overview of Literature Data Sources PubMed, HighWire
More informationRLIMS-P Website Help Document
RLIMS-P Website Help Document Table of Contents Introduction... 1 RLIMS-P architecture... 2 RLIMS-P interface... 2 Login...2 Input page...3 Results Page...4 Text Evidence/Curation Page...9 URL: http://annotation.dbi.udel.edu/text_mining/rlimsp2/
More informationWhat is Text Mining? Sophia Ananiadou National Centre for Text Mining University of Manchester
National Centre for Text Mining www.nactem.ac.uk University of Manchester Outline Aims of text mining Text Mining steps Text Mining uses Applications 2 Aims Extract and discover knowledge hidden in text
More informationefip online Help Document
efip online Help Document University of Delaware Computer and Information Sciences & Center for Bioinformatics and Computational Biology Newark, DE, USA December 2013 K K S I K K Table of Contents INTRODUCTION...
More informationRanking of CTD articles and interactions using the OntoGene pipeline
Ranking of CTD articles and interactions using the OntoGene pipeline Fabio Rinaldi, Simon Clematide and Simon Hafner Institute of Computational Linguistics, University of Zurich {rinaldi,siclemat}@cl.uzh.ch,{hafnersimon@gmail.com}
More informationText Mining. Munawar, PhD. Text Mining - Munawar, PhD
10 Text Mining Munawar, PhD Definition Text mining also is known as Text Data Mining (TDM) and Knowledge Discovery in Textual Database (KDT).[1] A process of identifying novel information from a collection
More informationPrecise Medication Extraction using Agile Text Mining
Precise Medication Extraction using Agile Text Mining Chaitanya Shivade *, James Cormack, David Milward * The Ohio State University, Columbus, Ohio, USA Linguamatics Ltd, Cambridge, UK shivade@cse.ohio-state.edu,
More informationNERD workshop. Luca ALMAnaCH - Inria Paris. Berlin, 18/09/2017
NERD workshop Luca Foppiano @ ALMAnaCH - Inria Paris Berlin, 18/09/2017 Agenda Introducing the (N)ERD service NERD REST API Usages and use cases Entities Rigid textual expressions corresponding to certain
More informationExtracting patient data from tables in clinical literature Case study on extraction of BMI, weight and number of patients
Extracting patient data from tables in clinical literature Case study on extraction of BMI, weight and number of patients Nikola Milosevic 1, Cassie Gregson 2, Robert Hernandez 2 and Goran Nenadic 1,3
More informationin the Semiconductor Industry
Realizing Smart Manufacturing in the Semiconductor Industry with SEMI Standards TechXPOT Smart Manufacturing SEMICON Taiwan 2016 September 7, 2016 Alan Weber Cimetrix Incorporated Outline What is Smart
More informationWeb Programming Pre-01A Web Programming Technologies. Aryo Pinandito, ST, M.MT
Web Programming Pre-01A Web Programming Technologies Aryo Pinandito, ST, M.MT Document Formats: The evolution of HTML HTML HyperText Markup Language Primary document type for the web Transmitted using
More informationNational Centre for Text Mining NaCTeM. e-science and data mining workshop
National Centre for Text Mining NaCTeM e-science and data mining workshop John Keane Co-Director, NaCTeM john.keane@manchester.ac.uk School of Informatics, University of Manchester What is text mining?
More informationAnnotating Spatio-Temporal Information in Documents
Annotating Spatio-Temporal Information in Documents Jannik Strötgen University of Heidelberg Institute of Computer Science Database Systems Research Group http://dbs.ifi.uni-heidelberg.de stroetgen@uni-hd.de
More informationActivities of LPI-Japan for Human Resource Development on Linux/OSS
Activities of LPI-Japan for Human Resource Development on Linux/OSS 2011/10/18 Shinichi Matsuda Technology Manager Linux Professional Institute Japan, NPO LPI-Japan 2011. All rights reserved. Agenda Statistics
More informationProfiling Medical Journal Articles Using a Gene Ontology Semantic Tagger. Mahmoud El-Haj Paul Rayson Scott Piao Jo Knight
Profiling Medical Journal Articles Using a Gene Ontology Semantic Tagger Mahmoud El-Haj Paul Rayson Scott Piao Jo Knight Origin and Outcomes Currently funded through a Wellcome Trust Seed award Collaboration
More informatione-scider: A tool to retrieve, prioritize and analyze the articles from PubMed database Sujit R. Tangadpalliwar 1, Rakesh Nimbalkar 2, Prabha Garg* 3
e-scider: A tool to retrieve, prioritize and analyze the articles from PubMed database Sujit R. Tangadpalliwar 1, Rakesh Nimbalkar 2, Prabha Garg* 3 1 National Institute of Pharmaceutical Education and
More informationExploring the Generation and Integration of Publishable Scientific Facts Using the Concept of Nano-publications
Exploring the Generation and Integration of Publishable Scientific Facts Using the Concept of Nano-publications Amanda Clare 1,3, Samuel Croset 2,3 (croset@ebi.ac.uk), Christoph Grabmueller 2,3, Senay
More informationThings to consider when using Semantics in your Information Management strategy. Toby Conrad Smartlogic
Things to consider when using Semantics in your Information Management strategy Toby Conrad Smartlogic toby.conrad@smartlogic.com +1 773 251 0824 Some of Smartlogic s 250+ Customers Awards Trend Setting
More informationPROJECT PERIODIC REPORT
PROJECT PERIODIC REPORT Grant Agreement number: 257403 Project acronym: CUBIST Project title: Combining and Uniting Business Intelligence and Semantic Technologies Funding Scheme: STREP Date of latest
More informationFiltering of Unstructured Text
International Journal of Engineering Research and Development e-issn: 2278-067X, p-issn: 2278-800X, www.ijerd.com Volume 11, Issue 12 (December 2015), PP.45-49 Filtering of Unstructured Text Sudersan Behera¹,
More informationApache UIMA and Mayo ctakes
Apache and Mayo and how it is used in the clinical domain March 16, 2012 Apache and Mayo Outline 1 Apache and Mayo Outline 1 2 Introducing Pipeline Modules Apache and Mayo What is? (You - eee - muh) Unstructured
More informationCreating Large-scale Training and Test Corpora for Extracting Structured Data from the Web
Creating Large-scale Training and Test Corpora for Extracting Structured Data from the Web Robert Meusel and Heiko Paulheim University of Mannheim, Germany Data and Web Science Group {robert,heiko}@informatik.uni-mannheim.de
More informationAn UIMA based Tool Suite for Semantic Text Processing
An UIMA based Tool Suite for Semantic Text Processing Katrin Tomanek, Ekaterina Buyko, Udo Hahn Jena University Language & Information Engineering Lab StemNet Knowledge Management for Immunology in life
More informationTools and Infrastructure for Supporting Enterprise Knowledge Graphs
Tools and Infrastructure for Supporting Enterprise Knowledge Graphs Sumit Bhatia, Nidhi Rajshree, Anshu Jain, and Nitish Aggarwal IBM Research sumitbhatia@in.ibm.com, {nidhi.rajshree,anshu.n.jain}@us.ibm.com,nitish.aggarwal@ibm.com
More informationKNOWLEDGE GRAPH: FROM METADATA TO INFORMATION VISUALIZATION AND BACK. Xia Lin College of Computing and Informatics Drexel University Philadelphia, PA
KNOWLEDGE GRAPH: FROM METADATA TO INFORMATION VISUALIZATION AND BACK Xia Lin College of Computing and Informatics Drexel University Philadelphia, PA 1 A little background of me Teach at Drexel University
More informationSemantic Knowledge Discovery OntoChem IT Solutions
Semantic Knowledge Discovery OntoChem IT Solutions OntoChem IT Solutions GmbH Blücherstr. 24 06120 Halle (Saale) Germany Tel. +49 345 4780472 Fax: +49 345 4780471 mail: info(at)ontochem.com Get the Gold!
More informationSAPIENT Automation project
Dr Maria Liakata Leverhulme Trust Early Career fellow Department of Computer Science, Aberystwyth University Visitor at EBI, Cambridge mal@aber.ac.uk 25 May 2010, London Motivation SAPIENT Automation Project
More informationIntegrated Access to Biological Data. A use case
Integrated Access to Biological Data. A use case Marta González Fundación ROBOTIKER, Parque Tecnológico Edif 202 48970 Zamudio, Vizcaya Spain marta@robotiker.es Abstract. This use case reflects the research
More informationMedLingMap: A growing resource mapping the Bio-Medical NLP field
MedLingMap: A growing resource mapping the Bio-Medical NLP field Marie Meteer, Bensiin Borukhov, Michael Crivaro, Michael Shafir, Attapol Thamrongrattanarit {mmeteer, bborukhov, mcrivaro, mshafir, tet}@brandeis.edu
More informationTowards Summarizing the Web of Entities
Towards Summarizing the Web of Entities contributors: August 15, 2012 Thomas Hofmann Director of Engineering Search Ads Quality Zurich, Google Switzerland thofmann@google.com Enrique Alfonseca Yasemin
More informationIntroduction to Text Mining. Hongning Wang
Introduction to Text Mining Hongning Wang CS@UVa Who Am I? Hongning Wang Assistant professor in CS@UVa since August 2014 Research areas Information retrieval Data mining Machine learning CS@UVa CS6501:
More informationText-mining-assisted biocuration workflows in Argo
Database, 2014, 1 14 doi: 10.1093/database/bau070 Original article Original article Text-mining-assisted biocuration workflows in Argo Rafal Rak 1, *, Riza Theresa Batista-Navarro 1,2, Andrew Rowley 1,
More informationNatural Language Processing with PoolParty
Natural Language Processing with PoolParty Table of Content Introduction to PoolParty 2 Resolving Language Problems 4 Key Features 5 Entity Extraction and Term Extraction 5 Shadow Concepts 6 Word Sense
More informationInterdomain Routing Security Workshop 21 BGP, 4 Bytes AS. Brocade Communications Systems, K.K.
Interdomain Routing Security Workshop 21 BGP, 4 Bytes AS Ken ichiro Hashimoto Brocade Communications Systems, K.K. September, 14 th, 2009 BGP Malformed AS_PATH そもそもうちは as0 を出せるのか? NetIron MLX-4 Router(config-bgp)#router
More informationTata Hitachi JIM (Japan-India Institute for Manufacturing)
1 Tata Hitachi JIM (Japan-India Institute for Manufacturing) Presenter : Sandeep Singh, Managing Director Tata Hitachi Construction Machinery Co. Pvt. Ltd. 1 st May 2018, New Delhi 2 About Tata-Hitachi
More informationParmenides. Semi-automatic. Ontology. construction and maintenance. Ontology. Document convertor/basic processing. Linguistic. Background knowledge
Discover hidden information from your texts! Information overload is a well known issue in the knowledge industry. At the same time most of this information becomes available in natural language which
More informationDecoupled Drupal with Angular
Decoupled Drupal with Angular Agenda Introduction Short intro on Drupal What is decoupled and choosing the right architecture Introduction to Angular Setting up Angular development environment Demo Application
More informationTeamware: A Collaborative, Web-based Annotation Environment. Kalina Bontcheva, Milan Agatonovic University of Sheffield
Teamware: A Collaborative, Web-based Annotation Environment Kalina Bontcheva, Milan Agatonovic University of Sheffield Outline Why Teamware? What s Teamware? Teamware for annotation Teamware for quality
More informationEnhancing applications with Cognitive APIs IBM Corporation
Enhancing applications with Cognitive APIs After you complete this section, you should understand: The Watson Developer Cloud offerings and APIs The benefits of commonly used Cognitive services 2 Watson
More informationTurning Text into Insight: Text Mining in the Life Sciences WHITEPAPER
Turning Text into Insight: Text Mining in the Life Sciences WHITEPAPER According to The STM Report (2015), 2.5 million peer-reviewed articles are published in scholarly journals each year. 1 PubMed contains
More informationQuickUMLS: a Fast, Unsupervised Approach for Medical Concept Extraction
QuickUMLS: a Fast, Unsupervised Approach for Medical Concept Extraction Luca Soldaini and Nazli Goharian @soldni Information Retrieval Lab Georgetown University 2016-07-21 MedIR 16 1 Task Medical Information
More informationBiquasiles, Triquasiles and Spatial Graphs
Biquasiles, Triquasiles and Spatial Graphs International Workshop on Spatial Graphs 2016 Waseda University, Japan Partially supported by Simons Foundation Joint Work With Joint Work With Deanna Needell
More informationText Mining. Representation of Text Documents
Data Mining is typically concerned with the detection of patterns in numeric data, but very often important (e.g., critical to business) information is stored in the form of text. Unlike numeric data,
More informationAdvanced Software Features for the LA-950
Advanced Software Features for the LA-950 Ian Treviranus ian.treviranus@horiba.com www.horiba.com/us/particle What we ll talk about Measurement tools Data analysis tools Data verification tools Q&A One-button
More informationKNIME for the life sciences Cambridge Meetup
KNIME for the life sciences Cambridge Meetup Greg Landrum, Ph.D. KNIME.com AG 12 July 2016 What is KNIME? A bit of motivation: tool blending, data blending, documentation, automation, reproducibility More
More informationMedical Event Extraction using the Swedish FrameNet, a pilot study
Medical Event Extraction using the Swedish FrameNet, a pilot study DIMITRIOS KOKKINAKIS Centre for Language Technology University of Gothenburg Sweden dimitrios.kokkinakis@svenska.gu.se Overview From entities
More informationDIGIT.B4 Big Data PoC
DIGIT.B4 Big Data PoC RTD Health papers D02.02 Technological Architecture Table of contents 1 Introduction... 5 2 Methodological Approach... 6 2.1 Business understanding... 7 2.2 Data linguistic understanding...
More informationColorado PROFILES. An Introduction
Colorado PROFILES An Introduction About Profiles Profiles is a research networking and expertise mining software tool. It not only shows traditional directory information, but also illustrates how each
More informationMetadata Ingestion and Processinng
biomedical and healthcare Data Discovery Index Ecosystem Ingestion and Processinng Jeffrey S. Grethe, Ph.D. 2017 BioCADDIE All Hands Meeting prototype Ingestion Indexing Repositories Ingestion ElasticSearch
More informationDecomposing a Scene into Geometric and Semantically Consistent Regions
Decomposing a Scene into Geometric and Semantically Consistent Regions Stephen Gould sgould@stanford.edu Richard Fulton rafulton@cs.stanford.edu Daphne Koller koller@cs.stanford.edu IEEE International
More informationEmbase for biomedical searching An introduction. Presented by Sherry Winter January 27, 2015
1 Embase for biomedical searching An introduction Presented by Sherry Winter January 27, 2015 2 2 Need to know Webinar control panel: Ask a question for questions and comments Option for full screen view
More informationHistorical Text Mining:
Historical Text Mining Historical Text Mining, and Historical Text Mining: Challenges and Opportunities Dr. Robert Sanderson Dept. of Computer Science University of Liverpool azaroth@liv.ac.uk http://www.csc.liv.ac.uk/~azaroth/
More informationA Technical Introduction to the Semantic Search Engine SeMedico
Talk in the Semesterprojekt Entwicklung einer Suchmaschine für Alternativmethoden zu Tierversuchen January 12, 2018 Humboldt-Universität zu Berlin A Technical Introduction to the Semantic Search Engine
More informationA RapidMiner framework for protein interaction extraction
A RapidMiner framework for protein interaction extraction Timur Fayruzov 1, George Dittmar 2, Nicolas Spence 2, Martine De Cock 1, Ankur Teredesai 2 1 Ghent University, Ghent, Belgium 2 University of Washington,
More informationIBM Watson Application Developer Workshop. Watson Knowledge Studio: Building a Machine-learning Annotator with Watson Knowledge Studio.
IBM Watson Application Developer Workshop Lab02 Watson Knowledge Studio: Building a Machine-learning Annotator with Watson Knowledge Studio January 2017 Duration: 60 minutes Prepared by Víctor L. Fandiño
More informationMaster Project. Various Aspects of Recommender Systems. Prof. Dr. Georg Lausen Dr. Michael Färber Anas Alzoghbi Victor Anthony Arrascue Ayala
Master Project Various Aspects of Recommender Systems May 2nd, 2017 Master project SS17 Albert-Ludwigs-Universität Freiburg Prof. Dr. Georg Lausen Dr. Michael Färber Anas Alzoghbi Victor Anthony Arrascue
More informationText mining & Information Extraction. UA.DETI José Luis Oliveira
Text mining & Information Extraction UA.DETI José Luis Oliveira Outline v Motivation & Background What s text mining? Fields of application Biomedical text mining Social network mining v Text mining tasks
More informationVisualization and text mining of patent and non-patent data
of patent and non-patent data Anton Heijs Information Solutions Delft, The Netherlands http://www.treparel.com/ ICIC conference, Nice, France, 2008 Outline Introduction Applications on patent and non-patent
More informationA document-inspired way for tracking changes of RDF data The case of the OpenCitations Corpus
A document-inspired way for tracking changes of RDF data The case of the OpenCitations Corpus Paper: https://w3id.org/oc/paper/occ-driftalod2016.html Silvio Peroni, David Shotton, Fabio Vitali 1st Drift-a-LOD
More informationTURNING TEXT INTO INSIGHT: TEXT MINING IN THE LIFE SCIENCES
TURNING TEXT INTO INSIGHT: TEXT MINING IN THE LIFE SCIENCES According to The STM Report (2015), 2.5 million peer-reviewed articles are published in scholarly journals each year. 1 PubMed contains more
More informationCreating a Recommender System. An Elasticsearch & Apache Spark approach
Creating a Recommender System An Elasticsearch & Apache Spark approach My Profile SKILLS Álvaro Santos Andrés Big Data & Analytics Solution Architect in Ericsson with more than 12 years of experience focused
More informationPosition Paper: Interoperability Challenges for Linguistic Linked Data
Position Paper: Interoperability Challenges for Linguistic Linked Data David Lewis (dave.lewis@cs.tcd.ie) Centre for Next General Localisation Trinity College Dublin Abstract: This position paper reviews
More informationSELF-SERVICE SEMANTIC DATA FEDERATION
SELF-SERVICE SEMANTIC DATA FEDERATION WE LL MAKE YOU A DATA SCIENTIST Contact: IPSNP Computing Inc. Chris Baker, CEO Chris.Baker@ipsnp.com (506) 721 8241 BIG VISION: SELF-SERVICE DATA FEDERATION Biomedical
More informationRECSM Summer School: Scraping the web. github.com/pablobarbera/big-data-upf
RECSM Summer School: Scraping the web Pablo Barberá School of International Relations University of Southern California pablobarbera.com Networked Democracy Lab www.netdem.org Course website: github.com/pablobarbera/big-data-upf
More informationSemi-Supervised Abstraction-Augmented String Kernel for bio-relationship Extraction
Semi-Supervised Abstraction-Augmented String Kernel for bio-relationship Extraction Pavel P. Kuksa, Rutgers University Yanjun Qi, Bing Bai, Ronan Collobert, NEC Labs Jason Weston, Google Research NY Vladimir
More informationSciMiner User s Manual
SciMiner User s Manual Copyright 2008 Junguk Hur. All rights reserved. Bioinformatics Program University of Michigan Ann Arbor, MI 48109, USA Email: juhur@umich.edu Homepage: http://jdrf.neurology.med.umich.edu/sciminer/
More informationDevelopment of an e-library Web Application
Development of an e-library Web Application Farrukh SHAHZAD Assistant Professor al-huda University, Houston, TX USA Email: dr.farrukh@alhudauniversity.org and Fathi M. ALWOSAIBI Information Technology
More informationEnterprise Data Catalog for Microsoft Azure Tutorial
Enterprise Data Catalog for Microsoft Azure Tutorial VERSION 10.2 JANUARY 2018 Page 1 of 45 Contents Tutorial Objectives... 4 Enterprise Data Catalog Overview... 5 Overview... 5 Objectives... 5 Enterprise
More informationSemantic MediaWiki (SMW) for Scientific Literature Management
Semantic MediaWiki (SMW) for Scientific Literature Management Bahar Sateli, René Witte Semantic Software Lab Department of Computer Science and Software Engineering Concordia University, Montréal SMWCon
More informationSteering Committee Meeting
Steering Committee Meeting To hear the meeting, you must call in Toll-free phone number: 1-866-740-1260 Access Code: 2201876 For international call in numbers, please visit: https://www.readytalk.com/account-administration/international-numbers
More informationPre-Requisites: CS2510. NU Core Designations: AD
DS4100: Data Collection, Integration and Analysis Teaches how to collect data from multiple sources and integrate them into consistent data sets. Explains how to use semi-automated and automated classification
More informationMinimal Metadata Standards and MIIDI Reports
Dryad-UK Workshop Wolfson College, Oxford 12 September 2011 Minimal Metadata Standards and MIIDI Reports David Shotton, Silvio Peroni and Tanya Gray Image BioInformatics Research Group Department of Zoology
More informationBioNav: An Ontology-Based Framework to Discover Semantic Links in the Cloud of Linked Data
BioNav: An Ontology-Based Framework to Discover Semantic Links in the Cloud of Linked Data María-Esther Vidal 1, Louiqa Raschid 2, Natalia Márquez 1, Jean Carlo Rivera 1, and Edna Ruckhaus 1 1 Universidad
More informationRetrieval of Highly Related Documents Containing Gene-Disease Association
Retrieval of Highly Related Documents Containing Gene-Disease Association K. Santhosh kumar 1, P. Sudhakar 2 Department of Computer Science & Engineering Annamalai University Annamalai Nagar, India. santhosh09539@gmail.com,
More informationA Scalable Architecture for Extracting, Aligning, Linking, and Visualizing Multi-Int Data
A Scalable Architecture for Extracting, Aligning, Linking, and Visualizing Multi-Int Data Craig Knoblock & Pedro Szekely University of Southern California Introduction Massive quantities of data available
More informationLIDER Survey. Overview. Number of participants: 24. Participant profile (organisation type, industry sector) Relevant use-cases
LIDER Survey Overview Participant profile (organisation type, industry sector) Relevant use-cases Discovering and extracting information Understanding opinion Content and data (Data Management) Monitoring
More informationEvent Registry. Gregor Leban. Jozef Stefan Institute
Event Registry Gregor Leban Jozef Stefan Institute Event registry http://eventregistry.org/ Event registry is a system for collecting news content published globally automatic identification of world events
More informationVisualizing Semantic Metadata from Biological Publications
Visualizing Semantic Metadata from Biological Publications Johannes Hellrich, Erik Faessler, Ekaterina Buyko and Udo Hahn Jena University Language and Information Engineering (JULIE) Lab Friedrich-Schiller-Universität
More informationMeasuring inter-annotator agreement in GO annotations
Measuring inter-annotator agreement in GO annotations Camon EB, Barrell DG, Dimmer EC, Lee V, Magrane M, Maslen J, Binns ns D, Apweiler R. An evaluation of GO annotation retrieval for BioCreAtIvE and GOA.
More informationREDCAP INTRODUCTION CLASS. June 7, 2016
REDCAP INTRODUCTION CLASS June 7, 2016 LEARNING OBJECTIVES What is REDCap? How do I get started in REDCap? Basic design User rights Data collection Exporting data Agenda Overview Getting access Design
More information