TEXT MINING: THE NEXT DATA FRONTIER
|
|
- Melvin Bond
- 5 years ago
- Views:
Transcription
1 TEXT MINING: THE NEXT DATA FRONTIER An Infrastructural Approach Dr. Petr Knoth CORE (core.ac.uk) Knowledge Media institute, The Open University United Kingdom
2 2 OpenMinTeD Establish an open and sustainable Text and Data Mining (TDM) platform and infrastructure where researchers can collaboratively create, discover, share and re-use knowledge from a wide range of text based scientific and scholarly related sources.
3 beyond Open Access MAKING SENSE OF LARGE VOLUMES OF SCIENTIFIC CONTENT 3
4 OPENMINTED -The Open Mining Infrastructure for Text and Data The phases of text mining STAGE 1 STAGE 2 STAGE 3 STAGE 4 Information Retrieval NLP Analysis Entity Recognition Information Extraction Data Mining Knowledge Discovery
5 OPENMINTED - The Open Mining Infrastructure for Text and Data TDM challenges for researchers 1. Content challenges - Barriers and obstacles due to non-availability, technical restrictions, copyright law or licensing issues - No uniform way to search for, retrieve and access content for TDM
6 OPENMINTED - The Open Mining Infrastructure for Text and Data TDM challenges for researchers 2. Services challenges How to identify the most fitting TDM service? How to combine with other TDM services I have access to? How to use them on my content?
7 OPENMINTED - The Open Mining Infrastructure for Text and Data TDM challenges for researchers 3. Processing challenges Where to deploy? Are my machines powerful enough? How can I get access to powerful machines? Where to store intermediate and final results? How to ensure persistence of storage?
8 OPENMINTED - The Open Mining Infrastructure for Text and Data OpenMinTeD Provides solutions an open and sustainable TDM infrastructure where researchers can collaboratively create, discover, share and re-use knowledge from a wide range of text based scientific-related sources.
9 OPENMINTED - The Open Mining Infrastructure for Text and Data OpenMinTeD working on many fronts ACCESSIBLE CONTENT DISCOVERABLE SERVICES EFFICIENT PROCESSING RESEARCH COMMUNITIES Via standardised programmatic interfaces Well-documented easily discoverable text mining services and workflows which process, analyse and annotate text Operate on public e-infrastructures via standarized APIs Different scientific communities have different challenges VALUE ADDED APPS Community-driven applications to illustrate the value of the infastructure. Engage with industry. 10
10 OPENMINTED = The Open Mining Infrastructure for Text and Data The project Started: June 2015 Duration: 3 years Budget of: 6 million Grant of: 5.3 million 16 Partners: - 6 mining research groups - 3 content providers - 1 data center - 1 library association - 2 legal experts - 6 community related partners - 2 SMEs PARTNERS Athena RIC Univ. of Manchester (NacTem) Univ. of Darmstadt INRA EMBL-EBI Agro-Know LIBER Univ. of Amsterdam Open University UK (CORE) EPFL CNIO Univ. of Sheffield (GATE) GESIS GRNET Frontiers Univ. of Stirling
11 OPENMINTED = The Open Mining Infrastructure for Text and Data The OpenMinTeD landscape
12 OPENMINTED = The Open Mining Infrastructure for Text and Data Infrastructural approach OpenMinted does not build new services, but adopts and adapts existing services for new communities
13 OPENMINTED = The Open Mining Infrastructure for Text and Data Infrastructural approach Focuses on interoperability across text mining services and content provision outlets
14 OPENMINTED = The Open Mining Infrastructure for Text and Data Infrastructural approach Creates and an Open & collaborative space for researchers to use the best fitting text mining services available building on the cloud computing philosophy
15 Overview OPENMINTED = The Open Mining Infrastructure for Text and Data Users: researchers, curators, text-miners and new services developers Platform services Registry Auth2 & Policy management Workflow Management Annotator Accounting Layer 1: Interoperability of text mining services (platforms or components) Layer 2: Interoperability of language resources & corpora Mining Platforms Mining Platforms Mining Platforms Mining Platforms Proprietary architectures Language resources and corpora registry service Language resources Language resources Language resources Language resources Layer 3: Interoperability to shared storage and computing resources Publisher text corpus Other text corpora OpenAIRE/CORE text corpus Other text corpora PMC text corpus Data centre Data centre Data centre Other text corpora Other types of text corpora Data centre in public cloud
16 OPENMINTED = The Open Mining Infrastructure for Text and Data Interoperability framework Bringing together mining tools, resources and content 1. Content metadata & transfer standards To document scientific literature, language resources, taxonomies and provenance as well as transfer protocols for full text retrieval
17 OPENMINTED = The Open Mining Infrastructure for Text and Data Interoperability framework Bringing together mining tools, resources and content 2. Service metadata & pipelining To document and classify text mining services, how they receive input, in what form they output their results, how they combine for workflows, what granularity to consider.
18 OPENMINTED = The Open Mining Infrastructure for Text and Data Interoperability framework Bringing together mining tools, resources and content 3. IPR and licensing To study IPR restrictions, describe license metadata for re-use, for content and TDM services & tools, and information on how to apply for academic and noncommercial mining research
19 OPENMINTED = The Open Mining Infrastructure for Text and Data OpenMinTeD users 1. End users - Researchers, data base curators, - Novice: use services to advance their science - Advanced: use TDM services into complex workflows
20 OPENMINTED = The Open Mining Infrastructure for Text and Data OpenMinTeD users 2. Content and service providers - Publishers, libraries, scientific data base centres, - TDM researchers - SME s
21 OPENMINTED = The Open Mining Infrastructure for Text and Data Bottom-up approach OpenMinTeD works with 4 use cases, which give their requirements and evaluate the results. RESEARCH ANALYTICS LIFE SCIENCES AGRICULTURE SOCIAL SCIENCES
22 Openminted use case 1 Scholarly communication analytics Semantic search and discovery of open scientific outcomes Map of academia scholarly communication network Research monitoring and analytics Partners CORE/OU, OpenAIRE/ARC, Frontiers 2
23 Openminted use case 2 Life sciences Assisted curation of the EMBL-EBI chemical databases for metabolomics Curation of the neurosciences resources KnowledgeBase and Neurolex Partners EBI - Metabolomics, Human brain project 2
24 Openminted use case 3 Agriculture and biodiversity Enrich agricultural databases to assist food- and water-borne disease outbreak alerts and product recalls Image, figure and dataset discovery in the AGRIS Partners INRA, AGRO-KNOW 2
25 Openminted use case 4 social sciences Develop and evaluate methods for the automatic detection and linking of named entities, citation traces and intentions in social science scientific publications Partners GESIS 2
26 OPENMINTED = The Open Mining Infrastructure for Text and Data What can OpenMinTeD do for you? Are you a content provider? make your content available for mining Register your collections in the OpenMinTeD registry and let others discover it
27 OPENMINTED = The Open Mining Infrastructure for Text and Data What can OpenMinTeD do for you? Are you a TDM service provider? share and collaborate with other TDM services Register your TDM service in the OpenMinTeD registry and let others discover it.
28 OPENMINTED = The Open Mining Infrastructure for Text and Data What can OpenMinTeD do for you? Are you a text miner/research who can benefot from text-mining? Use OpenMinTeD (when launched)
29 OPENMINTED = The Open Mining Infrastructure for Text and Data Conclusions - The ability to text-mine research literature at scale can redefine the way we do research - OpenMinTeD is laying the groundwork (interoperability) and building the cloud infrastructure for text-mining research literature - Building an open, transparent infrastructure that is enabling others to participate
30 twitter.com/openminted_eu facebook.com/openminted bit.do/openmintedlinkedin vimeo.com/openminted bit.do/openmintedplus Contact us
Enabling Open Science: Data Discoverability, Access and Use. Jo McEntyre Head of Literature Services
Enabling Open Science: Data Discoverability, Access and Use Jo McEntyre Head of Literature Services www.ebi.ac.uk About EMBL-EBI Part of the European Molecular Biology Laboratory International, non-profit
More informationEuropean Cloud Initiative: implementation status. Augusto BURGUEÑO ARJONA European Commission DG CNECT Unit C1: e-infrastructure and Science Cloud
European Cloud Initiative: implementation status Augusto BURGUEÑO ARJONA European Commission DG CNECT Unit C1: e-infrastructure and Science Cloud Political drivers for action EC Communication "European
More informationPowering Knowledge Discovery. Insights from big data with Linguamatics I2E
Powering Knowledge Discovery Insights from big data with Linguamatics I2E Gain actionable insights from unstructured data The world now generates an overwhelming amount of data, most of it written in natural
More informationOpenAIRE. Fostering the social and technical links that enable Open Science in Europe and beyond
Alessia Bardi and Paolo Manghi, Institute of Information Science and Technologies CNR Katerina Iatropoulou, ATHENA, Iryna Kuchma and Gwen Franck, EIFL Pedro Príncipe, University of Minho OpenAIRE Fostering
More informationNational Centre for Text Mining NaCTeM. e-science and data mining workshop
National Centre for Text Mining NaCTeM e-science and data mining workshop John Keane Co-Director, NaCTeM john.keane@manchester.ac.uk School of Informatics, University of Manchester What is text mining?
More informationI data set della ricerca ed il progetto EUDAT
I data set della ricerca ed il progetto EUDAT Casalecchio di Reno (BO) Via Magnanelli 6/3, 40033 Casalecchio di Reno 051 6171411 www.cineca.it 1 Digital as a Global Priority 2 Focus on research data Square
More informationCoupled Computing and Data Analytics to support Science EGI Viewpoint Yannick Legré, EGI.eu Director
Coupled Computing and Data Analytics to support Science EGI Viewpoint Yannick Legré, EGI.eu Director yannick.legre@egi.eu Credit slides: T. Ferrari www.egi.eu This work by EGI.eu is licensed under a Creative
More informationDOIs for Research Data
DOIs for Research Data Open Science Days 2017, 16.-17. Oktober 2017, Berlin Britta Dreyer, Technische Informationsbibliothek (TIB) http://orcid.org/0000-0002-0687-5460 Scope 1. DataCite Services 2. Data
More informationUsing Linked Data and taxonomies to create a quick-start smart thesaurus
7) MARJORIE HLAVA Using Linked Data and taxonomies to create a quick-start smart thesaurus 1. About the Case Organization The two current applications of this approach are a large scientific publisher
More informationDigital repositories as research infrastructure: a UK perspective
Digital repositories as research infrastructure: a UK perspective Dr Liz Lyon Director This work is licensed under a Creative Commons Licence Attribution-ShareAlike 2.0 UKOLN is supported by: Presentation
More informationIndiana University Research Technology and the Research Data Alliance
Indiana University Research Technology and the Research Data Alliance Rob Quick Manager High Throughput Computing Operations Officer - OSG and SWAMP Board Member - RDA Organizational Assembly RDA Mission
More informationData Management Plans. Sarah Jones Digital Curation Centre, Glasgow
Data Management Plans Sarah Jones Digital Curation Centre, Glasgow sarah.jones@glasgow.ac.uk Twitter: @sjdcc Data Management Plan (DMP) workshop, e-infrastructures Austria, Vienna, 17 November 2016 What
More informationInge Van Nieuwerburgh OpenAIRE NOAD Belgium. Tools&Services. OpenAIRE EUDAT. can be reused under the CC BY license
Inge Van Nieuwerburgh OpenAIRE NOAD Belgium Tools&Services OpenAIRE EUDAT can be reused under the CC BY license Open Access Infrastructure for Research in Europe www.openaire.eu Research Data Services,
More informationCheshire 3 Framework White Paper: Implementing Support for Digital Repositories in a Data Grid Environment
Cheshire 3 Framework White Paper: Implementing Support for Digital Repositories in a Data Grid Environment Paul Watry Univ. of Liverpool, NaCTeM pwatry@liverpool.ac.uk Ray Larson Univ. of California, Berkeley
More informationMercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard
Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard University @mercecrosas mercecrosas.com Open Research Cloud, May 11, 2017 Best Practices
More informationReproducibility and FAIR Data in the Earth and Space Sciences
Reproducibility and FAIR Data in the Earth and Space Sciences December 2017 Brooks Hanson Sr. VP, Publications, American Geophysical Union bhanson@agu.org Earth and Space Science is Essential for Society
More informationEGI federated e-infrastructure, a building block for the Open Science Commons
EGI federated e-infrastructure, a building block for the Open Science Commons Yannick LEGRÉ Director, EGI.eu www.egi.eu EGI-Engage is co-funded by the Horizon 2020 Framework Programme of the European Union
More informationSome Big Data Challenges
Some Big Data Challenges 2,500,000,000,000,000,000 Bytes (2.5 x 10 18 ) of data are created every day! (2012) or 8,000,000,000,000,000,000 (8 exabytes) of new data were stored globally by enterprises in
More informationApplying Auto-Data Classification Techniques for Large Data Sets
SESSION ID: PDAC-W02 Applying Auto-Data Classification Techniques for Large Data Sets Anchit Arora Program Manager InfoSec, Cisco The proliferation of data and increase in complexity 1995 2006 2014 2020
More informationACCELERATE YOUR SHAREPOINT ADOPTION AND ROI WITH CONTENT INTELLIGENCE
June 30, 2012 San Diego Convention Center ACCELERATE YOUR SHAREPOINT ADOPTION AND ROI WITH CONTENT INTELLIGENCE Stuart Laurie, Senior Consultant #SPSSAN Agenda 1. Challenges 2. What comes out of the box
More informationHelix Nebula, the Science Cloud
Helix Nebula, the Science Cloud A strategic Plan for a European Scientific Cloud Computing Infrastructure NORDUNet 2012, Oslo 18 th -20 th September Maryline Lengert, ESA Strategic Goal Helix Nebula, the
More informationGLOBAL INFRASTRUCTURES FOR SUPPORTING BIODIVERSITY RESEARCH
GLOBAL INFRASTRUCTURES FOR SUPPORTING BIODIVERSITY RESEARCH Main problem to solve How can we measure and calculate Essential Biodiversity Variables (EBVs) on a global scale? Which variables are most meaningful?
More informationPlatform UI Specification
Platform UI Specification November 25, 2016 Deliverable Code: D6.4 Version: 1.0 Final Dissemination level: Public This report presents the OpenMinTeD platform user interface design and implementation issues
More informationCORE: Improving access and enabling re-use of open access content using aggregations
CORE: Improving access and enabling re-use of open access content using aggregations Petr Knoth CORE (Connecting REpositories) Knowledge Media institute The Open University @petrknoth 1/39 Outline 1. The
More informationSemantic MediaWiki (SMW) for Scientific Literature Management
Semantic MediaWiki (SMW) for Scientific Literature Management Bahar Sateli, René Witte Semantic Software Lab Department of Computer Science and Software Engineering Concordia University, Montréal SMWCon
More informationPlatform UI Specification (26)
Platform UI Specification (26) December 20, 2017 Deliverable Code: D6.6 Version: 1.0 Final Dissemination level: Public This report presents the OpenMinTeD platform user interface design and implementation
More informationFor Attribution: Developing Data Attribution and Citation Practices and Standards
For Attribution: Developing Data Attribution and Citation Practices and Standards Board on Research Data and Information Policy and Global Affairs Division National Research Council in collaboration with
More informationWeb of Science. Platform Release Nina Chang Product Release Date: March 25, 2018 EXTERNAL RELEASE DOCUMENTATION
Web of Science EXTERNAL RELEASE DOCUMENTATION Platform Release 5.28 Nina Chang Product Release Date: March 25, 2018 Document Version: 1.0 Date of issue: March 22, 2018 RELEASE OVERVIEW The following features
More informationInteroperability Standards and Specifications
Interoperability Standards and Specifications June 20, 2017 Deliverable Code: D5.3 Version: 1.0 Dissemination level: Public First version of the interoperability standards and specification report that
More informationWhat is Text Mining? Sophia Ananiadou National Centre for Text Mining University of Manchester
National Centre for Text Mining www.nactem.ac.uk University of Manchester Outline Aims of text mining Text Mining steps Text Mining uses Applications 2 Aims Extract and discover knowledge hidden in text
More informationEUDAT & SeaDataCloud
EUDAT & SeaDataCloud SeaDataCloud Kick-off meeting Damien Lecarpentier CSC-IT Center for Science www.eudat.eu EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-infrastructures.
More informationOpenAIRE Open Knowledge Infrastructure for Europe
Birgit Schmidt University of Göttingen State and University Library OpenAIRE Open Knowledge Infrastructure for Europe ERC Workshop, 6-7 February 2013, Brussels OpenAIRE Characteristics A policy driven
More informationOpenAIRE From Pilot to Service
Natalia Manola University of Athens Department of Informatics and Telecommunications OpenAIRE From Pilot to Service The Open Knowledge Infrastructure for Europe Outline Open Access in Europe Brief history
More informationEUDAT - Open Data Services for Research
EUDAT - Open Data Services for Research Johannes Reetz EUDAT operations Max Planck Computing & Data Centre Science Operations Workshop 2015 ESO, Garching 24-27th November 2015 EUDAT receives funding from
More informationResearch Elsevier
Research Data @ Elsevier From generation through sharing and publishing to discovery IJsbrand Jan Aalbersberg SVP Journal and Data Solutions NDS, Boulder - June 12, 2014 Contributors: Anita de Waard Hylke
More informationSoftware + Services for Data Storage, Management, Discovery, and Re-Use
Software + Services for Data Storage, Management, Discovery, and Re-Use CODATA 22 Conference Stellenbosch, South Africa 25 October 2010 Alex D. Wade Director Scholarly Communication Microsoft External
More informationThe Materials Data Facility
The Materials Data Facility Ben Blaiszik (blaiszik@uchicago.edu), Kyle Chard (chard@uchicago.edu) Ian Foster (foster@uchicago.edu) materialsdatafacility.org What is MDF? We aim to make it simple for materials
More informationEmpowering People with Knowledge the Next Frontier for Web Search. Wei-Ying Ma Assistant Managing Director Microsoft Research Asia
Empowering People with Knowledge the Next Frontier for Web Search Wei-Ying Ma Assistant Managing Director Microsoft Research Asia Important Trends for Web Search Organizing all information Addressing user
More informationOpen-Source Natural Language Processing and Computational Archival Science
Open-Source Natural Language Processing and Computational Archival Science Kalina Bontcheva University of Sheffield @kbontcheva The University of Sheffield, 1995-2018 This work is licensed under the Creative
More informationData Discovery - Introduction
Data Discovery - Introduction Why (benefits of reusing data) How EUDAT's services help with this (in general) Adam Carter In days gone by: Design an experiment Getting Your Data Conduct the experiment
More informationMedici for Digital Cultural Heritage Libraries. George Tsouloupas, PhD The LinkSCEEM Project
Medici for Digital Cultural Heritage Libraries George Tsouloupas, PhD The LinkSCEEM Project Overview of Digital Libraries A Digital Library: "An informal definition of a digital library is a managed collection
More informationNational Materials Data Initiatives
National Materials Data Initiatives Chuck Ward Integrity Service Excellence Materials & Manufacturing Directorate Approved for public release, distribution is unlimited. 88ABW-2015-2270 Overview Policy
More informationCLARIN s central infrastructure. Dieter Van Uytvanck CLARIN-PLUS Tools & Services Workshop 2 June 2016 Vienna
CLARIN s central infrastructure Dieter Van Uytvanck CLARIN-PLUS Tools & Services Workshop 2 June 2016 Vienna CLARIN? Common Language Resources and Technology Infrastructure Research Infrastructure for
More informationre3data.org - Making research data repositories visible and discoverable
re3data.org - Making research data repositories visible and discoverable Robert Ulrich, Karlsruhe Institute of Technology Hans-Jürgen Goebelbecker, Karlsruhe Institute of Technology Frank Scholze, Karlsruhe
More informationPutting Open Access into Practice
Putting Open Access into Practice Dr. Nancy Pontika Connecting Repositories (CORE) Knowledge Media Institute Open University Twitter: @oacore VTT, Espoo (Finland) 11-12 May 2015 This work is licensed under
More informationPlatform UI Specification (20)
Platform UI Specification (20) June 20, 2017 Deliverable Code: D6.5 Version: 1.0 Final Dissemination level: Public This report presents the OpenMinTeD platform user interface design and implementation
More informationWelcome to the Pure International Conference. Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018
0 Welcome to the Pure International Conference Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018 1 Mendeley Data Use Synergies with Pure to Showcase Additional Research Outputs Nikhil Joshi Solutions
More informationELIXIR Compute platform
ELIXIR Compute platform Authors and contributors: Alexander Agafonov (UIT NO), Lars Ailo Bongo (UIT - NO), Mikael Borg (BILS - SE), Amelie Cornelis (EMBL-EBI), Rob Finn (EMBL-EBI), Montserrat Gonzalez
More informationDT-ICT : Big data solutions for energy
DT-ICT-11-2019: Big data solutions for energy info day Stefano Bertolo, DG CONNECT Mario Dionisio, DG ENER Scientific Programme Officers Who we are DG CONNECT, Unit G1 Data Policy and Innovation DG ENERGY,
More informationWhy CERIF? Keith G Jeffery Scientific Coordinator ERCIM Anne Assserson eurocris. Keith G Jeffery SDSVoc Workshop Amsterdam
A Europe-wide Interoperable Virtual Research Environment to Empower Multidisciplinary Research Communities and Accelerate Innovation and Collaboration Why CERIF? Keith G Jeffery Scientific Coordinator
More informationBringing Europeana and CLARIN together: Dissemination and exploitation of cultural heritage data in a research infrastructure
Bringing Europeana and CLARIN together: Dissemination and exploitation of cultural heritage data in a research infrastructure Twan Goosen 1 (CLARIN ERIC), Nuno Freire 2, Clemens Neudecker 3, Maria Eskevich
More informationThe OpenAIREplus Project
Special thanks to Natalia Manola and Yannis Ioannidis (University of Athens), who contributed to these slides The OpenAIREplus Project Paolo Manghi Istituto di Scienza e Tecnologie dell Informazione Consiglio
More informationScience Europe Consultation on Research Data Management
Science Europe Consultation on Research Data Management Consultation available until 30 April 2018 at http://scieur.org/rdm-consultation Introduction Science Europe and the Netherlands Organisation for
More informationGlobal Data Sharing The Research Data Alliance
Global Data Sharing The Research Data Alliance Dr. Francine Berman Co Chair, RDA Council Chair, RDA/US Hamilton Distinguished Professor of Computer Science, Rensselaer Polytechnic Institute 25/02/2016
More informationInteroperability Standards and Specification
Interoperability Standards and Specification October 31, 2017 Deliverable Code: D5.4 Version: 1.0 Dissemination level: Public First version of the interoperability standards and specification report that
More informationBig Data Value cppp Big Data Value Association Big Data Value ecosystem
Big Data Value cppp Big Data Value Association Big Data Value ecosystem Laure Le Bars, SAP, BDVA President and BDVe lead Nuria de Lama, ATOS, BDVA Deputy Secretary General, BDVe co-lead Ana García Robles,
More informationGlobus Platform Services for Data Publication. Greg Nawrocki University of Chicago & Argonne National Lab GeoDaRRS August 7, 2018
Globus Platform Services for Data Publication Greg Nawrocki greg@globus.org University of Chicago & Argonne National Lab GeoDaRRS August 7, 2018 Outline Globus Overview Globus Data Publication v1 Lessons
More information> Semantic Web Use Cases and Case Studies
> Semantic Web Use Cases and Case Studies Case Study: A Linked Open Data Resource List Management Tool for Undergraduate Students Chris Clarke, Talis Information Limited and Fiona Greig, University of
More informationICME: Status & Perspectives
ICME: Status & Perspectives from Materials Science and Engineering Surya R. Kalidindi Georgia Institute of Technology New Strategic Initiatives: ICME, MGI Reduce expensive late stage iterations Materials
More informationFREYA Connected Open Identifiers for Discovery, Access and Use of Research Resources
FREYA Connected Open Identifiers for Discovery, Access and Use of Research Resources Brian Matthews Data Science and Technology Group Scientific Computing Department STFC Persistent Identifiers Long-lasting
More informationEUDAT. Towards a pan-european Collaborative Data Infrastructure
EUDAT Towards a pan-european Collaborative Data Infrastructure Giuseppe Fiameni (g.fiameni@cineca.it) Claudio Cacciari SuperComputing, Application and Innovation CINECA Johannes Reatz RZG, Germany Damien
More informationMetadata Ingestion and Processinng
biomedical and healthcare Data Discovery Index Ecosystem Ingestion and Processinng Jeffrey S. Grethe, Ph.D. 2017 BioCADDIE All Hands Meeting prototype Ingestion Indexing Repositories Ingestion ElasticSearch
More informationMake the most of your access to ScienceDirect
1 Make the most of your access to ScienceDirect Present Future 2 ScienceDirect Training Deck We re here to help you make the most of your access to ScienceDirect. ScienceDirect offers researchers the latest
More informationThe iplant Data Commons
The iplant Data Commons Using irods to Facilitate Data Dissemination, Discovery, and Reproducibility Jeremy DeBarry, jdebarry@iplantcollaborative.org Tony Edgin, tedgin@iplantcollaborative.org Nirav Merchant,
More informationDataONE: Open Persistent Access to Earth Observational Data
Open Persistent Access to al Robert J. Sandusky, UIC University of Illinois at Chicago The Net Partners Update: ONE and the Conservancy December 14, 2009 Outline NSF s Net Program ONE Introduction Motivating
More informationEUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT
EUDAT A European Collaborative Data Infrastructure Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT OpenAire Interoperability Workshop Braga, Feb. 8, 2013 EUDAT Key facts
More informationEUDAT. Towards a pan-european Collaborative Data Infrastructure
EUDAT Towards a pan-european Collaborative Data Infrastructure Damien Lecarpentier CSC-IT Center for Science, Finland CESSDA workshop Tampere, 5 October 2012 EUDAT Towards a pan-european Collaborative
More informationInformatica Enterprise Information Catalog
Data Sheet Informatica Enterprise Information Catalog Benefits Automatically catalog and classify all types of data across the enterprise using an AI-powered catalog Identify domains and entities with
More informationThe ELIXIR of Linked Data
The ELIXIR of Linked Data Professor Carole Goble (UK node) Barend Mons (NL node), Helen Parkinson (EMBL-EBI node) The Interoperability Services Backbone Team European Life Sciences Infrastructure for Biological
More informationCANARIE Mandate Renewal Proposal
CANARIE Mandate Renewal Proposal Kathryn Anthonisen BCNET Conference April 23, 2018 Let s connect! @kanthonisen canarie.ca @canarie_inc canarie.ca @canarie_inc 2 Core Purpose Advancement of Canada s Knowledge
More informationContent Enrichment. An essential strategic capability for every publisher. Enriched content. Delivered.
Content Enrichment An essential strategic capability for every publisher Enriched content. Delivered. An essential strategic capability for every publisher Overview Content is at the centre of everything
More informationProgress towards the EOSC
Progress towards the EOSC Rapid overview of 6 Current Projects: EOSCpilot Juan Bicarregui einfracentral Alasdair Reid OpenAire Natalia Manola EOSC Hub Tiziana Ferrari FREYA Simon Lambert RDA Europe Sara
More informationSetting up a CIDOC CRM Adoption and Use Strategy CIDOC CRM: Success Stories, Challenges and New Perspective
Setting up a CIDOC CRM Adoption and Use Strategy CIDOC CRM: Success Stories, Challenges and New Perspective George Bruseker CIDOC 2017 Tblisi, Georgia 27/09/2017 Researcher, Interpreter Goal: A Semantic
More informationTools for Data Management. Research Data Management : Session 3 9 th June 2015
Tools for Data Management Research Data Management : Session 3 9 th June 2015 What do we mean by tools for data? A system that automates in some way the process of creating, transforming, analysing, visualising,
More informationBig Data infrastructure and tools in libraries
Line Pouchard, PhD Purdue University Libraries Research Data Group Big Data infrastructure and tools in libraries 08/10/2016 DATA IN LIBRARIES: THE BIG PICTURE IFLA/ UNIVERSITY OF CHICAGO BIG DATA: A VERY
More informationWeb of Science. Platform Release Nina Chang Product Release Date: December 10, 2017 EXTERNAL RELEASE DOCUMENTATION
Web of Science EXTERNAL RELEASE DOCUMENTATION Platform Release 5.27 Nina Chang Product Release Date: December 10, 2017 Document Version: 1.0 Date of issue: December 7, 2017 RELEASE OVERVIEW The following
More informationGiovanni Lamanna LAPP - Laboratoire d'annecy-le-vieux de Physique des Particules, Université de Savoie, CNRS/IN2P3, Annecy-le-Vieux, France
Giovanni Lamanna LAPP - Laboratoire d'annecy-le-vieux de Physique des Particules, Université de Savoie, CNRS/IN2P3, Annecy-le-Vieux, France ERF, Big data & Open data Brussels, 7-8 May 2014 EU-T0, Data
More informationData Replication: Automated move and copy of data. PRACE Advanced Training Course on Data Staging and Data Movement Helsinki, September 10 th 2013
Data Replication: Automated move and copy of data PRACE Advanced Training Course on Data Staging and Data Movement Helsinki, September 10 th 2013 Claudio Cacciari c.cacciari@cineca.it Outline The issue
More informationOpen Research Online The Open University s repository of research publications and other research outputs
Open Research Online The Open University s repository of research publications and other research outputs The Smart Book Recommender: An Ontology-Driven Application for Recommending Editorial Products
More informationCREATING SMART TRANSPORT SERVICES BY FACILITATING THE RE-USE OF OPEN GIS DATA
OPEN TRANSPORT NET TOMAS MILDORF 16 JUNE 2014 INSPIRE CONFERENCE 2014, AALBORG, DENMARK CREATING SMART TRANSPORT SERVICES BY FACILITATING THE RE-USE OF OPEN GIS DATA 2 1 OTN AT A GLANCE Full title OpenTransportNet
More informationOpen Science, FAIR data and effective data management
, FAIR data and effective data management This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License Federica Rosetta Director, Global Strategic Networks
More informationCustomising Location of Knowledge. Ann Apps and Ross MacIntyre MIMAS, The University of Manchester, UK
Customising Location of Ann Apps and Ross MacIntyre MIMAS, The University of Manchester, UK Outline Supporting scholarly research Overview of finding articles using Zetoc and OpenURL linking Institution
More informationNSF gateway to Scientific literature
NSF gateway to Scientific literature Workshop on Proposal Writing National Science Foundation 19 June 2012 Sunethra Perera Outline NSF Literature Local Literature at the NSF Local Literature at Other institutions
More informationRegional Information Centre for Scientific and Technological Cooperation with EU, Voronezh State University 1-2/07/2010, Voronezh
REGIONAL NETWORK FOR SUPPORT OF S&T COOPERATION BETWEEN RUSSIAN REGIONS AND THE EU Regional Information Centre for Scientific and Technological Cooperation with EU, Voronezh State University 1-2/07/2010,
More informationESA EO Programmes for CM16. EOEP-5 Block 4. Bilateral meeting with AT Delegation and Industry Vienna, 24/05/2016. ESA UNCLASSIFIED - For Official Use
ESA EO Programmes for CM16 EOEP-5 Block 4 Bilateral meeting with AT Delegation and Industry Vienna, 24/05/2016 EOEP-5 Block-5: EO Science for Society EO Science for Society will foster scientific excellence,
More informationData Management Checklist
Data Management Checklist Managing research data throughout its lifecycle ensures its long-term value and prevents data from falling into digital obsolescence. Proper data management is a key prerequisite
More informationEUDAT. Towards a pan-european Collaborative Data Infrastructure - A Nordic Perspective? -
EUDAT Towards a pan-european Collaborative Data Infrastructure - A Nordic Perspective? - Damien Lecarpentier CSC-IT Center for Science, Finland NeIC Conference Trondheim, 16 May 2013 Data trends Exponential
More informationLong-term preservation for INSPIRE: a metadata framework and geo-portal implementation
Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation INSPIRE 2010, KRAKOW Dr. Arif Shaon, Dr. Andrew Woolf (e-science, Science and Technology Facilities Council, UK) 3
More informationEUDAT Data Services & Tools for Researchers and Communities. Dr. Per Öster Director, Research Infrastructures CSC IT Center for Science Ltd
EUDAT Data Services & Tools for Researchers and Communities Dr. Per Öster Director, Research Infrastructures CSC IT Center for Science Ltd CSC IT CENTER FOR SCIENCE! Founded in 1971 as a technical support
More informationEdinburgh DataShare: Tackling research data in a DSpace institutional repository
Edinburgh DataShare: Tackling research data in a DSpace institutional repository Robin Rice EDINA and Data Library, Information Services University of Edinburgh, Scotland DSpace User Group Meeting Gothenburg,
More informationLaunching the. Data Curation Network NDS/MBDH 2018
NDS/MBDH 2018 Launching the Data Curation Network Lisa Johnston University of Minnesota Jake Carlson University of Michigan Cynthia Hudson-Vitale Penn State Univ. Heidi Imker University of Illinois Wendy
More informationPaving the Rocky Road Toward Open and FAIR in the Field Sciences
Paving the Rocky Road Toward Open and FAIR Kerstin Lehnert Lamont-Doherty Earth Observatory, Columbia University IEDA (Interdisciplinary Earth Data Alliance), www.iedadata.org IGSN e.v., www.igsn.org Field
More informationData publication and discovery with Globus
Data publication and discovery with Globus Questions and comments to outreach@globus.org The Globus data publication and discovery services make it easy for institutions and projects to establish collections,
More informationOpenAIRE From Pilot to Service The Open Knowledge Infrastructure for Europe
Natalia Manola University of Athens Department of Informatics and Telecommunications OpenAIRE From Pilot to Service The Open Knowledge Infrastructure for Europe Outline Open Access in Europe Brief history
More informationHistorical Text Mining:
Historical Text Mining Historical Text Mining, and Historical Text Mining: Challenges and Opportunities Dr. Robert Sanderson Dept. of Computer Science University of Liverpool azaroth@liv.ac.uk http://www.csc.liv.ac.uk/~azaroth/
More informationN. Marusov, I. Semenov
GRID TECHNOLOGY FOR CONTROLLED FUSION: CONCEPTION OF THE UNIFIED CYBERSPACE AND ITER DATA MANAGEMENT N. Marusov, I. Semenov Project Center ITER (ITER Russian Domestic Agency N.Marusov@ITERRF.RU) Challenges
More informationTHE ENVIRONMENTAL OBSERVATION WEB AND ITS SERVICE APPLICATIONS WITHIN THE FUTURE INTERNET Project introduction and technical foundations (I)
ENVIROfying the Future Internet THE ENVIRONMENTAL OBSERVATION WEB AND ITS SERVICE APPLICATIONS WITHIN THE FUTURE INTERNET Project introduction and technical foundations (I) INSPIRE Conference Firenze,
More informationACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development
ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development Jeremy Fischer Indiana University 9 September 2014 Citation: Fischer, J.L. 2014. ACCI Recommendations on Long Term
More informationHelix Nebula The Science Cloud
Helix Nebula The Science Cloud CERN 14 May 2014 Bob Jones (CERN) This document produced by Members of the Helix Nebula consortium is licensed under a Creative Commons Attribution 3.0 Unported License.
More information21ST century enterprise. HCL Technologies Presents. Roadmap for Data Center Transformation
21ST century enterprise HCL Technologies Presents Roadmap for Data Center Transformation june 2016 21st Century Impact on Data Centers The rising wave of digitalization has changed the way IT impacts business.
More information