TEXT MINING: THE NEXT DATA FRONTIER

Size: px
Start display at page:

Download "TEXT MINING: THE NEXT DATA FRONTIER"

Transcription

1 TEXT MINING: THE NEXT DATA FRONTIER An Infrastructural Approach Dr. Petr Knoth CORE (core.ac.uk) Knowledge Media institute, The Open University United Kingdom

2 2 OpenMinTeD Establish an open and sustainable Text and Data Mining (TDM) platform and infrastructure where researchers can collaboratively create, discover, share and re-use knowledge from a wide range of text based scientific and scholarly related sources.

3 beyond Open Access MAKING SENSE OF LARGE VOLUMES OF SCIENTIFIC CONTENT 3

4 OPENMINTED -The Open Mining Infrastructure for Text and Data The phases of text mining STAGE 1 STAGE 2 STAGE 3 STAGE 4 Information Retrieval NLP Analysis Entity Recognition Information Extraction Data Mining Knowledge Discovery

5 OPENMINTED - The Open Mining Infrastructure for Text and Data TDM challenges for researchers 1. Content challenges - Barriers and obstacles due to non-availability, technical restrictions, copyright law or licensing issues - No uniform way to search for, retrieve and access content for TDM

6 OPENMINTED - The Open Mining Infrastructure for Text and Data TDM challenges for researchers 2. Services challenges How to identify the most fitting TDM service? How to combine with other TDM services I have access to? How to use them on my content?

7 OPENMINTED - The Open Mining Infrastructure for Text and Data TDM challenges for researchers 3. Processing challenges Where to deploy? Are my machines powerful enough? How can I get access to powerful machines? Where to store intermediate and final results? How to ensure persistence of storage?

8 OPENMINTED - The Open Mining Infrastructure for Text and Data OpenMinTeD Provides solutions an open and sustainable TDM infrastructure where researchers can collaboratively create, discover, share and re-use knowledge from a wide range of text based scientific-related sources.

9 OPENMINTED - The Open Mining Infrastructure for Text and Data OpenMinTeD working on many fronts ACCESSIBLE CONTENT DISCOVERABLE SERVICES EFFICIENT PROCESSING RESEARCH COMMUNITIES Via standardised programmatic interfaces Well-documented easily discoverable text mining services and workflows which process, analyse and annotate text Operate on public e-infrastructures via standarized APIs Different scientific communities have different challenges VALUE ADDED APPS Community-driven applications to illustrate the value of the infastructure. Engage with industry. 10

10 OPENMINTED = The Open Mining Infrastructure for Text and Data The project Started: June 2015 Duration: 3 years Budget of: 6 million Grant of: 5.3 million 16 Partners: - 6 mining research groups - 3 content providers - 1 data center - 1 library association - 2 legal experts - 6 community related partners - 2 SMEs PARTNERS Athena RIC Univ. of Manchester (NacTem) Univ. of Darmstadt INRA EMBL-EBI Agro-Know LIBER Univ. of Amsterdam Open University UK (CORE) EPFL CNIO Univ. of Sheffield (GATE) GESIS GRNET Frontiers Univ. of Stirling

11 OPENMINTED = The Open Mining Infrastructure for Text and Data The OpenMinTeD landscape

12 OPENMINTED = The Open Mining Infrastructure for Text and Data Infrastructural approach OpenMinted does not build new services, but adopts and adapts existing services for new communities

13 OPENMINTED = The Open Mining Infrastructure for Text and Data Infrastructural approach Focuses on interoperability across text mining services and content provision outlets

14 OPENMINTED = The Open Mining Infrastructure for Text and Data Infrastructural approach Creates and an Open & collaborative space for researchers to use the best fitting text mining services available building on the cloud computing philosophy

15 Overview OPENMINTED = The Open Mining Infrastructure for Text and Data Users: researchers, curators, text-miners and new services developers Platform services Registry Auth2 & Policy management Workflow Management Annotator Accounting Layer 1: Interoperability of text mining services (platforms or components) Layer 2: Interoperability of language resources & corpora Mining Platforms Mining Platforms Mining Platforms Mining Platforms Proprietary architectures Language resources and corpora registry service Language resources Language resources Language resources Language resources Layer 3: Interoperability to shared storage and computing resources Publisher text corpus Other text corpora OpenAIRE/CORE text corpus Other text corpora PMC text corpus Data centre Data centre Data centre Other text corpora Other types of text corpora Data centre in public cloud

16 OPENMINTED = The Open Mining Infrastructure for Text and Data Interoperability framework Bringing together mining tools, resources and content 1. Content metadata & transfer standards To document scientific literature, language resources, taxonomies and provenance as well as transfer protocols for full text retrieval

17 OPENMINTED = The Open Mining Infrastructure for Text and Data Interoperability framework Bringing together mining tools, resources and content 2. Service metadata & pipelining To document and classify text mining services, how they receive input, in what form they output their results, how they combine for workflows, what granularity to consider.

18 OPENMINTED = The Open Mining Infrastructure for Text and Data Interoperability framework Bringing together mining tools, resources and content 3. IPR and licensing To study IPR restrictions, describe license metadata for re-use, for content and TDM services & tools, and information on how to apply for academic and noncommercial mining research

19 OPENMINTED = The Open Mining Infrastructure for Text and Data OpenMinTeD users 1. End users - Researchers, data base curators, - Novice: use services to advance their science - Advanced: use TDM services into complex workflows

20 OPENMINTED = The Open Mining Infrastructure for Text and Data OpenMinTeD users 2. Content and service providers - Publishers, libraries, scientific data base centres, - TDM researchers - SME s

21 OPENMINTED = The Open Mining Infrastructure for Text and Data Bottom-up approach OpenMinTeD works with 4 use cases, which give their requirements and evaluate the results. RESEARCH ANALYTICS LIFE SCIENCES AGRICULTURE SOCIAL SCIENCES

22 Openminted use case 1 Scholarly communication analytics Semantic search and discovery of open scientific outcomes Map of academia scholarly communication network Research monitoring and analytics Partners CORE/OU, OpenAIRE/ARC, Frontiers 2

23 Openminted use case 2 Life sciences Assisted curation of the EMBL-EBI chemical databases for metabolomics Curation of the neurosciences resources KnowledgeBase and Neurolex Partners EBI - Metabolomics, Human brain project 2

24 Openminted use case 3 Agriculture and biodiversity Enrich agricultural databases to assist food- and water-borne disease outbreak alerts and product recalls Image, figure and dataset discovery in the AGRIS Partners INRA, AGRO-KNOW 2

25 Openminted use case 4 social sciences Develop and evaluate methods for the automatic detection and linking of named entities, citation traces and intentions in social science scientific publications Partners GESIS 2

26 OPENMINTED = The Open Mining Infrastructure for Text and Data What can OpenMinTeD do for you? Are you a content provider? make your content available for mining Register your collections in the OpenMinTeD registry and let others discover it

27 OPENMINTED = The Open Mining Infrastructure for Text and Data What can OpenMinTeD do for you? Are you a TDM service provider? share and collaborate with other TDM services Register your TDM service in the OpenMinTeD registry and let others discover it.

28 OPENMINTED = The Open Mining Infrastructure for Text and Data What can OpenMinTeD do for you? Are you a text miner/research who can benefot from text-mining? Use OpenMinTeD (when launched)

29 OPENMINTED = The Open Mining Infrastructure for Text and Data Conclusions - The ability to text-mine research literature at scale can redefine the way we do research - OpenMinTeD is laying the groundwork (interoperability) and building the cloud infrastructure for text-mining research literature - Building an open, transparent infrastructure that is enabling others to participate

30 twitter.com/openminted_eu facebook.com/openminted bit.do/openmintedlinkedin vimeo.com/openminted bit.do/openmintedplus Contact us

Enabling Open Science: Data Discoverability, Access and Use. Jo McEntyre Head of Literature Services

Enabling Open Science: Data Discoverability, Access and Use. Jo McEntyre Head of Literature Services Enabling Open Science: Data Discoverability, Access and Use Jo McEntyre Head of Literature Services www.ebi.ac.uk About EMBL-EBI Part of the European Molecular Biology Laboratory International, non-profit

More information

European Cloud Initiative: implementation status. Augusto BURGUEÑO ARJONA European Commission DG CNECT Unit C1: e-infrastructure and Science Cloud

European Cloud Initiative: implementation status. Augusto BURGUEÑO ARJONA European Commission DG CNECT Unit C1: e-infrastructure and Science Cloud European Cloud Initiative: implementation status Augusto BURGUEÑO ARJONA European Commission DG CNECT Unit C1: e-infrastructure and Science Cloud Political drivers for action EC Communication "European

More information

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E Powering Knowledge Discovery Insights from big data with Linguamatics I2E Gain actionable insights from unstructured data The world now generates an overwhelming amount of data, most of it written in natural

More information

OpenAIRE. Fostering the social and technical links that enable Open Science in Europe and beyond

OpenAIRE. Fostering the social and technical links that enable Open Science in Europe and beyond Alessia Bardi and Paolo Manghi, Institute of Information Science and Technologies CNR Katerina Iatropoulou, ATHENA, Iryna Kuchma and Gwen Franck, EIFL Pedro Príncipe, University of Minho OpenAIRE Fostering

More information

National Centre for Text Mining NaCTeM. e-science and data mining workshop

National Centre for Text Mining NaCTeM. e-science and data mining workshop National Centre for Text Mining NaCTeM e-science and data mining workshop John Keane Co-Director, NaCTeM john.keane@manchester.ac.uk School of Informatics, University of Manchester What is text mining?

More information

I data set della ricerca ed il progetto EUDAT

I data set della ricerca ed il progetto EUDAT I data set della ricerca ed il progetto EUDAT Casalecchio di Reno (BO) Via Magnanelli 6/3, 40033 Casalecchio di Reno 051 6171411 www.cineca.it 1 Digital as a Global Priority 2 Focus on research data Square

More information

Coupled Computing and Data Analytics to support Science EGI Viewpoint Yannick Legré, EGI.eu Director

Coupled Computing and Data Analytics to support Science EGI Viewpoint Yannick Legré, EGI.eu Director Coupled Computing and Data Analytics to support Science EGI Viewpoint Yannick Legré, EGI.eu Director yannick.legre@egi.eu Credit slides: T. Ferrari www.egi.eu This work by EGI.eu is licensed under a Creative

More information

DOIs for Research Data

DOIs for Research Data DOIs for Research Data Open Science Days 2017, 16.-17. Oktober 2017, Berlin Britta Dreyer, Technische Informationsbibliothek (TIB) http://orcid.org/0000-0002-0687-5460 Scope 1. DataCite Services 2. Data

More information

Using Linked Data and taxonomies to create a quick-start smart thesaurus

Using Linked Data and taxonomies to create a quick-start smart thesaurus 7) MARJORIE HLAVA Using Linked Data and taxonomies to create a quick-start smart thesaurus 1. About the Case Organization The two current applications of this approach are a large scientific publisher

More information

Digital repositories as research infrastructure: a UK perspective

Digital repositories as research infrastructure: a UK perspective Digital repositories as research infrastructure: a UK perspective Dr Liz Lyon Director This work is licensed under a Creative Commons Licence Attribution-ShareAlike 2.0 UKOLN is supported by: Presentation

More information

Indiana University Research Technology and the Research Data Alliance

Indiana University Research Technology and the Research Data Alliance Indiana University Research Technology and the Research Data Alliance Rob Quick Manager High Throughput Computing Operations Officer - OSG and SWAMP Board Member - RDA Organizational Assembly RDA Mission

More information

Data Management Plans. Sarah Jones Digital Curation Centre, Glasgow

Data Management Plans. Sarah Jones Digital Curation Centre, Glasgow Data Management Plans Sarah Jones Digital Curation Centre, Glasgow sarah.jones@glasgow.ac.uk Twitter: @sjdcc Data Management Plan (DMP) workshop, e-infrastructures Austria, Vienna, 17 November 2016 What

More information

Inge Van Nieuwerburgh OpenAIRE NOAD Belgium. Tools&Services. OpenAIRE EUDAT. can be reused under the CC BY license

Inge Van Nieuwerburgh OpenAIRE NOAD Belgium. Tools&Services. OpenAIRE EUDAT. can be reused under the CC BY license Inge Van Nieuwerburgh OpenAIRE NOAD Belgium Tools&Services OpenAIRE EUDAT can be reused under the CC BY license Open Access Infrastructure for Research in Europe www.openaire.eu Research Data Services,

More information

Cheshire 3 Framework White Paper: Implementing Support for Digital Repositories in a Data Grid Environment

Cheshire 3 Framework White Paper: Implementing Support for Digital Repositories in a Data Grid Environment Cheshire 3 Framework White Paper: Implementing Support for Digital Repositories in a Data Grid Environment Paul Watry Univ. of Liverpool, NaCTeM pwatry@liverpool.ac.uk Ray Larson Univ. of California, Berkeley

More information

Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard

Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard Mercè Crosas, Ph.D. Chief Data Science and Technology Officer Institute for Quantitative Social Science (IQSS) Harvard University @mercecrosas mercecrosas.com Open Research Cloud, May 11, 2017 Best Practices

More information

Reproducibility and FAIR Data in the Earth and Space Sciences

Reproducibility and FAIR Data in the Earth and Space Sciences Reproducibility and FAIR Data in the Earth and Space Sciences December 2017 Brooks Hanson Sr. VP, Publications, American Geophysical Union bhanson@agu.org Earth and Space Science is Essential for Society

More information

EGI federated e-infrastructure, a building block for the Open Science Commons

EGI federated e-infrastructure, a building block for the Open Science Commons EGI federated e-infrastructure, a building block for the Open Science Commons Yannick LEGRÉ Director, EGI.eu www.egi.eu EGI-Engage is co-funded by the Horizon 2020 Framework Programme of the European Union

More information

Some Big Data Challenges

Some Big Data Challenges Some Big Data Challenges 2,500,000,000,000,000,000 Bytes (2.5 x 10 18 ) of data are created every day! (2012) or 8,000,000,000,000,000,000 (8 exabytes) of new data were stored globally by enterprises in

More information

Applying Auto-Data Classification Techniques for Large Data Sets

Applying Auto-Data Classification Techniques for Large Data Sets SESSION ID: PDAC-W02 Applying Auto-Data Classification Techniques for Large Data Sets Anchit Arora Program Manager InfoSec, Cisco The proliferation of data and increase in complexity 1995 2006 2014 2020

More information

ACCELERATE YOUR SHAREPOINT ADOPTION AND ROI WITH CONTENT INTELLIGENCE

ACCELERATE YOUR SHAREPOINT ADOPTION AND ROI WITH CONTENT INTELLIGENCE June 30, 2012 San Diego Convention Center ACCELERATE YOUR SHAREPOINT ADOPTION AND ROI WITH CONTENT INTELLIGENCE Stuart Laurie, Senior Consultant #SPSSAN Agenda 1. Challenges 2. What comes out of the box

More information

Helix Nebula, the Science Cloud

Helix Nebula, the Science Cloud Helix Nebula, the Science Cloud A strategic Plan for a European Scientific Cloud Computing Infrastructure NORDUNet 2012, Oslo 18 th -20 th September Maryline Lengert, ESA Strategic Goal Helix Nebula, the

More information

GLOBAL INFRASTRUCTURES FOR SUPPORTING BIODIVERSITY RESEARCH

GLOBAL INFRASTRUCTURES FOR SUPPORTING BIODIVERSITY RESEARCH GLOBAL INFRASTRUCTURES FOR SUPPORTING BIODIVERSITY RESEARCH Main problem to solve How can we measure and calculate Essential Biodiversity Variables (EBVs) on a global scale? Which variables are most meaningful?

More information

Platform UI Specification

Platform UI Specification Platform UI Specification November 25, 2016 Deliverable Code: D6.4 Version: 1.0 Final Dissemination level: Public This report presents the OpenMinTeD platform user interface design and implementation issues

More information

CORE: Improving access and enabling re-use of open access content using aggregations

CORE: Improving access and enabling re-use of open access content using aggregations CORE: Improving access and enabling re-use of open access content using aggregations Petr Knoth CORE (Connecting REpositories) Knowledge Media institute The Open University @petrknoth 1/39 Outline 1. The

More information

Semantic MediaWiki (SMW) for Scientific Literature Management

Semantic MediaWiki (SMW) for Scientific Literature Management Semantic MediaWiki (SMW) for Scientific Literature Management Bahar Sateli, René Witte Semantic Software Lab Department of Computer Science and Software Engineering Concordia University, Montréal SMWCon

More information

Platform UI Specification (26)

Platform UI Specification (26) Platform UI Specification (26) December 20, 2017 Deliverable Code: D6.6 Version: 1.0 Final Dissemination level: Public This report presents the OpenMinTeD platform user interface design and implementation

More information

For Attribution: Developing Data Attribution and Citation Practices and Standards

For Attribution: Developing Data Attribution and Citation Practices and Standards For Attribution: Developing Data Attribution and Citation Practices and Standards Board on Research Data and Information Policy and Global Affairs Division National Research Council in collaboration with

More information

Web of Science. Platform Release Nina Chang Product Release Date: March 25, 2018 EXTERNAL RELEASE DOCUMENTATION

Web of Science. Platform Release Nina Chang Product Release Date: March 25, 2018 EXTERNAL RELEASE DOCUMENTATION Web of Science EXTERNAL RELEASE DOCUMENTATION Platform Release 5.28 Nina Chang Product Release Date: March 25, 2018 Document Version: 1.0 Date of issue: March 22, 2018 RELEASE OVERVIEW The following features

More information

Interoperability Standards and Specifications

Interoperability Standards and Specifications Interoperability Standards and Specifications June 20, 2017 Deliverable Code: D5.3 Version: 1.0 Dissemination level: Public First version of the interoperability standards and specification report that

More information

What is Text Mining? Sophia Ananiadou National Centre for Text Mining University of Manchester

What is Text Mining? Sophia Ananiadou National Centre for Text Mining   University of Manchester National Centre for Text Mining www.nactem.ac.uk University of Manchester Outline Aims of text mining Text Mining steps Text Mining uses Applications 2 Aims Extract and discover knowledge hidden in text

More information

EUDAT & SeaDataCloud

EUDAT & SeaDataCloud EUDAT & SeaDataCloud SeaDataCloud Kick-off meeting Damien Lecarpentier CSC-IT Center for Science www.eudat.eu EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-infrastructures.

More information

OpenAIRE Open Knowledge Infrastructure for Europe

OpenAIRE Open Knowledge Infrastructure for Europe Birgit Schmidt University of Göttingen State and University Library OpenAIRE Open Knowledge Infrastructure for Europe ERC Workshop, 6-7 February 2013, Brussels OpenAIRE Characteristics A policy driven

More information

OpenAIRE From Pilot to Service

OpenAIRE From Pilot to Service Natalia Manola University of Athens Department of Informatics and Telecommunications OpenAIRE From Pilot to Service The Open Knowledge Infrastructure for Europe Outline Open Access in Europe Brief history

More information

EUDAT - Open Data Services for Research

EUDAT - Open Data Services for Research EUDAT - Open Data Services for Research Johannes Reetz EUDAT operations Max Planck Computing & Data Centre Science Operations Workshop 2015 ESO, Garching 24-27th November 2015 EUDAT receives funding from

More information

Research Elsevier

Research Elsevier Research Data @ Elsevier From generation through sharing and publishing to discovery IJsbrand Jan Aalbersberg SVP Journal and Data Solutions NDS, Boulder - June 12, 2014 Contributors: Anita de Waard Hylke

More information

Software + Services for Data Storage, Management, Discovery, and Re-Use

Software + Services for Data Storage, Management, Discovery, and Re-Use Software + Services for Data Storage, Management, Discovery, and Re-Use CODATA 22 Conference Stellenbosch, South Africa 25 October 2010 Alex D. Wade Director Scholarly Communication Microsoft External

More information

The Materials Data Facility

The Materials Data Facility The Materials Data Facility Ben Blaiszik (blaiszik@uchicago.edu), Kyle Chard (chard@uchicago.edu) Ian Foster (foster@uchicago.edu) materialsdatafacility.org What is MDF? We aim to make it simple for materials

More information

Empowering People with Knowledge the Next Frontier for Web Search. Wei-Ying Ma Assistant Managing Director Microsoft Research Asia

Empowering People with Knowledge the Next Frontier for Web Search. Wei-Ying Ma Assistant Managing Director Microsoft Research Asia Empowering People with Knowledge the Next Frontier for Web Search Wei-Ying Ma Assistant Managing Director Microsoft Research Asia Important Trends for Web Search Organizing all information Addressing user

More information

Open-Source Natural Language Processing and Computational Archival Science

Open-Source Natural Language Processing and Computational Archival Science Open-Source Natural Language Processing and Computational Archival Science Kalina Bontcheva University of Sheffield @kbontcheva The University of Sheffield, 1995-2018 This work is licensed under the Creative

More information

Data Discovery - Introduction

Data Discovery - Introduction Data Discovery - Introduction Why (benefits of reusing data) How EUDAT's services help with this (in general) Adam Carter In days gone by: Design an experiment Getting Your Data Conduct the experiment

More information

Medici for Digital Cultural Heritage Libraries. George Tsouloupas, PhD The LinkSCEEM Project

Medici for Digital Cultural Heritage Libraries. George Tsouloupas, PhD The LinkSCEEM Project Medici for Digital Cultural Heritage Libraries George Tsouloupas, PhD The LinkSCEEM Project Overview of Digital Libraries A Digital Library: "An informal definition of a digital library is a managed collection

More information

National Materials Data Initiatives

National Materials Data Initiatives National Materials Data Initiatives Chuck Ward Integrity Service Excellence Materials & Manufacturing Directorate Approved for public release, distribution is unlimited. 88ABW-2015-2270 Overview Policy

More information

CLARIN s central infrastructure. Dieter Van Uytvanck CLARIN-PLUS Tools & Services Workshop 2 June 2016 Vienna

CLARIN s central infrastructure. Dieter Van Uytvanck CLARIN-PLUS Tools & Services Workshop 2 June 2016 Vienna CLARIN s central infrastructure Dieter Van Uytvanck CLARIN-PLUS Tools & Services Workshop 2 June 2016 Vienna CLARIN? Common Language Resources and Technology Infrastructure Research Infrastructure for

More information

re3data.org - Making research data repositories visible and discoverable

re3data.org - Making research data repositories visible and discoverable re3data.org - Making research data repositories visible and discoverable Robert Ulrich, Karlsruhe Institute of Technology Hans-Jürgen Goebelbecker, Karlsruhe Institute of Technology Frank Scholze, Karlsruhe

More information

Putting Open Access into Practice

Putting Open Access into Practice Putting Open Access into Practice Dr. Nancy Pontika Connecting Repositories (CORE) Knowledge Media Institute Open University Twitter: @oacore VTT, Espoo (Finland) 11-12 May 2015 This work is licensed under

More information

Platform UI Specification (20)

Platform UI Specification (20) Platform UI Specification (20) June 20, 2017 Deliverable Code: D6.5 Version: 1.0 Final Dissemination level: Public This report presents the OpenMinTeD platform user interface design and implementation

More information

Welcome to the Pure International Conference. Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018

Welcome to the Pure International Conference. Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018 0 Welcome to the Pure International Conference Jill Lindmeier HR, Brand and Event Manager Oct 31, 2018 1 Mendeley Data Use Synergies with Pure to Showcase Additional Research Outputs Nikhil Joshi Solutions

More information

ELIXIR Compute platform

ELIXIR Compute platform ELIXIR Compute platform Authors and contributors: Alexander Agafonov (UIT NO), Lars Ailo Bongo (UIT - NO), Mikael Borg (BILS - SE), Amelie Cornelis (EMBL-EBI), Rob Finn (EMBL-EBI), Montserrat Gonzalez

More information

DT-ICT : Big data solutions for energy

DT-ICT : Big data solutions for energy DT-ICT-11-2019: Big data solutions for energy info day Stefano Bertolo, DG CONNECT Mario Dionisio, DG ENER Scientific Programme Officers Who we are DG CONNECT, Unit G1 Data Policy and Innovation DG ENERGY,

More information

Why CERIF? Keith G Jeffery Scientific Coordinator ERCIM Anne Assserson eurocris. Keith G Jeffery SDSVoc Workshop Amsterdam

Why CERIF? Keith G Jeffery Scientific Coordinator ERCIM Anne Assserson eurocris. Keith G Jeffery SDSVoc Workshop Amsterdam A Europe-wide Interoperable Virtual Research Environment to Empower Multidisciplinary Research Communities and Accelerate Innovation and Collaboration Why CERIF? Keith G Jeffery Scientific Coordinator

More information

Bringing Europeana and CLARIN together: Dissemination and exploitation of cultural heritage data in a research infrastructure

Bringing Europeana and CLARIN together: Dissemination and exploitation of cultural heritage data in a research infrastructure Bringing Europeana and CLARIN together: Dissemination and exploitation of cultural heritage data in a research infrastructure Twan Goosen 1 (CLARIN ERIC), Nuno Freire 2, Clemens Neudecker 3, Maria Eskevich

More information

The OpenAIREplus Project

The OpenAIREplus Project Special thanks to Natalia Manola and Yannis Ioannidis (University of Athens), who contributed to these slides The OpenAIREplus Project Paolo Manghi Istituto di Scienza e Tecnologie dell Informazione Consiglio

More information

Science Europe Consultation on Research Data Management

Science Europe Consultation on Research Data Management Science Europe Consultation on Research Data Management Consultation available until 30 April 2018 at http://scieur.org/rdm-consultation Introduction Science Europe and the Netherlands Organisation for

More information

Global Data Sharing The Research Data Alliance

Global Data Sharing The Research Data Alliance Global Data Sharing The Research Data Alliance Dr. Francine Berman Co Chair, RDA Council Chair, RDA/US Hamilton Distinguished Professor of Computer Science, Rensselaer Polytechnic Institute 25/02/2016

More information

Interoperability Standards and Specification

Interoperability Standards and Specification Interoperability Standards and Specification October 31, 2017 Deliverable Code: D5.4 Version: 1.0 Dissemination level: Public First version of the interoperability standards and specification report that

More information

Big Data Value cppp Big Data Value Association Big Data Value ecosystem

Big Data Value cppp Big Data Value Association Big Data Value ecosystem Big Data Value cppp Big Data Value Association Big Data Value ecosystem Laure Le Bars, SAP, BDVA President and BDVe lead Nuria de Lama, ATOS, BDVA Deputy Secretary General, BDVe co-lead Ana García Robles,

More information

Globus Platform Services for Data Publication. Greg Nawrocki University of Chicago & Argonne National Lab GeoDaRRS August 7, 2018

Globus Platform Services for Data Publication. Greg Nawrocki University of Chicago & Argonne National Lab GeoDaRRS August 7, 2018 Globus Platform Services for Data Publication Greg Nawrocki greg@globus.org University of Chicago & Argonne National Lab GeoDaRRS August 7, 2018 Outline Globus Overview Globus Data Publication v1 Lessons

More information

> Semantic Web Use Cases and Case Studies

> Semantic Web Use Cases and Case Studies > Semantic Web Use Cases and Case Studies Case Study: A Linked Open Data Resource List Management Tool for Undergraduate Students Chris Clarke, Talis Information Limited and Fiona Greig, University of

More information

ICME: Status & Perspectives

ICME: Status & Perspectives ICME: Status & Perspectives from Materials Science and Engineering Surya R. Kalidindi Georgia Institute of Technology New Strategic Initiatives: ICME, MGI Reduce expensive late stage iterations Materials

More information

FREYA Connected Open Identifiers for Discovery, Access and Use of Research Resources

FREYA Connected Open Identifiers for Discovery, Access and Use of Research Resources FREYA Connected Open Identifiers for Discovery, Access and Use of Research Resources Brian Matthews Data Science and Technology Group Scientific Computing Department STFC Persistent Identifiers Long-lasting

More information

EUDAT. Towards a pan-european Collaborative Data Infrastructure

EUDAT. Towards a pan-european Collaborative Data Infrastructure EUDAT Towards a pan-european Collaborative Data Infrastructure Giuseppe Fiameni (g.fiameni@cineca.it) Claudio Cacciari SuperComputing, Application and Innovation CINECA Johannes Reatz RZG, Germany Damien

More information

Metadata Ingestion and Processinng

Metadata Ingestion and Processinng biomedical and healthcare Data Discovery Index Ecosystem Ingestion and Processinng Jeffrey S. Grethe, Ph.D. 2017 BioCADDIE All Hands Meeting prototype Ingestion Indexing Repositories Ingestion ElasticSearch

More information

Make the most of your access to ScienceDirect

Make the most of your access to ScienceDirect 1 Make the most of your access to ScienceDirect Present Future 2 ScienceDirect Training Deck We re here to help you make the most of your access to ScienceDirect. ScienceDirect offers researchers the latest

More information

The iplant Data Commons

The iplant Data Commons The iplant Data Commons Using irods to Facilitate Data Dissemination, Discovery, and Reproducibility Jeremy DeBarry, jdebarry@iplantcollaborative.org Tony Edgin, tedgin@iplantcollaborative.org Nirav Merchant,

More information

DataONE: Open Persistent Access to Earth Observational Data

DataONE: Open Persistent Access to Earth Observational Data Open Persistent Access to al Robert J. Sandusky, UIC University of Illinois at Chicago The Net Partners Update: ONE and the Conservancy December 14, 2009 Outline NSF s Net Program ONE Introduction Motivating

More information

EUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT

EUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT EUDAT A European Collaborative Data Infrastructure Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT OpenAire Interoperability Workshop Braga, Feb. 8, 2013 EUDAT Key facts

More information

EUDAT. Towards a pan-european Collaborative Data Infrastructure

EUDAT. Towards a pan-european Collaborative Data Infrastructure EUDAT Towards a pan-european Collaborative Data Infrastructure Damien Lecarpentier CSC-IT Center for Science, Finland CESSDA workshop Tampere, 5 October 2012 EUDAT Towards a pan-european Collaborative

More information

Informatica Enterprise Information Catalog

Informatica Enterprise Information Catalog Data Sheet Informatica Enterprise Information Catalog Benefits Automatically catalog and classify all types of data across the enterprise using an AI-powered catalog Identify domains and entities with

More information

The ELIXIR of Linked Data

The ELIXIR of Linked Data The ELIXIR of Linked Data Professor Carole Goble (UK node) Barend Mons (NL node), Helen Parkinson (EMBL-EBI node) The Interoperability Services Backbone Team European Life Sciences Infrastructure for Biological

More information

CANARIE Mandate Renewal Proposal

CANARIE Mandate Renewal Proposal CANARIE Mandate Renewal Proposal Kathryn Anthonisen BCNET Conference April 23, 2018 Let s connect! @kanthonisen canarie.ca @canarie_inc canarie.ca @canarie_inc 2 Core Purpose Advancement of Canada s Knowledge

More information

Content Enrichment. An essential strategic capability for every publisher. Enriched content. Delivered.

Content Enrichment. An essential strategic capability for every publisher. Enriched content. Delivered. Content Enrichment An essential strategic capability for every publisher Enriched content. Delivered. An essential strategic capability for every publisher Overview Content is at the centre of everything

More information

Progress towards the EOSC

Progress towards the EOSC Progress towards the EOSC Rapid overview of 6 Current Projects: EOSCpilot Juan Bicarregui einfracentral Alasdair Reid OpenAire Natalia Manola EOSC Hub Tiziana Ferrari FREYA Simon Lambert RDA Europe Sara

More information

Setting up a CIDOC CRM Adoption and Use Strategy CIDOC CRM: Success Stories, Challenges and New Perspective

Setting up a CIDOC CRM Adoption and Use Strategy CIDOC CRM: Success Stories, Challenges and New Perspective Setting up a CIDOC CRM Adoption and Use Strategy CIDOC CRM: Success Stories, Challenges and New Perspective George Bruseker CIDOC 2017 Tblisi, Georgia 27/09/2017 Researcher, Interpreter Goal: A Semantic

More information

Tools for Data Management. Research Data Management : Session 3 9 th June 2015

Tools for Data Management. Research Data Management : Session 3 9 th June 2015 Tools for Data Management Research Data Management : Session 3 9 th June 2015 What do we mean by tools for data? A system that automates in some way the process of creating, transforming, analysing, visualising,

More information

Big Data infrastructure and tools in libraries

Big Data infrastructure and tools in libraries Line Pouchard, PhD Purdue University Libraries Research Data Group Big Data infrastructure and tools in libraries 08/10/2016 DATA IN LIBRARIES: THE BIG PICTURE IFLA/ UNIVERSITY OF CHICAGO BIG DATA: A VERY

More information

Web of Science. Platform Release Nina Chang Product Release Date: December 10, 2017 EXTERNAL RELEASE DOCUMENTATION

Web of Science. Platform Release Nina Chang Product Release Date: December 10, 2017 EXTERNAL RELEASE DOCUMENTATION Web of Science EXTERNAL RELEASE DOCUMENTATION Platform Release 5.27 Nina Chang Product Release Date: December 10, 2017 Document Version: 1.0 Date of issue: December 7, 2017 RELEASE OVERVIEW The following

More information

Giovanni Lamanna LAPP - Laboratoire d'annecy-le-vieux de Physique des Particules, Université de Savoie, CNRS/IN2P3, Annecy-le-Vieux, France

Giovanni Lamanna LAPP - Laboratoire d'annecy-le-vieux de Physique des Particules, Université de Savoie, CNRS/IN2P3, Annecy-le-Vieux, France Giovanni Lamanna LAPP - Laboratoire d'annecy-le-vieux de Physique des Particules, Université de Savoie, CNRS/IN2P3, Annecy-le-Vieux, France ERF, Big data & Open data Brussels, 7-8 May 2014 EU-T0, Data

More information

Data Replication: Automated move and copy of data. PRACE Advanced Training Course on Data Staging and Data Movement Helsinki, September 10 th 2013

Data Replication: Automated move and copy of data. PRACE Advanced Training Course on Data Staging and Data Movement Helsinki, September 10 th 2013 Data Replication: Automated move and copy of data PRACE Advanced Training Course on Data Staging and Data Movement Helsinki, September 10 th 2013 Claudio Cacciari c.cacciari@cineca.it Outline The issue

More information

Open Research Online The Open University s repository of research publications and other research outputs

Open Research Online The Open University s repository of research publications and other research outputs Open Research Online The Open University s repository of research publications and other research outputs The Smart Book Recommender: An Ontology-Driven Application for Recommending Editorial Products

More information

CREATING SMART TRANSPORT SERVICES BY FACILITATING THE RE-USE OF OPEN GIS DATA

CREATING SMART TRANSPORT SERVICES BY FACILITATING THE RE-USE OF OPEN GIS DATA OPEN TRANSPORT NET TOMAS MILDORF 16 JUNE 2014 INSPIRE CONFERENCE 2014, AALBORG, DENMARK CREATING SMART TRANSPORT SERVICES BY FACILITATING THE RE-USE OF OPEN GIS DATA 2 1 OTN AT A GLANCE Full title OpenTransportNet

More information

Open Science, FAIR data and effective data management

Open Science, FAIR data and effective data management , FAIR data and effective data management This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License Federica Rosetta Director, Global Strategic Networks

More information

Customising Location of Knowledge. Ann Apps and Ross MacIntyre MIMAS, The University of Manchester, UK

Customising Location of Knowledge. Ann Apps and Ross MacIntyre MIMAS, The University of Manchester, UK Customising Location of Ann Apps and Ross MacIntyre MIMAS, The University of Manchester, UK Outline Supporting scholarly research Overview of finding articles using Zetoc and OpenURL linking Institution

More information

NSF gateway to Scientific literature

NSF gateway to Scientific literature NSF gateway to Scientific literature Workshop on Proposal Writing National Science Foundation 19 June 2012 Sunethra Perera Outline NSF Literature Local Literature at the NSF Local Literature at Other institutions

More information

Regional Information Centre for Scientific and Technological Cooperation with EU, Voronezh State University 1-2/07/2010, Voronezh

Regional Information Centre for Scientific and Technological Cooperation with EU, Voronezh State University 1-2/07/2010, Voronezh REGIONAL NETWORK FOR SUPPORT OF S&T COOPERATION BETWEEN RUSSIAN REGIONS AND THE EU Regional Information Centre for Scientific and Technological Cooperation with EU, Voronezh State University 1-2/07/2010,

More information

ESA EO Programmes for CM16. EOEP-5 Block 4. Bilateral meeting with AT Delegation and Industry Vienna, 24/05/2016. ESA UNCLASSIFIED - For Official Use

ESA EO Programmes for CM16. EOEP-5 Block 4. Bilateral meeting with AT Delegation and Industry Vienna, 24/05/2016. ESA UNCLASSIFIED - For Official Use ESA EO Programmes for CM16 EOEP-5 Block 4 Bilateral meeting with AT Delegation and Industry Vienna, 24/05/2016 EOEP-5 Block-5: EO Science for Society EO Science for Society will foster scientific excellence,

More information

Data Management Checklist

Data Management Checklist Data Management Checklist Managing research data throughout its lifecycle ensures its long-term value and prevents data from falling into digital obsolescence. Proper data management is a key prerequisite

More information

EUDAT. Towards a pan-european Collaborative Data Infrastructure - A Nordic Perspective? -

EUDAT. Towards a pan-european Collaborative Data Infrastructure - A Nordic Perspective? - EUDAT Towards a pan-european Collaborative Data Infrastructure - A Nordic Perspective? - Damien Lecarpentier CSC-IT Center for Science, Finland NeIC Conference Trondheim, 16 May 2013 Data trends Exponential

More information

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation

Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation Long-term preservation for INSPIRE: a metadata framework and geo-portal implementation INSPIRE 2010, KRAKOW Dr. Arif Shaon, Dr. Andrew Woolf (e-science, Science and Technology Facilities Council, UK) 3

More information

EUDAT Data Services & Tools for Researchers and Communities. Dr. Per Öster Director, Research Infrastructures CSC IT Center for Science Ltd

EUDAT Data Services & Tools for Researchers and Communities. Dr. Per Öster Director, Research Infrastructures CSC IT Center for Science Ltd EUDAT Data Services & Tools for Researchers and Communities Dr. Per Öster Director, Research Infrastructures CSC IT Center for Science Ltd CSC IT CENTER FOR SCIENCE! Founded in 1971 as a technical support

More information

Edinburgh DataShare: Tackling research data in a DSpace institutional repository

Edinburgh DataShare: Tackling research data in a DSpace institutional repository Edinburgh DataShare: Tackling research data in a DSpace institutional repository Robin Rice EDINA and Data Library, Information Services University of Edinburgh, Scotland DSpace User Group Meeting Gothenburg,

More information

Launching the. Data Curation Network NDS/MBDH 2018

Launching the. Data Curation Network NDS/MBDH 2018 NDS/MBDH 2018 Launching the Data Curation Network Lisa Johnston University of Minnesota Jake Carlson University of Michigan Cynthia Hudson-Vitale Penn State Univ. Heidi Imker University of Illinois Wendy

More information

Paving the Rocky Road Toward Open and FAIR in the Field Sciences

Paving the Rocky Road Toward Open and FAIR in the Field Sciences Paving the Rocky Road Toward Open and FAIR Kerstin Lehnert Lamont-Doherty Earth Observatory, Columbia University IEDA (Interdisciplinary Earth Data Alliance), www.iedadata.org IGSN e.v., www.igsn.org Field

More information

Data publication and discovery with Globus

Data publication and discovery with Globus Data publication and discovery with Globus Questions and comments to outreach@globus.org The Globus data publication and discovery services make it easy for institutions and projects to establish collections,

More information

OpenAIRE From Pilot to Service The Open Knowledge Infrastructure for Europe

OpenAIRE From Pilot to Service The Open Knowledge Infrastructure for Europe Natalia Manola University of Athens Department of Informatics and Telecommunications OpenAIRE From Pilot to Service The Open Knowledge Infrastructure for Europe Outline Open Access in Europe Brief history

More information

Historical Text Mining:

Historical Text Mining: Historical Text Mining Historical Text Mining, and Historical Text Mining: Challenges and Opportunities Dr. Robert Sanderson Dept. of Computer Science University of Liverpool azaroth@liv.ac.uk http://www.csc.liv.ac.uk/~azaroth/

More information

N. Marusov, I. Semenov

N. Marusov, I. Semenov GRID TECHNOLOGY FOR CONTROLLED FUSION: CONCEPTION OF THE UNIFIED CYBERSPACE AND ITER DATA MANAGEMENT N. Marusov, I. Semenov Project Center ITER (ITER Russian Domestic Agency N.Marusov@ITERRF.RU) Challenges

More information

THE ENVIRONMENTAL OBSERVATION WEB AND ITS SERVICE APPLICATIONS WITHIN THE FUTURE INTERNET Project introduction and technical foundations (I)

THE ENVIRONMENTAL OBSERVATION WEB AND ITS SERVICE APPLICATIONS WITHIN THE FUTURE INTERNET Project introduction and technical foundations (I) ENVIROfying the Future Internet THE ENVIRONMENTAL OBSERVATION WEB AND ITS SERVICE APPLICATIONS WITHIN THE FUTURE INTERNET Project introduction and technical foundations (I) INSPIRE Conference Firenze,

More information

ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development

ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development Jeremy Fischer Indiana University 9 September 2014 Citation: Fischer, J.L. 2014. ACCI Recommendations on Long Term

More information

Helix Nebula The Science Cloud

Helix Nebula The Science Cloud Helix Nebula The Science Cloud CERN 14 May 2014 Bob Jones (CERN) This document produced by Members of the Helix Nebula consortium is licensed under a Creative Commons Attribution 3.0 Unported License.

More information

21ST century enterprise. HCL Technologies Presents. Roadmap for Data Center Transformation

21ST century enterprise. HCL Technologies Presents. Roadmap for Data Center Transformation 21ST century enterprise HCL Technologies Presents Roadmap for Data Center Transformation june 2016 21st Century Impact on Data Centers The rising wave of digitalization has changed the way IT impacts business.

More information