The challenge of collecting and evaluating LRs for commercial use

Size: px
Start display at page:

Download "The challenge of collecting and evaluating LRs for commercial use"

Transcription

1 Language Technologies Observatory The challenge of collecting and evaluating LRs for commercial use Bente Maegaard, CLARIN ERIC (and University of Copenhagen)

2 Overview of the challenges 2 Which LRs are needed? Where to find them Quality measures Improving LRs Awareness

3 LRs needed 3 (Many more) Language Resources for commercial applications are needed! Large scale LRs for e.g. SMT For many languages and domains Which languages? All European, many important international languages Which domains? Difficult to point to particular domains that are suffering as so little is available LRs are scarce and information about them is scattered LTO has created the LR Catalogue

4 Related work this is not a challenge! 4 Related work META-SHARE, ELRA, CLARIN VLO, OPUS, TAUS, PANACEA, etc The LTO focus in on LRs for commercial use

5 Evaluating LRs, usability 5 No shared method for evaluating the quality of an LR ELRA has validation procedure (Operational) Usability easier to define

6 Elements of quality/usability 6 Ease of access Ease of download and implementation Domain relevance Languages available Source and target language? is it really a translation? Cf. Multilingual websites Is it recent and/or updated? Availability/cost Sometimes size can be a quality

7 Selection criteria 7 Selection criteria for the LTO catalogue Availability Languages covered Validation Longevity (is it updated) Ease of download

8 Metadata 8 Metadata are indispensable Nothing can be found without metadata Time-consuming, find good balance

9 LTO metadata 9 Title Creator Contact person Description Language (mono-, bi- ) Resource type Availability Tags Modality - text URL Domain Format Format descr. Size Production date Comment

10 Challenge: insufficient metadata - Valorisation process 10 Many LRs detected could not go directly into the LTO Catalogue Insufficient metadata Metadata have been added/adapted/improved

11 Challenge: creation of new resources. Tools for new resources 11 If LRs don t exist, create them! Tools for corpus creation Tools for terminology extraction Have been investigated, guides written IPR a problem

12 Number of parallel corpora per language 12

13 Results of data collection investigations - challenges 13 Many LRs exist for academic purposes Not available for commercial use Companies don t share Many bilingual corpora are also multilingual Tools exist which help the creation of new LRs BUT: IPR rules make it very time-consuming to obtain the rights to use what you find on the web. Fair use would really help! Only small developments compared to META- White Papers

14 Results of implementation 14 The LTO catalogue exists The LTO catalogue only contains references to useful LRs We are happy to learn about more LRs that could be part of the LTO catalogue

15 Gap description summary of challenges 15 Coverage gap Languages, domains Awareness gap Demand side often unaware of what actually exists, which makes the LTO Catalogue particularly useful Supply side does not know which are the most needed LRs Quality gap Valorisation is needed Quantity gap Large quantities are needed for modern LT

16 Thank you! LT-Observatory Observatory for LR and MT in Europe Funded by the

On the way to Language Resources sharing: principles, challenges, solutions

On the way to Language Resources sharing: principles, challenges, solutions On the way to Language Resources sharing: principles, challenges, solutions Stelios Piperidis ILSP, RC Athena, Greece spip@ilsp.gr Content on the Multilingual Web, 4-5 April, Pisa, 2011 Co-funded by the

More information

Language Resources. Khalid Choukri ELRA/ELDA 55 Rue Brillat-Savarin, F Paris, France Tel Fax.

Language Resources. Khalid Choukri ELRA/ELDA 55 Rue Brillat-Savarin, F Paris, France Tel Fax. Language Resources By the Other Data Center over 15 years fruitful partnership Khalid Choukri ELRA/ELDA 55 Rue Brillat-Savarin, F-75013 Paris, France Tel. +33 1 43 13 33 33 -- Fax. +33 1 43 13 33 30 choukri@elda.org

More information

CLARIN for Linguists Portal & Searching for Resources. Jan Odijk LOT Summerschool Nijmegen,

CLARIN for Linguists Portal & Searching for Resources. Jan Odijk LOT Summerschool Nijmegen, CLARIN for Linguists Portal & Searching for Resources Jan Odijk LOT Summerschool Nijmegen, 2014-06-23 1 Overview CLARIN Portal Find data and tools 2 Overview CLARIN Portal Find data and tools 3 CLARIN

More information

Bringing Europeana and CLARIN together: Dissemination and exploitation of cultural heritage data in a research infrastructure

Bringing Europeana and CLARIN together: Dissemination and exploitation of cultural heritage data in a research infrastructure Bringing Europeana and CLARIN together: Dissemination and exploitation of cultural heritage data in a research infrastructure Twan Goosen 1 (CLARIN ERIC), Nuno Freire 2, Clemens Neudecker 3, Maria Eskevich

More information

META-SHARE : the open exchange platform Overview-Current State-Towards v3.0

META-SHARE : the open exchange platform Overview-Current State-Towards v3.0 META-SHARE : the open exchange platform Overview-Current State-Towards v3.0 Stelios Piperidis Athena RC, Greece spip@ilsp.gr A Strategy for Multilingual Europe Brussels, Belgium, June 20/21, 2012 Co-funded

More information

PDF hosted at the Radboud Repository of the Radboud University Nijmegen

PDF hosted at the Radboud Repository of the Radboud University Nijmegen PDF hosted at the Radboud Repository of the Radboud University Nijmegen The following full text is a publisher's version. For additional information about this publication click this link. http://hdl.handle.net/2066/40896

More information

ISLE Metadata Initiative (IMDI) PART 1 B. Metadata Elements for Catalogue Descriptions

ISLE Metadata Initiative (IMDI) PART 1 B. Metadata Elements for Catalogue Descriptions ISLE Metadata Initiative (IMDI) PART 1 B Metadata Elements for Catalogue Descriptions Version 3.0.13 August 2009 INDEX 1 INTRODUCTION...3 2 CATALOGUE ELEMENTS OVERVIEW...4 3 METADATA ELEMENT DEFINITIONS...6

More information

GETTING STARTED WITH DIGITAL COMMONWEALTH

GETTING STARTED WITH DIGITAL COMMONWEALTH GETTING STARTED WITH DIGITAL COMMONWEALTH Digital Commonwealth (www.digitalcommonwealth.org) is a Web portal and fee-based repository service for online cultural heritage materials held by Massachusetts

More information

Formats and standards for metadata, coding and tagging. Paul Meurer

Formats and standards for metadata, coding and tagging. Paul Meurer Formats and standards for metadata, coding and tagging Paul Meurer The FAIR principles FAIR principles for resources (data and metadata): Findable (-> persistent identifier, metadata, registered/indexed)

More information

clarin:el an infrastructure for documenting, sharing and processing language data

clarin:el an infrastructure for documenting, sharing and processing language data clarin:el an infrastructure for documenting, sharing and processing language data Stelios Piperidis, Penny Labropoulou, Maria Gavrilidou (Athena RC / ILSP) the problem 19/9/2015 ICGL12, FU-Berlin 2 use

More information

The MultilingualWeb-LT project

The MultilingualWeb-LT project Multilingualism & Drupal The MultilingualWeb-LT project Denver, CO. March 20th, 2012 Seite 1 Cocomore essentials Agency for integrated communication and IT services 120 employees Offices in Germany and

More information

From Open Data to Data- Intensive Science through CERIF

From Open Data to Data- Intensive Science through CERIF From Open Data to Data- Intensive Science through CERIF Keith G Jeffery a, Anne Asserson b, Nikos Houssos c, Valerie Brasse d, Brigitte Jörg e a Keith G Jeffery Consultants, Shrivenham, SN6 8AH, U, b University

More information

Janne Bondi johannessen, Anders Nøklestad, Joel Priestley and Kristin Hagen. WP5: Glossa Integration

Janne Bondi johannessen, Anders Nøklestad, Joel Priestley and Kristin Hagen. WP5: Glossa Integration Janne Bondi johannessen, Anders Nøklestad, Joel Priestley and Kristin Hagen WP5: Glossa Integration WP5 Glossa integration The current Glossa corpus interface and analysis tool will be integrated in the

More information

The Virtual Language Observatory!

The Virtual Language Observatory! The Virtual Language Observatory! Dieter Van Uytvanck! CMDI workshop, Nijmegen! 2012-09-13! 1! Overview! VLO?! What is behind it? Relation to CMDI?! How do I get my data in there?! Demo + excercises!!

More information

PANACEA Project Grant Agreement no.:

PANACEA Project Grant Agreement no.: SEVENTH FRAMEWORK PROGRAMME THEME 3 Information and communication Technologies PANACEA Project Grant Agreement no.: 248064 Platform for Automatic, Normalized Annotation and Cost-Effective Acquisition of

More information

Data is the new Oil (Ann Winblad)

Data is the new Oil (Ann Winblad) Data is the new Oil (Ann Winblad) Keith G Jeffery keith.jeffery@keithgjefferyconsultants.co.uk 20140415-16 JRC Workshop Big Open Data Keith G Jeffery 1 Data is the New Oil Like oil has been, data is Abundant

More information

CORLI. a linguistic consortium for corpus, language and interaction

CORLI. a linguistic consortium for corpus, language and interaction CORLI a linguistic consortium for corpus, language and interaction CORLI and HUMA-NUM CORLI = Corpus, Languages, and Interaction a French consortium of Huma-Num involved in linguistic research and teaching

More information

QUICK REFERENCE GUIDE

QUICK REFERENCE GUIDE Folders new projects. Organise your folders to find files quickly and easily 1 Look in your yellow storage Folders it can be organised into simple folder structures to help with browsing 2 Click on your

More information

EUDAT. Towards a pan-european Collaborative Data Infrastructure

EUDAT. Towards a pan-european Collaborative Data Infrastructure EUDAT Towards a pan-european Collaborative Data Infrastructure Martin Hellmich Slides adapted from Damien Lecarpentier DCH-RP workshop, Manchester, 10 April 2013 Research Infrastructures Research Infrastructure

More information

Versioning with PIDs

Versioning with PIDs Versioning with PIDs Martin Matthiesen, CSC IT Center for Science Ute Dieckmann, University of Helsinki CLARIN annual conference, Pisa, Italy Contents What we do: The repository in a nutshell The Language

More information

Platform UI Specification

Platform UI Specification Platform UI Specification November 25, 2016 Deliverable Code: D6.4 Version: 1.0 Final Dissemination level: Public This report presents the OpenMinTeD platform user interface design and implementation issues

More information

For each use case, the business need, usage scenario and derived requirements are stated. 1.1 USE CASE 1: EXPLORE AND SEARCH FOR SEMANTIC ASSESTS

For each use case, the business need, usage scenario and derived requirements are stated. 1.1 USE CASE 1: EXPLORE AND SEARCH FOR SEMANTIC ASSESTS 1 1. USE CASES For each use case, the business need, usage scenario and derived requirements are stated. 1.1 USE CASE 1: EXPLORE AND SEARCH FOR SEMANTIC ASSESTS Business need: Users need to be able to

More information

CACAO PROJECT AT THE 2009 TASK

CACAO PROJECT AT THE 2009 TASK CACAO PROJECT AT THE TEL@CLEF 2009 TASK Alessio Bosca, Luca Dini Celi s.r.l. - 10131 Torino - C. Moncalieri, 21 alessio.bosca, dini@celi.it Abstract This paper presents the participation of the CACAO prototype

More information

THE ULTIMATE SEO MIGRATION GUIDE

THE ULTIMATE SEO MIGRATION GUIDE THE ULTIMATE SEO MIGRATION GUIDE How to avoid a drop in ranking when launching a new website YOU KNOW THAT IT S TIME TO REBUILD YOUR WEBSITE, THERE IS ALWAYS THAT NAGGING CONCERN THAT THE NEW WEBSITE WILL

More information

META-SHARE metadata: Overview of the schema & Interoperability with other schemas

META-SHARE metadata: Overview of the schema & Interoperability with other schemas META-SHARE metadata: Overview of the schema & Interoperability with other schemas Penny Labropoulou & Maria Gavrilidou (ILSP/RC Athena) CMDI Interoperability Workshop Utrecht, Netherlands 4-5 June 2013

More information

Curation module in action - its preliminary findings on VLO metadata quality

Curation module in action - its preliminary findings on VLO metadata quality Curation module in action - its preliminary findings on VLO metadata quality Davor Ostojić, Go Sugimoto, Matej Ďurčo (Austrian Centre for Digital Humanities) CLARIN Annual Conference 2016, Aix-en-Provence,

More information

Knowledge Inventory for hydrogeology research

Knowledge Inventory for hydrogeology research Knowledge Inventory for hydrogeology research This project has received funding from the European Union s Horizon 2020 research and innovation programme under grant agreement No 642047. REDIAM Environmental

More information

META-SHARE: An Open Resource Exchange Infrastructure for Stimulating Research and Innovation

META-SHARE: An Open Resource Exchange Infrastructure for Stimulating Research and Innovation META-SHARE: An Open Resource Exchange Infrastructure for Stimulating Research and Innovation Stelios Piperidis Athena RC, Greece spip@ilsp.athena-innovation.gr Solutions for Multilingual Europe Budapest,

More information

Acrobat Pro: Useful standard features included in Acrobat Professional

Acrobat Pro: Useful standard features included in Acrobat Professional fessional PDF files are extremely pervasive in the printing industry. Although they are a common file format, many people are not taking full advantage of the useful features incorporated into Acrobat

More information

Striving for efficiency

Striving for efficiency Ron Dekker Director CESSDA Striving for efficiency Realise the social data part of EOSC How to Get the Maximum from Research Data Prerequisites and Outcomes University of Tartu, 29 May 2018 Trends 1.Growing

More information

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal

EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal Heinrich Widmann, DKRZ DI4R 2016, Krakow, 28 September 2016 www.eudat.eu EUDAT receives funding from the European Union's Horizon 2020

More information

Platform UI Specification (20)

Platform UI Specification (20) Platform UI Specification (20) June 20, 2017 Deliverable Code: D6.5 Version: 1.0 Final Dissemination level: Public This report presents the OpenMinTeD platform user interface design and implementation

More information

SYMMETRIC STORAGE VIRTUALISATION IN THE NETWORK

SYMMETRIC STORAGE VIRTUALISATION IN THE NETWORK SYMMETRIC STORAGE VIRTUALISATION IN THE NETWORK The symmetric and asymmetric virtualisation models are representatives of storage virtualization in the network. In both approaches it is possible to perform

More information

Multilingual Information Access for Digital Libraries The Metadata Records Translation Project

Multilingual Information Access for Digital Libraries The Metadata Records Translation Project Multilingual Information Access for Digital Libraries The Metadata Records Translation Project Jiangping Chen Http://max.lis.unt.edu/ Jiangping.chen@unt.edu July 2011 Presentation Outline About Me Current

More information

Nuno Freire National Library of Portugal Lisbon, Portugal

Nuno Freire National Library of Portugal Lisbon, Portugal Date submitted: 05/07/2010 UNIMARC in The European Library and related projects Nuno Freire National Library of Portugal Lisbon, Portugal E-mail: nuno.freire@bnportugal.pt Meeting: 148. UNIMARC WORLD LIBRARY

More information

How can CLARIN archive and curate my resources?

How can CLARIN archive and curate my resources? How can CLARIN archive and curate my resources? Christoph Draxler draxler@phonetik.uni-muenchen.de Outline! Relevant resources CLARIN infrastructure European Research Infrastructure Consortium National

More information

The DART-Europe E-theses Portal

The DART-Europe E-theses Portal The DART-Europe E-theses Portal Martin Moyle Digital Curation Manager UCL Library Services, UK m.moyle@ucl.ac.uk ETD 2009, University of Pittsburgh, June 10-13 2009 Contents DART-Europe: background The

More information

Open Archives Initiatives Protocol for Metadata Harvesting Practices for the cultural heritage sector

Open Archives Initiatives Protocol for Metadata Harvesting Practices for the cultural heritage sector Open Archives Initiatives Protocol for Metadata Harvesting Practices for the cultural heritage sector Relais Culture Europe mfoulonneau@relais-culture-europe.org Community report A community report on

More information

Towards a Linked Open Data Cloud of Language Resources in the Legal Domain

Towards a Linked Open Data Cloud of Language Resources in the Legal Domain Building the Legal Knowledge Graph for Smart Compliance Services in Multilingual Europe Towards a Linked Open Data Cloud of Language Resources in the Legal Domain Patricia Martín-Chozas, Elena Montiel-Ponsoda,

More information

Two Traditions of Metadata Development

Two Traditions of Metadata Development Two Traditions of Metadata Development Bibliographic control approach developed before computer technology and internet were commonplace. mainly used in libraries and universities. from early on used rules

More information

OASIS TECHNICAL COMMITTEE FORMAT OF AUTOMOTIVE REPAIR INFORMATION

OASIS TECHNICAL COMMITTEE FORMAT OF AUTOMOTIVE REPAIR INFORMATION OASIS TECHNICAL COMMITTEE FORMAT OF AUTOMOTIVE REPAIR INFORMATION Document Control Deliverable Code SC2-D3 Author(s) John Chelsom and SC2 working group Date 18-12-2002 Version Version 3.0 Notes: This document

More information

COLUMN. Choosing the right CMS authoring tools. Three key criteria will determine the most suitable authoring environment NOVEMBER 2003

COLUMN. Choosing the right CMS authoring tools. Three key criteria will determine the most suitable authoring environment NOVEMBER 2003 KM COLUMN NOVEMBER 2003 Choosing the right CMS authoring tools The authoring environment is the most important aspect of a content management system (CMS), for without content authors, there would be nothing

More information

Self-tuning ongoing terminology extraction retrained on terminology validation decisions

Self-tuning ongoing terminology extraction retrained on terminology validation decisions Self-tuning ongoing terminology extraction retrained on terminology validation decisions Alfredo Maldonado and David Lewis ADAPT Centre, School of Computer Science and Statistics, Trinity College Dublin

More information

Pri nceton Prime 3/24/2015. Welcome to What s New with Prime Journals.

Pri nceton Prime 3/24/2015. Welcome to What s New with Prime Journals. Welcome to What s New with Prime Journals. 1 We have made several improvements to Prime Journals. Journal instructions, rules, and error messages have been clarified and updated. Journal entry usability

More information

EUDAT. Towards a pan-european Collaborative Data Infrastructure

EUDAT. Towards a pan-european Collaborative Data Infrastructure EUDAT Towards a pan-european Collaborative Data Infrastructure Damien Lecarpentier CSC-IT Center for Science, Finland CESSDA workshop Tampere, 5 October 2012 EUDAT Towards a pan-european Collaborative

More information

Linport: Interoperable Translation and Localization Project Containers (A Progress Report) Alan Melby and Jeremy Coombs

Linport: Interoperable Translation and Localization Project Containers (A Progress Report) Alan Melby and Jeremy Coombs Linport: Interoperable Translation and Localization Project Containers (A Progress Report) Alan Melby and Jeremy Coombs What is Linport? (this slide and 2 more) An acronym for Language Interoperability

More information

1 Web references are marked by bracketed number order and can be found at the end of this squib.

1 Web references are marked by bracketed number order and can be found at the end of this squib. Squibs The Language Resource Switchboard Claus Zinn University of Tübingen Department of Computational Linguistics claus.zinn@uni-tuebingen.de The CLARIN research infrastructure gives users access to an

More information

Machine Translation Research in META-NET

Machine Translation Research in META-NET Machine Translation Research in META-NET Jan Hajič Institute of Formal and Applied Linguistics Charles University in Prague, CZ hajic@ufal.mff.cuni.cz Solutions for Multilingual Europe Budapest, Hungary,

More information

Visualization as Part of the Linguistic Processing Pipeline

Visualization as Part of the Linguistic Processing Pipeline Visualization as Part of the Linguistic Processing Pipeline Chris Culy and Verena Lyding European Academy Bozen-Bolzano {christopher.culy, verena.lyding}@eurac.edu Visualization of linguistic information

More information

Self Introduction. Presentation Outline. College of Information 3/31/2016. Multilingual Information Access to Digital Collections

Self Introduction. Presentation Outline. College of Information 3/31/2016. Multilingual Information Access to Digital Collections College of Information Multilingual Information Access to Digital Collections Jiangping Chen Http://coolt.lis.unt.edu/ Jiangping.chen@unt.edu April 20, 2016 Self Introduction An Associate Professor at

More information

PCSchool Search User Guide. Search User Guide. Created in version /11

PCSchool Search User Guide. Search User Guide. Created in version /11 Search User Guide Created in version 2007.4.0.5 1/11 Table of Contents Search Keyword Search... 2 Propagate Search Words... 3 Report Format... 3 Report Format... 4 Popup Format... 5 Keyword Search... 7

More information

Comments submitted by IGTF-J (Internet Governance Task Force of Japan) *

Comments submitted by IGTF-J (Internet Governance Task Force of Japan) * submitted by IGTF-J (Internet Governance Task Force of Japan) * http:www.igtf.jp Do you have any comments on the process of determining the issues and their presentation by the WGIG? Yes. The purpose of

More information

Implementation of the Data Seal of Approval

Implementation of the Data Seal of Approval Implementation of the Data Seal of Approval The Data Seal of Approval board hereby confirms that the Trusted Digital repository The Language Bank of Finland complies with the guidelines version 2014-2017

More information

2/1/2016. Discuss website usability essentials Explain principles of design Critique a website in terms of usability and design

2/1/2016. Discuss website usability essentials Explain principles of design Critique a website in terms of usability and design Due Tuesday, Feb. 9 upload to Blackboard Locate five HTML (not Flash) websites you believe exhibit good web design, usability and accessibility principles. Each website s critique is worth 10 points (50

More information

1 Overview chart. PIDs: talk with EPIC PIDs: MoU or advice. Assessment wave 3. VLO overhaul CMDI 1.2

1 Overview chart. PIDs: talk with EPIC PIDs: MoU or advice. Assessment wave 3. VLO overhaul CMDI 1.2 Title Centre Committee work plan 2014 Version 2 Author(s) Dieter Van Uytvanck Date 2014-02- 05 Status To be approved Distribution Centre Committee, NCF, BOD ID CE- 2013-0257 1 Overview chart PIDs: talk

More information

Make the Multilingual Web Work. TPAC 2012 Breakout Session

Make the Multilingual Web Work. TPAC 2012 Breakout Session Make the Multilingual Web Work TPAC 2012 Breakout Session 1. MULTILINGUALWEB-LT WORKING GROUP Gap: Metadata in the deep Web Input from the data base the hidden web : Ob postbank direkt, online-banking,

More information

D5.1.1: DRUPAL MT TRAINING MODULE

D5.1.1: DRUPAL MT TRAINING MODULE : DRUPAL MT TRAINING MODULE Karl Fritsche, Stephan Walter () Distribution: Public MultilingualWeb-LT (LT-Web) Language Technology in the Web FP7-ICT-2011-7 Project no: 287815 Document Information Deliverable

More information

SEO. Drivers You Are Missing in Content Marketing

SEO. Drivers You Are Missing in Content Marketing SEO Drivers You Are Missing in Content Marketing SEO IS ALWAYS CHANGING. HICH MEANS your content strategy what you create and how it is found is ALWAYS CHANGING AS WELL. BUT IS SEO ALWAYS CHANGING? BECAUSE

More information

LIDER: Building Free, Interlinked, and Interoperable Language Resources. Asunción Gómez- Pérez Philipp Cimiano

LIDER: Building Free, Interlinked, and Interoperable Language Resources. Asunción Gómez- Pérez Philipp Cimiano LIDER: Building Free, Interlinked, and Interoperable Language Resources Asunción Gómez- Pérez Philipp Cimiano MulBlingual Web Workshop Riga, 28th of April. 2015 20/11/2014 Presenter name 1. Surveys 2.

More information

A FRAMEWORK FOR MULTILINGUAL AND SEMANTIC ENRICHMENT OF DIGITAL CONTENT (NEW L10N BUSINESS OPPORTUNITIES) FREME WEBINAR HELD FOR GALA, 28 APRIL 2016

A FRAMEWORK FOR MULTILINGUAL AND SEMANTIC ENRICHMENT OF DIGITAL CONTENT (NEW L10N BUSINESS OPPORTUNITIES) FREME WEBINAR HELD FOR GALA, 28 APRIL 2016 Co-funded by the Horizon 2020 Framework Programme of the European Union Grant Agreement Number 644771 www.freme-project.eu A FRAMEWORK FOR MULTILINGUAL AND SEMANTIC ENRICHMENT OF DIGITAL CONTENT (NEW L10N

More information

CMDI and granularity

CMDI and granularity CMDI and granularity Identifier CLARIND-AP3-007 AP 3 Authors Dieter Van Uytvanck, Twan Goosen, Menzo Windhouwer Responsible Dieter Van Uytvanck Reference(s) Version Date Changes by State 1 2011-01-24 Dieter

More information

What Do You Mean It Doesn t Make Sense? Redesigning Finding Aids from the Users Perspective

What Do You Mean It Doesn t Make Sense? Redesigning Finding Aids from the Users Perspective What Do You Mean It Doesn t Make Sense? Redesigning Finding Aids from the Users Perspective Rethinking Finding Aids Archival patrons don t understand archival terminology Don t understand hierarchy Need

More information

EU mhealth Working Group

EU mhealth Working Group EU mhealth Working Group 500 million users 45000 publishers 5% of total app market 3bn downloads Status Guidelines for assessing reliability & validity Lots of work on Scope, Target Group & Criteria https://ec.europa.eu/digital-single-market/en/mhealth

More information

Chain of Data Creation. Data Creation. Lab Notebook. Plan Your Experiment, Experiment With your Plan

Chain of Data Creation. Data Creation. Lab Notebook. Plan Your Experiment, Experiment With your Plan Chain of Data Creation Data Creation 1. Preparation 2. Creation of Metadata 3. Acquisition 4. Building a Permanent Record 5. Data Management 6. Storage 7. Data Sharing Lab Notebook Plan Your Experiment,

More information

Best practices in the design, creation and dissemination of speech corpora at The Language Archive

Best practices in the design, creation and dissemination of speech corpora at The Language Archive LREC Workshop 18 2012-05-21 Istanbul Best practices in the design, creation and dissemination of speech corpora at The Language Archive Sebastian Drude, Daan Broeder, Peter Wittenburg, Han Sloetjes The

More information

WHAT S NEW IN QLIKVIEW 11

WHAT S NEW IN QLIKVIEW 11 WHAT S NEW IN QLIKVIEW 11 QlikView 11 takes Business Discovery to a whole new level by enabling users to more easily share information with coworkers, supporting larger enterprise deployments through enhanced

More information

Multilingual Guides. Many languages: Just one document

Multilingual Guides. Many languages: Just one document Multilingual Guides Many languages: Just one document What s a multilingual guide? Explains how to quickly install/set up a product Particularly used for hardware products Contains more than one language

More information

Storing Data in Objects

Storing Data in Objects Storing Data in Objects Rob Miles Department of Computer Science 28d 08120 Programming 2 Objects and Items I have said for some time that you use objects to represent things in your problem Objects equate

More information

Overview. Finding information and help Adobe Acrobat. Where to find it and why to use it. When converting from Word to Acrobat

Overview. Finding information and help Adobe Acrobat. Where to find it and why to use it. When converting from Word to Acrobat Formatting 101: Adobe Acrobat October 2017 Overview 2 Finding information and help Adobe Acrobat Where to find it and why to use it How to embed fonts When converting from Word to Acrobat Using the grid

More information

TaaS. Terminology as a Service. Project no

TaaS. Terminology as a Service. Project no TaaS Terminology as a Service Project no. 296312 Annual Public Report Year 2013 Contents Project Objectives... 3 Work progress and achievements during the period... 3 Acquiring Terminological Data from

More information

Introduction

Introduction Introduction EuropeanaConnect All-Staff Meeting Berlin, May 10 12, 2010 Welcome to the All-Staff Meeting! Introduction This is a quite big meeting. This is the end of successful project year Project established

More information

First metadata-enabled service in Croatian Webspace

First metadata-enabled service in Croatian Webspace WWW.HR: First metadata-enabled service in Croatian Webspace Igor Ljubi, Gordan Gledec Department of Telecommunications Faculty of Electrical Engineering and Computing University of Zagreb CUC 2001 September

More information

Story Workbench Quickstart Guide Version 1.2.0

Story Workbench Quickstart Guide Version 1.2.0 1 Basic Concepts Story Workbench Quickstart Guide Version 1.2.0 Mark A. Finlayson (markaf@mit.edu) Annotation An indivisible piece of data attached to a text is called an annotation. Annotations, also

More information

Living Library Federica Campus Virtuale Policy document

Living Library Federica Campus Virtuale Policy document Living Library Federica Campus Virtuale Policy document www.federica.unina.it P.O. FESR 2007-2013 Asse V, O.O. 5.1 e-government ed e-inclusion - Progetto: Campus Virtuale 1. Mission of the Living Library

More information

Information Retrieval and Knowledge Organisation

Information Retrieval and Knowledge Organisation Information Retrieval and Knowledge Organisation Knut Hinkelmann Content Information Retrieval Indexing (string search and computer-linguistic aproach) Classical Information Retrieval: Boolean, vector

More information

Metadata Catalogue Issues. Daan Broeder Max-Planck Institute for Psycholinguistics

Metadata Catalogue Issues. Daan Broeder Max-Planck Institute for Psycholinguistics Metadata Catalogue Issues Daan Broeder Max-Planck Institute for Psycholinguistics Introduction Methods of registering resources Metadata Making metadata interoperable Exposing metadata Facilitating resource

More information

Linked Open Data and Semantic Technologies for Research in Agriculture and Forestry

Linked Open Data and Semantic Technologies for Research in Agriculture and Forestry Linked Open and Semantic Technologies for Research in Agriculture and Forestry Platform Linked Nederland 2 April 2015 Rob Lokers, Alterra, Wageningen UR Contents related challenges in agricultural (and

More information

Data Curation Profile Human Genomics

Data Curation Profile Human Genomics Data Curation Profile Human Genomics Profile Author Profile Author Institution Name Contact J. Carlson N. Brown Purdue University J. Carlson, jrcarlso@purdue.edu Date of Creation October 27, 2009 Date

More information

TMX Wish List. FEISGILTT 2016 Dublin. Angelika Zerfaß

TMX Wish List. FEISGILTT 2016 Dublin. Angelika Zerfaß TMX Wish List FEISGILTT 2016 Dublin Angelika Zerfaß TMX usage TMX is being used widely to move data between systems and so far it does a good job. But when asked what TMX should be able to do on top of

More information

B2FIND and Metadata Quality

B2FIND and Metadata Quality B2FIND and Metadata Quality 3 rd EUDAT Conference 25 September 2014 Heinrich Widmann and B2FIND team 1 Outline B2FIND the EUDAT Metadata Service Semantic Mapping of Metadata Quality of Metadata Summary

More information

Ryen W. White, Matthew Richardson, Mikhail Bilenko Microsoft Research Allison Heath Rice University

Ryen W. White, Matthew Richardson, Mikhail Bilenko Microsoft Research Allison Heath Rice University Ryen W. White, Matthew Richardson, Mikhail Bilenko Microsoft Research Allison Heath Rice University Users are generally loyal to one engine Even when engine switching cost is low, and even when they are

More information

Student Success Guide

Student Success Guide Student Success Guide Contents Like a web page, links in this document can be clicked and they will take you to where you want to go. Using a Mouse 6 The Left Button 6 The Right Button 7 The Scroll Wheel

More information

Saving Potential in Technical Documentation

Saving Potential in Technical Documentation Saving Potential in Technical Documentation Shorter time to market, increasingly complex products, and a growing variety of languages are challenges that enterprises which operate on an international scale

More information

Supporting a Locale of One: Global Content Delivery for the Individual

Supporting a Locale of One: Global Content Delivery for the Individual Prof Vinny Wade Supporting a Locale of One: Global Content Delivery for the Individual Trinity College, Dublin University Deputy Director, CNGL Outline Challenge: Changing Face of Content Dynamic Multillingual

More information

Metadata for the Multilingual Web

Metadata for the Multilingual Web 19 Felix Sasaki 1 DFKI / W3C Fellow felix.sasaki@dfki.de We describe the Internationalization Tag Set (ITS) 2.0, an upcoming standard to foster the development of the multilingual Web. ITS 2.0 provides

More information

EUDAT-B2FIND A FAIR and Interdisciplinary Discovery Portal for Research Data

EUDAT-B2FIND A FAIR and Interdisciplinary Discovery Portal for Research Data EUDAT-B2FIND A FAIR and Interdisciplinary Discovery Portal for Research Data Heinrich Widmann, DKRZ Claudia Martens, DKRZ Open Science Days, Berlin, 17 October 2017 www.eudat.eu EUDAT receives funding

More information

ECP-2007-GEO OneGeology-Europe. Annex 1: Cookbook

ECP-2007-GEO OneGeology-Europe. Annex 1: Cookbook ECP-2007-GEO-317001 OneGeology-Europe Annex 1: Cookbook for creating multilingual metadata records using the OneGeology-Europe Metadata system (MIcKA) Authors: Lucie Kondrová, Robert Tomas, Štěpán Kafka

More information

Working towards a Metadata Federation of CLARIN and DARIAH-DE

Working towards a Metadata Federation of CLARIN and DARIAH-DE Working towards a Metadata Federation of CLARIN and DARIAH-DE Thomas Eckart Natural Language Processing Group University of Leipzig, Germany teckart@informatik.uni-leipzig.de Tobias Gradl Media Informatics

More information

HUMBOLDT Application Scenario: Protected Areas

HUMBOLDT Application Scenario: Protected Areas CC by Erlend Schei Copyright by Kecko Copyright by Michael Bezzina CC by Gunnar Ries Copyright by Michael Bezzina Copyright by Michael Bezzina Copyright by Michael Bezzina CC by fs999 CC by Jordan Nielsen

More information

Construction of a Metadata Database for Efficient Development and Use of Language Resources

Construction of a Metadata Database for Efficient Development and Use of Language Resources Construction of a Metadata Database for Efficient Development and Use of Language Resources Hitomi Tohyama, Shunsuke Kozawa, Kiyotaka Uchimoto, Shigeki Matsubara and Hitoshi Isahara Nagoya University,

More information

Enhanced retrieval using semantic technologies:

Enhanced retrieval using semantic technologies: Enhanced retrieval using semantic technologies: Ontology based retrieval as a new search paradigm? - Considerations based on new projects at the Bavarian State Library Dr. Berthold Gillitzer 28. Mai 2008

More information

Datos abiertos de Interés Lingüístico

Datos abiertos de Interés Lingüístico Datos abiertos de Interés Lingüístico Prof. Dr. Asunción Gómez-Pérez Artificial Intelligence Department Universidad Politécnica de Madrid Campus de Montegancedo sn 28660 Boadilla del Monte, Madrid http://www.oeg-upm.net

More information

Number game Experience of a European research infrastructure (CLARIN) for the analysis of web traffic

Number game Experience of a European research infrastructure (CLARIN) for the analysis of web traffic Number game Experience of a European research infrastructure (CLARIN) for the analysis of web traffic Go Sugimoto (Austrian Center for Digital Humanities) CLARIN Annual Conference 2016, Aix-en-Provence,

More information

LAB 5 Implementing an ALU

LAB 5 Implementing an ALU Goals To Do Design a practical ALU LAB 5 Implementing an ALU Learn how to extract performance numbers (area and speed) Draw a block level diagram of the MIPS 32-bit ALU, based on the description in the

More information

BHL-EUROPE: Biodiversity Heritage Library for Europe. Jana Hoffmann, Henning Scholz

BHL-EUROPE: Biodiversity Heritage Library for Europe. Jana Hoffmann, Henning Scholz Nimis P. L., Vignes Lebbe R. (eds.) Tools for Identifying Biodiversity: Progress and Problems pp. 43-48. ISBN 978-88-8303-295-0. EUT, 2010. BHL-EUROPE: Biodiversity Heritage Library for Europe Jana Hoffmann,

More information

Data Access User Needs Study A User Needs Assessment for the Colorado DOT (CDOT)

Data Access User Needs Study A User Needs Assessment for the Colorado DOT (CDOT) Data Access User Needs Study A User Needs Assessment for the Colorado DOT (CDOT) Lou Henefeld, GIS Support Unit Manager, Colorado DOT Louis.henefeld@dot.state.co.us Allen Ibaugh, CEO, Data Transfer Solutions

More information

TERMINOLOGY MANAGEMENT DURING TRANSLATION PROJECTS: PROFESSIONAL TESTIMONY

TERMINOLOGY MANAGEMENT DURING TRANSLATION PROJECTS: PROFESSIONAL TESTIMONY LINGUACULTURE, 1, 2010 TERMINOLOGY MANAGEMENT DURING TRANSLATION PROJECTS: PROFESSIONAL TESTIMONY Nancy Matis Abstract This article briefly presents an overview of the author's experience regarding the

More information

PID System for eresearch

PID System for eresearch the European Persistant Identifier Gesellschaft für wissenschaftliche Datenverarbeitung mbh Göttingen Am Fassberg, 37077 Göttingen ulrich.schwardmann@gwdg.de IZA/Gesis/RatSWD-WS Persistent Identifiers

More information

Networking European Digital Repositories

Networking European Digital Repositories Networking European Digital Repositories What to Network? Researchers generate knowledge This is going to become an amazing paper I only hope I will be able to access it Knowledge is wrapped in publications

More information

1. General requirements

1. General requirements Title CLARIN B Centre Checklist Version 6 Author(s) Peter Wittenburg, Dieter Van Uytvanck, Thomas Zastrow, Pavel Straňák, Daan Broeder, Florian Schiel, Volker Boehlke, Uwe Reichel, Lene Offersgaard Date

More information