The challenge of collecting and evaluating LRs for commercial use
|
|
- Ethelbert Joseph
- 5 years ago
- Views:
Transcription
1 Language Technologies Observatory The challenge of collecting and evaluating LRs for commercial use Bente Maegaard, CLARIN ERIC (and University of Copenhagen)
2 Overview of the challenges 2 Which LRs are needed? Where to find them Quality measures Improving LRs Awareness
3 LRs needed 3 (Many more) Language Resources for commercial applications are needed! Large scale LRs for e.g. SMT For many languages and domains Which languages? All European, many important international languages Which domains? Difficult to point to particular domains that are suffering as so little is available LRs are scarce and information about them is scattered LTO has created the LR Catalogue
4 Related work this is not a challenge! 4 Related work META-SHARE, ELRA, CLARIN VLO, OPUS, TAUS, PANACEA, etc The LTO focus in on LRs for commercial use
5 Evaluating LRs, usability 5 No shared method for evaluating the quality of an LR ELRA has validation procedure (Operational) Usability easier to define
6 Elements of quality/usability 6 Ease of access Ease of download and implementation Domain relevance Languages available Source and target language? is it really a translation? Cf. Multilingual websites Is it recent and/or updated? Availability/cost Sometimes size can be a quality
7 Selection criteria 7 Selection criteria for the LTO catalogue Availability Languages covered Validation Longevity (is it updated) Ease of download
8 Metadata 8 Metadata are indispensable Nothing can be found without metadata Time-consuming, find good balance
9 LTO metadata 9 Title Creator Contact person Description Language (mono-, bi- ) Resource type Availability Tags Modality - text URL Domain Format Format descr. Size Production date Comment
10 Challenge: insufficient metadata - Valorisation process 10 Many LRs detected could not go directly into the LTO Catalogue Insufficient metadata Metadata have been added/adapted/improved
11 Challenge: creation of new resources. Tools for new resources 11 If LRs don t exist, create them! Tools for corpus creation Tools for terminology extraction Have been investigated, guides written IPR a problem
12 Number of parallel corpora per language 12
13 Results of data collection investigations - challenges 13 Many LRs exist for academic purposes Not available for commercial use Companies don t share Many bilingual corpora are also multilingual Tools exist which help the creation of new LRs BUT: IPR rules make it very time-consuming to obtain the rights to use what you find on the web. Fair use would really help! Only small developments compared to META- White Papers
14 Results of implementation 14 The LTO catalogue exists The LTO catalogue only contains references to useful LRs We are happy to learn about more LRs that could be part of the LTO catalogue
15 Gap description summary of challenges 15 Coverage gap Languages, domains Awareness gap Demand side often unaware of what actually exists, which makes the LTO Catalogue particularly useful Supply side does not know which are the most needed LRs Quality gap Valorisation is needed Quantity gap Large quantities are needed for modern LT
16 Thank you! LT-Observatory Observatory for LR and MT in Europe Funded by the
On the way to Language Resources sharing: principles, challenges, solutions
On the way to Language Resources sharing: principles, challenges, solutions Stelios Piperidis ILSP, RC Athena, Greece spip@ilsp.gr Content on the Multilingual Web, 4-5 April, Pisa, 2011 Co-funded by the
More informationLanguage Resources. Khalid Choukri ELRA/ELDA 55 Rue Brillat-Savarin, F Paris, France Tel Fax.
Language Resources By the Other Data Center over 15 years fruitful partnership Khalid Choukri ELRA/ELDA 55 Rue Brillat-Savarin, F-75013 Paris, France Tel. +33 1 43 13 33 33 -- Fax. +33 1 43 13 33 30 choukri@elda.org
More informationCLARIN for Linguists Portal & Searching for Resources. Jan Odijk LOT Summerschool Nijmegen,
CLARIN for Linguists Portal & Searching for Resources Jan Odijk LOT Summerschool Nijmegen, 2014-06-23 1 Overview CLARIN Portal Find data and tools 2 Overview CLARIN Portal Find data and tools 3 CLARIN
More informationBringing Europeana and CLARIN together: Dissemination and exploitation of cultural heritage data in a research infrastructure
Bringing Europeana and CLARIN together: Dissemination and exploitation of cultural heritage data in a research infrastructure Twan Goosen 1 (CLARIN ERIC), Nuno Freire 2, Clemens Neudecker 3, Maria Eskevich
More informationMETA-SHARE : the open exchange platform Overview-Current State-Towards v3.0
META-SHARE : the open exchange platform Overview-Current State-Towards v3.0 Stelios Piperidis Athena RC, Greece spip@ilsp.gr A Strategy for Multilingual Europe Brussels, Belgium, June 20/21, 2012 Co-funded
More informationPDF hosted at the Radboud Repository of the Radboud University Nijmegen
PDF hosted at the Radboud Repository of the Radboud University Nijmegen The following full text is a publisher's version. For additional information about this publication click this link. http://hdl.handle.net/2066/40896
More informationISLE Metadata Initiative (IMDI) PART 1 B. Metadata Elements for Catalogue Descriptions
ISLE Metadata Initiative (IMDI) PART 1 B Metadata Elements for Catalogue Descriptions Version 3.0.13 August 2009 INDEX 1 INTRODUCTION...3 2 CATALOGUE ELEMENTS OVERVIEW...4 3 METADATA ELEMENT DEFINITIONS...6
More informationGETTING STARTED WITH DIGITAL COMMONWEALTH
GETTING STARTED WITH DIGITAL COMMONWEALTH Digital Commonwealth (www.digitalcommonwealth.org) is a Web portal and fee-based repository service for online cultural heritage materials held by Massachusetts
More informationFormats and standards for metadata, coding and tagging. Paul Meurer
Formats and standards for metadata, coding and tagging Paul Meurer The FAIR principles FAIR principles for resources (data and metadata): Findable (-> persistent identifier, metadata, registered/indexed)
More informationclarin:el an infrastructure for documenting, sharing and processing language data
clarin:el an infrastructure for documenting, sharing and processing language data Stelios Piperidis, Penny Labropoulou, Maria Gavrilidou (Athena RC / ILSP) the problem 19/9/2015 ICGL12, FU-Berlin 2 use
More informationThe MultilingualWeb-LT project
Multilingualism & Drupal The MultilingualWeb-LT project Denver, CO. March 20th, 2012 Seite 1 Cocomore essentials Agency for integrated communication and IT services 120 employees Offices in Germany and
More informationFrom Open Data to Data- Intensive Science through CERIF
From Open Data to Data- Intensive Science through CERIF Keith G Jeffery a, Anne Asserson b, Nikos Houssos c, Valerie Brasse d, Brigitte Jörg e a Keith G Jeffery Consultants, Shrivenham, SN6 8AH, U, b University
More informationJanne Bondi johannessen, Anders Nøklestad, Joel Priestley and Kristin Hagen. WP5: Glossa Integration
Janne Bondi johannessen, Anders Nøklestad, Joel Priestley and Kristin Hagen WP5: Glossa Integration WP5 Glossa integration The current Glossa corpus interface and analysis tool will be integrated in the
More informationThe Virtual Language Observatory!
The Virtual Language Observatory! Dieter Van Uytvanck! CMDI workshop, Nijmegen! 2012-09-13! 1! Overview! VLO?! What is behind it? Relation to CMDI?! How do I get my data in there?! Demo + excercises!!
More informationPANACEA Project Grant Agreement no.:
SEVENTH FRAMEWORK PROGRAMME THEME 3 Information and communication Technologies PANACEA Project Grant Agreement no.: 248064 Platform for Automatic, Normalized Annotation and Cost-Effective Acquisition of
More informationData is the new Oil (Ann Winblad)
Data is the new Oil (Ann Winblad) Keith G Jeffery keith.jeffery@keithgjefferyconsultants.co.uk 20140415-16 JRC Workshop Big Open Data Keith G Jeffery 1 Data is the New Oil Like oil has been, data is Abundant
More informationCORLI. a linguistic consortium for corpus, language and interaction
CORLI a linguistic consortium for corpus, language and interaction CORLI and HUMA-NUM CORLI = Corpus, Languages, and Interaction a French consortium of Huma-Num involved in linguistic research and teaching
More informationQUICK REFERENCE GUIDE
Folders new projects. Organise your folders to find files quickly and easily 1 Look in your yellow storage Folders it can be organised into simple folder structures to help with browsing 2 Click on your
More informationEUDAT. Towards a pan-european Collaborative Data Infrastructure
EUDAT Towards a pan-european Collaborative Data Infrastructure Martin Hellmich Slides adapted from Damien Lecarpentier DCH-RP workshop, Manchester, 10 April 2013 Research Infrastructures Research Infrastructure
More informationVersioning with PIDs
Versioning with PIDs Martin Matthiesen, CSC IT Center for Science Ute Dieckmann, University of Helsinki CLARIN annual conference, Pisa, Italy Contents What we do: The repository in a nutshell The Language
More informationPlatform UI Specification
Platform UI Specification November 25, 2016 Deliverable Code: D6.4 Version: 1.0 Final Dissemination level: Public This report presents the OpenMinTeD platform user interface design and implementation issues
More informationFor each use case, the business need, usage scenario and derived requirements are stated. 1.1 USE CASE 1: EXPLORE AND SEARCH FOR SEMANTIC ASSESTS
1 1. USE CASES For each use case, the business need, usage scenario and derived requirements are stated. 1.1 USE CASE 1: EXPLORE AND SEARCH FOR SEMANTIC ASSESTS Business need: Users need to be able to
More informationCACAO PROJECT AT THE 2009 TASK
CACAO PROJECT AT THE TEL@CLEF 2009 TASK Alessio Bosca, Luca Dini Celi s.r.l. - 10131 Torino - C. Moncalieri, 21 alessio.bosca, dini@celi.it Abstract This paper presents the participation of the CACAO prototype
More informationTHE ULTIMATE SEO MIGRATION GUIDE
THE ULTIMATE SEO MIGRATION GUIDE How to avoid a drop in ranking when launching a new website YOU KNOW THAT IT S TIME TO REBUILD YOUR WEBSITE, THERE IS ALWAYS THAT NAGGING CONCERN THAT THE NEW WEBSITE WILL
More informationMETA-SHARE metadata: Overview of the schema & Interoperability with other schemas
META-SHARE metadata: Overview of the schema & Interoperability with other schemas Penny Labropoulou & Maria Gavrilidou (ILSP/RC Athena) CMDI Interoperability Workshop Utrecht, Netherlands 4-5 June 2013
More informationCuration module in action - its preliminary findings on VLO metadata quality
Curation module in action - its preliminary findings on VLO metadata quality Davor Ostojić, Go Sugimoto, Matej Ďurčo (Austrian Centre for Digital Humanities) CLARIN Annual Conference 2016, Aix-en-Provence,
More informationKnowledge Inventory for hydrogeology research
Knowledge Inventory for hydrogeology research This project has received funding from the European Union s Horizon 2020 research and innovation programme under grant agreement No 642047. REDIAM Environmental
More informationMETA-SHARE: An Open Resource Exchange Infrastructure for Stimulating Research and Innovation
META-SHARE: An Open Resource Exchange Infrastructure for Stimulating Research and Innovation Stelios Piperidis Athena RC, Greece spip@ilsp.athena-innovation.gr Solutions for Multilingual Europe Budapest,
More informationAcrobat Pro: Useful standard features included in Acrobat Professional
fessional PDF files are extremely pervasive in the printing industry. Although they are a common file format, many people are not taking full advantage of the useful features incorporated into Acrobat
More informationStriving for efficiency
Ron Dekker Director CESSDA Striving for efficiency Realise the social data part of EOSC How to Get the Maximum from Research Data Prerequisites and Outcomes University of Tartu, 29 May 2018 Trends 1.Growing
More informationEUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal Heinrich Widmann, DKRZ DI4R 2016, Krakow, 28 September 2016 www.eudat.eu EUDAT receives funding from the European Union's Horizon 2020
More informationPlatform UI Specification (20)
Platform UI Specification (20) June 20, 2017 Deliverable Code: D6.5 Version: 1.0 Final Dissemination level: Public This report presents the OpenMinTeD platform user interface design and implementation
More informationSYMMETRIC STORAGE VIRTUALISATION IN THE NETWORK
SYMMETRIC STORAGE VIRTUALISATION IN THE NETWORK The symmetric and asymmetric virtualisation models are representatives of storage virtualization in the network. In both approaches it is possible to perform
More informationMultilingual Information Access for Digital Libraries The Metadata Records Translation Project
Multilingual Information Access for Digital Libraries The Metadata Records Translation Project Jiangping Chen Http://max.lis.unt.edu/ Jiangping.chen@unt.edu July 2011 Presentation Outline About Me Current
More informationNuno Freire National Library of Portugal Lisbon, Portugal
Date submitted: 05/07/2010 UNIMARC in The European Library and related projects Nuno Freire National Library of Portugal Lisbon, Portugal E-mail: nuno.freire@bnportugal.pt Meeting: 148. UNIMARC WORLD LIBRARY
More informationHow can CLARIN archive and curate my resources?
How can CLARIN archive and curate my resources? Christoph Draxler draxler@phonetik.uni-muenchen.de Outline! Relevant resources CLARIN infrastructure European Research Infrastructure Consortium National
More informationThe DART-Europe E-theses Portal
The DART-Europe E-theses Portal Martin Moyle Digital Curation Manager UCL Library Services, UK m.moyle@ucl.ac.uk ETD 2009, University of Pittsburgh, June 10-13 2009 Contents DART-Europe: background The
More informationOpen Archives Initiatives Protocol for Metadata Harvesting Practices for the cultural heritage sector
Open Archives Initiatives Protocol for Metadata Harvesting Practices for the cultural heritage sector Relais Culture Europe mfoulonneau@relais-culture-europe.org Community report A community report on
More informationTowards a Linked Open Data Cloud of Language Resources in the Legal Domain
Building the Legal Knowledge Graph for Smart Compliance Services in Multilingual Europe Towards a Linked Open Data Cloud of Language Resources in the Legal Domain Patricia Martín-Chozas, Elena Montiel-Ponsoda,
More informationTwo Traditions of Metadata Development
Two Traditions of Metadata Development Bibliographic control approach developed before computer technology and internet were commonplace. mainly used in libraries and universities. from early on used rules
More informationOASIS TECHNICAL COMMITTEE FORMAT OF AUTOMOTIVE REPAIR INFORMATION
OASIS TECHNICAL COMMITTEE FORMAT OF AUTOMOTIVE REPAIR INFORMATION Document Control Deliverable Code SC2-D3 Author(s) John Chelsom and SC2 working group Date 18-12-2002 Version Version 3.0 Notes: This document
More informationCOLUMN. Choosing the right CMS authoring tools. Three key criteria will determine the most suitable authoring environment NOVEMBER 2003
KM COLUMN NOVEMBER 2003 Choosing the right CMS authoring tools The authoring environment is the most important aspect of a content management system (CMS), for without content authors, there would be nothing
More informationSelf-tuning ongoing terminology extraction retrained on terminology validation decisions
Self-tuning ongoing terminology extraction retrained on terminology validation decisions Alfredo Maldonado and David Lewis ADAPT Centre, School of Computer Science and Statistics, Trinity College Dublin
More informationPri nceton Prime 3/24/2015. Welcome to What s New with Prime Journals.
Welcome to What s New with Prime Journals. 1 We have made several improvements to Prime Journals. Journal instructions, rules, and error messages have been clarified and updated. Journal entry usability
More informationEUDAT. Towards a pan-european Collaborative Data Infrastructure
EUDAT Towards a pan-european Collaborative Data Infrastructure Damien Lecarpentier CSC-IT Center for Science, Finland CESSDA workshop Tampere, 5 October 2012 EUDAT Towards a pan-european Collaborative
More informationLinport: Interoperable Translation and Localization Project Containers (A Progress Report) Alan Melby and Jeremy Coombs
Linport: Interoperable Translation and Localization Project Containers (A Progress Report) Alan Melby and Jeremy Coombs What is Linport? (this slide and 2 more) An acronym for Language Interoperability
More information1 Web references are marked by bracketed number order and can be found at the end of this squib.
Squibs The Language Resource Switchboard Claus Zinn University of Tübingen Department of Computational Linguistics claus.zinn@uni-tuebingen.de The CLARIN research infrastructure gives users access to an
More informationMachine Translation Research in META-NET
Machine Translation Research in META-NET Jan Hajič Institute of Formal and Applied Linguistics Charles University in Prague, CZ hajic@ufal.mff.cuni.cz Solutions for Multilingual Europe Budapest, Hungary,
More informationVisualization as Part of the Linguistic Processing Pipeline
Visualization as Part of the Linguistic Processing Pipeline Chris Culy and Verena Lyding European Academy Bozen-Bolzano {christopher.culy, verena.lyding}@eurac.edu Visualization of linguistic information
More informationSelf Introduction. Presentation Outline. College of Information 3/31/2016. Multilingual Information Access to Digital Collections
College of Information Multilingual Information Access to Digital Collections Jiangping Chen Http://coolt.lis.unt.edu/ Jiangping.chen@unt.edu April 20, 2016 Self Introduction An Associate Professor at
More informationPCSchool Search User Guide. Search User Guide. Created in version /11
Search User Guide Created in version 2007.4.0.5 1/11 Table of Contents Search Keyword Search... 2 Propagate Search Words... 3 Report Format... 3 Report Format... 4 Popup Format... 5 Keyword Search... 7
More informationComments submitted by IGTF-J (Internet Governance Task Force of Japan) *
submitted by IGTF-J (Internet Governance Task Force of Japan) * http:www.igtf.jp Do you have any comments on the process of determining the issues and their presentation by the WGIG? Yes. The purpose of
More informationImplementation of the Data Seal of Approval
Implementation of the Data Seal of Approval The Data Seal of Approval board hereby confirms that the Trusted Digital repository The Language Bank of Finland complies with the guidelines version 2014-2017
More information2/1/2016. Discuss website usability essentials Explain principles of design Critique a website in terms of usability and design
Due Tuesday, Feb. 9 upload to Blackboard Locate five HTML (not Flash) websites you believe exhibit good web design, usability and accessibility principles. Each website s critique is worth 10 points (50
More information1 Overview chart. PIDs: talk with EPIC PIDs: MoU or advice. Assessment wave 3. VLO overhaul CMDI 1.2
Title Centre Committee work plan 2014 Version 2 Author(s) Dieter Van Uytvanck Date 2014-02- 05 Status To be approved Distribution Centre Committee, NCF, BOD ID CE- 2013-0257 1 Overview chart PIDs: talk
More informationMake the Multilingual Web Work. TPAC 2012 Breakout Session
Make the Multilingual Web Work TPAC 2012 Breakout Session 1. MULTILINGUALWEB-LT WORKING GROUP Gap: Metadata in the deep Web Input from the data base the hidden web : Ob postbank direkt, online-banking,
More informationD5.1.1: DRUPAL MT TRAINING MODULE
: DRUPAL MT TRAINING MODULE Karl Fritsche, Stephan Walter () Distribution: Public MultilingualWeb-LT (LT-Web) Language Technology in the Web FP7-ICT-2011-7 Project no: 287815 Document Information Deliverable
More informationSEO. Drivers You Are Missing in Content Marketing
SEO Drivers You Are Missing in Content Marketing SEO IS ALWAYS CHANGING. HICH MEANS your content strategy what you create and how it is found is ALWAYS CHANGING AS WELL. BUT IS SEO ALWAYS CHANGING? BECAUSE
More informationLIDER: Building Free, Interlinked, and Interoperable Language Resources. Asunción Gómez- Pérez Philipp Cimiano
LIDER: Building Free, Interlinked, and Interoperable Language Resources Asunción Gómez- Pérez Philipp Cimiano MulBlingual Web Workshop Riga, 28th of April. 2015 20/11/2014 Presenter name 1. Surveys 2.
More informationA FRAMEWORK FOR MULTILINGUAL AND SEMANTIC ENRICHMENT OF DIGITAL CONTENT (NEW L10N BUSINESS OPPORTUNITIES) FREME WEBINAR HELD FOR GALA, 28 APRIL 2016
Co-funded by the Horizon 2020 Framework Programme of the European Union Grant Agreement Number 644771 www.freme-project.eu A FRAMEWORK FOR MULTILINGUAL AND SEMANTIC ENRICHMENT OF DIGITAL CONTENT (NEW L10N
More informationCMDI and granularity
CMDI and granularity Identifier CLARIND-AP3-007 AP 3 Authors Dieter Van Uytvanck, Twan Goosen, Menzo Windhouwer Responsible Dieter Van Uytvanck Reference(s) Version Date Changes by State 1 2011-01-24 Dieter
More informationWhat Do You Mean It Doesn t Make Sense? Redesigning Finding Aids from the Users Perspective
What Do You Mean It Doesn t Make Sense? Redesigning Finding Aids from the Users Perspective Rethinking Finding Aids Archival patrons don t understand archival terminology Don t understand hierarchy Need
More informationEU mhealth Working Group
EU mhealth Working Group 500 million users 45000 publishers 5% of total app market 3bn downloads Status Guidelines for assessing reliability & validity Lots of work on Scope, Target Group & Criteria https://ec.europa.eu/digital-single-market/en/mhealth
More informationChain of Data Creation. Data Creation. Lab Notebook. Plan Your Experiment, Experiment With your Plan
Chain of Data Creation Data Creation 1. Preparation 2. Creation of Metadata 3. Acquisition 4. Building a Permanent Record 5. Data Management 6. Storage 7. Data Sharing Lab Notebook Plan Your Experiment,
More informationBest practices in the design, creation and dissemination of speech corpora at The Language Archive
LREC Workshop 18 2012-05-21 Istanbul Best practices in the design, creation and dissemination of speech corpora at The Language Archive Sebastian Drude, Daan Broeder, Peter Wittenburg, Han Sloetjes The
More informationWHAT S NEW IN QLIKVIEW 11
WHAT S NEW IN QLIKVIEW 11 QlikView 11 takes Business Discovery to a whole new level by enabling users to more easily share information with coworkers, supporting larger enterprise deployments through enhanced
More informationMultilingual Guides. Many languages: Just one document
Multilingual Guides Many languages: Just one document What s a multilingual guide? Explains how to quickly install/set up a product Particularly used for hardware products Contains more than one language
More informationStoring Data in Objects
Storing Data in Objects Rob Miles Department of Computer Science 28d 08120 Programming 2 Objects and Items I have said for some time that you use objects to represent things in your problem Objects equate
More informationOverview. Finding information and help Adobe Acrobat. Where to find it and why to use it. When converting from Word to Acrobat
Formatting 101: Adobe Acrobat October 2017 Overview 2 Finding information and help Adobe Acrobat Where to find it and why to use it How to embed fonts When converting from Word to Acrobat Using the grid
More informationTaaS. Terminology as a Service. Project no
TaaS Terminology as a Service Project no. 296312 Annual Public Report Year 2013 Contents Project Objectives... 3 Work progress and achievements during the period... 3 Acquiring Terminological Data from
More informationIntroduction
Introduction EuropeanaConnect All-Staff Meeting Berlin, May 10 12, 2010 Welcome to the All-Staff Meeting! Introduction This is a quite big meeting. This is the end of successful project year Project established
More informationFirst metadata-enabled service in Croatian Webspace
WWW.HR: First metadata-enabled service in Croatian Webspace Igor Ljubi, Gordan Gledec Department of Telecommunications Faculty of Electrical Engineering and Computing University of Zagreb CUC 2001 September
More informationStory Workbench Quickstart Guide Version 1.2.0
1 Basic Concepts Story Workbench Quickstart Guide Version 1.2.0 Mark A. Finlayson (markaf@mit.edu) Annotation An indivisible piece of data attached to a text is called an annotation. Annotations, also
More informationLiving Library Federica Campus Virtuale Policy document
Living Library Federica Campus Virtuale Policy document www.federica.unina.it P.O. FESR 2007-2013 Asse V, O.O. 5.1 e-government ed e-inclusion - Progetto: Campus Virtuale 1. Mission of the Living Library
More informationInformation Retrieval and Knowledge Organisation
Information Retrieval and Knowledge Organisation Knut Hinkelmann Content Information Retrieval Indexing (string search and computer-linguistic aproach) Classical Information Retrieval: Boolean, vector
More informationMetadata Catalogue Issues. Daan Broeder Max-Planck Institute for Psycholinguistics
Metadata Catalogue Issues Daan Broeder Max-Planck Institute for Psycholinguistics Introduction Methods of registering resources Metadata Making metadata interoperable Exposing metadata Facilitating resource
More informationLinked Open Data and Semantic Technologies for Research in Agriculture and Forestry
Linked Open and Semantic Technologies for Research in Agriculture and Forestry Platform Linked Nederland 2 April 2015 Rob Lokers, Alterra, Wageningen UR Contents related challenges in agricultural (and
More informationData Curation Profile Human Genomics
Data Curation Profile Human Genomics Profile Author Profile Author Institution Name Contact J. Carlson N. Brown Purdue University J. Carlson, jrcarlso@purdue.edu Date of Creation October 27, 2009 Date
More informationTMX Wish List. FEISGILTT 2016 Dublin. Angelika Zerfaß
TMX Wish List FEISGILTT 2016 Dublin Angelika Zerfaß TMX usage TMX is being used widely to move data between systems and so far it does a good job. But when asked what TMX should be able to do on top of
More informationB2FIND and Metadata Quality
B2FIND and Metadata Quality 3 rd EUDAT Conference 25 September 2014 Heinrich Widmann and B2FIND team 1 Outline B2FIND the EUDAT Metadata Service Semantic Mapping of Metadata Quality of Metadata Summary
More informationRyen W. White, Matthew Richardson, Mikhail Bilenko Microsoft Research Allison Heath Rice University
Ryen W. White, Matthew Richardson, Mikhail Bilenko Microsoft Research Allison Heath Rice University Users are generally loyal to one engine Even when engine switching cost is low, and even when they are
More informationStudent Success Guide
Student Success Guide Contents Like a web page, links in this document can be clicked and they will take you to where you want to go. Using a Mouse 6 The Left Button 6 The Right Button 7 The Scroll Wheel
More informationSaving Potential in Technical Documentation
Saving Potential in Technical Documentation Shorter time to market, increasingly complex products, and a growing variety of languages are challenges that enterprises which operate on an international scale
More informationSupporting a Locale of One: Global Content Delivery for the Individual
Prof Vinny Wade Supporting a Locale of One: Global Content Delivery for the Individual Trinity College, Dublin University Deputy Director, CNGL Outline Challenge: Changing Face of Content Dynamic Multillingual
More informationMetadata for the Multilingual Web
19 Felix Sasaki 1 DFKI / W3C Fellow felix.sasaki@dfki.de We describe the Internationalization Tag Set (ITS) 2.0, an upcoming standard to foster the development of the multilingual Web. ITS 2.0 provides
More informationEUDAT-B2FIND A FAIR and Interdisciplinary Discovery Portal for Research Data
EUDAT-B2FIND A FAIR and Interdisciplinary Discovery Portal for Research Data Heinrich Widmann, DKRZ Claudia Martens, DKRZ Open Science Days, Berlin, 17 October 2017 www.eudat.eu EUDAT receives funding
More informationECP-2007-GEO OneGeology-Europe. Annex 1: Cookbook
ECP-2007-GEO-317001 OneGeology-Europe Annex 1: Cookbook for creating multilingual metadata records using the OneGeology-Europe Metadata system (MIcKA) Authors: Lucie Kondrová, Robert Tomas, Štěpán Kafka
More informationWorking towards a Metadata Federation of CLARIN and DARIAH-DE
Working towards a Metadata Federation of CLARIN and DARIAH-DE Thomas Eckart Natural Language Processing Group University of Leipzig, Germany teckart@informatik.uni-leipzig.de Tobias Gradl Media Informatics
More informationHUMBOLDT Application Scenario: Protected Areas
CC by Erlend Schei Copyright by Kecko Copyright by Michael Bezzina CC by Gunnar Ries Copyright by Michael Bezzina Copyright by Michael Bezzina Copyright by Michael Bezzina CC by fs999 CC by Jordan Nielsen
More informationConstruction of a Metadata Database for Efficient Development and Use of Language Resources
Construction of a Metadata Database for Efficient Development and Use of Language Resources Hitomi Tohyama, Shunsuke Kozawa, Kiyotaka Uchimoto, Shigeki Matsubara and Hitoshi Isahara Nagoya University,
More informationEnhanced retrieval using semantic technologies:
Enhanced retrieval using semantic technologies: Ontology based retrieval as a new search paradigm? - Considerations based on new projects at the Bavarian State Library Dr. Berthold Gillitzer 28. Mai 2008
More informationDatos abiertos de Interés Lingüístico
Datos abiertos de Interés Lingüístico Prof. Dr. Asunción Gómez-Pérez Artificial Intelligence Department Universidad Politécnica de Madrid Campus de Montegancedo sn 28660 Boadilla del Monte, Madrid http://www.oeg-upm.net
More informationNumber game Experience of a European research infrastructure (CLARIN) for the analysis of web traffic
Number game Experience of a European research infrastructure (CLARIN) for the analysis of web traffic Go Sugimoto (Austrian Center for Digital Humanities) CLARIN Annual Conference 2016, Aix-en-Provence,
More informationLAB 5 Implementing an ALU
Goals To Do Design a practical ALU LAB 5 Implementing an ALU Learn how to extract performance numbers (area and speed) Draw a block level diagram of the MIPS 32-bit ALU, based on the description in the
More informationBHL-EUROPE: Biodiversity Heritage Library for Europe. Jana Hoffmann, Henning Scholz
Nimis P. L., Vignes Lebbe R. (eds.) Tools for Identifying Biodiversity: Progress and Problems pp. 43-48. ISBN 978-88-8303-295-0. EUT, 2010. BHL-EUROPE: Biodiversity Heritage Library for Europe Jana Hoffmann,
More informationData Access User Needs Study A User Needs Assessment for the Colorado DOT (CDOT)
Data Access User Needs Study A User Needs Assessment for the Colorado DOT (CDOT) Lou Henefeld, GIS Support Unit Manager, Colorado DOT Louis.henefeld@dot.state.co.us Allen Ibaugh, CEO, Data Transfer Solutions
More informationTERMINOLOGY MANAGEMENT DURING TRANSLATION PROJECTS: PROFESSIONAL TESTIMONY
LINGUACULTURE, 1, 2010 TERMINOLOGY MANAGEMENT DURING TRANSLATION PROJECTS: PROFESSIONAL TESTIMONY Nancy Matis Abstract This article briefly presents an overview of the author's experience regarding the
More informationPID System for eresearch
the European Persistant Identifier Gesellschaft für wissenschaftliche Datenverarbeitung mbh Göttingen Am Fassberg, 37077 Göttingen ulrich.schwardmann@gwdg.de IZA/Gesis/RatSWD-WS Persistent Identifiers
More informationNetworking European Digital Repositories
Networking European Digital Repositories What to Network? Researchers generate knowledge This is going to become an amazing paper I only hope I will be able to access it Knowledge is wrapped in publications
More information1. General requirements
Title CLARIN B Centre Checklist Version 6 Author(s) Peter Wittenburg, Dieter Van Uytvanck, Thomas Zastrow, Pavel Straňák, Daan Broeder, Florian Schiel, Volker Boehlke, Uwe Reichel, Lene Offersgaard Date
More information