The Virtual Language Observatory!
|
|
- Dorcas Hunt
- 5 years ago
- Views:
Transcription
1 The Virtual Language Observatory! Dieter Van Uytvanck! CMDI workshop, Nijmegen! ! 1!
2 Overview! VLO?! What is behind it? Relation to CMDI?! How do I get my data in there?! Demo + excercises!! 2!
3 Context sketch! Lots of resources somewhere out there:! Data collections! Corpora! Lexica! Grammars! Multimedia recordings! Software! Web applications / services! Old-school linguistic resources:! Books! Articles! CD-ROMs! Itʼs like a jungle, sometimes...!!
4 VLO: the idea! Researcher: where do I start?! Provide a single entry point giving access to all information! Because of the large amount of data:! Drill-down paradigm (decrease search space gradually)! Multiple ways of exploring:! Full-text search! Facet browsing! Geographic overlay! Unified interface, links to the original context!
5 VLO?! Virtual Language Observatory! Several parts:! Facet browser (real search)! Google Earth overlay (visualization)! LRT inventory (ad-hoc, last resort metadata entry)! 5!
6 Facets?! A simple way to narrow down the search space, step by step! Values offered are dynamic: they change with every previous selection made! Purpose: quickly navigating through a huge amount of metadata! 6!
7 Facets?! Purpose: quickly navigating through a huge amount of resources! Useful too for metadata curation! Not the tool to answer research questions!! 7!
8 VLO Faceted Browser (1)! h"p://catalog.clarin.eu/ds/vlo 8!
9 VLO Faceted Browser (2)
10 VLO Faceted Browser (3)! Metadata analyzed is CMDI format! Metadata sources! CMDI files harvested from CLARIN centres! CMDIʼfied OLAC records (from CLARIN centres and others)! CMDIʼfied LRT inventory records! You can get to resources directly from search results! 10!
11 Exercises (1)! Find some resources in the catalogue:! Corpus Gysseling! Telephone conversation recordings in Nepal!
12 Exercises (2)! Find some resources in the Endangered Languages archive which are:! (Spoken) discourse with at least two consultants in Asia! Or (spoken) discourse with at least two consultants in a Face to Face conversation!
13 Limits! Inherent limit: Simple search! no OR combinations possible! no sophisticated search operations! Current limit (to be fixed)! Full-text search not for all fields, but only the ones displayed in the VLO! 13!
14 CMDI architecture! metadata catalogue ISOcat component registry & editor metadata modeler metadata user search & semantic mapping metadata editor metadata creator metadata curator Joint metadata repository OAI-PMH Service provider Local metadata repository OAI-PMH Data provider metadata curator DATA
15 Behind the scenes (1)! SOLR + lucene! Tomcat web application! For the parsing of the CMDIʼs: VTD-XML! Faster than SAX-parser! Still full XPath access! Memory-efficient (1.3x~1.5x the size of an XML document)! 15!
16 Behind the scenes (2)! 16!
17 VLO and ISOcat: natural allies (1)! The import of metadata files used to be hard coded! Now we look at the ISOcat links in the XSDs as generated from the CMDI profiles! Fallback to hard-coded XPath in case no ISOcat link is found! 17!
18 VLO and ISOcat: natural allies (2)! Import configuration example:! <facetconcept name="name" allowmultiplevalues="false"> <concept> <concept> <concept> <!-- no concept in lrt schema --> <pattern> /c:cmd/c:components/c:lrtinventoryresource/c:lrtcommon/ c:resourcename/text() </pattern> </facetconcept> 18!
19 How do I get my metadata in there?! Provide it as CMDI over OAI-PMH! If that is not possible:! Provide it as OLAC over OAI-PMH! Provide it as IMDI over OAI-PMH! If that is not possible either:! Enter it into the LRT inventory:! 19!
20 Order Specimen Habitat definikons ISOcat ISOcat.org XSD files Profile 1 Profile 2 Profile 3 Component registry Ingester XPath = data category VLO CMDI files Instance 1 Instance 2 Instance 3 Metadata Repository
21 Recent Additions! Links to language information: WALS, Wikipedia, Ethnologue, LinguistList and the VLO! Descriptions in the record listing! National Project facet! Feedback link! 21!
22 Still to come! A faceted browser is as good as its data, so curation steps are needed! more CMDI metadata! some more facets e.g.: year! Human-readable hdl links! Interface improvements! 22!
23 Questions?! ask them now! or send a mail to vlw@clarin.eu! More information:! !
CLARIN for Linguists Portal & Searching for Resources. Jan Odijk LOT Summerschool Nijmegen,
CLARIN for Linguists Portal & Searching for Resources Jan Odijk LOT Summerschool Nijmegen, 2014-06-23 1 Overview CLARIN Portal Find data and tools 2 Overview CLARIN Portal Find data and tools 3 CLARIN
More informationUsing the data in the archive
Using the data in the archive Jacquelijn Ringersma The Language Archive Max Planck Institute for Psycholinguistics DGfS-CNRS Summer School on Linguistic Typology A very rich archive A very rich archive
More informationMetadata and DCR. <CMD_Component /> Dieter Van Uytvanck. Max Planck Institute for Psycholinguistics
Metadata and DCR Dieter Van Uytvanck Max Planck Institute for Psycholinguistics Dieter.VanUytvanck@mpi.nl Overview Traditional metadata Component metadata Data categories The big picture
More informationD-SPIN Report R2.2b: The German Resource Landscape and a Portal
D-SPIN Report R2.2b: The German Resource Landscape and a Portal February 2010 D-SPIN, BMBF-FKZ: 01UG0801A Deliverable: R2.2: The German Language Resource Landscape and a Portal Responsible: Peter Wittenburg
More informationBuilding metadata components
Building metadata components Dieter Van Uytvanck Max Planck Institute for Psycholinguistics Dieter.VanUytvanck@mpi.nl Overview Traditional metadata Component metadata Data categories
More informationComponent Metadata Infrastructure Best Practices for CLARIN
Component Metadata Infrastructure Best Practices for CLARIN CMDI and Metadata Curation Task Forces Thomas Eckart, Twan Goosen, Susanne Haaf, Hanna Hedeland, Oddrun Ohren, Dieter Van Uytvanck and Menzo
More informationBest practices in the design, creation and dissemination of speech corpora at The Language Archive
LREC Workshop 18 2012-05-21 Istanbul Best practices in the design, creation and dissemination of speech corpora at The Language Archive Sebastian Drude, Daan Broeder, Peter Wittenburg, Han Sloetjes The
More informationCMDI and granularity
CMDI and granularity Identifier CLARIND-AP3-007 AP 3 Authors Dieter Van Uytvanck, Twan Goosen, Menzo Windhouwer Responsible Dieter Van Uytvanck Reference(s) Version Date Changes by State 1 2011-01-24 Dieter
More informationMETA-SHARE metadata: Overview of the schema & Interoperability with other schemas
META-SHARE metadata: Overview of the schema & Interoperability with other schemas Penny Labropoulou & Maria Gavrilidou (ILSP/RC Athena) CMDI Interoperability Workshop Utrecht, Netherlands 4-5 June 2013
More informationJust for the record, CMDI should be about semantic interoperability
Just for the record, CMDI should be about semantic interoperability Thorsten Trippel and Claus Zinn Seminar für Sprachwissenschaft Universität Tübingen firstname.lastname@uni-tuebingen.de Abstract The
More informationBringing Europeana and CLARIN together: Dissemination and exploitation of cultural heritage data in a research infrastructure
Bringing Europeana and CLARIN together: Dissemination and exploitation of cultural heritage data in a research infrastructure Twan Goosen 1 (CLARIN ERIC), Nuno Freire 2, Clemens Neudecker 3, Maria Eskevich
More information1. General requirements
Title CLARIN B Centre Checklist Version 6 Author(s) Peter Wittenburg, Dieter Van Uytvanck, Thomas Zastrow, Pavel Straňák, Daan Broeder, Florian Schiel, Volker Boehlke, Uwe Reichel, Lene Offersgaard Date
More informationACDH AUSTRIAN CENTRE FOR DIGITAL HUMANITIES
ARCHE = A Resource Centre for the HumanitiEs A digital archive for the humanities Implements the OAIS Reference Model for an Open Archival Information System arche.acdh.oeaw.ac.at WHAT IS AN ARCHIVE? Preserves
More information1 Overview chart. PIDs: talk with EPIC PIDs: MoU or advice. Assessment wave 3. VLO overhaul CMDI 1.2
Title Centre Committee work plan 2014 Version 2 Author(s) Dieter Van Uytvanck Date 2014-02- 05 Status To be approved Distribution Centre Committee, NCF, BOD ID CE- 2013-0257 1 Overview chart PIDs: talk
More informationSome challenges ahead for the Open Language Archives Community
Some challenges ahead for the Open Language Archives Community Gary F. Simons SIL International Co-coordinator with Steven Bird, Open Language Archives Community Workshop on Language Archives in the Americas
More informationMetadata Proposals for Corpora and Lexica
Metadata Proposals for Corpora and Lexica P. Wittenburg, W. Peters +, D. Broeder Max-Planck-Institute for Psycholinguistics Wundtlaan 1, 6525 XD Nijmegen, The Netherlands peter.wittenburg@mpi.nl + University
More informationMacbook Pro HostEurope CESNET 100%IT TransIP. DE (commercial) CZ UK Xeon E GHz. vcores Mem (GB)
Title VLO server benchmark Version 1 Author(s) Willem Elbers, Dieter Van Uytvanck Date 215-6- 25 Status Final version - informative Distribution centre committee ID CE- 215-555 1 Introduction CLARIN is
More informationB2FIND: EUDAT Metadata Service. Daan Broeder, et al. EUDAT Metadata Task Force
B2FIND: EUDAT Metadata Service Daan Broeder, et al. EUDAT Metadata Task Force EUDAT Joint Metadata Domain of Research Data Deliver a service for searching and browsing metadata across communities Appropriate
More informationEUDAT. A European Collaborative Data Infrastructure. Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT
EUDAT A European Collaborative Data Infrastructure Daan Broeder The Language Archive MPI for Psycholinguistics CLARIN, DASISH, EUDAT OpenAire Interoperability Workshop Braga, Feb. 8, 2013 EUDAT Key facts
More informationCLARIN s central infrastructure. Dieter Van Uytvanck CLARIN-PLUS Tools & Services Workshop 2 June 2016 Vienna
CLARIN s central infrastructure Dieter Van Uytvanck CLARIN-PLUS Tools & Services Workshop 2 June 2016 Vienna CLARIN? Common Language Resources and Technology Infrastructure Research Infrastructure for
More informationManaging very large Multimedia Archives and their Integration into Federations
Managing very large Multimedia Archives and their Integration into Federations Daan Broeder, Eric Auer, Marc Kemps-Snijders, Han Sloetjes, Peter Wittenburg, Claus Zinn 1 1 Max-Planck-Institute for Psycholinguistics,
More informationWorking towards a Metadata Federation of CLARIN and DARIAH-DE
Working towards a Metadata Federation of CLARIN and DARIAH-DE Thomas Eckart Natural Language Processing Group University of Leipzig, Germany teckart@informatik.uni-leipzig.de Tobias Gradl Media Informatics
More informationCuration module in action - its preliminary findings on VLO metadata quality
Curation module in action - its preliminary findings on VLO metadata quality Davor Ostojić, Go Sugimoto, Matej Ďurčo (Austrian Centre for Digital Humanities) CLARIN Annual Conference 2016, Aix-en-Provence,
More informationOLAC: Accessing the World s Language Resources
OLAC: Accessing the World s Language Resources Steven Bird CSSE, University of Melbourne LDC, University of Pennsylvania Gary Simons SIL International Graduate Institute of Applied Linguistics What is
More informationBuilding a Faceted Browser in CouchDB Using Views on Views and Erlang Metaprogramming
Browser in Using on and Erlang Browser in Using on and Erlang WFLP-2011 Odense, July 19 2011 on views claus.zinn@uni-tuebingen.de The NaLiDa Project Nachhaltigkeit Linguistischer Daten http://www.sfs.uni-tuebingen.de/nalida/.1
More informationComponent Registry, Browser and Editor Reference Manual
Component Registry, Browser and Editor Reference Manual Introduction The Component Registry has the following features: 1) Register and store CMDI Components/Profiles. 2) Enable a user to browse the registered
More informationclarin:el an infrastructure for documenting, sharing and processing language data
clarin:el an infrastructure for documenting, sharing and processing language data Stelios Piperidis, Penny Labropoulou, Maria Gavrilidou (Athena RC / ILSP) the problem 19/9/2015 ICGL12, FU-Berlin 2 use
More informationNew EuroVO registry. architecture and status as of May Menelaus Perdikeas, ESAC Neuropublic.
New EuroVO registry * architecture and status as of May 2014 Menelaus Perdikeas, ESAC Neuropublic mperdikeas@sciops.esa.int EuroVO new registry developed from scratch as a drop-in replacement of existing
More informationCORLI. a linguistic consortium for corpus, language and interaction
CORLI a linguistic consortium for corpus, language and interaction CORLI and HUMA-NUM CORLI = Corpus, Languages, and Interaction a French consortium of Huma-Num involved in linguistic research and teaching
More informationMetadata Tools Supporting Controlled Vocabulary Services
Metadata Tools Supporting Controlled Vocabulary Services Daan Broeder, Freddy Offenga, Don Willems Max-Planck Institute for Psycholinguistics daan.broeder@mpi.nl Abstract Within the ISLE Metadata Initiative
More informationSobekCM. Compiled for presentation to the Digital Library Working Group School of Oriental and African Studies
SobekCM Compiled for presentation to the Digital Library Working Group School of Oriental and African Studies SobekCM Is a digital library system built at and maintained by the University of Florida s
More informationFLAT: A CLARIN-compatible repository solution based on Fedora Commons
FLAT: A CLARIN-compatible repository solution based on Fedora Commons Paul Trilsbeek The Language Archive Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands Paul.Trilsbeek@mpi.nl Menzo
More informationISLE Metadata Initiative (IMDI) PART 1 B. Metadata Elements for Catalogue Descriptions
ISLE Metadata Initiative (IMDI) PART 1 B Metadata Elements for Catalogue Descriptions Version 3.0.13 August 2009 INDEX 1 INTRODUCTION...3 2 CATALOGUE ELEMENTS OVERVIEW...4 3 METADATA ELEMENT DEFINITIONS...6
More informationEUDAT-B2FIND A FAIR and Interdisciplinary Discovery Portal for Research Data
EUDAT-B2FIND A FAIR and Interdisciplinary Discovery Portal for Research Data Heinrich Widmann, DKRZ Claudia Martens, DKRZ Open Science Days, Berlin, 17 October 2017 www.eudat.eu EUDAT receives funding
More informationAn Evolving escience Environment for Research Data in Linguistics
An Evolving escience Environment for Research Data in Linguistics Claus Zinn, Peter Wittenburg, and Jacquelijn Ringersma Max Planck Institute for Psycholinguistics Wundtlaan 1, 6525 XD Nijmegen, The Netherlands
More informationWorking with CMDI in Arbil Jeroen Geerts - September 2016
Working with CMDI in Arbil Jeroen Geerts - September 2016 The Language Archive has been migrated to the CMDI metadata standard. CMDI metadata is based on profiles; each containing a certain amount of components.
More informationEUDAT. Towards a pan-european Collaborative Data Infrastructure
EUDAT Towards a pan-european Collaborative Data Infrastructure Damien Lecarpentier CSC-IT Center for Science, Finland CESSDA workshop Tampere, 5 October 2012 EUDAT Towards a pan-european Collaborative
More informationEditing and adding content to the deposit page
Editing and adding content to the deposit page Endangered Languages Archive, 11 April 2017 Overview Each collection in the ELAR catalogue has an introductory page. On this page the user finds general information
More informationExpressing language resource metadata as Linked Data: A potential agenda for the Open Language Archives Community
Expressing language resource metadata as Linked Data: A potential agenda for the Open Language Archives Community Gary F. Simons SIL International Co coordinator, Open Language Archives Community Workshop
More informationBuilding a Digital Library Software
Building a Software INVENIO, Part 1 J-Y. Le Meur Department of Information Technology CERN JINR-CERN School on GRID and Information Management Systems 14 May 2012 Outline 1 2 3 4 Outline 1 2 3 4 A physicist
More informationMetadata Catalogue Issues. Daan Broeder Max-Planck Institute for Psycholinguistics
Metadata Catalogue Issues Daan Broeder Max-Planck Institute for Psycholinguistics Introduction Methods of registering resources Metadata Making metadata interoperable Exposing metadata Facilitating resource
More informationEMELD Working Group on Resource Archiving
EMELD Working Group on Resource Archiving Language Digitization Project, Conference 2003: Digitizing and Annotating Texts and Field Recordings Preamble Sparkling prose that briefly explains why linguists
More informationGE: A flexible presentation platform for LR. Alex Dukers Jacquelijn Ringersma
GE: A flexible presentation platform for LR Alex Dukers Jacquelijn Ringersma GE: a flexibele presentation platform for LR, November 2006 Outline Presenting Linguistic Resources Geographical presentation
More informationLessons Learned. Implementing Rosetta in the Harold B. Lee Library
Lessons Learned Implementing Rosetta in the Harold B. Lee Library Provide Long Term Digital Access 1. To preserve BYU digital items: Digitized images, audio, video, Electronic articles, university records,
More information(Some) Standards in the Humanities. Sebastian Drude CLARIN ERIC RDA 4 th Plenary, Amsterdam September 2014
(Some) Standards in the Humanities Sebastian Drude CLARIN ERIC RDA 4 th Plenary, Amsterdam September 2014 1. Introduction Overview 2. Written text: the Text Encoding Initiative (TEI) 3. Multimodal: ELAN
More informationCitation Services for Institutional Repositories: Citebase Search. Tim Brody Intelligence, Agents, Multimedia Group University of Southampton
Citation Services for Institutional Repositories: Citebase Search Tim Brody Intelligence, Agents, Multimedia Group University of Southampton 28/04/2009 2 28/04/2009 3 Content The Open Access Literature
More informationOPENAIRE FP7 POST-GRANT OPEN ACCESS PILOT
OPENAIRE FP7 POST-GRANT OPEN ACCESS PILOT Alternative Funding Bid No 10. Hungarian Educational Research Journal (HERJ) Presenter: Laura Morvai University of Debrecen University and National Library Managing
More informationCitation Services for Institutional Repositories: Citebase Search. Tim Brody Intelligence, Agents, Multimedia Group University of Southampton
Citation Services for Institutional Repositories: Citebase Search Tim Brody Intelligence, Agents, Multimedia Group University of Southampton Content The Research Literature The Open Access Literature Why
More informationMuseKnowledge Hybrid Search
MuseKnowledge Hybrid Search MuseGlobal, Inc. One Embarcadero Suite 500 San Francisco, CA 94111 415 896-6873 www.museglobal.com MuseGlobal S.A Calea Bucuresti Bl. 27B, Sc. 1, Ap. 10 Craiova, România 40
More informationMetadata quality assurance for CLARIN
November 2014 Metadata quality assurance for CLARIN Marc Kemps-Snijders [Type the abstract of the document here. The abstract is typically a short summary of the contents of the document.] 1 Table of Contents
More informationImplementation of the Data Seal of Approval
Implementation of the Data Seal of Approval The Data Seal of Approval board hereby confirms that the Trusted Digital repository IDS Repository complies with the guidelines version 2014-2017 set by the.
More informationNational Documentation Centre Open access in Cultural Heritage digital content
National Documentation Centre Open access in Cultural Heritage digital content Haris Georgiadis, Ph.D. Senior Software Engineer EKT hgeorgiadis@ekt.gr The beginning.. 42 institutions documented & digitalized
More informationRegistry Interchange Format: Collections and Services (RIF-CS) explained
ANDS Guide Registry Interchange Format: Collections and Services (RIF-CS) explained Level: Awareness Last updated: 10 January 2017 Web link: www.ands.org.au/guides/rif-cs-explained The RIF-CS schema is
More informationSearch Framework for a Large Digital Records Archive DLF SPRING 2007 April 23-25, 25, 2007 Dyung Le & Quyen Nguyen ERA Systems Engineering National Ar
Search Framework for a Large Digital Records Archive DLF SPRING 2007 April 23-25, 25, 2007 Dyung Le & Quyen Nguyen ERA Systems Engineering National Archives & Records Administration Agenda ERA Overview
More informationWittenburg, Peter; Gulrajani, Greg; Broeder, Daan; Uneson, Marcus
Cross-Disciplinary Integration of Metadata Descriptions Wittenburg, Peter; Gulrajani, Greg; Broeder, Daan; Uneson, Marcus Published in: Proceedings of LREC 2004 2004 Link to publication Citation for published
More informationSomething will be connected - Semantic mapping from CMDI to Parthenos Entities
Something will be connected - Semantic mapping from CMDI to Parthenos Entities Matej Ďurčo ACDH-OEAW Vienna, Austria matej.durco @oeaw.ac.at Matteo Lorenzini ACDH-OEAW matteo.lorenzini @oeaw.ac.at Go Sugimoto
More informationDigital The Harold B. Lee Library
Digital Preservation @ The Harold B. Lee Library CIMA 23 May 2013 How we got here? 1. Understanding Digital Preservation 2. Search for Content 3. Maintain Optical Disc Storage 4. In House Preservation
More informationWeb-enabled Physical Samples: Curating and Publishing Physical Samples in CSIRO
Web-enabled Physical Samples: Curating and Publishing Physical Samples in CSIRO Anusuriya Devaraju Web-enabled Physical Samples: Curating and Publishing Physical Samples in CSIRO Anusuriya Devaraju, Jens
More informationMetadata Infrastructure for Language Resources and Technology
Metadata Infrastructure for Language Resources and Technology 2009-02-04 - Version 5 Editors: Daan Broeder, Bertrand Gaiffe, Maria Gavrilidou, Erhard Hinrichs, Lothar Lemnitzer, Dieter Van Uytvanck, Andreas
More informationSMART CONNECTOR TECHNOLOGY FOR FEDERATED SEARCH
SMART CONNECTOR TECHNOLOGY FOR FEDERATED SEARCH VERSION 1.4 27 March 2018 EDULIB, S.R.L. MUSE KNOWLEDGE HEADQUARTERS Calea Bucuresti, Bl. 27B, Sc. 1, Ap. 10, Craiova 200675, România phone +40 251 413 496
More informationMINT METADATA INTEROPERABILITY SERVICES
MINT METADATA INTEROPERABILITY SERVICES DIGITAL HUMANITIES SUMMER SCHOOL LEUVEN 10/09/2014 Nikolaos Simou National Technical University of Athens What is MINT? 2 Mint is a herb having hundreds of varieties
More informationD-SPIN. D-SPIN Report 2.1: Formation of Centres
D-SPIN D-SPIN Report 2.1: Formation of Centres June 2009 D-SPIN, BMBF-FKZ: 01UG0801B Deliverable: R2.1: Formation of Centres Responsible: Peter Wittenburg Editors: Peter Wittenburg 2 Contents 1. Introduction...4
More informationCentres Network Formation
Centres Network Formation 2009-02-04 - Version: 9 Editors: Dirk Roorda, Dieter van Uytvanck Peter Wittenburg, Martin Wynne The ultimate objective of CLARIN is to create a European federation of existing
More informationImplementation of the Data Seal of Approval
Implementation of the Data Seal of Approval The Data Seal of Approval board hereby confirms that the Trusted Digital repository LINDAT-Clarin - Centre for Language Research Infrastructure in the Czech
More informationShow me the data. The pilot UK Research Data Registry. 26 February 2014
because good research needs good data Show me the data The pilot UK Research Data Registry Alex Ball 1 Kevin Ashley 2 Patrick McCann 3 Laura Molloy 3 Veerle Van den Eynden 4 1 DCC/UKOLN Informatics, University
More informationInstitutional Repository using DSpace. Yatrik Patel Scientist D (CS)
Institutional Repository using DSpace Yatrik Patel Scientist D (CS) yatrik@inflibnet.ac.in What is Institutional Repository? Institutional repositories [are]... digital collections capturing and preserving
More informationOn the way to Language Resources sharing: principles, challenges, solutions
On the way to Language Resources sharing: principles, challenges, solutions Stelios Piperidis ILSP, RC Athena, Greece spip@ilsp.gr Content on the Multilingual Web, 4-5 April, Pisa, 2011 Co-funded by the
More informationExtending the Facets concept by applying NLP tools to catalog records of scientific literature
Extending the Facets concept by applying NLP tools to catalog records of scientific literature *E. Picchi, *M. Sassi, **S. Biagioni, **S. Giannini *Institute of Computational Linguistics **Institute of
More informationHow can CLARIN archive and curate my resources?
How can CLARIN archive and curate my resources? Christoph Draxler draxler@phonetik.uni-muenchen.de Outline! Relevant resources CLARIN infrastructure European Research Infrastructure Consortium National
More informationHow to Create a Custom Ingest Form
How to Create a Custom Ingest Form The following section presumes that you are using the Virtual Machine Image or are visiting http://sandbox.islandora.ca OR that you have installed and configured the
More informationhttp://resolver.caltech.edu/caltechlib:spoiti05 Caltech CODA http://coda.caltech.edu CODA: Collection of Digital Archives Caltech Scholarly Communication 15 Production Archives 3102 Records Theses, technical
More informationHosted by ALCTS Continuing Education. Elissah Becknell and Sarah Beth Weeks September 18, 2013
Hosted by ALCTS Continuing Education Elissah Becknell and Sarah Beth Weeks September 18, 2013 Formerly known as Google Refine, Open Refine is a free open source tool for cleaning up large data sets. It
More informationTowards a roadmap for standardization in language technology
Towards a roadmap for standardization in language technology Laurent Romary & Nancy Ide Loria-INRIA Vassar College Overview General background on standardization Available standards On-going activities
More informationComparing Open Source Digital Library Software
Comparing Open Source Digital Library Software George Pyrounakis University of Athens, Greece Mara Nikolaidou Harokopio University of Athens, Greece Topic: Digital Libraries: Design and Development, Open
More informationApplication Services for Knowledge Organisation and System Integration
www.askosi.org Application Services for Knowledge Organisation and System Integration A Short Presentation May 2010 Christophe Dupriez dupriez@askosi.org Thesauri: Take a walk on the «Why?» slide! Search
More informationData Exchange and Conversion Utilities and Tools (DExT)
Data Exchange and Conversion Utilities and Tools (DExT) Louise Corti, Angad Bhat, Herve L Hours UK Data Archive CAQDAS Conference, April 2007 An exchange format for qualitative data Data exchange models
More informationCMDI 1.2: Improvements in the CLARIN Component Metadata Infrastructure
CMDI 1.2: Improvements in the CLARIN Component Metadata Infrastructure Twan Goosen 1 Menzo Windhouwer 2 Oddrun Ohren 3 Axel Herold 4 Thomas Eckart 5 Matej Ďurčo 6 Oliver Schonefeld 7 1 The Language Archive,
More informationInformatics 1: Data & Analysis
Informatics 1: Data & Analysis Lecture 9: Trees and XML Ian Stark School of Informatics The University of Edinburgh Tuesday 11 February 2014 Semester 2 Week 5 http://www.inf.ed.ac.uk/teaching/courses/inf1/da
More informationOAI (Open Archives Initiative) Suite Version 3.0. Introductory Guide for New Users
OAI (Open Archives Initiative) Suite Version 3.0 Introductory Guide for New Users Any comments or requests for change to this user guide should be referred to:- Axiell Ltd. Hall View Drive Bilborough Nottingham,
More informationBIBLID (2004) 93:1 pp (2004.6) 209. NBINet NBINet 92
BIBLID 1026-5279 (2004) 93:1 pp. 209-235 (2004.6) 209 92 NBINet NBINet 92 Keywords HTTP Z39.50 OPENRUL OAI (Open Archives Initiative) DOI (Digital Object Identifier) Metadata Topic Maps Ontology E-mail:
More informationSERAD CNES Service for Data Referencing and Archiving
SERAD CNES Service for Data Referencing and Archiving Danièle Boucon, Richard Moreno, Martine Larroque, Dominique Heulet, Paul Kopp, Michel Duplaa PV2009 - December 1st - SERAD, CNES Service for Data Referencing
More informationA Repository of Metadata Crosswalks. Jean Godby, Devon Smith, Eric Childress, Jeffrey A. Young OCLC Online Computer Library Center Office of Research
A Repository of Metadata Crosswalks Jean Godby, Devon Smith, Eric Childress, Jeffrey A. Young OCLC Online Computer Library Center Office of Research DLF-2004 Spring Forum April 21, 2004 Outline of this
More informationFedora Relationships and Information Network Overlays. CS 431 April 19, 2006 Carl Lagoze Cornell University
Fedora Relationships and Information Network Overlays CS 431 April 19, 2006 Carl Lagoze Cornell University Fedora Resource Index: Using RDF and ontologies Fedora Digital Objects Resource Index View dc:creator
More informationDeveloping ArXivSI to Help Scientists to Explore the Research Papers in ArXiv
Submitted on: 19.06.2015 Developing ArXivSI to Help Scientists to Explore the Research Papers in ArXiv Zhixiong Zhang National Science Library, Chinese Academy of Sciences, Beijing, China. E-mail address:
More informationAn e-infrastructure for Language Documentation on the Web
An e-infrastructure for Language Documentation on the Web Gary F. Simons, SIL International William D. Lewis, University of Washington Scott Farrar, University of Arizona D. Terence Langendoen, National
More informationMicrodata Management Toolkit (MMT) National Data Archive (NADA)
Microdata Management Toolkit (MMT) National Data Archive (NADA) An Overview Microdata Management Toolkit What it is A collection of tools The Metadata Editor: to document your survey in compliance with
More informationTowards a Web Search Service for Minority Language Communities
Towards a Web Search Service for Minority Language Communities Baden Hughes Department of Computer Science and Software Engineering The University of Melbourne VIC 3010, Australia badenh@csse.unimelb.edu.au
More informationLong-term digital preservation of UNSWorks
Long-term digital preservation of UNSWorks UNSW Library Arif Shaon, Maude Frances CAUL Community Days 2014 UNSW Australia The University of New South Wales at a Glance: https://www.unsw.edu.au/sites/default/files/documents/unsw4009_miniguide_2012_aw2_v2.pdf
More informationEUDAT Training 2 nd EUDAT Conference, Rome October 28 th Introduction, Vision and Architecture. Giuseppe Fiameni CINECA Rob Baxter EPCC EUDAT members
EUDAT Training 2 nd EUDAT Conference, Rome October 28 th Introduction, Vision and Architecture Giuseppe Fiameni CINECA Rob Baxter EPCC EUDAT members Agenda Background information Services Common Data Infrastructure
More informationBuilding an OAI-based Union Catalog for the National Digital Archives Program in Taiwan
Building an OAI-based Union Catalog for the National Digital Archives Program in Taiwan Chao-chen Chen, Professor National Taiwan Normal University Chao-chen Chen 1 NDAP program NDAP is devoted to Digitalize
More informationA Gentle Introduction to Metadata
A Gentle Introduction to Metadata Jeff Good University of California, Berkeley Source: http://www.language-archives.org/documents/gentle-intro.html 1. Introduction Metadata is a new word based on an old
More informationThe OAIS Reference Model: current implementations
The OAIS Reference Model: current implementations Michael Day, UKOLN, University of Bath m.day@ukoln.ac.uk Chinese-European Workshop on Digital Preservation, Beijing, China, 14-16 July 2004 Presentation
More informationRoy Lowry, Gwen Moncoiffe and Adam Leadbetter (BODC) Cathy Norton and Lisa Raymond (MBLWHOI Library) Ed Urban (SCOR) Peter Pissierssens (IODE Project
Roy Lowry, Gwen Moncoiffe and Adam Leadbetter (BODC) Cathy Norton and Lisa Raymond (MBLWHOI Library) Ed Urban (SCOR) Peter Pissierssens (IODE Project Office) Linda Pikula (IODE GEMIM/NOAA Library) Data
More informationEvolving the digital library for digital scholarship enablement
Evolving the digital library for digital scholarship enablement Cillian Joy, NUI Galway Library HEAnet, 13 November 2015 cillianjoy Outline Digital library Strategy Our approach Recent work Issues Future
More informationSurvey of Existing Services in the Mathematical Digital Libraries and Repositories in the EuDML Project
Survey of Existing Services in the Mathematical Digital Libraries and Repositories in the EuDML Project Radoslav Pavlov, Desislava Paneva-Marinova, and Georgi Simeonov Institute of Mathematics and Informatics,
More informationPersistent identifiers, long-term access and the DiVA preservation strategy
Persistent identifiers, long-term access and the DiVA preservation strategy Eva Müller Electronic Publishing Centre Uppsala University Library, http://publications.uu.se/epcentre/ 1 Outline DiVA project
More informationOpen Archives Initiatives Protocol for Metadata Harvesting Practices for the cultural heritage sector
Open Archives Initiatives Protocol for Metadata Harvesting Practices for the cultural heritage sector Relais Culture Europe mfoulonneau@relais-culture-europe.org Community report A community report on
More informationThe challenge of collecting and evaluating LRs for commercial use
Language Technologies Observatory The challenge of collecting and evaluating LRs for commercial use www.lt-observatory.eu Bente Maegaard, CLARIN ERIC (and University of Copenhagen) Overview of the challenges
More informationNuno Freire National Library of Portugal Lisbon, Portugal
Date submitted: 05/07/2010 UNIMARC in The European Library and related projects Nuno Freire National Library of Portugal Lisbon, Portugal E-mail: nuno.freire@bnportugal.pt Meeting: 148. UNIMARC WORLD LIBRARY
More informationPackage rdryad. June 18, 2018
Type Package Title Access for Dryad Web Services Package rdryad June 18, 2018 Interface to the Dryad ``Solr'' API, their ``OAI-PMH'' service, and fetch datasets. Dryad () is a curated
More information