Annotation Services to Support Collaborative Development of Scholarly Editions

Size: px
Start display at page:

Download "Annotation Services to Support Collaborative Development of Scholarly Editions"

Transcription

1 Annotation Services to Support Collaborative Development of Scholarly Editions Open Annotation Collaboration (OAC) Annotation Demonstration Experiment Report Anna Gerber and Jane Hunter ITEE, The University of Queensland 1. Executive Summary The aim of this experiment has been to evaluate and demonstrate the applicability of OAC in the context of collaborative scholarly editions of literary works. Several specialised types of annotation supporting the scholarly editing process were identified and modelled using the OAC beta data model. A user interface, which displays multiple versions of a literary work with textual transcriptions and facsimiles presented side- by- side was developed, and the Web- browser- based Aus- e- Lit LORE annotation tool 1 was extended to use the OAC model to allow segments of facsimile images and transcriptions, and variations between versions of a work to be annotated with those specialised annotation types. The LOREstore repository was also developed, to store annotations as RDF graphs and to support search, display, retrieval and SPARQL querying of OAC annotations. This report describes the use cases addressed, provides examples of the specialised annotation types that were identified to support the collaborative development of scholarly editions, and discusses the lessons learned from building annotation tools and services using the OAC beta model, during the course of this demonstration experiment. 2. Use Case Context Scholarly editions are the outcome of detailed study of a specific literary work or collection of shorter works such as poems or short stories. When preparing a scholarly edition, scholarly editors aim to provide a comprehensive description of the history of the literary work(s) including information about significant versions and physical forms. The IFLA FRBR model 2 can be used as a bibliographical foundation for describing these versions. In addition to a textual essay, editorial decisions are argued in textual notes, and a textual apparatus is 1 e- lit/ 2 requirements- for- bibliographic- records

2 compiled to record the alterations made between different versions of the work. Annotations can be used to document these textual notes and variations, and can provide an additional layer of information about the documents being studied, and the people or organisations who were involved in the production of the work over time. Annotations in the form of explanatory notes may also address the content of the text, identifying such things as allusions to other works, historical contexts and stylistic significance. Modern scholarly editions are increasingly collaborative ventures with editors, advisers and editorial board members dispersed globally, so there is a critical need for Web- based tools and services to support the collaborative development of scholarly editions by distributed scholars. Although tools such as MEDITE 3, Juxta 4, CollateX 5 and MVD GUI 6 facilitate comparing and displaying variants across multiple digitised versions of a work, there is no established, common model or tool for documenting, sharing or replying to annotations on specific variants within a collaborative edition. Hence the key use cases for this experiment focus on enabling editors to create, share and reply to scholarly commentary attached to variations between versions (FRBR manifestations) of a particular work and to parts of the text (i.e. a segment of a transcription) or facsimile image). Editors also need to be able to retrieve annotations (including their own and those created by collaborators) across versions, through search and querying by target resource. For example, users want to retrieve and display all annotations associated with a particular transcription or image (that is part of an edition) and display them in parallel within the Web browser. 3. Annotation Types We characterised the types of annotations that may be attached to selections of transcriptions, facsimiles and variants to support the production of apparatus and scholarly commentary during development of a collaborative scholarly edition as follows: Variation Annotation Purpose Describe textual variation between versions of a work. Description The scholarly commentary attached to textual variation describes metadata properties such as the date when the original variation occurred and the agent responsible for the change, and also allows links to additional resources such as part of a manuscript image, or supporting documentary evidence. Variation Annotations are eventually published as part of the content a scholarly edition. Implementation We define VariationAnnotation as a subtype of oac:annotation, with a structured data body (the target URI identifies an ORE 3 poleia.lip6.fr/~ganascia/medite_project

3 Resource Map expressed as RDF/XML) Textual Note Purpose Description Implementation Explanatory Note Purpose Description Implementation Comment Purpose Description Implementation Document or provide support for editorial decisions. These will be published as part of an edition. TextualNote is a subtype of oac:annotation, which may target a variant, or any segment of a transcription or facsimile image. Provide explanatory commentary on selected characters, words, paragraphs, sections etc. These will be published as part of an edition. These may, for example, define an obscure word, provide historical context, or may identify a person, place, event or some other allusion in the text, and so we can characterise these further into the following types o Literary Allusion o Classical Allusion o Biblical Allusion o Glossary o Historical Note o Bibliographical Note ExplanatoryNote is a subtype of oac:annotation. Explanatory Notes typically target a segment of a transcription or facsimile image. To facilitate communication between collaborating editors during the editorial process. These are intended for communication during the editing process, and so they will not be published as part of an edition. We use the oac:annotation class Reply Purpose Description Implementation To facilitate communication between collaborating editors during the editorial process. Replies can be attached to any of the above annotation types, and are not published as part of an edition. We use the oac:reply class 4. Example Annotations This section provides examples of typical annotations produced during the experiment. Except for variation annotations, which use an ORE Resource Map body, our annotation tool creates inline text or HTML bodies for all annotation types. We also allow semantic or free- text tags to be attached to any of these annotation types in addition to the commentary e.g. for tagging places, people,

4 events, subjects etc. RDF serialisations for these examples are provided in Appendix A. Figure 1: Explanatory Note (Biblical Allusion) Figure 2: Variation Annotation between two versions of The Buln Buln and the Brolga

5 Figure 3: Textual Note and Reply 5. Summary of Progress 5.1 Accomplishments The LOREstore repository was developed to store annotations as RDF named graphs. We also use it to store data bodies for variation annotations (stored as ORE Resource Maps). It provides a Web interface supporting annotation search and display, repository content and user account management, and a REST API supporting Create, Read, Update and Delete of annotations as well retrieval by annotation identifier, target, keyword search or SPARQL query. Several example works were established and displayed via a Web interface developed using the nmerge 7 / MVD- GUI tool, which compares versions of a literary work side- by- side, with the option to switch between displaying the facsimile or transcription for each version. The LORE annotation sidebar was extended so that OAC Annotations representing comments, explanatory notes and textual notes on a specific version of a work (e.g. a segment of a transcription or facsimile image), and OAC Annotations describing textual variation between versions of a work, can be created and displayed, and to support annotation of annotations (e.g. as replies). Appendix B provides screenshots of the user interface for viewing transcriptions, facsimiles and textual variation and the annotation sidebar, while Appendix C 7 Desmond Schmidt, nmerge,

6 provides screenshots of the web interface developed for lorestore. Appendix D describes conference presentations relating to the experiment. 5.2 Limitations, delays and failures Although the AustLit collection contains many full text resources, for most works the collection only includes a single version. We needed to spend additional time to source, and sometimes to encode (using TEI) additional versions to use as examples during the experiment. Because of the amount of time required to digitise and encode such documents, we decided to work with small examples e.g. a couple of chapters extracted from a larger work, a single story from an anthology, rather than full examples as might be found in a complete edition. We wanted to support annotation of any resource regardless of location e.g. to allow annotation of transcriptions and facsimiles made available online through libraries and archives. Hence, our annotation tool is implemented as a Web browser extension. We worked on porting the LORE annotation sidebar (originally developed for Firefox), so that it could be installed as an extension for Google Chrome to make the tool available to users who did not wish to use Firefox. This development effort relied upon an experimental sidebar API that was made available through Chrome s developer channel. Unfortunately that sidebar API was discontinued and removed from Chrome, so we have not been able to finish porting the extension, and LORE is at present only available for Firefox, which limits the potential user base. We hope to complete this work when a replacement for the sidebar API for Chrome becomes available in the future. 5.3 Supplemental Ontologies Required and Recommendations We used the following ontologies in combination with the OAC model: Dublin Core: creator, created, modified, title, description for annotations and annotation bodies FOAF: for creator metadata FRBR in RDF 8 : for bibliographic metadata for Works, Expressions, Manifestations Annotation Ontology v : PrefixPostfixSelector for identifying segments of text We also developed our own ontology describing the subclasses of oac:annotation listed in Section 3, as well as custom properties to record variation metadata, and to relate digital surrogates (transcriptions and facsimiles) to FRBR entities. Identifying FRBR (and other non- information) entities consistently remains an unresolved issue for interoperable annotations across distributed scholarly editions, annotation systems and content servers, for example, to enable all annotations for the 1603 edition of Hamlet to be retrieved and displayed. We assigned our own local identifiers for FRBR entities for this experiment, but community agreement on conventions for generating or mapping between such ontology/wiki/understandingao

7 identifiers, or use of a name authority will be necessary to achieve seamless sharing of these annotations across systems. We used XPointers, W3C media fragments and AO PrefixPostfixSelectors for describing segments of transcriptions and facsimiles. Describing segments of digital transcriptions and facsimiles independently of their media type or format remains a key challenge to interoperability. This may be achieved through use of a schema that allows segments to be described by line, paragraph and page references; prefix- postfix notation; content offsets, etc., but this requires community agreement. Existing standards for addressing sections of texts apply to specific formats (such as TEI) only. 6. Discussion of Results and Conclusions 6.1 Technical Lessons Learned Initially, we defined many annotation subclasses, for example, to represent different types of Allusion, Historical, Bibliographical notes etc. As the number of subclasses increased, it became impractical to enumerate all possible types in each query, so we tried switching the Storage and Inferencing Layer (SAIL) within the RDF repository to one that supports type inferencing. However, this choice impacts on performance and scalability of the annotation repository, and many highly scalable SAILs do not support inferencing. Type inferencing is also not supported in the JavaScript library that we are using within our annotation client. To avoid the need for type inferencing altogether, we modified the annotation client to explicitly assert that the type of each annotation is oac:annotation by adding an additional rdf:type property. The downside of this approach is that we are storing redundant type information for most annotations. Eventually, we decided to reduce the number of annotation subclasses, and we discuss guidelines for when to subclass in Section The main technical weaknesses with the OAC model that we identified are the complexity of the model for common, basic use cases and that there is more than one way to represent certain information, which will increase the development effort required to produce tools that fully implement the model. For example, for annotations on part of an image, the image segment can be specified using an SVG constraint (with a constrains relationship to the image URI) or using a media fragment identifier in the target URI (with an ispartof relationship to the image URI). In both cases there is no direct link from the annotation object to the image URI. Queries to retrieve all annotations on a given image must examine the target URI (for whole of image annotations), as well as URIs related via these two properties. However, it is the flexibility afforded by this complexity that enables the OAC model to represent complex scholarly annotation use cases, such as those we have presented in this report. It is in the application to such use cases, that simpler models like Annotea have proven to be limited in their capabilities.

8 6.2 Additional use cases identified Create and attribute bodies from existing (offline) content The scholarly editors collaborating with us during this experiment provided scholarly commentary in the form of books and Word documents that we wished to reuse for annotation bodies. While we were creating annotations to represent this content, we realized that supporting import of annotation bodies from existing content, while maintaining the original authorship so that the body content is properly attributed, is an additional use case. It is important to preserve the authorship of the annotation as well, because the annotator has contributed by selecting a segment of a text as the annotation target, and selecting what they consider to be a relevant part of some existing commentary to attach as the annotation body. The OAC model does allow separate authorship for the annotation, target and body, even if the body is provided as inline content with the annotation. We have added this metadata to annotations as required, through the LOREstore content management web interface, and we will extend our annotation client to support display and management of this additional metadata in the future. Attach geographical co-ordinates Attaching geographical co- ordinates to build a map or list of places, as included in some print editions was identified as an additional use case from user feedback during the experiment. We will investigate supporting this use case as future work. Relate multiple resources that are not associated with textual variation During the experiment, users of the annotation tool began to create variation annotations to relate multiple resources, even when the target was not textual variation. It became apparent that the distinction between Textual Note and Variation Annotation is artificial (the only difference being whether the body is RDF). This suggests that we should modify our annotation tool and rethink our annotation class hierarchy, for example, to select a more generic type instead of VariationAnnotation (e.g. oac:dataannotation) so that both explanatory notes and textual notes can support data bodies to link multiple targets. Export annotations to publication formats Users requested that they be able to export annotations to a Word document, PDF or epub, for archiving and sharing offline. We developed an export to Word feature for our annotation tool in response to these requests. This new use case suggests that it would be useful to investigate how to embed OAC annotation metadata within non- Web- based publication formats e.g. using RDFa, to provide an alternative method of interchange, rather than just a one- way export. 6.3 Modelling insights The main strengths of the OAC model for representing annotations for collaborative scholarly editions that we have identified during this experiment are as follows: The OAC model supports multiple targets, and each can be associated with a constraint for specifying the segment of interest, which means that we can

9 create interoperable annotations describing textual variation across multiple versions without extending the model or creating aggregate targets. By comparison, within the Annotea model there is no mechanism for associating an Annotea context (used for selecting the segment of a text) with a target, so when mapping variation annotations to Annotea, we used non- standard extensions, which rely on naming conventions rather than explicit semantic relationships. Targets and bodies can be any media type and can be located on any server. This flexibility means that we can directly annotate digitised materials (e.g. facsimile images) that have been made available through external archives and libraries. It also allows us to create RDF data annotation bodies, so that metadata properties associated with the body can be stored separately rather than included in the annotation graph, making the provenance of those statements clear. Because the OAC model is RDF- based, it is a trivial exercise to extend the model and include properties from existing ontologies within the annotation graph. We use custom properties to link target documents to FRBR entities, allowing us to query and retrieve annotations across multiple versions of the same FRBR expression or work Subclassing of oac:annotation This experiment has clarified our thoughts about when to subclass oac:annotation as opposed to using a tag or some other property to categorise annotations. We recommend subclassing oac:annotation under the following circumstances: For a small, fixed set of types; When the annotation types represent distinct concepts in the domain, and there is a requirement to provide search, filtering, or different presentation or handling of annotations according to those types. For example, printed scholarly editions almost always present explanatory notes in a standalone chapter or appendix, while descriptions of textual variation and textual notes are often published as apparatus in footnotes or on the opposite page to the text. Some editions also include additional sections for glossaries, and for place names with a map, so these would be candidates for subclasses. When the type relates to the semantics of the annotates relationship between the body and the target (e.g. for an explanatory note, the body provides an explanation for the target selection, and for a glossary entry, the body is a definition of the target selection). Conversely, we recommend using tags rather than a subclass to distinguish annotations when: The list of types is large or requires frequent reorganisation or extension; Interpretation is required in order to determine which type applies or multiple types may apply to individual annotations; or The type relates to the content of either the body or the target and not the nature of the relationship between body and target. Based on these insights, we should support tagging explanatory notes as classical, biblical, historical, mythological allusion etc., because allusion is about

10 the target only and is subjective, so it makes sense to create the tag as part of the annotation body rather than using an annotation subclass. The annotation types that could be represented as subclasses for our experiment include TextualNote and ExplanatoryNote, with subclasses Glossary and GeographicalNote. Modifying our annotation tool and repository to support using tags to distinguish annotation types e.g. for types of allusion, remains work to be completed in the future Aggregate Targets vs. Bodies Another modelling issue that we grappled with was whether to use aggregate targets when annotating differences between multiple versions of a work. Our initial approach was to create aggregate targets for Variation Annotations, however, we soon realised that the ordering and relationships that were being asserted in the aggregation by the annotator information that was not recorded prior to the creation of the annotation and which is subject to interpretation actually constituted part of the scholarly commentary content of the annotation, so we decided to use an aggregate body instead. Using multiple targets for the versions rather than an aggregation also greatly simplified the SPARQL queries for retrieving annotations by target (the most common query generated via our annotation tool) Referencing vs. targeting We have found it useful to remember that an oac:annotation reifies an annotates relationship between body and target, and that the body should be about the target(s) as a guideline for when to target vs. when to reference a resource within an annotation. For example, when creating an annotation that asserts that a publisher s editor made a change that occurred between two versions of a work, we may wish to link to some correspondence between the author and publisher so that a future reviewer of the edition will be able to view this evidence. The annotation would target the two versions of the work, however, the commentary is not really about the correspondence, and hence we would reference the letter using a dc:relation property in the variation annotation body, rather than including it as another target of the annotation Conclusion Our experiment has demonstrated that the OAC model can be applied to collaborative authoring of electronic scholarly editions to: support annotations to enable discussion; document textual variation between multiple versions of a work; attach scholarly commentary in the form of explanatory and textual notes. Acknowledgements The authors particularly wish acknowledge the valuable contributions to this experiment made by Dr Roger Osborne (UQ), Professor Paul Eggert (UNSW) and Professor Tim Dolin (Curtin University).

11 Appendix A Annotation RDF Serialisations The following prefixes apply for all rdf: < rdf- syntax- dc: cnt: dcterms: foaf: oac: < Explanatory Note from Figure 1 < dc:language "en" ; dc:title "Go thou and do likewise" ; dcterms:created " T13:32: :00"^^dcterms:W3CDTF; oac:hasbody <urn:uuid:f A0AA- 732F20FCAFEA> ; a < ; a < annotation- ns#explanatorynote> ; oac:hastarget rphy&version1=2#xpointer(string- range(id("id "), "", 431, 26))> dcterms:creator < <urn:uuid:f A0AA- 732F20FCAFEA> a < ; cnt:characterencoding "UTF- 8" ; cnt:chars "Cf. 'Go and do thou likewise' (Luke 10:37), but in the context of procreation, perhaps a faint echo of Be fruitful, and multiply, and replenish the earth (Genesis 1:28)". rphy&version1=2#xpointer(string- range(id("id "), "", 431, 26))> dcterms:ispartof rphy&version1=2>. < a < ; foaf:name "Roger Osborne". Variation Annotation from Figure 2 < dc:title "trying to turn] TSb A running in TSa" ; dcterms:created " T17:35: :00"^^dcterms:W3CDTF ; oac:hasbody < ; a < ; a < annotation- ns#variationannotation> ; oac:hastarget rphy&version1=1#xpointer(string- range(id("id "), "", 145, 10))> ; oac:hastarget rphy&version1=2#xpointer(string- range(id("id "), "", 245, 14))> dcterms:creator < < dc:format "application/rdf+xml".

12 Body for Variation Annotation: < a < ; ore:describes < ; dc:creator "Anna Gerber" ; dc:desciption "Furphy appears to have tested a revision on TSa, by striking through 'running in a single steer' and beginning a replacement that started with 'shouldering' before leaving the revision alone. By the time Furphy completed the BBB TS, 'trying to turn' had been inserted. This suggests that another document lies between TSb and TSa. " ; lit:variation- agent "Joseph Furphy". < a < ; ore:aggregates rphy&version1=1#xpointer(string- range(id(%22id %22),%20%22%22,%20145,%2010))> ; ore:aggregates rphy&version1=2#xpointer(string- range(id(%22id %22),%20%22%22,%20245,%2014))>. rphy&version1=1#xpointer(string- range(id(%22id %22),%20%22%22,%20145,%2010))> dcterms:ispartof rphy&version1=1> ; dc:title "typescript". rphy&version1=2#xpointer(string- range(id(%22id %22),%20%22%22,%20245,%2014))> dcterms:ispartof rphy&version1=2> dc:title "1948". rphy&version1=2> lit:surrogatefor < ; lit:isvariantof rphy&version1=1>. rphy&version1=1> lit:surrogatefor < Textual Note from Figure 3 < dc:language "en" ; dc:title "Amen" ; dcterms:created " T15:54: :00"^^dcterms:W3CDTF ; oac:hasbody <urn:uuid:b C46A- 4DD9- BA3E- 4FB8B4D36BC4> ; a < ; a < annotation- ns#textualnote> ;

13 oac:hastarget rphy&version1=2#xpointer(string- range(id("id "), "", 887, 4))> ; dcterms:creator < <urn:uuid:b C46A- 4DD9- BA3E- 4FB8B4D36BC4> a < ; cnt:characterencoding "UTF- 8" ; cnt:chars "TS continues with a long paragraph in which Jeff Rigby, who was dropped from BB and SIL, advises Mrs Falkland- Pritchard on her career as an authoress. Rigby identifies Dickens, Rousseau s Social Contract, Paine s Rights of Man, Uncle Tom s Cabin and Don Quixote as works that marked an epoch and brought about moral revolution. Some of the exchanges originally given to Rigby in TS fall to Tom Collins, but never any serious moralising about life or art.". rphy&version1=2#xpointer(string- range(id("id "), "", 887, 4))> dcterms:ispartof rphy&version1=2>. Reply from Figure 4 < dc:language "en" ; dc:title "Re: Amen" ; dcterms:created " T16:34: :00"^^dcterms:W3CDTF ; oac:hasbody <urn:uuid:e20d5767-4c0b D6B- 20C72560E418> ; a < ; oac:hastarget < ; dcterms:creator < <urn:uuid:e20d5767-4c0b D6B- 20C72560E418> a < ; cnt:characterencoding "UTF- 8" ; cnt:chars "While not deemed suitable for The Buln Buln and the Brolga, this passage is significant to the argument of Such is Life (1898). Furphy is much more concerned with exploring the 'fiction of facts' and the 'facts of fiction' in the typescript version. Returned to their previous context, the unrevised sections of the Buln Buln and the Brolga perform a different function in a significantly different narrative.". Appendix B Annotation Tool Screenshots The following screenshots illustrate some typical uses of the LORE Annotation sidebar: discussion between collaborators in the form of comments and replies, viewing annotations attached to a single version or variation between two versions of a work, and creating an annotation.

14 Figure B.1: Scholarly discussion through Comments and Replies Figure B.2: UI for creating a variation annotation

15 Figure B.3: Annotations on a single version of a Work Figure B.4: In-browser variation annotation display

16 Figure B.5: Creating an Explanatory Note using the experimental annotation sidebar for Chrome Appendix C Repository Screenshots The lorestore repository was developed for this experiment, to support storage, search and display of OAC annotations and ORE Resource Maps used for variation annotation bodies. The source code for the lorestore repository has been released under a GPL 3.0 open source license, and is available on GitHub at eresearch/lorestore/ A sandbox instance is deployed at Annotation Search and Display The lorestore web interface supports keyword search and search by target. Annotations can be displayed (Fig C.1), and can be retrieved in a variety of formats including TriG, TriX, RDF/XML and JSON, or visualised as a graph (Fig C.2). A SPARQL endpoint and editor (Fig C.3) is also provided to enable custom queries.

17 Figure C.1: Annotation summary display Figure C.2: Graphical annotation display

18 Figure C.3: SPARQL endpoint and query editor REST API A REST API was developed to support creating, reading, updating, deleting and querying of annotations. Documentation and code examples of usage (Fig C.4) are available through the lorestore web interface.

19 Figure C.4: API documentation and usage examples Administration The lorestore repository supports role- based access control and authentication. Annotations and data bodies can be published publicly or privately. Administrators can manage content (Fig C.5) and user accounts (Fig C.6) through the web interface. Figure C.5: Content management functions

20 Figure C.6: User account management functions Appendix D Conference Presentations The following conference presentations have included discussion of or content relating to this annotation demonstration experiment: A. Gerber, R. Osborne, Transforming Communication in Textual Scholarship: Open Annotation for Electronic Editions, Digital Humanities Australasia (DHA) 2012, Canberra, March Slides available: DHA2012- slides- web.pdf The abstract for this presentation is provided as Appendix D.1. R. Osborne, A. Gerber, K. Kilner, "Using LORE", THATCamp Canberra, 7-9 October, 2011, Slides available: THATCamp2011.pdf A. Gerber, "LORE: An open source research tool for Australian literary scholars", linux.conf.au, January, 2011, Brisbane, Australia, Slides available: LCA2011Slides.pdf The following paper has also been accepted and will be presented in July: R. Osborne, A. Gerber, J. Hunter, Ontology- based Annotation for Electronic Editions using the Open Annotation Collaboration (OAC) Data Model, Ontology- based Annotation Workshop, Digital Humanities The abstract is provided as Appendix D.2 (figures have been elided for brevity)

21 Appendix D.1 - Abstract for Paper presented at Digital Humanities Australasia, March 2012 Transforming Communication in Textual Scholarship: Open Annotation for Electronic Editions Anna Gerber & Roger Osborne, The University of Queensland Abstract The Open Annotation Collaboration (OAC) provides a framework for sharing scholarly annotations across clients, servers, collections, applications and architectures. The OAC data model is based on linked data and semantic web principles, and can be tailored to meet the complex scholarly annotation requirements of specific research communities while maintaining interoperability. In this paper, we describe how we have applied the OAC model to support annotation within an electronic edition of Joseph Furphy s Such is Life. When preparing a scholarly edition, the editors aim to provide a comprehensive description of the history of a work, specifically information about significant versions and physical forms. In addition to a substantial textual essay, editorial decisions are argued in textual notes, and a textual apparatus is compiled to record the alterations made between different versions. Modern scholarly editions are frequently collaborative ventures with multiple editors, advisers and an editorial board dispersed globally. The open- source annotation toolkit that we have developed enables editors to relate transcripts with facsimiles; attach textual and explanatory notes to text and image selections; reference secondary sources; record information about textual variations; and to engage in collaborative discussion through comments, questions and replies. The flexibility of the OAC model allows us to use the same toolkit for annotations at all stages of the scholarly editing process, leaving a record of editorial decisions and allowing export for publication in print or electronic form. In 2003, editorial theorist Jerome McGann wrote, In the next fifty years the entirety of our inherited archive of cultural works will have to be reedited within a network of digital storage, access, and dissemination. This system, which is already under development, is transnational and transcultural. Tools such as those being developed for OAC will make a significant contribution to the thought and practical applications that flow from McGann s prediction. Appendix D.2 - Abstract submitted to DH2012 Ontology- Based Annotation Workshop Ontology- based Annotation for Electronic Editions using the Open Annotation Collaboration (OAC) Data Model

22 Roger Osborne, Anna Gerber, Jane Hunter The University of Queensland 1. Introduction Scholarly editions of literary works include significant amounts of information in explanatory notes, textual notes and glossaries. Print- based editions are limited by the amount of page space allocated, but electronic editions can support more comprehensive collections of notes and additional information to supplement the longer historical and textual essays that provide the main scholarly argument about the need for the edition and the validity of the editorial rationale. In an electronic edition, these notes may take the form of annotations. An ontology- based annotation system can extend the usefulness of notes beyond the limits of static, print- based models, and enable their discovery, sharing and re- use via the Web. In an electronic edition that includes facsimiles, transcriptions and collations, annotations provide an extra layer of information about the nature of the documents, the textual content of each document, the textual transmission between documents, and the various people and organisations that played a part in the production of the literary work over time. Annotations can also provide glosses about the text itself, identifying such things as allusions to other works, historical contexts and stylistic significance. Digital images and transcriptions provide a surrogate for the material artefacts held in libraries and archives, enabling the relationships between documents, people and organisations to be efficiently modelled within an ontology- based annotation system. Modelling the relationships between documents, people and organisations makes explicit the many implicit assumptions that exist in the mind of the editor and the intended audience for the edition. It also provides an alternative to text- based communication of standard explanatory notes by supporting graphical and tabular representations of information and by allowing powerful semantic querying, filtering, and faceted browsing within and across electronic editions. The complex range of internal and external relationships that emerge from a scholarly edition not only test the limits of print- based editions, but also test the limits of hierarchical data models. The graph- based, flexible, extensible nature of an ontology- based system is better suited to representing the complete history of literary, philosophical and historical works. In this paper we describe and discuss some of the challenges involved with how we are applying the Open Annotation Collaboration (OAC) data model [1] within the Australian Electronic Scholarly Editing (AustESE) i project to represent these data. Our OAC- based annotation system is enhanced by integration with the IFLA FRBR [2] taxonomy, which provides a solid bibliographical foundation for annotations to traverse all conceptual levels of a work. This benefits the editor by providing a well- structured environment to collect, describe, and analyse a work, but it also benefits readers by providing a wider variety of reading strategies to help them pursue their study of a particular work and its multiple derivative forms.

23 2. Modelling Approach The OAC provides a common data model for representing annotations across tools, architectures and collections. The model, which is expressed as an OWL ontology, is intended to be extensible, so that it can be refined to meet the annotation requirements of specific communities. We have extended the OAC data model with specialised annotation types to support the production of apparatus and commentary within electronic editions by subclassing the oac:annotation class. We categorise annotations as ExplanatoryNotes (providing commentary), TextualNotes (which provide support for editorial decisions), or VariationAnnotations (which describe textual variation between versions of a work). These annotation types can be used in search queries and for filtering and sorting annotations for display and inclusion for print or electronic publication. We have defined additional properties that may be used within the body of a VariationAnnotation, to record metadata about the agent, date or cause of the variation as well as documentary evidence including links to manuscript facsimiles. Within our RDF- based annotation tool and annotation repository ii, we have adopted a Linked Data approach of using HTTP URIs to identify entities that may be referenced within annotations, including documents, agents (people or organisations) and conceptual FRBR entities (Works, Expressions, Manifestations and Items). We use FOAF and Dublin Core to record annotation provenance, and we apply and extend the FRBR ontology iii with properties that relate the transcriptions and corresponding facsimile images that are being annotated. The base oac:annotation and oac:reply types are used in our system to support comments and discussion between collaborating editors content which is not usually considered to be part of the scholarly content of the edition. The flexibility of the OAC model, and particularly its extensibility and support for multiple targets and RDF annotation bodies, allows us to use the same annotation tools and repository at all stages of the scholarly editing process. Semantic tagging can be used in addition to the customised annotation types, to identify annotations that serve different purposes within the editing workflow - ultimately supporting efficient filtering and customised views that can be adapted for different modes of publication or intended audiences. 3. Discussion and Challenges Modern scholarly editions are frequently collaborative ventures with multiple editors, advisers and an editorial board dispersed globally. But to extend electronic editions beyond the closed, finished or abandoned, look- but- don t- touch products, described by Peter Shillingsburg [3], scholarly editing needs to be conducted in collaborative, open- ended electronic environments. Such an environment will support the scholarly editing model advocated by Hans Walter Gabler: one that is predicated on the functional correlation of bodies of material content in a systemics of discourses and argument [4]. Peter Robinson [5] suggests that the future of scholarly editing lies with a network of many servers, all holding different parts of an edition, with many other servers providing a range of services to the readers and scholars interested in this edition. Although the OAC ontology allows us to address these plans by representing the structure

24 of the annotations consistently across a range of tools and servers, we have identified several challenges to interoperability that should be resolved before seamless sharing of annotations within such collaborative, relational and distributed editions can be achieved, including: Identifying entities (e.g. people, documents, works etc.) consistently across annotation systems and content servers, so that queries can retrieve and display all annotations on a given entity. Use of name authorities or community agreement on naming conventions may help to address this issue, however any solution must also be applicable to conceptual entities such as semantic tags and non- extant resources (e.g. missing manuscripts) that may be referenced within annotations. Describing segments of digital transcriptions and facsimiles independently of their media type or format, for example, through use of a common schema that allows segments to be described by line, paragraph and page references; prefix- postfix notation; content offsets, etc. TEI and HTML have addressing schemes but these are low- level and format- specific. Developing strategies to manage the subjectiveness of interpretation that may be involved in deciding how to describe versions of a work in terms of FRBR, and how to relate particular documents to those FRBR entities, particularly when dealing with manuscripts and digital surrogates. One strategy would be to apply semantic inferencing rules to align bibliographic structures between systems. 4. Conclusion By using an ontology- based annotation system to represent knowledge that would normally be assumed of experienced readers of an edition, electronic editions can be made accessible to a wider audience. The ability to search, browse and represent information in graphical and tabular form will greatly assist new readers and novice researchers to navigate the large amounts of information and the complex networks of relationships that are captured in an electronic edition. These features will also benefit scholarly editors by recording the processes of editing in a way that better supports comprehensive checking, verification and review by external bodies. Ultimately, ontology- based annotation systems will enable collaborative, distributed editions to more easily share information across platforms, taking full advantage of the potential of semantic web technology, and accelerating the creation and communication of knowledge. Acknowledgements This work was undertaken as an annotation demonstration experiment through the Open Annotation Collaboration (OAC) and will be further developed for the AustESE project. The OAC is funded by the Andrew W. Mellon Foundation and the partners of the collaboration. The AustESE project is funded through the Australian National eresearch Collaboration Tools and Resources (NeCTAR) eresearch tools program. References [1] R. Sanderson and H. Van De Sompel, Open Annotation: Beta Data Model Guide, [Online]. Available: [Accessed: 12-Apr-2012].

25 [2] IFLA Study Group on the Functional Requirements for Bibliographic Records, Functional requirements for bibliographic records : final report, UBCIM publications, new series, vol. 19, [3] P. Shillingsburg, How Literary Works Exist: Implied, Represented, and Interpreted, in Text and Genre in Reconstruction: Effects of Digitalization on Ideas, Behaviours, Products and Institutions, W. Mccarty, Ed. Cambridge: OpenBook Publishers, 2010, pp [4] H. W. Gabler, Theorizing the Digital Scholarly Edition, Literature Compass, vol. 7, no. 2, pp , [5] P. M. W. Robinson, Towards a Scholarly Editing System for the Next Decades, Lecture Notes in Computer Science, vol. 5402, pp , i AustESE Project: ii lorestore eresearch/lorestore/ iii Expression of Core FRBR Concepts in RDF

LORE: A Compound Object Authoring and Publishing Tool for Literary Scholars based on the FRBR. Anna Gerber, Jane Hunter

LORE: A Compound Object Authoring and Publishing Tool for Literary Scholars based on the FRBR. Anna Gerber, Jane Hunter LORE: A Compound Object Authoring and Publishing Tool for Literary Scholars based on the FRBR Anna Gerber, Jane Hunter Open Repositories 2009 Overview LORE: Literature Object Reuse and Exchange Background

More information

LORE: A Compound Object Authoring and Publishing Tool for the Australian Literature Studies Community

LORE: A Compound Object Authoring and Publishing Tool for the Australian Literature Studies Community LORE: A Compound Object Authoring and Publishing Tool for the Australian Literature Studies Community Anna Gerber 1, Jane Hunter 1 1 University of Queensland St Lucia, Queensland, Australia (617) 3365

More information

Metadata Workshop 3 March 2006 Part 1

Metadata Workshop 3 March 2006 Part 1 Metadata Workshop 3 March 2006 Part 1 Metadata overview and guidelines Amelia Breytenbach Ria Groenewald What metadata is Overview Types of metadata and their importance How metadata is stored, what metadata

More information

The RMap Project: Linking the Products of Research and Scholarly Communication Tim DiLauro

The RMap Project: Linking the Products of Research and Scholarly Communication Tim DiLauro The RMap Project: Linking the Products of Research and Scholarly Communication 2015 04 22 Tim DiLauro Motivation Compound objects fast becoming the norm for outputs of scholarly communication.

More information

The CEN Metalex Naming Convention

The CEN Metalex Naming Convention The CEN Metalex Naming Convention Fabio Vitali University of Bologna CEN Metalex CEN Metalex has been an international effort to create an interchange format between national XML formats for legislation.

More information

Contents. G52IWS: The Semantic Web. The Semantic Web. Semantic web elements. Semantic Web technologies. Semantic Web Services

Contents. G52IWS: The Semantic Web. The Semantic Web. Semantic web elements. Semantic Web technologies. Semantic Web Services Contents G52IWS: The Semantic Web Chris Greenhalgh 2007-11-10 Introduction to the Semantic Web Semantic Web technologies Overview RDF OWL Semantic Web Services Concluding comments 1 See Developing Semantic

More information

Semantic Web Fundamentals

Semantic Web Fundamentals Semantic Web Fundamentals Web Technologies (706.704) 3SSt VU WS 2018/19 with acknowledgements to P. Höfler, V. Pammer, W. Kienreich ISDS, TU Graz January 7 th 2019 Overview What is Semantic Web? Technology

More information

Data Exchange and Conversion Utilities and Tools (DExT)

Data Exchange and Conversion Utilities and Tools (DExT) Data Exchange and Conversion Utilities and Tools (DExT) Louise Corti, Angad Bhat, Herve L Hours UK Data Archive CAQDAS Conference, April 2007 An exchange format for qualitative data Data exchange models

More information

DC-Text - a simple text-based format for DC metadata

DC-Text - a simple text-based format for DC metadata DC-Text - a simple text-based format for DC metadata Pete Johnston Eduserv Foundation Tel: +44 1225 474323 pete.johnston@eduserv.org.uk Andy Powell Eduserv Foundation Tel: +44 1225 474319 andy.powell@eduserv.org.uk

More information

COLLABORATIVE EUROPEAN DIGITAL ARCHIVE INFRASTRUCTURE

COLLABORATIVE EUROPEAN DIGITAL ARCHIVE INFRASTRUCTURE COLLABORATIVE EUROPEAN DIGITAL ARCHIVE INFRASTRUCTURE Project Acronym: CENDARI Project Grant No.: 284432 Theme: FP7-INFRASTRUCTURES-2011-1 Project Start Date: 01 February 2012 Project End Date: 31 January

More information

NOTSL Fall Meeting, October 30, 2015 Cuyahoga County Public Library Parma, OH by

NOTSL Fall Meeting, October 30, 2015 Cuyahoga County Public Library Parma, OH by NOTSL Fall Meeting, October 30, 2015 Cuyahoga County Public Library Parma, OH by Roman S. Panchyshyn Catalog Librarian, Assistant Professor Kent State University Libraries This presentation will address

More information

Google indexed 3,3 billion of pages. Google s index contains 8,1 billion of websites

Google indexed 3,3 billion of pages. Google s index contains 8,1 billion of websites Access IT Training 2003 Google indexed 3,3 billion of pages http://searchenginewatch.com/3071371 2005 Google s index contains 8,1 billion of websites http://blog.searchenginewatch.com/050517-075657 Estimated

More information

Adaptable and Adaptive Web Information Systems. Lecture 1: Introduction

Adaptable and Adaptive Web Information Systems. Lecture 1: Introduction Adaptable and Adaptive Web Information Systems School of Computer Science and Information Systems Birkbeck College University of London Lecture 1: Introduction George Magoulas gmagoulas@dcs.bbk.ac.uk October

More information

Annotation & Publishing Standards Work at the W3C

Annotation & Publishing Standards Work at the W3C CNI Fall 2017 Membership Meeting 12 December 2017 - Washington, D.C. Annotation & Publishing Standards Work at the W3C Timothy W. Cole (t-cole3@illinois.edu) University of Illinois at Urbana-Champaign

More information

ResolutionDefinition - PILIN Team Wiki - Trac. Resolve. Retrieve. Reveal Association. Facets. Indirection. Association data. Retrieval Key.

ResolutionDefinition - PILIN Team Wiki - Trac. Resolve. Retrieve. Reveal Association. Facets. Indirection. Association data. Retrieval Key. Resolve. Retrieve. Reveal Association. Facets. Indirection. Association data. Retrieval Key. ResolutionDefinitionBackground 1. Other definitions XRI definition: Resolution is the function of dereferencing

More information

For Attribution: Developing Data Attribution and Citation Practices and Standards

For Attribution: Developing Data Attribution and Citation Practices and Standards For Attribution: Developing Data Attribution and Citation Practices and Standards Board on Research Data and Information Policy and Global Affairs Division National Research Council in collaboration with

More information

Comp 336/436 - Markup Languages. Fall Semester Week 2. Dr Nick Hayward

Comp 336/436 - Markup Languages. Fall Semester Week 2. Dr Nick Hayward Comp 336/436 - Markup Languages Fall Semester 2017 - Week 2 Dr Nick Hayward Digitisation - textual considerations comparable concerns with music in textual digitisation density of data is still a concern

More information

Encoding and Designing for the Swift Poems Project

Encoding and Designing for the Swift Poems Project Encoding and Designing for the Swift Poems Project Jonathan Swift and the Text Encoding Initiative James R. Griffin III Digital Library Developer Lafayette College Libraries Introductions James Woolley

More information

A Rationale for Teaching Hypertext Authoring in Literature Courses

A Rationale for Teaching Hypertext Authoring in Literature Courses Computer Writing and Research Lab White Paper Series: #030822-2 A Rationale for Teaching Hypertext Authoring in Literature Courses David Barndollar dpbarndollar@mail.utexas.edu University of Texas at Austin

More information

University of Bath. Publication date: Document Version Publisher's PDF, also known as Version of record. Link to publication

University of Bath. Publication date: Document Version Publisher's PDF, also known as Version of record. Link to publication Citation for published version: Patel, M & Duke, M 2004, 'Knowledge Discovery in an Agents Environment' Paper presented at European Semantic Web Symposium 2004, Heraklion, Crete, UK United Kingdom, 9/05/04-11/05/04,.

More information

ANNOTATION STUDIO User s Guide. DRAFT - Version January 2015

ANNOTATION STUDIO User s Guide. DRAFT - Version January 2015 ANNOTATION STUDIO User s Guide DRAFT - Version January 2015 Table of Contents 1. Annotation Studio and How you can use it to improve the classroom experience... 3 2. Description and terminology... 5 2.1

More information

Digital Design of Paper Technologies Workshop

Digital Design of Paper Technologies Workshop Digital Design of Paper Technologies Workshop On the durability and curatibility of Digital Scholarly Editions Daniel Bruder Those who cannot remember the past are condemned to repeat it. Those who cannot

More information

6JSC/Chair/8 25 July 2013 Page 1 of 34. From: Barbara Tillett, JSC Chair To: JSC Subject: Proposals for Subject Relationships

6JSC/Chair/8 25 July 2013 Page 1 of 34. From: Barbara Tillett, JSC Chair To: JSC Subject: Proposals for Subject Relationships Page 1 of 34 From: Barbara Tillett, JSC Chair To: JSC Subject: Proposals for Subject Relationships Related discussion paper and responses: 6JSC/LC rep/3 (May 20, 2011) and responses from ACOC, ALA, BL,

More information

Europeana Data Model. Stefanie Rühle (SUB Göttingen) Slides by Valentine Charles

Europeana Data Model. Stefanie Rühle (SUB Göttingen) Slides by Valentine Charles Europeana Data Model Stefanie Rühle (SUB Göttingen) Slides by Valentine Charles 08th Oct. 2014, DC 2014 Outline Europeana The Europeana Data Model (EDM) Modeling data in EDM Mapping, extensions and refinements

More information

data elements (Delsey, 2003) and by providing empirical data on the actual use of the elements in the entire OCLC WorldCat database.

data elements (Delsey, 2003) and by providing empirical data on the actual use of the elements in the entire OCLC WorldCat database. Shawne D. Miksa, William E. Moen, Gregory Snyder, Serhiy Polyakov, Amy Eklund Texas Center for Digital Knowledge, University of North Texas Denton, Texas, U.S.A. Metadata Assistance of the Functional Requirements

More information

PRINCIPLES AND FUNCTIONAL REQUIREMENTS

PRINCIPLES AND FUNCTIONAL REQUIREMENTS INTERNATIONAL COUNCIL ON ARCHIVES PRINCIPLES AND FUNCTIONAL REQUIREMENTS FOR RECORDS IN ELECTRONIC OFFICE ENVIRONMENTS RECORDKEEPING REQUIREMENTS FOR BUSINESS SYSTEMS THAT DO NOT MANAGE RECORDS OCTOBER

More information

The International Journal of Digital Curation Issue 2, Volume

The International Journal of Digital Curation Issue 2, Volume 28 A Compound Object Authoring and Publishing Tool A Compound Object Authoring and Publishing Tool for Literary Scholars Based on the IFLA-FRBR Model Anna Gerber, Jane Hunter, University of Queensland,

More information

WM2015 Conference, March 15 19, 2015, Phoenix, Arizona, USA

WM2015 Conference, March 15 19, 2015, Phoenix, Arizona, USA OECD NEA Radioactive Waste Repository Metadata Management (RepMet) Initiative (2014-2018) 15614 Claudio Pescatore*, Alexander Carter** *OECD Nuclear Energy Agency 1 (claudio.pescatore@oecd.org) ** Radioactive

More information

KM COLUMN. How to evaluate a content management system. Ask yourself: what are your business goals and needs? JANUARY What this article isn t

KM COLUMN. How to evaluate a content management system. Ask yourself: what are your business goals and needs? JANUARY What this article isn t KM COLUMN JANUARY 2002 How to evaluate a content management system Selecting and implementing a content management system (CMS) will be one of the largest IT projects tackled by many organisations. With

More information

SDMX self-learning package No. 3 Student book. SDMX-ML Messages

SDMX self-learning package No. 3 Student book. SDMX-ML Messages No. 3 Student book SDMX-ML Messages Produced by Eurostat, Directorate B: Statistical Methodologies and Tools Unit B-5: Statistical Information Technologies Last update of content February 2010 Version

More information

Ontology-based Architecture Documentation Approach

Ontology-based Architecture Documentation Approach 4 Ontology-based Architecture Documentation Approach In this chapter we investigate how an ontology can be used for retrieving AK from SA documentation (RQ2). We first give background information on the

More information

PROJECT PERIODIC REPORT

PROJECT PERIODIC REPORT PROJECT PERIODIC REPORT Grant Agreement number: 257403 Project acronym: CUBIST Project title: Combining and Uniting Business Intelligence and Semantic Technologies Funding Scheme: STREP Date of latest

More information

BUILDING THE SEMANTIC WEB

BUILDING THE SEMANTIC WEB BUILDING THE SEMANTIC WEB You might have come across the term Semantic Web Applications often, during talks about the future of Web apps. Check out what this is all about There are two aspects to the possible

More information

Ontology Servers and Metadata Vocabulary Repositories

Ontology Servers and Metadata Vocabulary Repositories Ontology Servers and Metadata Vocabulary Repositories Dr. Manjula Patel Technical Research and Development m.patel@ukoln.ac.uk http://www.ukoln.ac.uk/ Overview agentcities.net deployment grant Background

More information

Semantic Web Fundamentals

Semantic Web Fundamentals Semantic Web Fundamentals Web Technologies (706.704) 3SSt VU WS 2017/18 Vedran Sabol with acknowledgements to P. Höfler, V. Pammer, W. Kienreich ISDS, TU Graz December 11 th 2017 Overview What is Semantic

More information

A Dublin Core Application Profile in the Agricultural Domain

A Dublin Core Application Profile in the Agricultural Domain Proc. Int l. Conf. on Dublin Core and Metadata Applications 2001 A Dublin Core Application Profile in the Agricultural Domain DC-2001 International Conference on Dublin Core and Metadata Applications 2001

More information

Metadata. Week 4 LBSC 671 Creating Information Infrastructures

Metadata. Week 4 LBSC 671 Creating Information Infrastructures Metadata Week 4 LBSC 671 Creating Information Infrastructures Muddiest Points Memory madness Hard drives, DVD s, solid state disks, tape, Digitization Images, audio, video, compression, file names, Where

More information

Linked Data: What Now? Maine Library Association 2017

Linked Data: What Now? Maine Library Association 2017 Linked Data: What Now? Maine Library Association 2017 Linked Data What is Linked Data Linked Data refers to a set of best practices for publishing and connecting structured data on the Web. URIs - Uniform

More information

Library of Congress BIBFRAME Pilot. NOTSL Fall Meeting October 30, 2015

Library of Congress BIBFRAME Pilot. NOTSL Fall Meeting October 30, 2015 Library of Congress BIBFRAME Pilot NOTSL Fall Meeting October 30, 2015 THE BIBFRAME EDITOR AND THE LC PILOT The Semantic Web and Linked Data : a Recap of the Key Concepts Learning Objectives Describe the

More information

Browsing the Semantic Web

Browsing the Semantic Web Proceedings of the 7 th International Conference on Applied Informatics Eger, Hungary, January 28 31, 2007. Vol. 2. pp. 237 245. Browsing the Semantic Web Peter Jeszenszky Faculty of Informatics, University

More information

RDA? GAME ON!! A B C L A / B C C A T S P R E C O N F E R E N C E A P R I L 2 2, : : 0 0 P M

RDA? GAME ON!! A B C L A / B C C A T S P R E C O N F E R E N C E A P R I L 2 2, : : 0 0 P M RDA? GAME ON!! A B C L A / B C C A T S P R E C O N F E R E N C E A P R I L 2 2, 2 0 1 0 1 : 3 0 5 : 0 0 P M S P E A K E R S : L I N D A W O O D C O C K ( K W A N T L E N P O L Y T E C H N I C U N I V E

More information

Profiles Research Networking Software API Guide

Profiles Research Networking Software API Guide Profiles Research Networking Software API Guide Documentation Version: March 13, 2013 Software Version: ProfilesRNS_1.0.3 Table of Contents Overview... 2 PersonID, URI, and Aliases... 3 1) Profiles RNS

More information

DCMI Abstract Model - DRAFT Update

DCMI Abstract Model - DRAFT Update 1 of 7 9/19/2006 7:02 PM Architecture Working Group > AMDraftUpdate User UserPreferences Site Page Actions Search Title: Text: AttachFile DeletePage LikePages LocalSiteMap SpellCheck DCMI Abstract Model

More information

CEN MetaLex. Facilitating Interchange in E- Government. Alexander Boer

CEN MetaLex. Facilitating Interchange in E- Government. Alexander Boer CEN MetaLex Facilitating Interchange in E- Government Alexander Boer aboer@uva.nl MetaLex Initiative taken by us in 2002 Workshop on an open XML interchange format for legal and legislative resources www.metalex.eu

More information

Data Curation Profile Human Genomics

Data Curation Profile Human Genomics Data Curation Profile Human Genomics Profile Author Profile Author Institution Name Contact J. Carlson N. Brown Purdue University J. Carlson, jrcarlso@purdue.edu Date of Creation October 27, 2009 Date

More information

Orchestrating Music Queries via the Semantic Web

Orchestrating Music Queries via the Semantic Web Orchestrating Music Queries via the Semantic Web Milos Vukicevic, John Galletly American University in Bulgaria Blagoevgrad 2700 Bulgaria +359 73 888 466 milossmi@gmail.com, jgalletly@aubg.bg Abstract

More information

Proposal for Implementing Linked Open Data on Libraries Catalogue

Proposal for Implementing Linked Open Data on Libraries Catalogue Submitted on: 16.07.2018 Proposal for Implementing Linked Open Data on Libraries Catalogue Esraa Elsayed Abdelaziz Computer Science, Arab Academy for Science and Technology, Alexandria, Egypt. E-mail address:

More information

Improving access and facilitating research: The music collections in the new catalogues of the French National Library (BnF)

Improving access and facilitating research: The music collections in the new catalogues of the French National Library (BnF) Improving access and facilitating research: The music collections in the new catalogues of the French National Library (BnF) The general catalogue of the BnF First computer catalogue for the users of the

More information

Database of historical places, persons, and lemmas

Database of historical places, persons, and lemmas Database of historical places, persons, and lemmas Natalia Korchagina Outline 1. Introduction 1.1 Swiss Law Sources Foundation as a Digital Humanities project 1.2 Data to be stored 1.3 Final goal: how

More information

JENA: A Java API for Ontology Management

JENA: A Java API for Ontology Management JENA: A Java API for Ontology Management Hari Rajagopal IBM Corporation Page Agenda Background Intro to JENA Case study Tools and methods Questions Page The State of the Web Today The web is more Syntactic

More information

Europeana update: aspects of the data

Europeana update: aspects of the data Europeana update: aspects of the data Robina Clayphan, Europeana Foundation European Film Gateway Workshop, 30 May 2011, Frankfurt/Main Overview The Europeana Data Model (EDM) Data enrichment activity

More information

Towards the Semantic Desktop. Dr. Øyvind Hanssen University Library of Tromsø

Towards the Semantic Desktop. Dr. Øyvind Hanssen University Library of Tromsø Towards the Semantic Desktop Dr. Øyvind Hanssen University Library of Tromsø Agenda Background Enabling trends and technologies Desktop computing and The Semantic Web Online Social Networking and P2P Computing

More information

Metadata and Encoding Standards for Digital Initiatives: An Introduction

Metadata and Encoding Standards for Digital Initiatives: An Introduction Metadata and Encoding Standards for Digital Initiatives: An Introduction Maureen P. Walsh, The Ohio State University Libraries KSU-SLIS Organization of Information 60002-004 October 29, 2007 Part One Non-MARC

More information

Archives in a Networked Information Society: The Problem of Sustainability in the Digital Information Environment

Archives in a Networked Information Society: The Problem of Sustainability in the Digital Information Environment Archives in a Networked Information Society: The Problem of Sustainability in the Digital Information Environment Shigeo Sugimoto Research Center for Knowledge Communities Graduate School of Library, Information

More information

Meta-Bridge: A Development of Metadata Information Infrastructure in Japan

Meta-Bridge: A Development of Metadata Information Infrastructure in Japan Proc. Int l Conf. on Dublin Core and Applications 2011 Meta-Bridge: A Development of Information Infrastructure in Japan Mitsuharu Nagamori Graduate School of Library, Information and Media Studies, University

More information

Contribution of OCLC, LC and IFLA

Contribution of OCLC, LC and IFLA Contribution of OCLC, LC and IFLA in The Structuring of Bibliographic Data and Authorities : A path to Linked Data BY Basma Chebani Head of Cataloging and Metadata Services, AUB Libraries Presented to

More information

Data is the new Oil (Ann Winblad)

Data is the new Oil (Ann Winblad) Data is the new Oil (Ann Winblad) Keith G Jeffery keith.jeffery@keithgjefferyconsultants.co.uk 20140415-16 JRC Workshop Big Open Data Keith G Jeffery 1 Data is the New Oil Like oil has been, data is Abundant

More information

Development of an Ontology-Based Portal for Digital Archive Services

Development of an Ontology-Based Portal for Digital Archive Services Development of an Ontology-Based Portal for Digital Archive Services Ching-Long Yeh Department of Computer Science and Engineering Tatung University 40 Chungshan N. Rd. 3rd Sec. Taipei, 104, Taiwan chingyeh@cse.ttu.edu.tw

More information

Semantic Web and Python Concepts to Application development

Semantic Web and Python Concepts to Application development PyCon 2009 IISc, Bangalore, India Semantic Web and Python Concepts to Application development Vinay Modi Voice Pitara Technologies Private Limited Outline Web Need better web for the future Knowledge Representation

More information

The NEPOMUK project. Dr. Ansgar Bernardi DFKI GmbH Kaiserslautern, Germany

The NEPOMUK project. Dr. Ansgar Bernardi DFKI GmbH Kaiserslautern, Germany The NEPOMUK project Dr. Ansgar Bernardi DFKI GmbH Kaiserslautern, Germany ansgar.bernardi@dfki.de Integrated Project n 27705 Priority 2.4.7 Semantic knowledge based systems NEPOMUK is a three-year Integrated

More information

Internal Structure of Information Packages in Digital Preservation

Internal Structure of Information Packages in Digital Preservation JISTaP http://www.jistap.org Journal of Information Science Theory and Practice Research Paper J. of infosci. theory and practice 2(4): 06-19, 2014 http://dx.doi.org/10.1633/jistap.2014.2.4.1 Internal

More information

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper. Semantic Web Company PoolParty - Server PoolParty - Technical White Paper http://www.poolparty.biz Table of Contents Introduction... 3 PoolParty Technical Overview... 3 PoolParty Components Overview...

More information

FIBO Metadata in Ontology Mapping

FIBO Metadata in Ontology Mapping FIBO Metadata in Ontology Mapping For Open Ontology Repository OOR Metadata Workshop VIII 02 July 2013 Copyright 2010 EDM Council Inc. 1 Overview The Financial Industry Business Ontology Introduction FIBO

More information

Enrichment, Reconciliation and Publication of Linked Data with the BIBFRAME model. Tiziana Possemato Casalini Libri

Enrichment, Reconciliation and Publication of Linked Data with the BIBFRAME model. Tiziana Possemato Casalini Libri Enrichment, Reconciliation and Publication of Linked Data with the BIBFRAME model Tiziana Possemato Casalini Libri - @Cult New cooperative scenarios New context: new ways of cooperating between institutions

More information

SKOS. COMP62342 Sean Bechhofer

SKOS. COMP62342 Sean Bechhofer SKOS COMP62342 Sean Bechhofer sean.bechhofer@manchester.ac.uk Ontologies Metadata Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies

More information

Data formats for exchanging classifications UNSD

Data formats for exchanging classifications UNSD ESA/STAT/AC.234/22 11 May 2011 UNITED NATIONS DEPARTMENT OF ECONOMIC AND SOCIAL AFFAIRS STATISTICS DIVISION Meeting of the Expert Group on International Economic and Social Classifications New York, 18-20

More information

COLUMN. Choosing the right CMS authoring tools. Three key criteria will determine the most suitable authoring environment NOVEMBER 2003

COLUMN. Choosing the right CMS authoring tools. Three key criteria will determine the most suitable authoring environment NOVEMBER 2003 KM COLUMN NOVEMBER 2003 Choosing the right CMS authoring tools The authoring environment is the most important aspect of a content management system (CMS), for without content authors, there would be nothing

More information

USING DC FOR SERVICE DESCRIPTION

USING DC FOR SERVICE DESCRIPTION USING DC FOR SERVICE DESCRIPTION The Nature of Services...2 Content of a service...2 Aggregation/Boundary...3 Use of Elements to Describe Services...4 Resource content: Audience, Coverage, Description,

More information

Reducing Consumer Uncertainty

Reducing Consumer Uncertainty Spatial Analytics Reducing Consumer Uncertainty Towards an Ontology for Geospatial User-centric Metadata Introduction Cooperative Research Centre for Spatial Information (CRCSI) in Australia Communicate

More information

Table of contents for The organization of information / Arlene G. Taylor and Daniel N. Joudrey.

Table of contents for The organization of information / Arlene G. Taylor and Daniel N. Joudrey. Table of contents for The organization of information / Arlene G. Taylor and Daniel N. Joudrey. Chapter 1: Organization of Recorded Information The Need to Organize The Nature of Information Organization

More information

Opus: University of Bath Online Publication Store

Opus: University of Bath Online Publication Store Patel, M. (2004) Semantic Interoperability in Digital Library Systems. In: WP5 Forum Workshop: Semantic Interoperability in Digital Library Systems, DELOS Network of Excellence in Digital Libraries, 2004-09-16-2004-09-16,

More information

Reducing Consumer Uncertainty Towards a Vocabulary for User-centric Geospatial Metadata

Reducing Consumer Uncertainty Towards a Vocabulary for User-centric Geospatial Metadata Meeting Host Supporting Partner Meeting Sponsors Reducing Consumer Uncertainty Towards a Vocabulary for User-centric Geospatial Metadata 105th OGC Technical Committee Palmerston North, New Zealand Dr.

More information

The Semantic Web DEFINITIONS & APPLICATIONS

The Semantic Web DEFINITIONS & APPLICATIONS The Semantic Web DEFINITIONS & APPLICATIONS Data on the Web There are more an more data on the Web Government data, health related data, general knowledge, company information, flight information, restaurants,

More information

EDEN An Epigraphic Web Database of Ancient Inscriptions

EDEN An Epigraphic Web Database of Ancient Inscriptions EDEN An Epigraphic Web Database of Ancient Inscriptions Martin Scholz (FAU Erlangen-Nürnberg) 21.04.2016 Outline Goals, Content, and Structure of EDEN Online Database Semantic Modelling Annotating Text

More information

Solution Architecture Template (SAT) Design Guidelines

Solution Architecture Template (SAT) Design Guidelines Solution Architecture Template (SAT) Design Guidelines Change control Modification Details Version 2.0.0 Alignment with EIRA v2.0.0 Version 1.0.0 Initial version ISA² Action - European Interoperability

More information

A Domain-Customizable SVG-Based Graph Editor for Software Visualizations

A Domain-Customizable SVG-Based Graph Editor for Software Visualizations A Domain-Customizable SVG-Based Graph Editor for Software Visualizations Tony Lin, Feng Zou, Holger M. Kienle and Hausi A. Müller University of Victoria, Canada {gaoyun,fzou,kienle,hausi}@cs.uvic.ca Abstract

More information

warwick.ac.uk/lib-publications

warwick.ac.uk/lib-publications Original citation: Zhao, Lei, Lim Choi Keung, Sarah Niukyun and Arvanitis, Theodoros N. (2016) A BioPortalbased terminology service for health data interoperability. In: Unifying the Applications and Foundations

More information

Ontologies SKOS. COMP62342 Sean Bechhofer

Ontologies SKOS. COMP62342 Sean Bechhofer Ontologies SKOS COMP62342 Sean Bechhofer sean.bechhofer@manchester.ac.uk Metadata Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies

More information

ClickLearn Studio. - A technical guide 9/18/2017

ClickLearn Studio. - A technical guide 9/18/2017 ClickLearn Studio - A technical guide 9/18/2017 All products and companies mentioned in this document are or may be registered trademarks of their respective companies or owners. ClickLearn ApS reserves

More information

For those of you who may not have heard of the BHL let me give you some background. The Biodiversity Heritage Library (BHL) is a consortium of

For those of you who may not have heard of the BHL let me give you some background. The Biodiversity Heritage Library (BHL) is a consortium of 1 2 For those of you who may not have heard of the BHL let me give you some background. The Biodiversity Heritage Library (BHL) is a consortium of natural history and botanical libraries that cooperate

More information

A Tagging Approach to Ontology Mapping

A Tagging Approach to Ontology Mapping A Tagging Approach to Ontology Mapping Colm Conroy 1, Declan O'Sullivan 1, Dave Lewis 1 1 Knowledge and Data Engineering Group, Trinity College Dublin {coconroy,declan.osullivan,dave.lewis}@cs.tcd.ie Abstract.

More information

Introduction and background

Introduction and background page 1 of 9 To: Joint Steering Committee for Development of RDA From: Gordon Dunsire, CILIP representative Subject: RDF representation of RDA relationship designators: discussion paper Introduction and

More information

Tara McPherson School of Cinematic Arts USC Los Angeles, CA, USA

Tara McPherson School of Cinematic Arts USC Los Angeles, CA, USA Tara McPherson School of Cinematic Arts USC Los Angeles, CA, USA Both scholarship + popular culture have gone online There were about 25,400 active scholarly peer-reviewed journals in early 2009, collectively

More information

CSCU9T4: Managing Information

CSCU9T4: Managing Information CSCU9T4: Managing Information CSCU9T4 Spring 2016 1 The Module Module co-ordinator: Dr Gabriela Ochoa Lectures by: Prof Leslie Smith (l.s.smith@cs.stir.ac.uk) and Dr Nadarajen Veerapen (nve@cs.stir.ac.uk)

More information

Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS. Jenn Riley IU Metadata Librarian DLP Brown Bag Series February 25, 2005

Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS. Jenn Riley IU Metadata Librarian DLP Brown Bag Series February 25, 2005 Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS Jenn Riley IU Metadata Librarian DLP Brown Bag Series February 25, 2005 Descriptive metadata Enables users to find relevant materials Used

More information

Software Requirements Specification for the Names project prototype

Software Requirements Specification for the Names project prototype Software Requirements Specification for the Names project prototype Prepared for the JISC Names Project by Daniel Needham, Amanda Hill, Alan Danskin & Stephen Andrews April 2008 1 Table of Contents 1.

More information

Background. Recommendations. SAC13-ANN/11/Rev. SAC/RDA Subcommittee/2013/1 March 8, 2013; rev. July 11, 2013 page 1 of 7

Background. Recommendations. SAC13-ANN/11/Rev. SAC/RDA Subcommittee/2013/1 March 8, 2013; rev. July 11, 2013 page 1 of 7 page 1 of 7 TO: FROM: SUBJECT: RELATED: ALA/ALCTS/CaMMS/Subject Analysis Committee ALA/ALCTS/CaMMS/Committee on Cataloging: Description and Access SAC RDA Subcommittee Treatment of Subjects in RDA: A Discussion

More information

DELIVERABLE. Deliverable D3.2: Prototyping Platform Implemented

DELIVERABLE. Deliverable D3.2: Prototyping Platform Implemented DELIVERABLE Project Acronym: DM2E Grant Agreement number: ICT-PSP-297274 Project Title: Digitised Manuscripts to Europeana Deliverable D3.2: Prototyping Platform Implemented Revision: 1.0 Authors: Christian

More information

Minimal Metadata Standards and MIIDI Reports

Minimal Metadata Standards and MIIDI Reports Dryad-UK Workshop Wolfson College, Oxford 12 September 2011 Minimal Metadata Standards and MIIDI Reports David Shotton, Silvio Peroni and Tanya Gray Image BioInformatics Research Group Department of Zoology

More information

Stylus Studio Case Study: FIXML Working with Complex Message Sets Defined Using XML Schema

Stylus Studio Case Study: FIXML Working with Complex Message Sets Defined Using XML Schema Stylus Studio Case Study: FIXML Working with Complex Message Sets Defined Using XML Schema Introduction The advanced XML Schema handling and presentation capabilities of Stylus Studio have valuable implications

More information

Alexander Haffner. RDA and the Semantic Web

Alexander Haffner. RDA and the Semantic Web Alexander Haffner RDA and the Semantic Web 1 Internationalisation and Interoperability interoperability of information and library systems internationalisation in descriptive cataloguing and subject cataloguing

More information

Porting Social Media Contributions with SIOC

Porting Social Media Contributions with SIOC Porting Social Media Contributions with SIOC Uldis Bojars, John G. Breslin, and Stefan Decker DERI, National University of Ireland, Galway, Ireland firstname.lastname@deri.org Abstract. Social media sites,

More information

RIM Document Editorial Tasks

RIM Document Editorial Tasks 0 0 0 Rim Document Editorial Tasks RIM Document Editorial Tasks V Technical Editorial Services For HL Contract Work Announcement V Technical Editor January 00 Ockham Information Services LLC 0 Adams Street

More information

D WSMO Data Grounding Component

D WSMO Data Grounding Component Project Number: 215219 Project Acronym: SOA4All Project Title: Instrument: Thematic Priority: Service Oriented Architectures for All Integrated Project Information and Communication Technologies Activity

More information

The Europeana Data Model and Europeana Libraries Robina Clayphan

The Europeana Data Model and Europeana Libraries Robina Clayphan The Europeana Data Model and Europeana Libraries Robina Clayphan 27 April 2012, The British Library, London Overview 1. How delighted I am to be here 2. The Europeana Data Model What is it for? What does

More information

case study The Asset Description Metadata Schema (ADMS) A common vocabulary to publish semantic interoperability assets on the Web July 2011

case study The Asset Description Metadata Schema (ADMS) A common vocabulary to publish semantic interoperability assets on the Web July 2011 case study July 2011 The Asset Description Metadata Schema (ADMS) A common vocabulary to publish semantic interoperability assets on the Web DISCLAIMER The views expressed in this document are purely those

More information

From the Web to the Semantic Web: RDF and RDF Schema

From the Web to the Semantic Web: RDF and RDF Schema From the Web to the Semantic Web: RDF and RDF Schema Languages for web Master s Degree Course in Computer Engineering - (A.Y. 2016/2017) The Semantic Web [Berners-Lee et al., Scientific American, 2001]

More information

Metadata: The Theory Behind the Practice

Metadata: The Theory Behind the Practice Metadata: The Theory Behind the Practice Item Type Presentation Authors Coleman, Anita Sundaram Citation Metadata: The Theory Behind the Practice 2002-04, Download date 06/07/2018 12:18:20 Link to Item

More information

The Canadian Information Network for Research in the Social Sciences and Humanities.

The Canadian Information Network for Research in the Social Sciences and Humanities. The Canadian Information Network for Research in the Social Sciences and Humanities http://www.synergiescanada.org Tim Au Yeung and Mary Westell Libraries and Cultural Resources University of Calgary March

More information

Introduction to the Semantic Web

Introduction to the Semantic Web Introduction to the Semantic Web Charlie Abela Department of Artificial Intelligence charlie.abela@um.edu.mt Lecture Outline Course organisation Today s Web limitations Machine-processable data The Semantic

More information