Document Metadata: document technical metadata for digital preservation

Size: px
Start display at page:

Download "Document Metadata: document technical metadata for digital preservation"

Transcription

1 Document Metadata: document technical metadata for digital preservation By Carol C.H. Chou - Florida Digital Archive (FDA) Andrea Goethals - Harvard University Library (HUL) March 24,

2 Table of Contents Introduction... 3 Applicable Formats... 5 Data Dictionary... 6 Document Metadata schema Appendix A: A sample PREMIS document with embedded docmd schema

3 Introduction The Florida Digital Archive (FDA) provides digital preservation for the eleven public universities in Florida. Since it was established in 2005, FDA has ingested over nine millions files with over 19 terabytes of data. There are approximately 50,000 files in document formats such as PDF, Word and OpenDocument Text Format. Most of these documents come from the Electronic Thesis and Dissertation (ETD) collections in those universities. Ensuring all FDA collections remain usable and renderable is one of the critical missions for FDA. Extracting technical metadata from documents is essential as it can aid in characterizing the kinds of documents in our preservation collections; listing document properties that may hinder preservation (encryption, external fonts, etc); and providing requirements in selecting tools/facilities for document transformation including normalization and migration. In addition, Document technical metadata can be used to verify the result of document transformations, ensuring the properties of the original document are preserved and properly transformed to the new document format. There are currently many metadata standards for various format groups. For images, there is MIX (NISO Metadata for Images in XML 1 ). For text-based formats such as plain-text, XML, DTD and HTML, there is TextMD (Technical Metadata for Text 2 ) schema. In terms of audio, there are two emerging standards: AES-X098B (work in progress by the Audio Engineering Society SC Working Group on Digital Library and Archive Systems), and the AMD schema 3 by the Library of Congress. When it comes to document formats such as PDF, Word or OpenDocument Text, it has come to our attention that there is currently no technical metadata standard to follow. Though JHOVE provides a metadata extraction function for PDF and presents the extracted PDF metadata in JHOVE schema, the extracted metadata is massive and is expressed in a page-by-page manner. We hope to develop a document metadata schema which is simpler and may be applied to document formats other than PDF. The document metadata schema may be expressed in XML or database form. We will also develop a style-sheet to convert PDF metadata in the JHOVE schema into the document metadata schema. Andrea Goethals at Harvard University Library (HUL) has also expressed the need for a document metadata schema. HUL is enhancing their preservation repository, the Digital Repository System (DRS), to accept born-digital content, including documents, and sees this document metadata schema as the first step towards preserving documents for the long-term. Together, we are developing this document metadata schema for the 1 See 2 See 3 See 3

4 use of FDA and the DRS. We hope to gather input from the preservation community to enhance the document metadata schema which may also be useful to other trusted repositories as well. 4

5 Applicable Formats This metadata schema was developed based on a small set of popular document formats but should be generally applicable to formats that are: primarily text; but that allow creators a choice of fonts, colors, text size, backgrounds; and that support embedded multimedia (images, sounds, video etc); and that may contain application specific features; and that support page layouts with margins, columns, etc. These formats include but are not limited to: Format MIME Type File Extension Native Applications Microsoft Word application/msword application/ms-word (This second MIME type isn't correct but is sometimes used) doc Microsoft Word and Microsoft Office Word Portable Document Format application/pdf pdf Adobe Writer OpenDocument Text Writer 6.0 Document StarWriter 5.x Document application/vnd.oasis.o pendocument.text application/vnd.sun.xml.writer application/vnd.stardivi sion.writer odt OpenOffice.org2.0 / StarOffice 8 and later sxw OpenOffice.org1.0 / StarOffice6.0 and later sdw StarOffice 5.x StarWriter 4.x Document application/x-starwriter sdw StarOffice 4.x WordPerfect Document application/vndwordperfect wpd WordPerfect and WordPerfect Office Works Text Document application/vnd.ms-wor ks wps Microsoft Works For each metadata element listed in the data dictionary, the document formats are listed that are known to contain either the associated metadata values directly in the file or that could be determined indirectly by parsing the files. 5

6 Data Dictionary The data dictionary describes the semantic meaning and constraints of documentspecific metadata. The document specific metadata are those document properties that are deemed preservation-worthy and pertain to most document formats. Some elements are included because they will aid in evaluating the completeness of the content after transformations (e.g. number of pages). Other elements are included because they will aid in selecting or aggregating documents for risk analysis, preservation or delivery planning (e.g. Features). Please note that general preservation and descriptive metadata may also be extracted from documents. This metadata includes size, encryption, title, author/creator, createdate, copyright, digital signature, protection/permission, etc and may be recorded by using standard preservation schema such as PREMIS and MODS. Semantic unit PageCount Semantic components None Total number of pages in the document Data Constraint Min 1 Mandatory Semantic unit WordCount Semantic components None Total number of words in the document Data Constraint Min 0 This element is included in this schema because it can be valuable for evaluating the completeness of the content after transformations. Caution must be used with this element however because tools and applications that can determine the number of words in a document do not always use the same algorithm for determining this value. Semantic unit CharacterCount Semantic components None 6

7 Total number of characters in the document Data Constraint Min 0 This element is included in this schema because it can be valuable for evaluating the completeness of the content after transformations. Caution must be used with this element however because tools and applications that can determine the number of characters in a document do not always use the same algorithm for determining this value. Semantic unit ParagraphCount Semantic components None Total number of paragraphs in the document Data Constraint Min 0 This element is included in this schema because it can be valuable for evaluating the completeness of the content after transformations. Caution must be used with this element however because tools and applications that can determine the number of paragraphs in a document do not always use the same algorithm for determining this value. Semantic unit LineCount Semantic components None Total number of lines in the document Data Constraint Min 0 This element is included in this schema because it can be valuable for evaluating the completeness of the content after transformations. Caution must be used with this element however because tools and applications that can determine the number of lines in a document do not always use the same algorithm for determining this value. Semantic unit TableCount 7

8 Semantic components None Total number of tables in the document Data Constraint Min 0 This element is included in this schema because it can be valuable for evaluating the completeness of the content after transformations. Caution must be used with this element however because tools and applications that can determine the number of tables in a document do not always use the same algorithm for determining this value. Semantic unit GraphicsCount Semantic componentsnone Total number of graphics in the document Data Constraint Min 0 This element is included in this schema because it can be valuable for evaluating the completeness of the content after transformations. Caution must be used with this element however because tools and applications that can determine the number of graphics in a document do not always use the same algorithm for determining this value. Semantic unit Language Semantic componentsnone A language identifier specifying the natural language used in the document Data Constraint String (or some kind of controlled vocabulary like ISO alpha-3 language codes) Cardinality 0 - N Characteristic Content Semantic unit Fonts 8

9 Semantic components FontName isembedded A list of fonts used in the document Data Constraint Container Mandatory Cardinality 1 - N Characteristic Content, Appearance This element allows a repository to store the names of all fonts used in a document. Some repositories may choose to store only the non-embedded fonts. The use of non-embedded fonts may hinder the long term preservation of the documents. For example, a document encoded with a proprietary non-embedded math font may not be migrated due to unavailability of the specific math font. It is recommended that repositories record at least the nonembedded fonts to assist in identifying the documents with potential long-term preservation risks. Semantic unit FontName Semantic componentsnone Name of a font Data Constraint String Characteristic Content, Appearance Semantic unit IsEmbedded Semantic components None An indication of whether or not a font is embedded in a document. Data Constraint Y, N Characteristic Content, Appearance Semantic unit Features 9

10 Semantic componentsnone Additional document features Data Constraint istagged, haslayers, hastransparancy, hasoutline, hasthumbnails, hasattachments, hasforms, hasannotations Cardinality 0 - N Characteristic istagged: structure haslayers: appearance hastransparency: appearance hasoutline: behavior, appearance hasthumbnails: appearance hasattachments: structure, behavior hasforms: content hasannotations: content 10

11 Document Metadata schema An XML schema, DocMD, for describing the document specific metadata. It is currently located at <?xml version="1.0" encoding="utf-8"?> <!-- Editor: Florida Center for Library Automation (FCLA) and Harvard University Library (HUL) Revised: March 17, > <xs:schema xmlns:xs=" xmlns:docmd=" targetnamespace=" elementformdefault="qualified" attributeformdefault="unqualified"> <xs:element name="document"> <xs:sequence> <xs:element name="pagecount" minoccurs="1" maxoccurs="1" > <xs:restriction base="xs:integer"> <xs:element name="wordcount" minoccurs="0" maxoccurs="1" > <xs:restriction base="xs:integer"> <xs:element name="charactercount" minoccurs="0" maxoccurs="1" > <xs:restriction base="xs:integer"> <xs:element name="paragraphcount" minoccurs="0" maxoccurs="1" > <xs:restriction base="xs:integer"> <xs:element name="linecount" minoccurs="0" maxoccurs="1" > <xs:restriction base="xs:integer"> <xs:element name="tablecount" minoccurs="0" maxoccurs="1" > 11

12 <xs:restriction base="xs:integer"> <xs:element name="graphicscount" minoccurs="0" maxoccurs="1" > <xs:restriction base="xs:integer"> <xs:element name="language" minoccurs="0" maxoccurs="unbounded" type="xs:string"/> <xs:element name="font" minoccurs="1" maxoccurs="unbounded"> <xs:attribute name="fontname" type="xs:string"/> <xs:attribute name="isembedded" type="xs:boolean"/> <xs:element name="features" minoccurs="0" maxoccurs="unbounded"> <xs:restriction base="xs:string"> <xs:enumeration value="istagged"/> <xs:enumeration value="hasoutline"/> <xs:enumeration value="hasthumbnails"/> <xs:enumeration value="haslayers"/> <xs:enumeration value="hasforms"/> <xs:enumeration value="hasannotations"/> <xs:enumeration value="hasattachments"/> <xs:enumeration value="usetransparency"/> </xs:sequence> </xs:schema> 12

13 Appendix A: A sample PREMIS document with embedded docmd schema <premis xmlns:xsi=" xmlns="info:lc/xmlns/premis-v2" xsi:schemalocation="info:lc/xmlns/premis-v2 version="2.0"> <object xsi:type="file"> <objectidentifier> <objectidentifiertype>daitss2</objectidentifiertype> <objectidentifiervalue>/users/carol/desktop/work/testdata/pdf/etd.pdf</objectidentifiervalue> </objectidentifier> <objectcharacteristics> <compositionlevel>0</compositionlevel> <size> </size> <format> <formatdesignation> <formatname>pdf</formatname> <formatversion>1.3</formatversion> </formatdesignation> <formatregistry> <formatregistryname>pronom</formatregistryname> <formatregistrykey>fmt/17</formatregistrykey> </formatregistry> </format> <creatingapplication> <creatingapplicationname>thesis (Electronic thesis).doc - Microsoft Word</ creatingapplicationname> <datecreatedbyapplication>tue Apr 24 16:22:40 EDT 2001</dateCreatedByApplication> </creatingapplication> <fixity> <messagedigestalgorithm>md5</messagedigestalgorithm> <messagedigest>6bf12f206a3e70c88cfe2aa5213dd227</messagedigest> </fixity> <objectcharacteristicsextension> <doc xmlns=" <document> <PageCount>123</PageCount> <Font FontName="Arial" isembedded="false"/> <Font FontName="TimesNewRoman,BoldItalic" isembedded="false"/> <Font FontName="BookmanOldStyle" isembedded="false"/> <Font FontName="Arial,Bold" isembedded="false"/> <Font FontName="TimesNewRoman,Italic" isembedded="false"/> <Feature>hasThumbnails</Feature> </document> </doc> </objectcharacteristicsextension> </objectcharacteristics> </object> </premis> 13

Document Metadata: document technical metadata for digital preservation

Document Metadata: document technical metadata for digital preservation Document Metadata: document technical metadata for digital preservation By Carol Chou - Florida Digital Archive (FDA) Andrea Goethals - Harvard Library (HL) March 18, 2009 Rev. November 30, 2012 1 Table

More information

Andrea Goethals, Harvard Library ASERL Webinar File Information Tool Set

Andrea Goethals, Harvard Library ASERL Webinar File Information Tool Set Andrea Goethals, Harvard Library ASERL Webinar 2013 File Information Tool Set Intro to File formats File tools FITS Specific structure or arrangement of data code stored as a computer file. A file format

More information

QosPolicyHolder:1 Erratum

QosPolicyHolder:1 Erratum Erratum Number: Document and Version: Cross References: Next sequential erratum number Effective Date: July 14, 2006 Document erratum applies to the service document QosPolicyHolder:1 This Erratum has

More information

[MS-SSISPARAMS-Diff]: Integration Services Project Parameter File Format. Intellectual Property Rights Notice for Open Specifications Documentation

[MS-SSISPARAMS-Diff]: Integration Services Project Parameter File Format. Intellectual Property Rights Notice for Open Specifications Documentation [MS-SSISPARAMS-Diff]: Intellectual Property Rights Notice for Open Specifications Documentation Technical Documentation. Microsoft publishes Open Specifications documentation ( this documentation ) for

More information

Oracle B2B 11g Technical Note. Technical Note: 11g_005 Attachments. Table of Contents

Oracle B2B 11g Technical Note. Technical Note: 11g_005 Attachments. Table of Contents Oracle B2B 11g Technical Note Technical Note: 11g_005 Attachments This technical note lists the attachment capabilities available in Oracle B2B Table of Contents Overview... 2 Setup for Fabric... 2 Setup

More information

Intellectual Property Rights Notice for Open Specifications Documentation

Intellectual Property Rights Notice for Open Specifications Documentation [MS-SSISPARAMS-Diff]: Intellectual Property Rights tice for Open Specifications Documentation Technical Documentation. Microsoft publishes Open Specifications documentation for protocols, file formats,

More information

Archives in a Networked Information Society: The Problem of Sustainability in the Digital Information Environment

Archives in a Networked Information Society: The Problem of Sustainability in the Digital Information Environment Archives in a Networked Information Society: The Problem of Sustainability in the Digital Information Environment Shigeo Sugimoto Research Center for Knowledge Communities Graduate School of Library, Information

More information

What do you do when your file formats become obsolete? Lydia T. Motyka Florida Center for Library Automation USETDA 2011

What do you do when your file formats become obsolete? Lydia T. Motyka Florida Center for Library Automation USETDA 2011 What do you do when your file formats become obsolete? Lydia T. Motyka Florida Center for Library Automation USETDA 2011 The FCLA, the FDA, and DAITSS FDA: a service of the Florida Center for Library Automation

More information

Introduction Syntax and Usage XML Databases Java Tutorial XML. November 5, 2008 XML

Introduction Syntax and Usage XML Databases Java Tutorial XML. November 5, 2008 XML Introduction Syntax and Usage Databases Java Tutorial November 5, 2008 Introduction Syntax and Usage Databases Java Tutorial Outline 1 Introduction 2 Syntax and Usage Syntax Well Formed and Valid Displaying

More information

Custom Data Access with MapObjects Java Edition

Custom Data Access with MapObjects Java Edition Custom Data Access with MapObjects Java Edition Next Generation Command and Control System (NGCCS) Tactical Operations Center (TOC) 3-D Concurrent Technologies Corporation Derek Sedlmyer James Taylor 05/24/2005

More information

/// Rapport. / Testdocumentatie nieuwe versie Register producten en dienstverlening (IPDC)

/// Rapport. / Testdocumentatie nieuwe versie Register producten en dienstverlening (IPDC) /// Rapport / Testdocumentatie nieuwe versie Register producten en dienstverlening (IPDC) / Maart 2017 www.vlaanderen.be/informatievlaanderen Informatie Vlaanderen /// Aanpassingen aan de webservices Dit

More information

Content Submission Guidelines

Content Submission Guidelines Content Submission Guidelines EPUB2/3 and PDF Introduction Key Features Content Submission Procedure Metadata EPUB file PDF file Cover file Introduction The PUBlizard Reader also fully supports legacy

More information

Level of Assurance Authentication Context Profiles for SAML 2.0

Level of Assurance Authentication Context Profiles for SAML 2.0 2 3 4 5 Level of Assurance Authentication Context Profiles for SAML 2.0 Draft 01 01 April 2008 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 Specification URIs: This

More information

3. Technical and administrative metadata standards. Metadata Standards and Applications

3. Technical and administrative metadata standards. Metadata Standards and Applications 3. Technical and administrative metadata standards Metadata Standards and Applications Goals of session To understand the different types of administrative metadata standards To learn what types of metadata

More information

Approaches to using NEMSIS V3 Custom Elements

Approaches to using NEMSIS V3 Custom Elements NEMSIS TAC Whitepaper Approaches to using NEMSIS V3 Custom Elements Date August 17, 2011 July 31, 2013 (added section Restrictions, page 11) March 13, 2014 ( CorrelationID now reads CustomElementID as

More information

Metadata and Encoding Standards for Digital Initiatives: An Introduction

Metadata and Encoding Standards for Digital Initiatives: An Introduction Metadata and Encoding Standards for Digital Initiatives: An Introduction Maureen P. Walsh, The Ohio State University Libraries KSU-SLIS Organization of Information 60002-004 October 29, 2007 Part One Non-MARC

More information

No Trade Secrets. Microsoft does not claim any trade secret rights in this documentation.

No Trade Secrets. Microsoft does not claim any trade secret rights in this documentation. [MS-OXSHRMSG]: Intellectual Property Rights Notice for Open Specifications Documentation Technical Documentation. Microsoft publishes Open Specifications documentation for protocols, file formats, languages,

More information

TED schemas. Governance and latest updates

TED schemas. Governance and latest updates TED schemas Governance and latest updates Enric Staromiejski Torregrosa Carmelo Greco 9 October 2018 Agenda 1. Objectives 2. Scope 3. TED XSD 3.0.0 Technical harmonisation of all TED artefacts Code lists

More information

Part III: Survey of Internet technologies

Part III: Survey of Internet technologies Part III: Survey of Internet technologies Content (e.g., HTML) kinds of objects we re moving around? References (e.g, URLs) how to talk about something not in hand? Protocols (e.g., HTTP) how do things

More information

Restricting complextypes that have mixed content

Restricting complextypes that have mixed content Restricting complextypes that have mixed content Roger L. Costello October 2012 complextype with mixed content (no attributes) Here is a complextype with mixed content:

More information

AlwaysUp Web Service API Version 11.0

AlwaysUp Web Service API Version 11.0 AlwaysUp Web Service API Version 11.0 0. Version History... 2 1. Overview... 3 2. Operations... 4 2.1. Common Topics... 4 2.1.1. Authentication... 4 2.1.2. Error Handling... 4 2.2. Get Application Status...

More information

SMKI Repository Interface Design Specification TPMAG baseline submission draft version 8 September 2015

SMKI Repository Interface Design Specification TPMAG baseline submission draft version 8 September 2015 SMKI Repository Interface Design Specification DCC Public Page 1 of 21 Contents 1 Introduction 3 1.1 Purpose and Scope 3 1.2 Target Response Times 3 2 Interface Definition 4 2.1 SMKI Repository Portal

More information

Work/Studies History. Programming XML / XSD. Database

Work/Studies History. Programming XML / XSD. Database Work/Studies History 1. What was your emphasis in your bachelor s work at XXX? 2. What was the most interesting project you worked on there? 3. What is your emphasis in your master s work here at UF? 4.

More information

Web Services. The Pervasive Internet

Web Services. The Pervasive Internet Web Services CPSC 328 Spring 2009 The Pervasive Internet Years ago, computers couldn t talk to each other like they can now Researchers wanted to share information The Internet! Gopher & Veronica (text

More information

Apache UIMA Regular Expression Annotator Documentation

Apache UIMA Regular Expression Annotator Documentation Apache UIMA Regular Expression Annotator Documentation Written and maintained by the Apache UIMA Development Community Version 2.3.1 Copyright 2006, 2011 The Apache Software Foundation License and Disclaimer.

More information

[MS-TMPLDISC]: Template Discovery Web Service Protocol. Intellectual Property Rights Notice for Open Specifications Documentation

[MS-TMPLDISC]: Template Discovery Web Service Protocol. Intellectual Property Rights Notice for Open Specifications Documentation [MS-TMPLDISC]: Intellectual Property Rights Notice for Open Specifications Documentation Technical Documentation. Microsoft publishes Open Specifications documentation for protocols, file formats, languages,

More information

SYNDICATING HIERARCHIES EFFECTIVELY

SYNDICATING HIERARCHIES EFFECTIVELY SDN Contribution SYNDICATING HIERARCHIES EFFECTIVELY Applies to: SAP MDM 5.5 Summary This document introduces hierarchy tables and a method of effectively sending out data stored in hierarchy tables. Created

More information

XML extensible Markup Language

XML extensible Markup Language extensible Markup Language Eshcar Hillel Sources: http://www.w3schools.com http://java.sun.com/webservices/jaxp/ learning/tutorial/index.html Tutorial Outline What is? syntax rules Schema Document Object

More information

Software Engineering Methods, XML extensible Markup Language. Tutorial Outline. An Example File: Note.xml XML 1

Software Engineering Methods, XML extensible Markup Language. Tutorial Outline. An Example File: Note.xml XML 1 extensible Markup Language Eshcar Hillel Sources: http://www.w3schools.com http://java.sun.com/webservices/jaxp/ learning/tutorial/index.html Tutorial Outline What is? syntax rules Schema Document Object

More information

All About <xml> CS193D, 2/22/06

All About <xml> CS193D, 2/22/06 CS193D Handout 17 Winter 2005/2006 February 21, 2006 XML See also: Chapter 24 (709-728) All About CS193D, 2/22/06 XML is A markup language, but not really a language General purpose Cross-platform

More information

2006 Martin v. Löwis. Data-centric XML. XML Schema (Part 1)

2006 Martin v. Löwis. Data-centric XML. XML Schema (Part 1) Data-centric XML XML Schema (Part 1) Schema and DTD Disadvantages of DTD: separate, non-xml syntax very limited constraints on data types (just ID, IDREF, ) no support for sets (i.e. each element type

More information

Oracle Utilities Opower Energy Efficiency Web Portal - Classic Single Sign-On

Oracle Utilities Opower Energy Efficiency Web Portal - Classic Single Sign-On Oracle Utilities Opower Energy Efficiency Web Portal - Classic Single Sign-On Configuration Guide E84772-01 Last Update: Monday, October 09, 2017 Oracle Utilities Opower Energy Efficiency Web Portal -

More information

Building on to the Digital Preservation Foundation at Harvard Library. Andrea Goethals ABCD-Library Meeting June 27, 2016

Building on to the Digital Preservation Foundation at Harvard Library. Andrea Goethals ABCD-Library Meeting June 27, 2016 Building on to the Digital Preservation Foundation at Harvard Library Andrea Goethals ABCD-Library Meeting June 27, 2016 What do we already have? What do we still need? Where I ll focus DIGITAL PRESERVATION

More information

QosPolicyHolder 1.0. For UPnP Version Date: March 10th, 2005

QosPolicyHolder 1.0. For UPnP Version Date: March 10th, 2005 QosPolicyHolder 1.0 For UPnP Version 1.0 2 Date: March 10th, 2005 This Standardized DCP has been adopted as a Standardized DCP by the Steering Committee of the UPnP Forum, pursuant to Section 2.1(c)(ii)

More information

No Trade Secrets. Microsoft does not claim any trade secret rights in this documentation.

No Trade Secrets. Microsoft does not claim any trade secret rights in this documentation. [MS-OTPCE]: Intellectual Property Rights Notice for Open Specifications Documentation Technical Documentation. Microsoft publishes Open Specifications documentation for protocols, file formats, languages,

More information

An Introduction to PREMIS. Jenn Riley Metadata Librarian IU Digital Library Program

An Introduction to PREMIS. Jenn Riley Metadata Librarian IU Digital Library Program An Introduction to PREMIS Jenn Riley Metadata Librarian IU Digital Library Program Outline Background and context PREMIS data model PREMIS data dictionary Implementing PREMIS Adoption and ongoing developments

More information

Oracle Hospitality OPERA Web Self- Service Brochure Web Service Specification Version 5.1. September 2017

Oracle Hospitality OPERA Web Self- Service Brochure Web Service Specification Version 5.1. September 2017 Oracle Hospitality OPERA Web Self- Service Brochure Web Service Specification Version 5.1 September 2017 Copyright 1987, 2017, Oracle and/or its affiliates. All rights reserved. This software and related

More information

MWTM 6.1 NBAPI WSDL and XSD Definitions

MWTM 6.1 NBAPI WSDL and XSD Definitions APPENDIXA This appendix describes the WSDL and XSD 1 (XML Schema Definition) definitions for MWTM 6.1 Northbound API (NBAPI): InventoryAPI.wsdl, page A-1 EventAPI.wsdl, page A-5 ProvisionAPI.wsdl, page

More information

TC57 Use of XML Schema. Scott Neumann. October 3, 2005

TC57 Use of XML Schema. Scott Neumann. October 3, 2005 TC57 Use of XML Schema Scott Neumann October 3, 2005 Introduction The purpose of this presentation is to respond to an action item from the last WG14 meeting regarding the use of XML Schema by WG14 and

More information

Big Data 9. Data Models

Big Data 9. Data Models Ghislain Fourny Big Data 9. Data Models pinkyone / 123RF Stock Photo 1 Syntax vs. Data Models Physical view Syntax this is text. 2 Syntax vs. Data Models a Logical view

More information

Archivists Toolkit: Description Functional Area

Archivists Toolkit: Description Functional Area : Description Functional Area Outline D1: Overview D2: Resources D2.1: D2.2: D2.3: D2.4: D2.5: D2.6: D2.7: Description Business Rules Required and Optional Tasks Sequences User intentions / Application

More information

Physician Data Center API API Specification. 7/3/2014 Federation of State Medical Boards Kevin Hagen

Physician Data Center API API Specification. 7/3/2014 Federation of State Medical Boards Kevin Hagen 7/3/2014 Federation of State Medical Boards Kevin Hagen Revision Description Date 1 Original Document 2/14/2014 2 Update with Degree search field 7/3/2014 Overview The Physician Data Center (PDC) offers

More information

[MS-TPXS-Diff]: Telemetry Protocol XML Schema. Intellectual Property Rights Notice for Open Specifications Documentation

[MS-TPXS-Diff]: Telemetry Protocol XML Schema. Intellectual Property Rights Notice for Open Specifications Documentation [MS-TPXS-Diff]: Intellectual Property Rights Notice for Open Specifications Documentation Technical Documentation. Microsoft publishes Open Specifications documentation ( this documentation ) for protocols,

More information

Document erratum applies to QosDevice:1. List other Erratum s or Documents that this change may apply to or have associated changes with

Document erratum applies to QosDevice:1. List other Erratum s or Documents that this change may apply to or have associated changes with Erratum Number: Document and Version: Cross References: QosDevice:1 Erratum Next sequential erratum number Effective Date: July 14, 2006 Document erratum applies to QosDevice:1 List other Erratum s or

More information

Solution Sheet 5 XML Data Models and XQuery

Solution Sheet 5 XML Data Models and XQuery The Systems Group at ETH Zurich Big Data Fall Semester 2012 Prof. Dr. Donald Kossmann Prof. Dr. Nesime Tatbul Assistants: Martin Kaufmann Besmira Nushi 07.12.2012 Solution Sheet 5 XML Data Models and XQuery

More information

Big Data for Engineers Spring Data Models

Big Data for Engineers Spring Data Models Ghislain Fourny Big Data for Engineers Spring 2018 11. Data Models pinkyone / 123RF Stock Photo CSV (Comma separated values) This is syntax ID,Last name,first name,theory, 1,Einstein,Albert,"General, Special

More information

PTS XML STANDARD GUIDELINE

PTS XML STANDARD GUIDELINE PTS XML STANDARD GUIDELINE September 2012 Turkish Medicines & Medical Devices Agency, Department of Pharmaceutical Track & Trace System Söğütözü Mahallesi 2176 Sok. No: 5 P.K.06520 Çankaya, Ankara Phone:

More information

Semantic Web Technologies and Automated Auctions

Semantic Web Technologies and Automated Auctions Semantic Web Technologies and Automated Auctions Papers: "Implementing Semantic Interoperability in Electronic Auctions" - Juha Puustjarvi (2007) "Ontologies for supporting negotiation in e-commerce" -

More information

Its All About The Metadata

Its All About The Metadata Best Practices Exchange 2013 Its All About The Metadata Mark Evans - Digital Archiving Practice Manager 11/13/2013 Agenda Why Metadata is important Metadata landscape A flexible approach Case study - KDLA

More information

11. Documents and Document Models

11. Documents and Document Models 1 of 14 10/3/2005 2:47 PM 11. Documents and Document Models IS 202-4 October 2005 Copyright  2005 Robert J. Glushko Plan for IO & IR Lecture #11 What is a document? Document types The Document Type Spectrum

More information

MWTM NBAPI WSDL and XSD Definitions

MWTM NBAPI WSDL and XSD Definitions APPENDIXA This appendix describes the WSDL and XSD 1 (XML Schema Definition) definitions for MWTM 6.1.4 Northbound API (NBAPI): InventoryAPI.wsdl, page A-1 EventAPI.wsdl, page A-10 ProvisionAPI.wsdl, page

More information

Automated Load Forecast System (ALFS) Interface Specification. Fall 2017 Release

Automated Load Forecast System (ALFS) Interface Specification. Fall 2017 Release Automated Load Forecast System (ALFS) Interface Specification Fall 2017 Release Version: 1.1 March 27, 2017 Revision History Date Version Description 03/01/2017 1.0 Initial document release related to

More information

DRS Update. HL Digital Preservation Services & Library Technology Services Created 2/2017, Updated 4/2017

DRS Update. HL Digital Preservation Services & Library Technology Services Created 2/2017, Updated 4/2017 Update HL Digital Preservation Services & Library Technology Services Created 2/2017, Updated 4/2017 1 AGENDA DRS DRS DRS Architecture DRS DRS DRS Work 2 COLLABORATIVELY MANAGED DRS Business Owner Digital

More information

Oracle Enterprise Data Quality

Oracle Enterprise Data Quality Oracle Enterprise Data Quality Automated Loading and Running of Projects Version 9.0 January 2012 Copyright 2006, 2012, Oracle and/or its affiliates. All rights reserved. Oracle Enterprise Data Quality,

More information

HVDC LINK DOCUMENT UML MODEL AND SCHEMA

HVDC LINK DOCUMENT UML MODEL AND SCHEMA 1 HVDC LINK DOCUMENT UML MODEL AND SCHEMA 2017-01-19 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 Table of Contents 1 Objective...

More information

SWAD-Europe Deliverable 6.3a Description of prototype implementation (documentation for deliverable 6.2)

SWAD-Europe Deliverable 6.3a Description of prototype implementation (documentation for deliverable 6.2) Mon Jun 07 2004 17:07:23 Europe/Madrid SWAD-Europe Deliverable 6.3a Description of prototype implementation (documentation for deliverable 6.2) Building knowledge objects from disparate, related resources

More information

[MS-DPMDS]: Master Data Services Data Portability Overview. Intellectual Property Rights Notice for Open Specifications Documentation

[MS-DPMDS]: Master Data Services Data Portability Overview. Intellectual Property Rights Notice for Open Specifications Documentation [MS-DPMDS]: Intellectual Property Rights Notice for Open Specifications Documentation Technical Documentation. Microsoft publishes Open Specifications documentation for protocols, file formats, languages,

More information

Messages are securely encrypted using HTTPS. HTTPS is the most commonly used secure method of exchanging data among web browsers.

Messages are securely encrypted using HTTPS. HTTPS is the most commonly used secure method of exchanging data among web browsers. May 6, 2009 9:39 SIF Specifications SIF Implementation Specification The SIF Implementation Specification is based on the World Wide Web Consortium (W3C) endorsed Extensible Markup Language (XML) which

More information

XML Schema. Mario Alviano A.Y. 2017/2018. University of Calabria, Italy 1 / 28

XML Schema. Mario Alviano A.Y. 2017/2018. University of Calabria, Italy 1 / 28 1 / 28 XML Schema Mario Alviano University of Calabria, Italy A.Y. 2017/2018 Outline 2 / 28 1 Introduction 2 Elements 3 Simple and complex types 4 Attributes 5 Groups and built-in 6 Import of other schemes

More information

Specifications for SHORT System Document Submission Service

Specifications for SHORT System Document Submission Service Specifications for SHOT System Document Submission Service Version 1.3 August 2015 Version 1.3 August 2015 1 evision History Version Date Major Changes 1.0 December 2010 Initial version. 1.1 February 2011

More information

Extensible Markup Language Processing

Extensible Markup Language Processing CHAPTER 2 Revised: June 24, 2009, This chapter describes the Extensible Markup Language (XML) process in the Common Object Request Broker Architecture (CORBA) adapter. XML and Components Along with XML,

More information

CMS SOAP CLIENT SOFTWARE REQUIREMENTS SPECIFICATION

CMS SOAP CLIENT SOFTWARE REQUIREMENTS SPECIFICATION CMS SOAP CLIENT SOFTWARE REQUIREMENTS SPECIFICATION CONTENTS 1. Introduction 1.1. Purpose 1.2. Scope Of Project 1.3. Glossary 1.4. References 1.5. Overview Of Document 2. Overall Description 2.1. System

More information

Pattern/Object Markup Language (POML): A Simple XML Schema for Object Oriented Code Description

Pattern/Object Markup Language (POML): A Simple XML Schema for Object Oriented Code Description Pattern/Object Markup Language (POML): A Simple XML Schema for Object Oriented Code Description Jason McC. Smith Apr 7, 2004 Abstract Pattern/Object Markup Language (or POML) is a simple XML Schema for

More information

Qualys Cloud Platform (VM, PC) v8.x API Release Notes

Qualys Cloud Platform (VM, PC) v8.x API Release Notes API Release Notes Version 8.18.1 March 19, 2019 This new version of the Qualys Cloud Platform (VM, PC) includes improvements to the Qualys API. You ll find all the details in our user guides, available

More information

[MS-MSL]: Mapping Specification Language File Format. Intellectual Property Rights Notice for Open Specifications Documentation

[MS-MSL]: Mapping Specification Language File Format. Intellectual Property Rights Notice for Open Specifications Documentation [MS-MSL]: Intellectual Property Rights Notice for Open Specifications Documentation Technical Documentation. Microsoft publishes Open Specifications documentation ( this documentation ) for protocols,

More information

DRS 2 Glossary. access flag An object access flag records the least restrictive access flag recorded for one of the object s files: ο ο

DRS 2 Glossary. access flag An object access flag records the least restrictive access flag recorded for one of the object s files: ο ο Harvard University Information Technology Library Technology Services DRS 2 Glossary access flag An object access flag records the least restrictive access flag recorded for one of the object s files:

More information

Digital Preservation at NARA

Digital Preservation at NARA Digital Preservation at NARA Policy, Records, Technology Leslie Johnston Director of Digital Preservation US National Archives and Records Administration (NARA) ARMA, April 18, 2018 Policy Managing Government

More information

[MS-QDEFF]: Query Definition File Format. Intellectual Property Rights Notice for Open Specifications Documentation

[MS-QDEFF]: Query Definition File Format. Intellectual Property Rights Notice for Open Specifications Documentation [MS-QDEFF]: Intellectual Property Rights Notice for Open Specifications Documentation Technical Documentation. Microsoft publishes Open Specifications documentation ( this documentation ) for protocols,

More information

[MS-DPAD]: Alert Definition Data Portability Overview. Intellectual Property Rights Notice for Open Specifications Documentation

[MS-DPAD]: Alert Definition Data Portability Overview. Intellectual Property Rights Notice for Open Specifications Documentation [MS-DPAD]: Intellectual Property Rights Notice for Open Specifications Documentation Technical Documentation. Microsoft publishes Open Specifications documentation ( this documentation ) for protocols,

More information

Customer Market Results Interface (CMRI) For RC Interface Specification. Version: 1.0.0

Customer Market Results Interface (CMRI) For RC Interface Specification. Version: 1.0.0 Customer Market Results Interface (CMRI) For RC Interface Specification Version: 1.0.0 November 1, 2018 Revision History Date Version Description 11/01/2018 1.0.0 Initial document release Page 2 of 10

More information

DFP Mobile Ad Network and Rich Media API

DFP Mobile Ad Network and Rich Media API DFP Mobile Ad Network and Rich Media API v2.0, 12 June 2012 Background DFP Mobile is adopting a single open API for integrating with all ad networks and rich media vendors. This has the following benefits:

More information

Positioning Additional Constraints

Positioning Additional Constraints Positioning Additional Constraints Issue XML Schema 1.1 allows additional constraints to be imposed on elements and attributes, above and beyond the constraints specified by their data type. Where should

More information

No Trade Secrets. Microsoft does not claim any trade secret rights in this documentation.

No Trade Secrets. Microsoft does not claim any trade secret rights in this documentation. [MS-DPAD]: Intellectual Property Rights Notice for Open Specifications Documentation Technical Documentation. Microsoft publishes Open Specifications documentation for protocols, file formats, languages,

More information

[MS-DPRDL]: Report Definition Language Data Portability Overview. Intellectual Property Rights Notice for Open Specifications Documentation

[MS-DPRDL]: Report Definition Language Data Portability Overview. Intellectual Property Rights Notice for Open Specifications Documentation [MS-DPRDL]: Intellectual Property Rights Notice for Open Specifications Documentation Technical Documentation. Microsoft publishes Open Specifications documentation ( this documentation ) for protocols,

More information

[MS-OXWSSYNC]: Mailbox Contents Synchronization Web Service Protocol Specification

[MS-OXWSSYNC]: Mailbox Contents Synchronization Web Service Protocol Specification [MS-OXWSSYNC]: Mailbox Contents Synchronization Web Service Protocol Specification Intellectual Property Rights Notice for Open Specifications Documentation Technical Documentation. Microsoft publishes

More information

ONVIF. XML Schema Version and Extension Handling White Paper

ONVIF. XML Schema Version and Extension Handling White Paper ONVIF 1 XML Schema Extension Handling ONVIF XML Schema Version and Extension Handling White Paper Version 1.2 December, 2015 1 Background 1.1 Purpose Version and extensions handling for XML schemas are

More information

3GPP TS V8.2.0 ( )

3GPP TS V8.2.0 ( ) TS 24.623 V8.2.0 (2009-12) Technical Specification 3rd Generation Partnership Project; Technical Specification Group Core Network and Terminals; Extensible Markup Language (XML) Configuration Access Protocol

More information

No Trade Secrets. Microsoft does not claim any trade secret rights in this documentation.

No Trade Secrets. Microsoft does not claim any trade secret rights in this documentation. [MS-MSL]: Intellectual Property Rights Notice for Open Specifications Documentation Technical Documentation. Microsoft publishes Open Specifications documentation for protocols, file formats, languages,

More information

Using Inventory Export Guide

Using Inventory Export Guide Introducing Inventory Import and Export XML Using Inventory Export Guide To Manage Your Inventory Data Version 1.0 ADD TO CART XML API GUIDE 5/28/13 PAGE 1 Copyright 2013 Shopatron, Inc. Using Inventory

More information

No Trade Secrets. Microsoft does not claim any trade secret rights in this documentation.

No Trade Secrets. Microsoft does not claim any trade secret rights in this documentation. [MS-TPXS]: Intellectual Property Rights Notice for Open Specifications Documentation Technical Documentation. Microsoft publishes Open Specifications documentation for protocols, file formats, languages,

More information

Semantic Web. XML and XML Schema. Morteza Amini. Sharif University of Technology Fall 94-95

Semantic Web. XML and XML Schema. Morteza Amini. Sharif University of Technology Fall 94-95 ه عا ی Semantic Web XML and XML Schema Morteza Amini Sharif University of Technology Fall 94-95 Outline Markup Languages XML Building Blocks XML Applications Namespaces XML Schema 2 Outline Markup Languages

More information

OCIMF. SIRE Crew Web Services 2.0

OCIMF. SIRE Crew Web Services 2.0 OCIMF SIRE Crew Web Services 2.0 v1.0.03 1 March 2012 Introduction OCIMF SIRE Web Services V2 are available at the following URLs: http://wsv2.ocimf-sire.com/ocimfservices.asmx https://wsv2.ocimf-sire.com/ocimfservices.asmx

More information

PLANNED RESOURCE SCHEDULE DOCUMENT UML MODEL AND SCHEMA

PLANNED RESOURCE SCHEDULE DOCUMENT UML MODEL AND SCHEMA 1 PLANNED RESOURCE SCHEDULE DOCUMENT UML MODEL AND SCHEMA 2019-02-12 APPROVED DOCUMENT 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42

More information

RESOURCE SCHEDULE CONFIRMATION DOCUMENT UML MODEL AND SCHEMA

RESOURCE SCHEDULE CONFIRMATION DOCUMENT UML MODEL AND SCHEMA 1 RESOURCE SCHEDULE CONFIRMATION DOCUMENT UML MODEL AND SCHEMA 2019-02-12 APPROVED DOCUMENT 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40

More information

Week 5 Aim: Description. Source Code

Week 5 Aim: Description. Source Code Week 5 Aim: Write an XML file which will display the Book information which includes the following: 1) Title of the book 2) Author Name 3) ISBN number 4) Publisher name 5) Edition 6) Price Write a Document

More information

Test Assertions Part 2 - Test Assertion Markup Language Version 1.0

Test Assertions Part 2 - Test Assertion Markup Language Version 1.0 Test Assertions Part 2 - Test Assertion Markup Language Version 1.0 Draft 1.0.2 6 January 2010 Specification URIs: This Version: Previous Version: [NA] Latest Version: http://docs.oasis-open.org/tag/taml/v1.0/testassertionmarkuplanguage-1.0.html

More information

UNAVAILABILITY DOCUMENT UML MODEL AND SCHEMA

UNAVAILABILITY DOCUMENT UML MODEL AND SCHEMA 1 UNAVAILABILITY DOCUMENT UML MODEL AND SCHEMA 2017-01-27 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 Table of Contents 1 Objective...

More information

Automated Load Forecast System (ALFS) For RC Interface Specification

Automated Load Forecast System (ALFS) For RC Interface Specification Automated Load Forecast System (ALFS) For RC Interface Specification Version: 1.0 October 22, 2018 Revision History Date Version Description 10/23/2018 1.0 Initial document release related to the Load

More information

REDISPATCH DOCUMENT UML MODEL AND SCHEMA

REDISPATCH DOCUMENT UML MODEL AND SCHEMA 1 REDISPATCH DOCUMENT UML MODEL AND SCHEMA 2019-02-12 APPROVED DOCUMENT 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 Table of Contents 1

More information

Metadata for SAML 1.0 Web Browser Profiles

Metadata for SAML 1.0 Web Browser Profiles 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 Metadata for SAML 1.0 Web Browser Profiles Working Draft 00, 12 November 2002 Document identifier: draft-sstc-saml-meta-data-00 Location:

More information

Manage Desktop Layout

Manage Desktop Layout You can define the layout of the Finesse desktop on the Desktop Layout tab. Important Requirements, such as processor speed and RAM, for clients that access the Finesse desktop can vary. Desktops that

More information

D-Cinema Packaging Caption and Closed Subtitle

D-Cinema Packaging Caption and Closed Subtitle SMPTE STANDARD SMPTE 429-12-2008 D-Cinema Packaging Caption and Closed Subtitle Page 1 of 11 pages Table of Contents Page Foreword... 2 Intellectual Property... 2 1 Scope... 3 2 Conformance Notation...

More information

Draft Digital Preservation Policy for IGNCA. Dr. Aditya Tripathi Banaras Hindu University Varanasi

Draft Digital Preservation Policy for IGNCA. Dr. Aditya Tripathi Banaras Hindu University Varanasi Draft Digital Preservation Policy for IGNCA Dr. Aditya Tripathi Banaras Hindu University Varanasi aditya@bhu.ac.in adityatripathi@hotmail.com Digital Preservation Born Digital Object Regardless of U S

More information

[MS-QDEFF]: Query Definition File Format. Intellectual Property Rights Notice for Open Specifications Documentation

[MS-QDEFF]: Query Definition File Format. Intellectual Property Rights Notice for Open Specifications Documentation [MS-QDEFF]: Intellectual Property Rights Notice for Open Specifications Documentation Technical Documentation. Microsoft publishes Open Specifications documentation for protocols, file formats, languages,

More information

Publications Office. TED Website - Notice Viewer WS Technical Specifications Document - Appendix D - NoticeViewer

Publications Office. TED Website - Notice Viewer WS Technical Specifications Document - Appendix D - NoticeViewer Publications Office Subject NoticeViewer WS API Version / Status 1.03 Release Date 17/02/2017 Filename Document Reference TED_WEBSITE-TSP-Technical_Specifications_Document-v1.03 TED-TSP-Appendix D Table

More information

[MS-OXSHRMSG]: Sharing Message Attachment Schema. Intellectual Property Rights Notice for Open Specifications Documentation

[MS-OXSHRMSG]: Sharing Message Attachment Schema. Intellectual Property Rights Notice for Open Specifications Documentation [MS-OXSHRMSG]: Intellectual Property Rights Notice for Open Specifications Documentation Technical Documentation. Microsoft publishes Open Specifications documentation for protocols, file formats, languages,

More information

Descriptive Metadata for Spartan Archive RO Collection Qualified Dublin Core

Descriptive Metadata for Spartan Archive RO Collection Qualified Dublin Core Descriptive Metadata for Spartan Archive RO Collection Qualified Dublin Core Element Qualifier Scheme Example (RO instance) identifier msu- uahc:ua6.7- AcademicPrograms title Academic Programs creator

More information

Web Service Provider Example - Enabling Visible Business

Web Service Provider Example - Enabling Visible Business Web Services Example Web Service Provider Example - Enabling Visible Business Company A makes earrings. One of their suppliers, Company B, provides the glass beads that are used in the earrings. Company

More information