- What we actually mean by documents (the FRBR hierarchy) - What are the components of documents

Size: px
Start display at page:

Download "- What we actually mean by documents (the FRBR hierarchy) - What are the components of documents"

Transcription

1 Purpose of these slides Introduction to XML for parliamentary documents (and all other kinds of documents, actually) Prof. Fabio Vitali University of Bologna Part 1 Introduce the principal aspects of electronic management of documents - What we actually mean by documents (the FRBR hierarchy) - What are the components of documents - What do we mean by data and metadata about documents Introduce some technologies related to electronic management of documents - XML - DTDs - XML Schema - XSLT - RDF and OWL all somehow connected and related to parliamentary documents (but not necessarily only to them) Next: Parliamentary activities 2/38 Parliamentary activities A complex production system that generates documents with different legal status: - Bills and acts to become the law of a country - Debate records (or hansards) to become a lasting log of the activities of the parliament - Daily and weekly announcements, tablings and reports to organize and master the internal logistics Each document of the uttermost importance and to be printed in large quantities and/or made available to a wide part of the population daily and in a very limited amount of time Next: Computer support for parliamentary activities 3/38 Computer support for parliamentary activities Support for generating documents - Drafting activities, record keeping, translation into national languages, etc. Support for workflow - Management of documents across lifecycle, storage, security, timely involvement of relevant individuals and offices Support for citizens access - Multi-channel publication (on paper and on the web), search, classification, identification Further activities - Consolidation, comparison, language synchronization, etc. Next: How the Web can help 4/38 How the Web can help Born as a publishing medium HTML helped make it a big success HTML is constraining by its own simplicity - Excessive reliance on typographic rather than semantic description - Few rules that are not even strongly imposed A new language was invented, called XML, that could solve that - Clear differentiation between aspect and meaning - Strong syntactic rules heavily imposed to guarantee uniformity, homogeneity, sophisticated applications XML XML (Extensible Markup Language) is a W3C standard of incredibly widespread diffusion. XML is pure syntax, without pre-defined semantics. This allows document designers to provide their own semantics. Thanks to the associated languages (DTD, XSLT, RDF) we can create sophisticated applications with big flexibility in uses. XML allows to create markup languages that are readable, generic, structured, hierarchical. Next: XML 5/38 Next: Parliamentary documents and XML 6/38 1

2 Parliamentary documents and XML XML is ideal for representing parliamentary documents (and especially bills and acts): - They have a well-defined structure, which is systematic and standardized - There are required and optional parts according to rules and tradition - There are containment constraints that determine the global correctness of the document - There are references to other texts (schedules, other acts, etc.) that can fruitfully be used to create a hypertext network. Why is XML good? Energy / Information Conversion is difficult Conversion is very easy Next: Why is XML good? 7/38 Next: Documents 8/38 3 problems Documents How do we refer precisely to the normative content of the documents and of their parts? Next: 3 problems (2) 10/38 3 problems (2) - When they are different physical copies of the same document (two identical books) - When they are different ways by which the same words appear (a MS Word file and its printout on paper) - When they are two different set of sentences with the same name and purpose (two versions of the same act) How do we refer precisely to the normative content of the documents and of their parts? 3 problems (2) - The words and punctuation it is composed of. - The way in which is is shown on page (pagination, typography, colors, margins and fonts) - The conceptual role of each fragment (this sentence is a title, this is a clause, this is a reference, etc.) How do we refer precisely to the normative content of the documents and of their parts? Next: 3 problems (2) 11/38 Next: 3 problems (2) 12/38 2

3 3 problems (2) How do we refer precisely to the normative content of the documents and of their parts? - The meaning - The words - The name 3 solutions - The IFLA FRBR hierarchy: from abstract ideas to physical files Work, Expression, Manifestation, Item - The SGML components: from meaning to typography Content, presentation, structure How do we refer precisely to the normative content of the documents and of their parts? - The semantic web approach: applying semantics where it fits data, metadata, ontology Next: 3 solutions 13/38 Next: The IFLA FRBR hierarchy (1) 14/38 The IFLA FRBR hierarchy (1) Work: a distinct intellectual creation. Expression: the specific form in which a work is realized - In our model, all variants and versions of a text that incorporates amendments and updates to an earlier version are considered expressions of the same work. Manifestation: the representation of an expression according to the requirements of a medium Item: a single exemplar of a manifestation - In our model, a manifestation is a representation of an expression as an eletronic document in a specific format - All copies of the same (identical) manifestation are items. All items are accessible in a specific position on a specific computer. Next: The IFLA FRBR hierarchy (2) 15/38 Work: The IFLA FRBR hierarchy (2) - The play Hamlet by William Shakepeare - The Italian act #3 (5 January 2001) Expression: - The first quarto of Hamlet (1601); - the first folio of Hamlet (1623); - the movie version of Hamlet by Kenneth Brannagh (1996) - The original version of Italian act 2001; - the amended version of Italian act 3/2001 as of 19/12/2003 Next: The IFLA FRBR hierarchy (3) 16/38 The IFLA FRBR hierarchy (3) Manifestation: - One of the printed versions of the first folio version of Hamlet (e.g.: Penguin Books, 1994) - One of the computer versions of Hamlet (e.g., Project Gutemberg) - The NIR XML version of the amended version of Italian Act 3/2001 as of 19/12/ The printed version of the original version of Italian Act 3/2001 on the Italian Gazette #2 (2001) Item: - My own copy of Hamlet by Penguin Books; the copy of Hamlet on the Gutemberg Project s own site - The copy of the NIR XML version of Italian Act 3/2001 on my computer. The one I copy on your computers. Content The SGML components (1) - What exactly was written in the document. - The content is composed of words, punctuation, sentences, images, paragraphs and so on. Structure - How the content is organized - All documents have an internal organization, composed of subdivisions, hierarchies, preambles and conclusions, attachments, and so on. Within a paragraph, all parts that have a relevance (e.g. references, quotations, etc.) Presentation - The typographical choices to present a document on screen or on paper. Next: The SGML components (1) 17/38 Next: The SGML components (2) 18/38 3

4 The SGML components (2) The structure adds meaning to pieces of content. - The text Interpretation assumes meaning once we know it is the title of article #2 of the Italian Act 3/2001 The structure connects the presentation to the content - Once we know that the text interpretation is the title of an article, we can apply the typographical choices associated to article titles. The structure can be used to test the correctness of a document - We can deduce that a document is incorrect if there is no title associated to an article. The SGML components (3) The content itself can be categorized in categories: - Pure content, appears in the document because it is instrumental to the message conveyed by the document. For instance, the text THE RETIREMENT BENEFITS AUTHORITY This is what we really are interested in - Structural content appears because it marks the beginning or the end of a structure. For instance, the text Part II This can be used for deducing information about the structure - Presentation oriented content Appears because it is dictated by the presentation choices of the document. For instance, page numbers and repeating headers. This can be safely ignored and thrown away. Next: The SGML components (3) 19/38 Next: The Semantic Web approach (1) 20/38 The Semantic Web approach (1) The semantic web approach (2) Data: - the actual text as was provided initially by the author of the document Metadata: - Any consideration or comment or additional information that can be expressed on the content and on the document. - Metadata is generated either by human intervention, or through automated processes. Ontology (in short): - A representation of the conceptual model that shapes all metadata associated to a document. Next: The semantic web approach (2) 21/38 Authors contribution: data - The words and punctuation and breaks, exactly as have been written and accepted by the original author (with legislation, the legislative body) Editors contribution: metadata - Publication data. Lifecycle information. Footnotes. Analysis of provisions. - Metadata is useless unless it is provided following a precise schema, called ontology. In a way, editors are the authors of the metadata Put it in another way, metadata is information about a document that was not provided by its authors. Next: Markup 22/38 Markup We call markup the additions to a written text that can let us use applications to work on the text: - Structural markup - Descriptive markup - Presentation markup With XML, we add markup to the text of a document so that further applications can work on it. XML uses a special syntax to add and distinguish text from markup Next: XML Markup (1) 23/38 XML Markup (1) XML markup clearly distinguishes elements, text (or #PCDATA) and attributes. An element is contained within start tags and end tags, which are distinguishable through angle brackets: - <title>interpretation.</title> The content of an element can be - just text (simple text elements) - Other elements (structural elements) - A mix of text and other elements (mixed content elements) Next: XML Markup (2) 24/38 4

5 XML Markup (2) Within the start tag we can sometimes find attributes, i.e. additional information about the element <act contains="singleversion"> </act> - A special attribute is href, that indicates the destination of a reference As in <ref href= #sec2 >section 2</ref> of this act. - Another special attribute is id that provides a reliable name for the element to be used in references <clause id= sec1-cla1"> </clause> <section id= sec2 > </section> In a way, metadata is information about the document, while the attribute is information about the element Naming documents and fragments Uniform resource Identifiers - These are used throughout the World Wide Web to indicate resources. - The best known are the URL (Uniform Resource Locators) that are used to navigate on the web Fragment Identifiers - Within a document, one can point to a specific part of the etxt through the fragment identifier - This corresponds to an element whose attribute id is part3 Next: Naming documents and fragments 25/38 Next: Naming documents and fragments (2) 26/38 Naming documents and fragments (2) In our case the situation is more complex. Works, expressions and manifestations are not physical resources, but abstract entities. Yet, references are rarely (or never) to items, but to those concepts So works, expressions and manifestations must have their own URI, which is not a URL (i.e., it does not correspond to a physical address on a computer) The act of finding out what is the URL of the item that best represents the manifestation that we are looking for is called URI resolution. Markups and languages Next: Markups and languages 27/38 Procedural and descriptive markup Structured and hierarchical markup With procedural markup we precisely indicate the task to apply to each fragment of text in order to, say, display the document. We indicate bold, italic, font name, font size, margins, etc. Basically, the actual usage determines the markup inserted in the text With descriptive markup we precisely identify the structural or semantic roles of each text fragment. Rather than bold or font size, we indicate aspects such as heading, caption, quotation, paragraph, reference, etc. Basically, since structural and semantic roles are independent of usage, I fill the document with persistent information. Next: Structured and hierarchical markup 29/38 Markup can be used to identify and exploit structures, i.e. organization of content in connected fragments. It is possible to identify rules to define a concept of correctness of text. Structures can be suggested (descriptive markup) or required (prescriptive markup). Documents are correct (valid) if they adhere to the rules specified. Some structures can be hierarchical. Legislative documents are often a hierarchy of containers. Capturing correctly the hierarchy of containment is an important characteristic for markup languages. Next: Markup meta-language 30/38 5

6 Markup meta-language A meta-language is a language to define languages, a grammar to build new languages. XML is not a markup language, but a language to used to create markup languages. XML does not provide suggestions on how to define specific aspects of a document: bold or italic or reference or paragraph. Rather, it provides a grammar to provide such aspects can be defined in a new language. Document Type Definitions and XML schema languages The DTD or the XML Schema (XSD) are documents that describe an XML-based language. They are the necessary step between the metalanguage and language. A schema document contains the list of allowed elements, attributes and repeatable document fragments (entities) A schema document further contains the set of all constraints that all elements and attributes must undergo. Constraints are expressed in terms of presence, repeatability and order. Next: Document Type Definitions and XML schema languages 31/38 Next: DTDs 32/38 DTDs XML Schema DTD is the most basic validation language for XML documents. - A W3C standard. Indeed, part of the XML language definition itself. - Uses its own (odd) syntax - Compact, easy to learn and manage - Can stay with the XML document or be referred to by the XML document - Adequately expressive on structures, less so on data content - Universally known and used. All tools support it. XML Schema is another validation language: - Also a W3C standard, but independent of XML (and independently evolving version 1.1 is to be standardized later this year) - uses XML-based syntax - Much longer, precise, difficult to read and use - Needs to stay outide of the XML document - More precise both for structures and data content You can require a date fragment to actually contain a valid date - Also widely known, but fights against a number of competitors, among which Relax NG, an ISO standard. - Aimed at cross pollination between information engineering and database structuring. Next: XML Schema 33/38 Next: Displaying XML documents: XSLT 34/38 Displaying XML documents: XSLT Displaying XML document is a downstream activity: it is very easy XSLT (XML Style Language - Transformation) is used to generate displayable versions of XML documents. XSLT is very flexible, and the same XML document can use many different XSLT stylesheets for different media and with different graphical layouts and typographical characteristics. XSLT can be used for generating both on-line and on print versions of the same document. Next: Metadata and the Semantic Web 35/38 Metadata and the Semantic Web Traditional Web technologies have only dealt with display on-screen (and, partially, on paper). Metadata are information stored about documents, and can be used for proper cataloguing, classification, search, sophisticated applications. The Semantic Web - RDF, OWL, Ontologies, Topic Maps, etc. - connected initiatives to provide web applications with the capabities to reason about, rather than just display, documents Next: Next 36/38 6

7 Next After the break we shall discuss - The syntax of DTDs - The basic ideas of XML Schema - The fundamental concepts of XSLT - A few points about metadata, metadata schemas, and ontologies Conclusions Markup languages are necessary for enriching data with information about the usages and the applications that can use the data Descriptive markup is the best starting point for the creation of new markup languages. XML is best among markup languages for several reasons: - It is a non proprietary, widely accepted standard - It is structured, hierarchical, descriptive - It allows both prescriptive and descriptive approaches - Tools exist in all operating systems and computer architectures. Next: Conclusions 37/38 Fine presentazione 38/38 7

- XML. - DTDs - XML Schema - XSLT. Web Services. - Well-formedness is a REQUIRED check on XML documents

- XML. - DTDs - XML Schema - XSLT. Web Services. - Well-formedness is a REQUIRED check on XML documents Purpose of this day Introduction to XML for parliamentary documents (and all other kinds of documents, actually) Prof. Fabio Vitali University of Bologna Introduce the principal aspects of electronic management

More information

Purpose of this talk. The role of standards and legislative standards. Summary. Network effect (1) Good network effect. Network effect (2)

Purpose of this talk. The role of standards and legislative standards. Summary. Network effect (1) Good network effect. Network effect (2) Purpose of this talk The role of standards and legislative standards Prof. Fabio Vitali University of Bologna Since modern Information and Communication Technologies (ICT) are instrumental to the best

More information

CEN MetaLex. Facilitating Interchange in E- Government. Alexander Boer

CEN MetaLex. Facilitating Interchange in E- Government. Alexander Boer CEN MetaLex Facilitating Interchange in E- Government Alexander Boer aboer@uva.nl MetaLex Initiative taken by us in 2002 Workshop on an open XML interchange format for legal and legislative resources www.metalex.eu

More information

The CEN Metalex Naming Convention

The CEN Metalex Naming Convention The CEN Metalex Naming Convention Fabio Vitali University of Bologna CEN Metalex CEN Metalex has been an international effort to create an interchange format between national XML formats for legislation.

More information

Web Standards Mastering HTML5, CSS3, and XML

Web Standards Mastering HTML5, CSS3, and XML Web Standards Mastering HTML5, CSS3, and XML Leslie F. Sikos, Ph.D. orders-ny@springer-sbm.com www.springeronline.com rights@apress.com www.apress.com www.apress.com/bulk-sales www.apress.com Contents

More information

Introduction and background

Introduction and background page 1 of 9 To: Joint Steering Committee for Development of RDA From: Gordon Dunsire, CILIP representative Subject: RDF representation of RDA relationship designators: discussion paper Introduction and

More information

Device Independent Principles for Adapted Content Delivery

Device Independent Principles for Adapted Content Delivery Device Independent Principles for Adapted Content Delivery Tayeb Lemlouma 1 and Nabil Layaïda 2 OPERA Project Zirst 655 Avenue de l Europe - 38330 Montbonnot, Saint Martin, France Tel: +33 4 7661 5281

More information

The Wonderful World of XML. Presented by Laurie K. Brooks AML Consulting, Inc.

The Wonderful World of XML. Presented by Laurie K. Brooks AML Consulting, Inc. The Wonderful World of XML Presented by Laurie K. Brooks AML Consulting, Inc. XML Precursors Hypertext and Multimedia => Hypermedia Internet => World Wide Web Generalized Markup => SGML and HTML Proprietary

More information

Information Technology Document Schema Definition Languages (DSDL) Part 1: Overview

Information Technology Document Schema Definition Languages (DSDL) Part 1: Overview ISO/IEC JTC 1/SC 34 Date: 2008-09-17 ISO/IEC FCD 19757-1 ISO/IEC JTC 1/SC 34/WG 1 Secretariat: Japanese Industrial Standards Committee Information Technology Document Schema Definition Languages (DSDL)

More information

Active Documents in XML

Active Documents in XML Active Documents in XML Luca Bompani, Paolo Ciancarini, Fabio Vitali Dept. of Computer Science, University of Bologna {bompani ciancarini vitali}@cs.unibo.it Abstract XML is an extremely promising approach

More information

XML Metadata Standards and Topic Maps

XML Metadata Standards and Topic Maps XML Metadata Standards and Topic Maps Erik Wilde 16.7.2001 XML Metadata Standards and Topic Maps 1 Outline what is XML? a syntax (not a data model!) what is the data model behind XML? XML Information Set

More information

The XML Metalanguage

The XML Metalanguage The XML Metalanguage Mika Raento mika.raento@cs.helsinki.fi University of Helsinki Department of Computer Science Mika Raento The XML Metalanguage p.1/442 2003-09-15 Preliminaries Mika Raento The XML Metalanguage

More information

Akoma Ntoso Version 1.0. Part 1: XML Vocabulary

Akoma Ntoso Version 1.0. Part 1: XML Vocabulary Akoma Ntoso Version 1.0. Part 1: XML Vocabulary Committee Specification Draft 03 / Public Review Draft 03 18 January 2017 Specification URIs This version: http://docs.oasis-open.org/legaldocml/akn-core/v1.0/csprd03/part1-vocabulary/akn-core-v1.0-

More information

Introduction to XML Zdeněk Žabokrtský, Rudolf Rosa

Introduction to XML Zdeněk Žabokrtský, Rudolf Rosa NPFL092 Technology for Natural Language Processing Introduction to XML Zdeněk Žabokrtský, Rudolf Rosa November 28, 2018 Charles Univeristy in Prague Faculty of Mathematics and Physics Institute of Formal

More information

Comp 336/436 - Markup Languages. Fall Semester Week 4. Dr Nick Hayward

Comp 336/436 - Markup Languages. Fall Semester Week 4. Dr Nick Hayward Comp 336/436 - Markup Languages Fall Semester 2018 - Week 4 Dr Nick Hayward XML - recap first version of XML became a W3C Recommendation in 1998 a useful format for data storage and exchange config files,

More information

COMP9321 Web Application Engineering

COMP9321 Web Application Engineering COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 4 http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid=2411 1 Extensible

More information

User Interaction: XML and JSON

User Interaction: XML and JSON User Interaction: XML and JSON Assoc. Professor Donald J. Patterson INF 133 Fall 2012 1 HTML and XML 1989: Tim Berners-Lee invents the Web with HTML as its publishing language Based on SGML Separates data

More information

XML: Extensible Markup Language

XML: Extensible Markup Language XML: Extensible Markup Language CSC 375, Fall 2015 XML is a classic political compromise: it balances the needs of man and machine by being equally unreadable to both. Matthew Might Slides slightly modified

More information

User Interaction: XML and JSON

User Interaction: XML and JSON User Interaction: XML and JSON Asst. Professor Donald J. Patterson INF 133 Fall 2011 1 What might a design notebook be like? Cooler What does a design notebook entry look like? HTML and XML 1989: Tim Berners-Lee

More information

.. Cal Poly CPE/CSC 366: Database Modeling, Design and Implementation Alexander Dekhtyar..

.. Cal Poly CPE/CSC 366: Database Modeling, Design and Implementation Alexander Dekhtyar.. .. Cal Poly CPE/CSC 366: Database Modeling, Design and Implementation Alexander Dekhtyar.. XML in a Nutshell XML, extended Markup Language is a collection of rules for universal markup of data. Brief History

More information

Tutorial 1 Getting Started with HTML5. HTML, CSS, and Dynamic HTML 5 TH EDITION

Tutorial 1 Getting Started with HTML5. HTML, CSS, and Dynamic HTML 5 TH EDITION Tutorial 1 Getting Started with HTML5 HTML, CSS, and Dynamic HTML 5 TH EDITION Objectives Explore the history of the Internet, the Web, and HTML Compare the different versions of HTML Study the syntax

More information

Comp 336/436 - Markup Languages. Fall Semester Week 2. Dr Nick Hayward

Comp 336/436 - Markup Languages. Fall Semester Week 2. Dr Nick Hayward Comp 336/436 - Markup Languages Fall Semester 2017 - Week 2 Dr Nick Hayward Digitisation - textual considerations comparable concerns with music in textual digitisation density of data is still a concern

More information

A tutorial report for SENG Agent Based Software Engineering. Course Instructor: Dr. Behrouz H. Far. XML Tutorial.

A tutorial report for SENG Agent Based Software Engineering. Course Instructor: Dr. Behrouz H. Far. XML Tutorial. A tutorial report for SENG 609.22 Agent Based Software Engineering Course Instructor: Dr. Behrouz H. Far XML Tutorial Yanan Zhang Department of Electrical and Computer Engineering University of Calgary

More information

Teiid Designer User Guide 7.5.0

Teiid Designer User Guide 7.5.0 Teiid Designer User Guide 1 7.5.0 1. Introduction... 1 1.1. What is Teiid Designer?... 1 1.2. Why Use Teiid Designer?... 2 1.3. Metadata Overview... 2 1.3.1. What is Metadata... 2 1.3.2. Editing Metadata

More information

NISO STS (Standards Tag Suite) Differences Between ISO STS 1.1 and NISO STS 1.0. Version 1 October 2017

NISO STS (Standards Tag Suite) Differences Between ISO STS 1.1 and NISO STS 1.0. Version 1 October 2017 NISO STS (Standards Tag Suite) Differences Between ISO STS 1.1 and NISO STS 1.0 Version 1 October 2017 1 Introduction...1 1.1 Four NISO STS Tag Sets...1 1.2 Relationship of NISO STS to ISO STS...1 1.3

More information

Introduction to XML. XML: basic elements

Introduction to XML. XML: basic elements Introduction to XML XML: basic elements XML Trying to wrap your brain around XML is sort of like trying to put an octopus in a bottle. Every time you think you have it under control, a new tentacle shows

More information

FORMAT & TYPING GUIDE

FORMAT & TYPING GUIDE FORMAT & TYPING GUIDE for CEATI Reports updated March 2018 Timeline of a Report Pre-Writing As you sit down to write the report, ensure you begin by downloading the most recent CEATI template at www.ceati.com/rfps.

More information

It is possible to create webpages without knowing anything about the HTML source behind the page.

It is possible to create webpages without knowing anything about the HTML source behind the page. What is HTML? HTML is the standard markup language for creating Web pages. HTML is a fairly simple language made up of elements, which can be applied to pieces of text to give them different meaning in

More information

Lesson 4 - Basic Text Formatting

Lesson 4 - Basic Text Formatting Lesson 4 - Basic Text Formatting Objectives In this lesson we will: Introduce Wiki Syntax Learn how to Bold and Italicise text, and add Headings Learn how to add bullets and lists Now that you have made

More information

Semantic Web and Electronic Information Resources Danica Radovanović

Semantic Web and Electronic Information Resources Danica Radovanović D.Radovanovic: Semantic Web and Electronic Information Resources 1, Infotheca journal 4(2003)2, p. 157-163 UDC 004.738.5:004.451.53:004.22 Semantic Web and Electronic Information Resources Danica Radovanović

More information

Formalize Accessibility. Accessibility and Open Source. Italian Legislation. Law n. 4 can be summarized: Focal Points on Technical Requirements

Formalize Accessibility. Accessibility and Open Source. Italian Legislation. Law n. 4 can be summarized: Focal Points on Technical Requirements Formalize Accessibility A web interface is formally Accessible if it s consistent to a set of guidelines. Accessibility and Open Source Massimo Canducci Technical Manager Research and Innovation Unit Director

More information

Comp 336/436 - Markup Languages. Fall Semester Week 4. Dr Nick Hayward

Comp 336/436 - Markup Languages. Fall Semester Week 4. Dr Nick Hayward Comp 336/436 - Markup Languages Fall Semester 2017 - Week 4 Dr Nick Hayward XML - recap first version of XML became a W3C Recommendation in 1998 a useful format for data storage and exchange config files,

More information

Labelling & Classification using emerging protocols

Labelling & Classification using emerging protocols Labelling & Classification using emerging protocols "wheels you don't have to reinvent & bandwagons you can jump on" Stephen McGibbon Lotus Development Assumptions The business rationale and benefits of

More information

User Interaction: XML and JSON

User Interaction: XML and JSON User Interaction: and JSON Asst. Professor Donald J. Patterson INF 133 Fall 2010 1 What might a design notebook be like? Cooler What does a design notebook entry look like? HTML and 1989: Tim Berners-Lee

More information

XML. Jonathan Geisler. April 18, 2008

XML. Jonathan Geisler. April 18, 2008 April 18, 2008 What is? IS... What is? IS... Text (portable) What is? IS... Text (portable) Markup (human readable) What is? IS... Text (portable) Markup (human readable) Extensible (valuable for future)

More information

Semantic Web Lecture Part 1. Prof. Do van Thanh

Semantic Web Lecture Part 1. Prof. Do van Thanh Semantic Web Lecture Part 1 Prof. Do van Thanh Overview of the lecture Part 1 Why Semantic Web? Part 2 Semantic Web components: XML - XML Schema Part 3 - Semantic Web components: RDF RDF Schema Part 4

More information

Enabling Grids for E-sciencE ISSGC 05. XML documents. Richard Hopkins, National e-science Centre, Edinburgh June

Enabling Grids for E-sciencE ISSGC 05. XML documents. Richard Hopkins, National e-science Centre, Edinburgh June ISSGC 05 XML documents Richard Hopkins, National e-science Centre, Edinburgh June 2005 www.eu-egee.org Overview Goals General appreciation of XML Sufficient detail to understand WSDLs Structure Philosophy

More information

Table of contents for The organization of information / Arlene G. Taylor and Daniel N. Joudrey.

Table of contents for The organization of information / Arlene G. Taylor and Daniel N. Joudrey. Table of contents for The organization of information / Arlene G. Taylor and Daniel N. Joudrey. Chapter 1: Organization of Recorded Information The Need to Organize The Nature of Information Organization

More information

ISO/IEC TR TECHNICAL REPORT

ISO/IEC TR TECHNICAL REPORT TECHNICAL REPORT ISO/IEC TR 22250-1 First edition 2002-02-15 Information technology Document description and processing languages Regular Language Description for XML (RELAX) Part 1: RELAX Core Technologies

More information

What You See Is What You Sign Trustworthy Display of XML Documents for Signing and Verification

What You See Is What You Sign Trustworthy Display of XML Documents for Signing and Verification What You See Is What You Sign Trustworthy Display of XML Documents for Signing and Verification Karl Scheibelhofer Institute for Applied Information Processing and Communications /nffeldgasse 16a, A-8010

More information

Hello INTRODUCTION TO XML. Neil Maycock. Pro-Bel Limited, UK ABSTRACT

Hello INTRODUCTION TO XML. Neil Maycock. Pro-Bel Limited, UK ABSTRACT INTRODUCTION TO XML Neil Maycock Pro-Bel Limited, UK ABSTRACT As the broadcasting world becomes ever more digital, computers systems are becoming more prevalent in many forms. This is bringing new challenges

More information

Markup Languages SGML, HTML, XML, XHTML. CS 431 February 13, 2006 Carl Lagoze Cornell University

Markup Languages SGML, HTML, XML, XHTML. CS 431 February 13, 2006 Carl Lagoze Cornell University Markup Languages SGML, HTML, XML, XHTML CS 431 February 13, 2006 Carl Lagoze Cornell University Problem Richness of text Elements: letters, numbers, symbols, case Structure: words, sentences, paragraphs,

More information

Metadata Workshop 3 March 2006 Part 1

Metadata Workshop 3 March 2006 Part 1 Metadata Workshop 3 March 2006 Part 1 Metadata overview and guidelines Amelia Breytenbach Ria Groenewald What metadata is Overview Types of metadata and their importance How metadata is stored, what metadata

More information

XML Support for Annotated Language Resources

XML Support for Annotated Language Resources XML Support for Annotated Language Resources Nancy Ide Department of Computer Science Vassar College Poughkeepsie, New York USA ide@cs.vassar.edu Laurent Romary Equipe Langue et Dialogue LORIA/CNRS Vandoeuvre-lès-Nancy,

More information

XML Overview, part 1

XML Overview, part 1 XML Overview, part 1 Norman Gray Revision 1.4, 2002/10/30 XML Overview, part 1 p.1/28 Contents The who, what and why XML Syntax Programming with XML Other topics The future http://www.astro.gla.ac.uk/users/norman/docs/

More information

COMP9321 Web Application Engineering. Extensible Markup Language (XML)

COMP9321 Web Application Engineering. Extensible Markup Language (XML) COMP9321 Web Application Engineering Extensible Markup Language (XML) Dr. Basem Suleiman Service Oriented Computing Group, CSE, UNSW Australia Semester 1, 2016, Week 4 http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid=2442

More information

LEGISLATIVE XML: Inter-American Development Bank. Principles and Technical Tools

LEGISLATIVE XML: Inter-American Development Bank. Principles and Technical Tools Inter-American Development Bank INSTITUTIONS FOR DEVELOPMENT (IFD) DISCUSSION PAPER No. IDB-DP-222 LEGISLATIVE XML: Principles and Technical Tools Monica Palmirani, CIRSFID, University of Bologna Fabio

More information

Introduction to Topologi Markup Editor , 2005 Topologi Pty. Ltd.

Introduction to Topologi Markup Editor , 2005 Topologi Pty. Ltd. http://www.topologi.com Introduction to Topologi Markup Editor 2.3 2004, 2005 Topologi Pty. Ltd. info@topologi.com Markup Editor Edit publishing-oriented documents in the built-in Markup Editor: a tag-aware

More information

Create web pages in HTML with a text editor, following the rules of XHTML syntax and using appropriate HTML tags Create a web page that includes

Create web pages in HTML with a text editor, following the rules of XHTML syntax and using appropriate HTML tags Create a web page that includes CMPT 165 INTRODUCTION TO THE INTERNET AND THE WORLD WIDE WEB By Hassan S. Shavarani UNIT2: MARKUP AND HTML 1 IN THIS UNIT YOU WILL LEARN THE FOLLOWING Create web pages in HTML with a text editor, following

More information

Proposal for Implementing Linked Open Data on Libraries Catalogue

Proposal for Implementing Linked Open Data on Libraries Catalogue Submitted on: 16.07.2018 Proposal for Implementing Linked Open Data on Libraries Catalogue Esraa Elsayed Abdelaziz Computer Science, Arab Academy for Science and Technology, Alexandria, Egypt. E-mail address:

More information

DSD: A Schema Language for XML

DSD: A Schema Language for XML DSD: A Schema Language for XML Nils Klarlund, AT&T Labs Research Anders Møller, BRICS, Aarhus University Michael I. Schwartzbach, BRICS, Aarhus University Connections between XML and Formal Methods XML:

More information

Structured documents

Structured documents Structured documents An overview of XML Structured documents Michael Houghton 15/11/2000 Unstructured documents Broadly speaking, text and multimedia document formats can be structured or unstructured.

More information

Semistructured Content

Semistructured Content On our first day Semistructured Content 1 Structured data : database system tagged, typed well-defined semantic interpretation Semi-structured data: tagged - (HTML?) some help with semantic interpretation

More information

Index. alt, 38, 57 class, 86, 88, 101, 107 href, 24, 51, 57 id, 86 88, 98 overview, 37. src, 37, 57. backend, WordPress, 146, 148

Index. alt, 38, 57 class, 86, 88, 101, 107 href, 24, 51, 57 id, 86 88, 98 overview, 37. src, 37, 57. backend, WordPress, 146, 148 Index Numbers & Symbols (angle brackets), in HTML, 47 : (colon), in CSS, 96 {} (curly brackets), in CSS, 75, 96. (dot), in CSS, 89, 102 # (hash mark), in CSS, 87 88, 99 % (percent) font size, in CSS,

More information

Publishing Technology 101 A Journal Publishing Primer. Mike Hepp Director, Technology Strategy Dartmouth Journal Services

Publishing Technology 101 A Journal Publishing Primer. Mike Hepp Director, Technology Strategy Dartmouth Journal Services Publishing Technology 101 A Journal Publishing Primer Mike Hepp Director, Technology Strategy Dartmouth Journal Services mike.hepp@sheridan.com Publishing Technology 101 AGENDA 12 3 EVOLUTION OF PUBLISHING

More information

extensible Markup Language

extensible Markup Language extensible Markup Language XML is rapidly becoming a widespread method of creating, controlling and managing data on the Web. XML Orientation XML is a method for putting structured data in a text file.

More information

Information technology Document Schema Definition Languages (DSDL) Part 8: Document Semantics Renaming Language (DSRL)

Information technology Document Schema Definition Languages (DSDL) Part 8: Document Semantics Renaming Language (DSRL) INTERNATIONAL STANDARD ISO/IEC 19757-8:2008 TECHNICAL CORRIGENDUM 1 Published 2011-12-01 INTERNATIONAL ORGANIZATION FOR STANDARDIZATION МЕЖДУНАРОДНАЯ ОРГАНИЗАЦИЯ ПО СТАНДАРТИЗАЦИИ ORGANISATION INTERNATIONALE

More information

SDMX self-learning package No. 3 Student book. SDMX-ML Messages

SDMX self-learning package No. 3 Student book. SDMX-ML Messages No. 3 Student book SDMX-ML Messages Produced by Eurostat, Directorate B: Statistical Methodologies and Tools Unit B-5: Statistical Information Technologies Last update of content February 2010 Version

More information

XML ALONE IS NOT SUFFICIENT FOR EFFECTIVE WEBEDI

XML ALONE IS NOT SUFFICIENT FOR EFFECTIVE WEBEDI Chapter 18 XML ALONE IS NOT SUFFICIENT FOR EFFECTIVE WEBEDI Fábio Ghignatti Beckenkamp and Wolfgang Pree Abstract: Key words: WebEDI relies on the Internet infrastructure for exchanging documents among

More information

Moving from Single Sourcing to Reuse with XML DITA

Moving from Single Sourcing to Reuse with XML DITA C A S E S T U D Y Moving from Single Sourcing to Reuse with XML DITA Lori Fisher, Program Director for Data Management User Technology, IBM Corp. The concepts of single sourcing and reuse have been talked

More information

Automatic Metadata Extraction for Archival Description and Access

Automatic Metadata Extraction for Archival Description and Access Automatic Metadata Extraction for Archival Description and Access WILLIAM UNDERWOOD Georgia Tech Research Institute Abstract: The objective of the research reported is this paper is to develop techniques

More information

Automated Classification. Lars Marius Garshol Topic Maps

Automated Classification. Lars Marius Garshol Topic Maps Automated Classification Lars Marius Garshol Topic Maps 2007 2007-03-21 Automated classification What is it? Why do it? 2 What is automated classification? Create parts of a topic map

More information

CiviX Author Custom Actions Cheat Sheet

CiviX Author Custom Actions Cheat Sheet Amendment Bylaw Elements CiviX Author Custom Actions Cheat Sheet 1 Alt + 6 Add Amendment Explanatory Note Add an amendment explan note which explains the purpose of the amendment - Occurs above an amendment

More information

CHAPTER 2 MARKUP LANGUAGES: XHTML 1.0

CHAPTER 2 MARKUP LANGUAGES: XHTML 1.0 WEB TECHNOLOGIES A COMPUTER SCIENCE PERSPECTIVE CHAPTER 2 MARKUP LANGUAGES: XHTML 1.0 Modified by Ahmed Sallam Based on original slides by Jeffrey C. Jackson reserved. 0-13-185603-0 HTML HELLO WORLD! Document

More information

XML and Agent Communication

XML and Agent Communication Tutorial Report for SENG 609.22- Agent-based Software Engineering Course Instructor: Dr. Behrouz H. Far XML and Agent Communication Jingqiu Shao Fall 2002 1 XML and Agent Communication Jingqiu Shao Department

More information

Information Technology Metadata registries (MDR) Part 5: Naming and identification principles

Information Technology Metadata registries (MDR) Part 5: Naming and identification principles ISO/IEC 2011 All rights reserved ISO/IEC JTC1 /SC 32 /WG2 N1580 Date: 2011-09-13 ISO/IEC WD 11179-5 ISO/IEC JTC1 /SC 32/WG 2 Secretariat: ANSI Information Technology Metadata registries (MDR) Part 5: Naming

More information

ONIX for Books Product Information Message. Application Note: Embedding HTML markup in ONIX 3.0 data elements

ONIX for Books Product Information Message. Application Note: Embedding HTML markup in ONIX 3.0 data elements ONIX for Books Product Information Message Application Note: Embedding HTML markup in ONIX 3.0 data elements In ONIX whether version 2.1 or 3.0 there are many common issues that arise when data providers

More information

S emistructured Data & XML

S emistructured Data & XML S emistructured Data & XML Database Systems, A Practical Approach to Design, Implementation and Management (Connolly & Begg, Ch. 29) XML Bible (Harold, Ch. 1) S lide:1 14/04/04 1 Overview Semistructured

More information

Contents. Topics. 01. WWW 02. WWW Documents 03. Web Service 04. Web Technologies. Management of Technology. C01-1. Documents

Contents. Topics. 01. WWW 02. WWW Documents 03. Web Service 04. Web Technologies. Management of Technology. C01-1. Documents Management of Technology Topics C01-1. Documents Code: 166125-01 Course: Management of Technology Period: Spring 2013 Professor: Sync Sangwon Lee, Ph. D 1 Contents 01. WWW 03. Web Service 04. Web Technologies

More information

Cloned page. A Technical Introduction to PDF/UA. DEFWhitepaper. The PDF/UA Standard for Universal Accessibility

Cloned page. A Technical Introduction to PDF/UA. DEFWhitepaper. The PDF/UA Standard for Universal Accessibility A Technical Introduction to PDF/UA DEFWhitepaper Applying WCAG to PDF The PDF/UA Standard for Universal Accessibility Traditionally, PDF documents didn t have a good reputation regarding basic accessibility

More information

Terminology the actors

Terminology the actors Practical #1 Summer School LEX September 2009 C.I.R.S.F.I.D. University of Bologna Monica Palmirani 5-01.-10 1 Legal Analysis 5-01.-10 2 Terminology the actors Original document: text approved by the assembly

More information

Blackboard staff how to guide Accessible Course Design

Blackboard staff how to guide Accessible Course Design The purpose of this guide is to help online course authors in creating accessible content using the Blackboard page editor. The advice is based primarily on W3C s Web Content Accessibility Guidelines 1.0

More information

State of Colorado. ADA IT Accessibility Standards For the Blind and Visually Impaired

State of Colorado. ADA IT Accessibility Standards For the Blind and Visually Impaired State of Colorado ADA IT Accessibility Standards For the Blind and Visually Impaired and IT Accessibility Procurement Criteria Adopted January 19, 2001 Version 2.0 July 2005 State of Colorado ADA Standards

More information

Achitectural specification: Base

Achitectural specification: Base Contents Introduction to DITA... 5 DITA terminology and notation...5 Basic concepts...9 File extensions...10 Producing different deliverables from a single source...11 DITA markup...12 DITA topics...12

More information

ISO/IEC INTERNATIONAL STANDARD. Information technology Multimedia framework (MPEG-21) Part 21: Media Contract Ontology

ISO/IEC INTERNATIONAL STANDARD. Information technology Multimedia framework (MPEG-21) Part 21: Media Contract Ontology INTERNATIONAL STANDARD ISO/IEC 21000-21 First edition 2013-07-01 Information technology Multimedia framework (MPEG-21) Part 21: Media Contract Ontology Technologies de l'information Cadre multimédia (MPEG-21)

More information

7.1 Introduction. extensible Markup Language Developed from SGML A meta-markup language Deficiencies of HTML and SGML

7.1 Introduction. extensible Markup Language Developed from SGML A meta-markup language Deficiencies of HTML and SGML 7.1 Introduction extensible Markup Language Developed from SGML A meta-markup language Deficiencies of HTML and SGML Lax syntactical rules Many complex features that are rarely used HTML is a markup language,

More information

The New Document Digital Polymorphic Ubiquitous Actionable Patrick P. Bergmans University of Ghent

The New Document Digital Polymorphic Ubiquitous Actionable Patrick P. Bergmans University of Ghent X X The New Document Digital Polymorphic Ubiquitous Actionable Patrick P. Bergmans University of Ghent The Traditional Document Documents have been around for thousands of years The Bible is a document

More information

Copyright 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley. Chapter 7 XML

Copyright 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley. Chapter 7 XML Chapter 7 XML 7.1 Introduction extensible Markup Language Developed from SGML A meta-markup language Deficiencies of HTML and SGML Lax syntactical rules Many complex features that are rarely used HTML

More information

INTERNATIONAL STANDARD

INTERNATIONAL STANDARD INTERNATIONAL STANDARD IEC 61360-2 Edition 2.1 2004-02 Edition 2:2002 consolidated with amendment 1:2003 Standard data element types with associated classification scheme for electric components Part 2:

More information

1. Please, please, please look at the style sheets job aid that I sent to you some time ago in conjunction with this document.

1. Please, please, please look at the style sheets job aid that I sent to you some time ago in conjunction with this document. 1. Please, please, please look at the style sheets job aid that I sent to you some time ago in conjunction with this document. 2. W3Schools has a lovely html tutorial here (it s worth the time): http://www.w3schools.com/html/default.asp

More information

How to Create a Document Template with Microsoft Word

How to Create a Document Template with Microsoft Word How to Create a Document Template with Microsoft Word Many organizations rely on document templates to jump-start the writing of certain types of documents (like procedures and policies), and most of these

More information

Background of HTML and the Internet

Background of HTML and the Internet Background of HTML and the Internet World Wide Web in Plain English http://www.youtube.com/watch?v=akvva2flkbk Structure of the World Wide Web A network is a structure linking computers together for the

More information

W3C XML XML Overview

W3C XML XML Overview Overview Jaroslav Porubän 2008 References Tutorials, http://www.w3schools.com Specifications, World Wide Web Consortium, http://www.w3.org David Hunter, et al.: Beginning, 4th Edition, Wrox, 2007, 1080

More information

Hospitality Industry Technology Integration Standards Glossary of Terminology

Hospitality Industry Technology Integration Standards Glossary of Terminology Hospitality Industry Technology Integration Standards Glossary of Terminology Abstract Class Account API Application Architecture Association Attribute Bandwidth Base Class Behavior Binding Blind Post

More information

Intro to XML. Borrowed, with author s permission, from:

Intro to XML. Borrowed, with author s permission, from: Intro to XML Borrowed, with author s permission, from: http://business.unr.edu/faculty/ekedahl/is389/topic3a ndroidintroduction/is389androidbasics.aspx Part 1: XML Basics Why XML Here? You need to understand

More information

SDMX self-learning package XML based technologies used in SDMX-IT TEST

SDMX self-learning package XML based technologies used in SDMX-IT TEST SDMX self-learning package XML based technologies used in SDMX-IT TEST Produced by Eurostat, Directorate B: Statistical Methodologies and Tools Unit B-5: Statistical Information Technologies Last update

More information

Tutorial 2 - HTML basics

Tutorial 2 - HTML basics Tutorial 2 - HTML basics Developing a Web Site The first phase in creating a new web site is planning. This involves determining the site s navigation structure, content, and page layout. It is only after

More information

Semistructured Content

Semistructured Content On our first day Semistructured Content 1 Structured data : database system tagged, typed well-defined semantic interpretation Semi-structured data: tagged - XML (HTML?) some help with semantic interpretation

More information

GRAPHIC WEB DESIGNER PROGRAM

GRAPHIC WEB DESIGNER PROGRAM NH128 HTML Level 1 24 Total Hours COURSE TITLE: HTML Level 1 COURSE OVERVIEW: This course introduces web designers to the nuts and bolts of HTML (HyperText Markup Language), the programming language used

More information

Developing a Basic Web Page

Developing a Basic Web Page Developing a Basic Web Page Creating a Web Page for Stephen Dubé s Chemistry Classes 1 Objectives Review the history of the Web, the Internet, and HTML Describe different HTML standards and specifications

More information

Natural Language Requirements

Natural Language Requirements Natural Language Requirements Software Verification and Validation Laboratory Requirement Elaboration Heuristic Domain Model» Requirement Relationship Natural Language is elaborated via Requirement application

More information

XML-based production of Eurostat publications

XML-based production of Eurostat publications Doc. Eurostat/ITDG/October 2007/2.3.1 IT Directors Group 15 and 16 October 2007 BECH Building, 5, rue Alphonse Weicker, Luxembourg-Kirchberg Room QUETELET 9.30 a.m. - 5.30 p.m. 9.00 a.m 1.00 p.m. XML-based

More information

TEXAS A&M AGRILIFE EXTENSION SERVICE PROCEDURES

TEXAS A&M AGRILIFE EXTENSION SERVICE PROCEDURES TEXAS A&M AGRILIFE EXTENSION SERVICE PROCEDURES 33.02.02.X1.01 WEB ACCESSIBILITY AND USABILITY Approved: June 30, 2009 1.00 GENERAL 1.01 Texas A&M AgriLife Extension Service is committed to making its

More information

EMERGING TECHNOLOGIES. XML Documents and Schemas for XML documents

EMERGING TECHNOLOGIES. XML Documents and Schemas for XML documents EMERGING TECHNOLOGIES XML Documents and Schemas for XML documents Outline 1. Introduction 2. Structure of XML data 3. XML Document Schema 3.1. Document Type Definition (DTD) 3.2. XMLSchema 4. Data Model

More information

Conformance Requirements Guideline Version 0.1

Conformance Requirements Guideline Version 0.1 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 Editors: Conformance Requirements Guideline Version 0.1 Aug 22, 2001 Lynne Rosenthal (lynne.rosenthal@nist.gov)

More information

Tennessee. Business Technology Course Code Web Design Essentials. HTML Essentials, Second Edition 2010

Tennessee. Business Technology Course Code Web Design Essentials. HTML Essentials, Second Edition 2010 Tennessee Business Technology Course Code 6501240 Web Design Essentials HTML Essentials, Second Edition 2010 Notation Key SE Student Edition LE Learning Expectation Standard 1.0 Demonstrate knowledge of

More information

N. Brownlee Independent Submissions Editor Expires: April 21, 2013 October 18, 2012

N. Brownlee Independent Submissions Editor Expires: April 21, 2013 October 18, 2012 INTERNET-DRAFT H. Flanagan Intended Status: Informational RFC Series Editor N. Brownlee Independent Submissions Editor Expires: April 21, 2013 October 18, 2012 RFC Series Format Development draft-rfc-format-flanagan-01

More information

Chapter 1 Getting Started with HTML 5 1. Chapter 2 Introduction to New Elements in HTML 5 21

Chapter 1 Getting Started with HTML 5 1. Chapter 2 Introduction to New Elements in HTML 5 21 Table of Contents Chapter 1 Getting Started with HTML 5 1 Introduction to HTML 5... 2 New API... 2 New Structure... 3 New Markup Elements and Attributes... 3 New Form Elements and Attributes... 4 Geolocation...

More information

USER GUIDE. MADCAP FLARE 2017 r3. Accessibility

USER GUIDE. MADCAP FLARE 2017 r3. Accessibility USER GUIDE MADCAP FLARE 2017 r3 Accessibility Copyright 2018 MadCap Software. All rights reserved. Information in this document is subject to change without notice. The software described in this document

More information

XML Motivations. Semi-structured data. Principles of Information and Database Management 198:336 Week 8 Mar 28 Matthew Stone.

XML Motivations. Semi-structured data. Principles of Information and Database Management 198:336 Week 8 Mar 28 Matthew Stone. XML Motivations Principles of Information and Database Management 198:336 Week 8 Mar 28 Matthew Stone Semi-structured data Relaxing traditional schema Storing more complex objects Standardized data Using

More information