XML. extensible Markup Language. ... and its usefulness for linguists

Size: px
Start display at page:

Download "XML. extensible Markup Language. ... and its usefulness for linguists"

Transcription

1 XML extensible Markup Language... and its usefulness for linguists Thomas Mayer Fachbereich Sprachwissenschaft, Universität Konstanz Seminar Computerlinguistik II (Miriam Butt) 23. Juli 2004

2 0.Outline Thomas Mayer XML 1

3 0.Outline Preliminaries Thomas Mayer XML 1

4 0.Outline Preliminaries Introduction to XML Thomas Mayer XML 1

5 0.Outline Preliminaries Introduction to XML Using XML for grammar writing (with XLE) Thomas Mayer XML 1

6 1.Preliminaries At least two types of applications can be distinguished Thomas Mayer XML 2

7 1.Preliminaries At least two types of applications can be distinguished Document-centric applications (e.g. Information Retrieval) mainly text documents with little structure (chapters, sections,... ) Thomas Mayer XML 2

8 1.Preliminaries At least two types of applications can be distinguished Document-centric applications (e.g. Information Retrieval) mainly text documents with little structure (chapters, sections,... ) Data-centric applications ( traditional database applications) very regular data; sometimes, however, we might want some flexibility Thomas Mayer XML 2

9 XML was designed to satisfy the needs of both worlds Thomas Mayer XML 3

10 XML was designed to satisfy the needs of both worlds It was designed to be simple, generic and extensible Thomas Mayer XML 3

11 XML was designed to satisfy the needs of both worlds It was designed to be simple, generic and extensible XML documents are pure ASCII files = Humans can read them Thomas Mayer XML 3

12 2.Introduction to XML Thomas Mayer XML 4

13 2.Introduction to XML 1. XML a new HTML? Thomas Mayer XML 4

14 2.Introduction to XML 1. XML a new HTML? 2. The structure of an XML document Thomas Mayer XML 4

15 2.Introduction to XML 1. XML a new HTML? 2. The structure of an XML document 3. XML data query and retrieval Thomas Mayer XML 4

16 2.1XML a new HTML?... definitely not! HTML (HyperText Markup Language) is for displaying web contents on the browser HTML is a subset of XML (HTML XML SGML) XML could be used to determine the layout of a web page, but it is much more powerful Thomas Mayer XML 5

17 2.2The structure of an XML document head <?xml version="1.0"? encoding="iso "?>... body <note> <to>tove</to> <from>jani</from> <subject>reminder</subject> <message>take a break!</message> </note> Thomas Mayer XML 6

18 The basic element of an XML document is a tag Thomas Mayer XML 7

19 The basic element of an XML document is a tag A tag usually consists of a start tag (e.g. <foo>) and its corresponding end tag (e.g. </foo>) Thomas Mayer XML 7

20 The basic element of an XML document is a tag A tag usually consists of a start tag (e.g. <foo>) and its corresponding end tag (e.g. </foo>) Exception: empty elements (e.g. <foo />) Thomas Mayer XML 7

21 The basic element of an XML document is a tag A tag usually consists of a start tag (e.g. <foo>) and its corresponding end tag (e.g. </foo>) Exception: empty elements (e.g. <foo />) Elements can have attributes, i.e. pairs of attributes and values separated by an equation sign: Thomas Mayer XML 7

22 <foo attribute="value"> Thomas Mayer XML 8

23 <foo attribute="value"> Element names may contain the following characters: letters (capital and small), numbers, underscores ( ), hyphens ( - ), dots (. ) and colons ( : ). Thomas Mayer XML 8

24 However, there are special elements... Thomas Mayer XML 9

25 However, there are special elements... Processing Instructions (PIs): information for programs that work on the XML document <?xml-stylesheet href="headlines.css" type="te Thomas Mayer XML 9

26 However, there are special elements... Processing Instructions (PIs): information for programs that work on the XML document <?xml-stylesheet href="headlines.css" type="te Special elements starting with! : Thomas Mayer XML 9

27 However, there are special elements... Processing Instructions (PIs): information for programs that work on the XML document <?xml-stylesheet href="headlines.css" type="te Special elements starting with! : <!DOCTYPE...> Thomas Mayer XML 9

28 However, there are special elements... Processing Instructions (PIs): information for programs that work on the XML document <?xml-stylesheet href="headlines.css" type="te Special elements starting with! : <!DOCTYPE...> <![CDATA[...]]> Thomas Mayer XML 9

29 However, there are special elements... Processing Instructions (PIs): information for programs that work on the XML document <?xml-stylesheet href="headlines.css" type="te Special elements starting with! : <!DOCTYPE...> <![CDATA[...]]> <! > comments as in HTML Thomas Mayer XML 9

30 The building blocks of XML documents Elements Tags Attributes PCDATA CDATA Processing Instructions Thomas Mayer XML 10

31 Comments Entities Thomas Mayer XML 11

32 PCDATA sections Thomas Mayer XML 12

33 PCDATA sections character strings between a start and an end tag <foo>... whatever... </foo> Thomas Mayer XML 12

34 PCDATA sections character strings between a start and an end tag <foo>... whatever... </foo> Escape sequences for special characters <, >, ", They begin with an ampersand and end with a semi-colon (e.g. < for <) Thomas Mayer XML 12

35 PCDATA sections character strings between a start and an end tag <foo>... whatever... </foo> Escape sequences for special characters <, >, ", They begin with an ampersand and end with a semi-colon (e.g. < for <) Thomas Mayer XML 12

36 You can define your own entities in the same way (e.g. &myentity;) Thomas Mayer XML 13

37 Well-formed vs. valid XML documents Thomas Mayer XML 14

38 Well-formed vs. valid XML documents An XML document is well-formed if it observes the rules of the XML specification (i.e. the restrictions on element names, proper nesting, etc.) Thomas Mayer XML 14

39 Well-formed vs. valid XML documents An XML document is well-formed if it observes the rules of the XML specification (i.e. the restrictions on element names, proper nesting, etc.) An XML document is valid if it is wellformed and conforms to its Document Type Definition (DTD) (either inline in the XML document or as an external reference), which specifies some rules Thomas Mayer XML 14

40 regarding the nesting of certain elements, etc. Thomas Mayer XML 15

41 regarding the nesting of certain elements, etc. What does a DTD look like? Thomas Mayer XML 15

42 Document Type Definition (DTD) Thomas Mayer XML 16

43 Document Type Definition (DTD) Internal DTD: <?xml version="1.0" encoding="iso "?> <!DOCTYPE note [... ]> Thomas Mayer XML 16

44 Document Type Definition (DTD) Internal DTD: <?xml version="1.0" encoding="iso "?> <!DOCTYPE note [... ]> External DTD: <?xml version="1.0" encoding="iso "?> Thomas Mayer XML 16

45 <!DOCTYPE note SYSTEM "note.dtd">... Thomas Mayer XML 17

46 An example DTD ("note.dtd"): <?xml version="1.0"?> <!ELEMENT note (to,from,subject,message)> <!ELEMENT to(#pcdata)> <!ELEMENT from (#PCDATA)> <!ELEMENT subject (#PCDATA)> <!ELEMENT message (#PCDATA)> Thomas Mayer XML 18

47 2.3XML related standards Thomas Mayer XML 19

48 2.3XML related standards XPath Thomas Mayer XML 19

49 2.3XML related standards XPath XQuery Thomas Mayer XML 19

50 2.3XML related standards XPath XQuery XSLT Thomas Mayer XML 19

51 XPath Thomas Mayer XML 20

52 XPath Querying XML data essentially means to identify (or address) nodes, and to test further properties of these nodes. Thomas Mayer XML 20

53 XPath Querying XML data essentially means to identify (or address) nodes, and to test further properties of these nodes. There are certain relationships between nodes (axes) Thomas Mayer XML 20

54 XPath Querying XML data essentially means to identify (or address) nodes, and to test further properties of these nodes. There are certain relationships between nodes (axes) Given the relationships preceding/following and ancestor/descendant each node Thomas Mayer XML 20

55 partitions the whole XML document into four disjoint regions: Thomas Mayer XML 21

56 Thomas Mayer XML 22

57 The syntax of XPath Thomas Mayer XML 23

58 The syntax of XPath The path expression is the core construct of XPath Thomas Mayer XML 23

59 The syntax of XPath The path expression is the core construct of XPath Each path consists of one or more steps, syntactically separated by /. s 0 /s 1 /... /s n Thomas Mayer XML 23

60 The syntax of XPath The path expression is the core construct of XPath Each path consists of one or more steps, syntactically separated by /. s 0 /s 1 /... /s n The entire XPath expression will always return a sequence of nodes (duplicate free and in document order) Thomas Mayer XML 23

61 XQuery Thomas Mayer XML 24

62 XQuery XQuery is a standard means to query XML databases Thomas Mayer XML 24

63 XQuery XQuery is a standard means to query XML databases It extends XPath by adding some powerful new operators and an elaborate type system Thomas Mayer XML 24

64 XSLT Thomas Mayer XML 25

65 XSLT XSL (extensible Stylesheet Language) consists of two important components: Thomas Mayer XML 25

66 XSLT XSL (extensible Stylesheet Language) consists of two important components: one for formatting XML data Thomas Mayer XML 25

67 XSLT XSL (extensible Stylesheet Language) consists of two important components: one for formatting XML data one for transforming XML data into other XML data Thomas Mayer XML 25

68 XSLT XSL (extensible Stylesheet Language) consists of two important components: one for formatting XML data one for transforming XML data into other XML data The second component is often abbreviated to XSLT (XSL Transformation) Thomas Mayer XML 25

69 3.Grammar writing with XML XML can be used for... Thomas Mayer XML 26

70 3.Grammar writing with XML XML can be used for... documenting grammars Thomas Mayer XML 26

71 3.Grammar writing with XML XML can be used for... documenting grammars annotating corpora Thomas Mayer XML 26

72 3.Grammar writing with XML XML can be used for... documenting grammars annotating corpora encoding grammars Thomas Mayer XML 26

73 3.Grammar writing with XML XML can be used for... documenting grammars annotating corpora encoding grammars... Thomas Mayer XML 26

74 3.1Documenting grammars Idea: The source grammar (XLE) is annotated with XML tags for documentation purposes. Together with a source documentation you can extract the information from the grammar and build a complete (output) documentation for the grammar. Thomas Mayer XML 27

75 Thomas Mayer XML 28

76 3.2Annotating corpora We already discussed that... Thomas Mayer XML 29

77 3.3Encoding grammars A rule such as NP --> D: ^=!; N: ^=!; can be encoded as... Thomas Mayer XML 30

78 <rule> <mother>np</mother> <daughters> <daughter> <c_struc>d</c_struc> <f_struc>^=!</f_struc> </daughter> <daughter> <c_struc>n</c_struc> <f_struc>^=!</f_struc> </daughter> </daughters> </rule> Thomas Mayer XML 31

79 Several output files could be generated Thomas Mayer XML 32

80 Several output files could be generated grammar code Thomas Mayer XML 32

81 Several output files could be generated grammar code pdf-file with tree structure Thomas Mayer XML 32

82 Several output files could be generated grammar code pdf-file with tree structure html-file with links to the rules Thomas Mayer XML 32

83 Have a nice term break!

M359 Block5 - Lecture12 Eng/ Waleed Omar

M359 Block5 - Lecture12 Eng/ Waleed Omar Documents and markup languages The term XML stands for extensible Markup Language. Used to label the different parts of documents. Labeling helps in: Displaying the documents in a formatted way Querying

More information

Introduction to XML. An Example XML Document. The following is a very simple XML document.

Introduction to XML. An Example XML Document. The following is a very simple XML document. Introduction to XML Extensible Markup Language (XML) was standardized in 1998 after 2 years of work. However, it developed out of SGML (Standard Generalized Markup Language), a product of the 1970s and

More information

XML: Introduction. !important Declaration... 9:11 #FIXED... 7:5 #IMPLIED... 7:5 #REQUIRED... Directive... 9:11

XML: Introduction. !important Declaration... 9:11 #FIXED... 7:5 #IMPLIED... 7:5 #REQUIRED... Directive... 9:11 !important Declaration... 9:11 #FIXED... 7:5 #IMPLIED... 7:5 #REQUIRED... 7:4 @import Directive... 9:11 A Absolute Units of Length... 9:14 Addressing the First Line... 9:6 Assigning Meaning to XML Tags...

More information

XML. Marie Dubremetz Uppsala, April 2014

XML. Marie Dubremetz Uppsala, April 2014 XML Marie Dubremetz marie.dubremetz@lingfil.uu.se Uppsala, April 2014 Presentation Plan 1 Introduction 2 XML Specificities and Motivations 3 XML: Vocabulary and Techniques Uppsala May 2015 2/37 Table of

More information

Data Presentation and Markup Languages

Data Presentation and Markup Languages Data Presentation and Markup Languages MIE456 Tutorial Acknowledgements Some contents of this presentation are borrowed from a tutorial given at VLDB 2000, Cairo, Agypte (www.vldb.org) by D. Florescu &.

More information

Web Services Part I. XML Web Services. Instructor: Dr. Wei Ding Fall 2009

Web Services Part I. XML Web Services. Instructor: Dr. Wei Ding Fall 2009 Web Services Part I Instructor: Dr. Wei Ding Fall 2009 CS 437/637 Database-Backed Web Sites and Web Services 1 XML Web Services XML Web Services = Web Services A Web service is a different kind of Web

More information

XML: Extensible Markup Language

XML: Extensible Markup Language XML: Extensible Markup Language CSC 375, Fall 2015 XML is a classic political compromise: it balances the needs of man and machine by being equally unreadable to both. Matthew Might Slides slightly modified

More information

What is XML? XML is designed to transport and store data.

What is XML? XML is designed to transport and store data. What is XML? XML stands for extensible Markup Language. XML is designed to transport and store data. HTML was designed to display data. XML is a markup language much like HTML XML was designed to carry

More information

7.1 Introduction. extensible Markup Language Developed from SGML A meta-markup language Deficiencies of HTML and SGML

7.1 Introduction. extensible Markup Language Developed from SGML A meta-markup language Deficiencies of HTML and SGML 7.1 Introduction extensible Markup Language Developed from SGML A meta-markup language Deficiencies of HTML and SGML Lax syntactical rules Many complex features that are rarely used HTML is a markup language,

More information

Copyright 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley. Chapter 7 XML

Copyright 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley. Chapter 7 XML Chapter 7 XML 7.1 Introduction extensible Markup Language Developed from SGML A meta-markup language Deficiencies of HTML and SGML Lax syntactical rules Many complex features that are rarely used HTML

More information

XML. Objectives. Duration. Audience. Pre-Requisites

XML. Objectives. Duration. Audience. Pre-Requisites XML XML - extensible Markup Language is a family of standardized data formats. XML is used for data transmission and storage. Common applications of XML include business to business transactions, web services

More information

markup language carry data define your own tags self-descriptive W3C Recommendation

markup language carry data define your own tags self-descriptive W3C Recommendation XML intro What is XML? XML stands for EXtensible Markup Language XML is a markup language much like HTML XML was designed to carry data, not to display data XML tags are not predefined. You must define

More information

XML. Presented by : Guerreiro João Thanh Truong Cong

XML. Presented by : Guerreiro João Thanh Truong Cong XML Presented by : Guerreiro João Thanh Truong Cong XML : Definitions XML = Extensible Markup Language. Other Markup Language : HTML. XML HTML XML describes a Markup Language. XML is a Meta-Language. Users

More information

Chapter 1: Getting Started. You will learn:

Chapter 1: Getting Started. You will learn: Chapter 1: Getting Started SGML and SGML document components. What XML is. XML as compared to SGML and HTML. XML format. XML specifications. XML architecture. Data structure namespaces. Data delivery,

More information

COMP9321 Web Application Engineering

COMP9321 Web Application Engineering COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 4 http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid=2411 1 Extensible

More information

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 27-1

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 27-1 Slide 27-1 Chapter 27 XML: Extensible Markup Language Chapter Outline Introduction Structured, Semi structured, and Unstructured Data. XML Hierarchical (Tree) Data Model. XML Documents, DTD, and XML Schema.

More information

XML. Jonathan Geisler. April 18, 2008

XML. Jonathan Geisler. April 18, 2008 April 18, 2008 What is? IS... What is? IS... Text (portable) What is? IS... Text (portable) Markup (human readable) What is? IS... Text (portable) Markup (human readable) Extensible (valuable for future)

More information

W3C XML XML Overview

W3C XML XML Overview Overview Jaroslav Porubän 2008 References Tutorials, http://www.w3schools.com Specifications, World Wide Web Consortium, http://www.w3.org David Hunter, et al.: Beginning, 4th Edition, Wrox, 2007, 1080

More information

Markup Languages SGML, HTML, XML, XHTML. CS 431 February 13, 2006 Carl Lagoze Cornell University

Markup Languages SGML, HTML, XML, XHTML. CS 431 February 13, 2006 Carl Lagoze Cornell University Markup Languages SGML, HTML, XML, XHTML CS 431 February 13, 2006 Carl Lagoze Cornell University Problem Richness of text Elements: letters, numbers, symbols, case Structure: words, sentences, paragraphs,

More information

Introduction to XML. XML: basic elements

Introduction to XML. XML: basic elements Introduction to XML XML: basic elements XML Trying to wrap your brain around XML is sort of like trying to put an octopus in a bottle. Every time you think you have it under control, a new tentacle shows

More information

Author: Irena Holubová Lecturer: Martin Svoboda

Author: Irena Holubová Lecturer: Martin Svoboda NPRG036 XML Technologies Lecture 1 Introduction, XML, DTD 19. 2. 2018 Author: Irena Holubová Lecturer: Martin Svoboda http://www.ksi.mff.cuni.cz/~svoboda/courses/172-nprg036/ Lecture Outline Introduction

More information

Introduction Syntax and Usage XML Databases Java Tutorial XML. November 5, 2008 XML

Introduction Syntax and Usage XML Databases Java Tutorial XML. November 5, 2008 XML Introduction Syntax and Usage Databases Java Tutorial November 5, 2008 Introduction Syntax and Usage Databases Java Tutorial Outline 1 Introduction 2 Syntax and Usage Syntax Well Formed and Valid Displaying

More information

Part 2: XML and Data Management Chapter 6: Overview of XML

Part 2: XML and Data Management Chapter 6: Overview of XML Part 2: XML and Data Management Chapter 6: Overview of XML Prof. Dr. Stefan Böttcher 6. Overview of the XML standards: XML, DTD, XML Schema 7. Navigation in XML documents: XML axes, DOM, SAX, XPath, Tree

More information

The XML Metalanguage

The XML Metalanguage The XML Metalanguage Mika Raento mika.raento@cs.helsinki.fi University of Helsinki Department of Computer Science Mika Raento The XML Metalanguage p.1/442 2003-09-15 Preliminaries Mika Raento The XML Metalanguage

More information

CSS, Cascading Style Sheets

CSS, Cascading Style Sheets CSS, Cascading Style Sheets HTML was intended to define the content of a document This is a heading This is a paragraph This is a table element Not how they look (aka style)

More information

XML. extensible Markup Language. Overview. Overview. Overview XML Components Document Type Definition (DTD) Attributes and Tags An XML schema

XML. extensible Markup Language. Overview. Overview. Overview XML Components Document Type Definition (DTD) Attributes and Tags An XML schema XML extensible Markup Language An introduction in XML and parsing XML Overview XML Components Document Type Definition (DTD) Attributes and Tags An XML schema 3011 Compiler Construction 2 Overview Overview

More information

EXtensible Markup Language XML

EXtensible Markup Language XML EXtensible Markup Language XML 1 What is XML? XML stands for EXtensible Markup Language XML is a markup language much like HTML XML was designed to carry data, not to display data XML tags are not predefined.

More information

Chapter 13 XML: Extensible Markup Language

Chapter 13 XML: Extensible Markup Language Chapter 13 XML: Extensible Markup Language - Internet applications provide Web interfaces to databases (data sources) - Three-tier architecture Client V Application Programs Webserver V Database Server

More information

Overview. Introduction. Introduction XML XML. Lecture 16 Introduction to XML. Boriana Koleva Room: C54

Overview. Introduction. Introduction XML XML. Lecture 16 Introduction to XML. Boriana Koleva Room: C54 Overview Lecture 16 Introduction to XML Boriana Koleva Room: C54 Email: bnk@cs.nott.ac.uk Introduction The Syntax of XML XML Document Structure Document Type Definitions Introduction Introduction SGML

More information

CSI 3140 WWW Structures, Techniques and Standards. Representing Web Data: XML

CSI 3140 WWW Structures, Techniques and Standards. Representing Web Data: XML CSI 3140 WWW Structures, Techniques and Standards Representing Web Data: XML XML Example XML document: An XML document is one that follows certain syntax rules (most of which we followed for XHTML) Guy-Vincent

More information

COMP9321 Web Application Engineering. Extensible Markup Language (XML)

COMP9321 Web Application Engineering. Extensible Markup Language (XML) COMP9321 Web Application Engineering Extensible Markup Language (XML) Dr. Basem Suleiman Service Oriented Computing Group, CSE, UNSW Australia Semester 1, 2016, Week 4 http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid=2442

More information

Introduction to XML Zdeněk Žabokrtský, Rudolf Rosa

Introduction to XML Zdeněk Žabokrtský, Rudolf Rosa NPFL092 Technology for Natural Language Processing Introduction to XML Zdeněk Žabokrtský, Rudolf Rosa November 28, 2018 Charles Univeristy in Prague Faculty of Mathematics and Physics Institute of Formal

More information

Comp 336/436 - Markup Languages. Fall Semester Week 4. Dr Nick Hayward

Comp 336/436 - Markup Languages. Fall Semester Week 4. Dr Nick Hayward Comp 336/436 - Markup Languages Fall Semester 2017 - Week 4 Dr Nick Hayward XML - recap first version of XML became a W3C Recommendation in 1998 a useful format for data storage and exchange config files,

More information

TASC Consulting Technical Writing Courseware Training

TASC Consulting Technical Writing Courseware Training Understanding XML Aruna Panangipally TASC Consulting Technical Writing Courseware Training Session Outline Why should a technical writer know XML? The Beginning Understanding markup languages Origins of

More information

Delivery Options: Attend face-to-face in the classroom or remote-live attendance.

Delivery Options: Attend face-to-face in the classroom or remote-live attendance. XML Programming Duration: 5 Days Price: $2795 *California residents and government employees call for pricing. Discounts: We offer multiple discount options. Click here for more info. Delivery Options:

More information

Well-formed XML Documents

Well-formed XML Documents Well-formed XML Documents Asst. Prof. Dr. Kanda Runapongsa Saikaew (krunapon@kku.ac.th) Dept. of Computer Engineering Khon Kaen University 1 Agenda Types of XML documents Why Well-formed XML Documents

More information

XML databases. Jan Chomicki. University at Buffalo. Jan Chomicki (University at Buffalo) XML databases 1 / 9

XML databases. Jan Chomicki. University at Buffalo. Jan Chomicki (University at Buffalo) XML databases 1 / 9 XML databases Jan Chomicki University at Buffalo Jan Chomicki (University at Buffalo) XML databases 1 / 9 Outline 1 XML data model 2 XPath 3 XQuery Jan Chomicki (University at Buffalo) XML databases 2

More information

Delivery Options: Attend face-to-face in the classroom or via remote-live attendance.

Delivery Options: Attend face-to-face in the classroom or via remote-live attendance. XML Programming Duration: 5 Days US Price: $2795 UK Price: 1,995 *Prices are subject to VAT CA Price: CDN$3,275 *Prices are subject to GST/HST Delivery Options: Attend face-to-face in the classroom or

More information

Solutions. a. Yes b. No c. Cannot be determined without the DTD. d. Schema. 9. Explain the term extensible. 10. What is an attribute?

Solutions. a. Yes b. No c. Cannot be determined without the DTD. d. Schema. 9. Explain the term extensible. 10. What is an attribute? Chapter 7: Information Representation Method XML Solutions Summative Assessment Multiple-Choice Questions (MCQs) 1. XML was developed to overcome the limitations of the markup language. a. EDI b. SGML

More information

Structured documents

Structured documents Structured documents An overview of XML Structured documents Michael Houghton 15/11/2000 Unstructured documents Broadly speaking, text and multimedia document formats can be structured or unstructured.

More information

Introduction to XML. M2 MIA, Grenoble Université. François Faure

Introduction to XML. M2 MIA, Grenoble Université. François Faure M2 MIA, Grenoble Université Example tove jani reminder dont forget me this weekend!

More information

COMP9321 Web Application Engineering

COMP9321 Web Application Engineering COMP9321 Web Application Engineering Semester 2, 2017 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 4 http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid= 2465 1

More information

Comp 336/436 - Markup Languages. Fall Semester Week 4. Dr Nick Hayward

Comp 336/436 - Markup Languages. Fall Semester Week 4. Dr Nick Hayward Comp 336/436 - Markup Languages Fall Semester 2018 - Week 4 Dr Nick Hayward XML - recap first version of XML became a W3C Recommendation in 1998 a useful format for data storage and exchange config files,

More information

.. Cal Poly CPE/CSC 366: Database Modeling, Design and Implementation Alexander Dekhtyar..

.. Cal Poly CPE/CSC 366: Database Modeling, Design and Implementation Alexander Dekhtyar.. .. Cal Poly CPE/CSC 366: Database Modeling, Design and Implementation Alexander Dekhtyar.. XML in a Nutshell XML, extended Markup Language is a collection of rules for universal markup of data. Brief History

More information

Chapter 7: XML Namespaces

Chapter 7: XML Namespaces 7. XML Namespaces 7-1 Chapter 7: XML Namespaces References: Tim Bray, Dave Hollander, Andrew Layman: Namespaces in XML. W3C Recommendation, World Wide Web Consortium, Jan 14, 1999. [http://www.w3.org/tr/1999/rec-xml-names-19990114],

More information

Semantic Web. XML and XML Schema. Morteza Amini. Sharif University of Technology Fall 94-95

Semantic Web. XML and XML Schema. Morteza Amini. Sharif University of Technology Fall 94-95 ه عا ی Semantic Web XML and XML Schema Morteza Amini Sharif University of Technology Fall 94-95 Outline Markup Languages XML Building Blocks XML Applications Namespaces XML Schema 2 Outline Markup Languages

More information

Chapter 1: Semistructured Data Management XML

Chapter 1: Semistructured Data Management XML Chapter 1: Semistructured Data Management XML XML - 1 The Web has generated a new class of data models, which are generally summarized under the notion semi-structured data models. The reasons for that

More information

but XML goes far beyond HTML: it describes data

but XML goes far beyond HTML: it describes data The XML Meta-Language 1 Introduction to XML The father of markup languages: XML = EXtensible Markup Language is a simplified version of SGML Originally created to overcome the limitations of HTML the HTML

More information

Digital Asset Management 2. Introduction to Digital Media Format

Digital Asset Management 2. Introduction to Digital Media Format Digital Asset Management 2. Introduction to Digital Media Format 2009-09-24 Outline Image format and coding methods Audio format and coding methods Video format and coding methods Introduction to HTML

More information

02 Structured Web. Semantic Web. Documents in XML

02 Structured Web. Semantic Web. Documents in XML Semantic Web 02 Structured Web Documents in XML Imran Ihsan Assistant Professor, Department of Computer Science Air University, Islamabad, Pakistan www.imranihsan.com Role of XML in the Semantic Web Most

More information

XML: Managing with the Java Platform

XML: Managing with the Java Platform In order to learn which questions have been answered correctly: 1. Print these pages. 2. Answer the questions. 3. Send this assessment with the answers via: a. FAX to (212) 967-3498. Or b. Mail the answers

More information

SDPL : XML Basics 2. SDPL : XML Basics 1. SDPL : XML Basics 4. SDPL : XML Basics 3. SDPL : XML Basics 5

SDPL : XML Basics 2. SDPL : XML Basics 1. SDPL : XML Basics 4. SDPL : XML Basics 3. SDPL : XML Basics 5 2 Basics of XML and XML documents 2.1 XML and XML documents Survivor's Guide to XML, or XML for Computer Scientists / Dummies 2.1 XML and XML documents 2.2 Basics of XML DTDs 2.3 XML Namespaces XML 1.0

More information

XML Introduction 1. XML Stands for EXtensible Mark-up Language (XML). 2. SGML Electronic Publishing challenges -1986 3. HTML Web Presentation challenges -1991 4. XML Data Representation challenges -1996

More information

Chapter 1: Semistructured Data Management XML

Chapter 1: Semistructured Data Management XML Chapter 1: Semistructured Data Management XML 2006/7, Karl Aberer, EPFL-IC, Laboratoire de systèmes d'informations répartis XML - 1 The Web has generated a new class of data models, which are generally

More information

XML. COSC Dr. Ramon Lawrence. An attribute is a name-value pair declared inside an element. Comments. Page 3. COSC Dr.

XML. COSC Dr. Ramon Lawrence. An attribute is a name-value pair declared inside an element. Comments. Page 3. COSC Dr. COSC 304 Introduction to Database Systems XML Dr. Ramon Lawrence University of British Columbia Okanagan ramon.lawrence@ubc.ca XML Extensible Markup Language (XML) is a markup language that allows for

More information

XML 2 APPLICATION. Chapter SYS-ED/ COMPUTER EDUCATION TECHNIQUES, INC.

XML 2 APPLICATION. Chapter SYS-ED/ COMPUTER EDUCATION TECHNIQUES, INC. XML 2 APPLIATION hapter SYS-ED/ OMPUTER EDUATION TEHNIQUES, IN. Objectives You will learn: How to create an XML document. The role of the document map, prolog, and XML declarations. Standalone declarations.

More information

Using XML Learning Objects Current and Future Benefits for Web Based Training Scenarios

Using XML Learning Objects Current and Future Benefits for Web Based Training Scenarios Using XML Learning Objects Current and Future Benefits for Web Based Training Scenarios Maik Stührenberg Applied and Computational Linguistics Group Justus-Liebig-Universität Gießen maik.stuehrenberg@uni-giessen.de

More information

The concept of DTD. DTD(Document Type Definition) Why we need DTD

The concept of DTD. DTD(Document Type Definition) Why we need DTD Contents Topics The concept of DTD Why we need DTD The basic grammar of DTD The practice which apply DTD in XML document How to write DTD for valid XML document The concept of DTD DTD(Document Type Definition)

More information

웹기술및응용. XML Basics 2018 년 2 학기. Instructor: Prof. Young-guk Ha Dept. of Computer Science & Engineering

웹기술및응용. XML Basics 2018 년 2 학기. Instructor: Prof. Young-guk Ha Dept. of Computer Science & Engineering 웹기술및응용 XML Basics 2018 년 2 학기 Instructor: Prof. Young-guk Ha Dept. of Computer Science & Engineering 목차 q Introduction to XML q XML Document Structure and Basic Syntax 2 Introduction to XML XML (extensible

More information

Introduction to Database Systems CSE 414

Introduction to Database Systems CSE 414 Introduction to Database Systems CSE 414 Lecture 14-15: XML CSE 414 - Spring 2013 1 Announcements Homework 4 solution will be posted tomorrow Midterm: Monday in class Open books, no notes beyond one hand-written

More information

XML Structures. Web Programming. Uta Priss ZELL, Ostfalia University. XML Introduction Syntax: well-formed Semantics: validity Issues

XML Structures. Web Programming. Uta Priss ZELL, Ostfalia University. XML Introduction Syntax: well-formed Semantics: validity Issues XML Structures Web Programming Uta Priss ZELL, Ostfalia University 2013 Web Programming XML1 Slide 1/32 Outline XML Introduction Syntax: well-formed Semantics: validity Issues Web Programming XML1 Slide

More information

Introduction to Database Systems CSE 444

Introduction to Database Systems CSE 444 Introduction to Database Systems CSE 444 Lecture 25: XML 1 XML Outline XML Syntax Semistructured data DTDs XPath Coverage of XML is much better in new edition Readings Sections 11.1 11.3 and 12.1 [Subset

More information

XML. Rodrigo García Carmona Universidad San Pablo-CEU Escuela Politécnica Superior

XML. Rodrigo García Carmona Universidad San Pablo-CEU Escuela Politécnica Superior XML Rodrigo García Carmona Universidad San Pablo-CEU Escuela Politécnica Superior XML INTRODUCTION 2 THE XML LANGUAGE XML: Extensible Markup Language Standard for the presentation and transmission of information.

More information

Enhanced XML Retrieval with Flexible Constraints Evaluation

Enhanced XML Retrieval with Flexible Constraints Evaluation University of Milano Bicocca Department of Informatics, Systems and Communication (DISCo) Enhanced XML Retrieval with Flexible Constraints Evaluation Ph.D dissertation of Emanuele Panzeri Supervisor: Prof.

More information

XML and DTD. Mario Alviano A.Y. 2017/2018. University of Calabria, Italy 1 / 28

XML and DTD. Mario Alviano A.Y. 2017/2018. University of Calabria, Italy 1 / 28 1 / 28 XML and DTD Mario Alviano University of Calabria, Italy A.Y. 2017/2018 Outline 2 / 28 1 Introduction 2 XML syntax 3 Namespace 4 Document Type Definition (DTD) 5 Exercises Outline 3 / 28 1 Introduction

More information

XML & Related Languages

XML & Related Languages Unit 1 1 Introduction: XML XML = extensible Markup Language the universal format for structured documents and data on the Web. www.w3c.org/xml simple, very flexible text format derived from SGML (ISO 8879).

More information

CSC Web Technologies, Spring Web Data Exchange Formats

CSC Web Technologies, Spring Web Data Exchange Formats CSC 342 - Web Technologies, Spring 2017 Web Data Exchange Formats Web Data Exchange Data exchange is the process of transforming structured data from one format to another to facilitate data sharing between

More information

Appendix H XML Quick Reference

Appendix H XML Quick Reference HTML Appendix H XML Quick Reference What Is XML? Extensible Markup Language (XML) is a subset of the Standard Generalized Markup Language (SGML). XML allows developers to create their own document elements

More information

Semistructured data, XML, DTDs

Semistructured data, XML, DTDs Semistructured data, XML, DTDs Introduction to Databases Manos Papagelis Thanks to Ryan Johnson, John Mylopoulos, Arnold Rosenbloom and Renee Miller for material in these slides Structured vs. unstructured

More information

Additional Readings on XPath/XQuery Main source on XML, but hard to read:

Additional Readings on XPath/XQuery Main source on XML, but hard to read: Introduction to Database Systems CSE 444 Lecture 10 XML XML (4.6, 4.7) Syntax Semistructured data DTDs XML Outline April 21, 2008 1 2 Further Readings on XML Additional Readings on XPath/XQuery Main source

More information

Introduction to Data Management CSE 344

Introduction to Data Management CSE 344 Introduction to Data Management CSE 344 Lecture 11: XML and XPath 1 XML Outline What is XML? Syntax Semistructured data DTDs XPath 2 What is XML? Stands for extensible Markup Language 1. Advanced, self-describing

More information

Chapter 1: XML Syntax

Chapter 1: XML Syntax 1. XML Syntax 1-1 Chapter 1: XML Syntax References: Boc DuCharme: XML The Annotated Specification. Prentice Hall, 1999. Tim Bray, Jean Paoli, C.M. Sperberg-McQueen: Extensible Markup Language (XML) 1.0,

More information

- XML. - DTDs - XML Schema - XSLT. Web Services. - Well-formedness is a REQUIRED check on XML documents

- XML. - DTDs - XML Schema - XSLT. Web Services. - Well-formedness is a REQUIRED check on XML documents Purpose of this day Introduction to XML for parliamentary documents (and all other kinds of documents, actually) Prof. Fabio Vitali University of Bologna Introduce the principal aspects of electronic management

More information

CountryData Technologies for Data Exchange. Introduction to XML

CountryData Technologies for Data Exchange. Introduction to XML CountryData Technologies for Data Exchange Introduction to XML What is XML? EXtensible Markup Language Format is similar to HTML, but XML deals with data structures, while HTML is about presentation Open

More information

Chapter 10: Understanding the Standards

Chapter 10: Understanding the Standards Disclaimer: All words, pictures are adopted from Learning Web Design (3 rd eds.) by Jennifer Niederst Robbins, published by O Reilly 2007. Chapter 10: Understanding the Standards CSc2320 In this chapter

More information

Bioinforma)cs Resources XML / Web Access

Bioinforma)cs Resources XML / Web Access Bioinforma)cs Resources XML / Web Access Lecture & Exercises Prof. B. Rost, Dr. L. Richter, J. Reeb Ins)tut für Informa)k I12 XML Infusion (in 10 sec) compila)on from hkp://www.w3schools.com/xml/default.asp

More information

Marker s feedback version

Marker s feedback version Two hours Special instructions: This paper will be taken on-line and this is the paper format which will be available as a back-up UNIVERSITY OF MANCHESTER SCHOOL OF COMPUTER SCIENCE Semi-structured Data

More information

10/24/12. What We Have Learned So Far. XML Outline. Where We are Going Next. XML vs Relational. What is XML? Introduction to Data Management CSE 344

10/24/12. What We Have Learned So Far. XML Outline. Where We are Going Next. XML vs Relational. What is XML? Introduction to Data Management CSE 344 What We Have Learned So Far Introduction to Data Management CSE 344 Lecture 12: XML and XPath A LOT about the relational model Hand s on experience using a relational DBMS From basic to pretty advanced

More information

Introduction to XML. Chapter 133

Introduction to XML. Chapter 133 Chapter 133 Introduction to XML A. Multiple choice questions: 1. Attributes in XML should be enclosed within. a. single quotes b. double quotes c. both a and b d. none of these c. both a and b 2. Which

More information

XML, DTD, and XPath. Announcements. From HTML to XML (extensible Markup Language) CPS 116 Introduction to Database Systems. Midterm has been graded

XML, DTD, and XPath. Announcements. From HTML to XML (extensible Markup Language) CPS 116 Introduction to Database Systems. Midterm has been graded XML, DTD, and XPath CPS 116 Introduction to Database Systems Announcements 2 Midterm has been graded Graded exams available in my office Grades posted on Blackboard Sample solution and score distribution

More information

EMERGING TECHNOLOGIES. XML Documents and Schemas for XML documents

EMERGING TECHNOLOGIES. XML Documents and Schemas for XML documents EMERGING TECHNOLOGIES XML Documents and Schemas for XML documents Outline 1. Introduction 2. Structure of XML data 3. XML Document Schema 3.1. Document Type Definition (DTD) 3.2. XMLSchema 4. Data Model

More information

PART. Oracle and the XML Standards

PART. Oracle and the XML Standards PART I Oracle and the XML Standards CHAPTER 1 Introducing XML 4 Oracle Database 10g XML & SQL E xtensible Markup Language (XML) is a meta-markup language, meaning that the language, as specified by the

More information

extensible Markup Language

extensible Markup Language extensible Markup Language XML is rapidly becoming a widespread method of creating, controlling and managing data on the Web. XML Orientation XML is a method for putting structured data in a text file.

More information

Outline. XML vs. HTML and Well Formed vs. Valid. XML Overview. CSC309 Tutorial --XML 4. Edward Xia

Outline. XML vs. HTML and Well Formed vs. Valid. XML Overview. CSC309 Tutorial --XML 4. Edward Xia CSC309 Tutorial XML Edward Xia November 7, 2003 Outline XML Overview XML DOCTYPE Element Declarations Attribute List Declarations Entity Declarations CDATA Stylesheet PI XML Namespaces A Complete Example

More information

Introduction to Database Systems CSE 414

Introduction to Database Systems CSE 414 Introduction to Database Systems CSE 414 Lecture 13: XML and XPath 1 Announcements Current assignments: Web quiz 4 due tonight, 11 pm Homework 4 due Wednesday night, 11 pm Midterm: next Monday, May 4,

More information

General introduction to the TEI, markup, and XML

General introduction to the TEI, markup, and XML General introduction to the TEI, markup, and XML TEI@Oxford July 2009 In which we are introduced Aims of Course 1. Examine the concept of markup and XML encoding 2. Provide hands-on experience in using

More information

XML in Databases. Albrecht Schmidt. al. Albrecht Schmidt, Aalborg University 1

XML in Databases. Albrecht Schmidt.   al. Albrecht Schmidt, Aalborg University 1 XML in Databases Albrecht Schmidt al@cs.auc.dk http://www.cs.auc.dk/ al Albrecht Schmidt, Aalborg University 1 What is XML? (1) Where is the Life we have lost in living? Where is the wisdom we have lost

More information

[MS-XML]: Microsoft Extensible Markup Language (XML) 1.0 Fourth Edition Standards Support Document

[MS-XML]: Microsoft Extensible Markup Language (XML) 1.0 Fourth Edition Standards Support Document [MS-XML]: Microsoft Extensible Markup Language (XML) 1.0 Fourth Edition Standards Support Document Intellectual Property Rights Notice for Open Specifications Documentation Technical Documentation. Microsoft

More information

Part II: Semistructured Data

Part II: Semistructured Data Inf1-DA 2011 2012 II: 22 / 119 Part II Semistructured Data XML: II.1 Semistructured data, XPath and XML II.2 Structuring XML II.3 Navigating XML using XPath Corpora: II.4 Introduction to corpora II.5 Querying

More information

Using Attribute Grammars to Uniformly Represent Structured Documents - Application to Information Retrieval

Using Attribute Grammars to Uniformly Represent Structured Documents - Application to Information Retrieval Using Attribute Grammars to Uniformly Represent Structured Documents - Application to Information Retrieval Alda Lopes Gançarski Pierre et Marie Curie University, Laboratoire d Informatique de Paris 6,

More information

Data Exchange. Hyper-Text Markup Language. Contents: HTML Sample. HTML Motivation. Cascading Style Sheets (CSS) Problems w/html

Data Exchange. Hyper-Text Markup Language. Contents: HTML Sample. HTML Motivation. Cascading Style Sheets (CSS) Problems w/html Data Exchange Contents: Mariano Cilia / cilia@informatik.tu-darmstadt.de Origins (HTML) Schema DOM, SAX Semantic Data Exchange Integration Problems MIX Model 1 Hyper-Text Markup Language HTML Hypertext:

More information

Mulberry Classes Guide to Using the Oxygen XML Editor (v20.0)

Mulberry Classes Guide to Using the Oxygen XML Editor (v20.0) Mulberry Classes Guide to Using the Oxygen XML Editor (v20.0) Mulberry Technologies, Inc. 17 West Jefferson Street, Suite 207 Rockville, MD 20850 Phone: 301/315-9631 Fax: 301/315-8285 info@mulberrytech.com

More information

Web scraping and crawling, open data, markup languages and data shaping. Paolo Boldi Dipartimento di Informatica Università degli Studi di Milano

Web scraping and crawling, open data, markup languages and data shaping. Paolo Boldi Dipartimento di Informatica Università degli Studi di Milano Web scraping and crawling, open data, markup languages and data shaping Paolo Boldi Dipartimento di Informatica Università degli Studi di Milano Data Analysis Three steps Data Analysis Three steps In every

More information

Introduction to Semistructured Data and XML. Overview. How the Web is Today. Based on slides by Dan Suciu University of Washington

Introduction to Semistructured Data and XML. Overview. How the Web is Today. Based on slides by Dan Suciu University of Washington Introduction to Semistructured Data and XML Based on slides by Dan Suciu University of Washington CS330 Lecture April 8, 2003 1 Overview From HTML to XML DTDs Querying XML: XPath Transforming XML: XSLT

More information

2009 Martin v. Löwis. Data-centric XML. XML Syntax

2009 Martin v. Löwis. Data-centric XML. XML Syntax Data-centric XML XML Syntax 2 What Is XML? Extensible Markup Language Derived from SGML (Standard Generalized Markup Language) Two goals: large-scale electronic publishing exchange of wide variety of data

More information

EXtensible Markup Language XML

EXtensible Markup Language XML EXtensible Markup Language XML Main source: W3C School tutorials 1 Mark-up Languages A way of describing information in a document. Standard Generalized Mark-Up Language (SGML) - a specification for a

More information

INTERNET PROGRAMMING XML

INTERNET PROGRAMMING XML INTERNET PROGRAMMING XML Software Engineering Branch / 4 th Class Computer Engineering Department University of Technology OUTLINES XML Basic XML Advanced 2 HTML & CSS & JAVASCRIPT & XML DOCUMENTS HTML

More information

x ide xml Integrated Development Environment Specifications Document 1 Project Description 2 Specifi fications

x ide xml Integrated Development Environment Specifications Document 1 Project Description 2 Specifi fications x ide xml Integrated Development Environment Specifications Document Colin Hartnett (cphartne) 7 February 2003 1 Project Description There exist many integrated development environments that make large

More information

Databases and Internet Applications

Databases and Internet Applications Databases and Internet Applications Part 1 Chapter 7.1-7.5 Comp 521 Files and Databases Fall 2016 1 Uniform Resource Identifiers Uniform naming schema to identify resources on the Internet A resource can

More information

Comp 336/436 - Markup Languages. Fall Semester Week 2. Dr Nick Hayward

Comp 336/436 - Markup Languages. Fall Semester Week 2. Dr Nick Hayward Comp 336/436 - Markup Languages Fall Semester 2017 - Week 2 Dr Nick Hayward Digitisation - textual considerations comparable concerns with music in textual digitisation density of data is still a concern

More information