Introduction to Semistructured Data and XML. Contents

Size: px
Start display at page:

Download "Introduction to Semistructured Data and XML. Contents"

Transcription

1 Contents Overview What is XML? How the Web is Today New Universal Data Exchange Format: XML What is the W3C? Semistructured Data What is Self-describing Data? The Semistructured Data Model Characteristics of Semistructured Data Conversion from XML to Objects Conversion from Objects to XML XML s origin is document processing, not databases From HTML to XML HTML XML Is XML a Database? Databases and XML The extensible Markup Language (XML) Markup Languages A brief history of markup XML is XML Features and motivations XML features XML: Motivation XML Structure Rules for Well-Formed XML Motivation for Nesting Structure of XML Attributes Vs. Subelements

2 Namespaces XML: A Simple Example XML Data Model - A Tree XML Document Schema Document Type Definition (DTD) Element Specification in DTD Attribute specification in DTD: IDs and IDREFs Limitations of DTDs XML Processing: The XML Parser Well-Formed XML Documents Important XML Standards XML Terminology Summary

3 Introduction to Semistructured Data and XML Sources Database System Concepts- Silberschatz Database Management Systems, R. Ramakrishnan Some slides by Dan Suciu from University of Washington Overview What is XML? Semistructured data HTML vs. XML XML Terminology Namespace DTD XML Schema What is XML? XML stands for EXtensible Markup Language XML is a markup language much like HTML. XML was designed to describe data. XML tags are not predefined in XML. You must define your own tags. XML is self describing. XML uses a DTD (Document Type Definition) or schema to formally describe the data. 106

4 XML is a standard for data exchange. All major database products have been retrofitted with facilities to store and construct XML documents There are already database products that are specifically designed to work with XML documents rather than relational or object-oriented data XML is closely related to object-oriented and so-called semistructured data XML can be used to exchange data In the real world, computer systems and databases contain data in incompatible formats. One of the most time consuming challenges for developers has been to exchange data between such systems over the Internet. Converting the data to XML can greatly reduce this complexity and create data that can be read by different types of applications. Suitable for semistructured data and has become a standard: o Easy to describe object-like data o Selfdescribing o Doesn t require a schema (but can be provided optionally) XML related documents and Languages: o DTDs an older way to specify schema o XML Schema a newer, more powerful (and much more complex!) way of specifying schema 107

5 o Query and transformation languages: XPath XSLT XQuery How the Web is Today HTML documents o often generated by applications o consumed by humans only o easy access: across platforms, across organizations No application interoperability: o HTML not understood by applications New Universal Data Exchange Format: XML A recommendation from the W3C XML = data XML generated by applications XML consumed by applications Easy access: across platforms, organizations What is the W3C? Group of member organizations (more than 400) o Hosted by MIT, INRIA, Keio University o 50 full-time staff members 108

6 Posts specifications for the Web o Notes: submitted by member, made public for comments, no endorsement yet o Working drafts: specification that is under consideration and open to comment o Recommendation: Accepted working draft becomes recommendation; since the W3C is not a government body, cannot use the term standard XML is suitable for semistructured data o Easy to describe object-like data o Selfdescribing o Doesn t require a schema (but can be provided optionally) 109

7 Semistructured Data Examples of Data sources with non-rigid structure o Biological data o Web data To make the previous student list suitable for machine consumption on the Web, it should be o self-describing (some schema-like information, like attribute names, is part of data itself) 110

8 What is Self-describing Data? 111

9 The Semistructured Data Model Syntax for Semistructured Data Observe: Nested tuples, set-values, oids! 112

10 Characteristics of Semistructured Data Missing or additional attributes Multiple attributes Different types in different objects Heterogeneous collections Comparison with Relational Data 113

11 Conversion from XML to Objects Conversion from Objects to XML 114

12 XML s origin is document processing, not databases Allows things like standalone text (useless for databases) o <foo> Hello <moo>123</moo> Bye </foo> Attributes aren t needed just bloat the number of ways to represent the same thing XML data is ordered, while database data is not: <something><foo>1</foo><bar>2</bar></something> is different from <something><bar>2</bar><foo>1</foo></something> but these two complex values are same: Overview of XML [something: [bar:2, foo:1]] [something: [foo:1, bar:2]] The HyperText Markup Language (HTML) o A simple language for distributing text-based information XML is Extensible, unlike HTML o Users can add new tags, and separately specify how the tag should be handled for display XML combined with other Web technologies to yield. o A distributed information Web Like HTML, but any number of different tags can be used (up to the document author) 115

13 Unlike HTML, no semantics behind the tags o For instance, HTML s <table> </table> means: render contents as a table; in XML: doesn t mean anything Unlike HTML, is intolerant to bugs Browsers will render buggy HTML pages XML processors are not supposed to process buggy XML documents From HTML to XML HTML describes the presentation 116

14 HTML <h1> Bibliography </h1> <p> <i> Foundations of Databases </i> Abiteboul, Hull, Vianu <br> Addison Wesley, 1995 <p> <i> Data on the Web </i> Abiteboul, Buneman, Suciu <br> Morgan Kaufmann, 1999 XML <bibliography> <book> <title> Foundations </title> </book> </bibliography> <author> Abiteboul </author> <author> Hull </author> <author> Vianu </author> <publisher> Addison Wesley </publisher> <year> 1995 </year> XML describes the content 117

15 Is XML a Database? An XML document is a database only in the strictest sense of the term. That is, it is a collection of data. In many ways, this makes it no different from any other file -- after all, all files contain data of some sort. As a "database" format, XML has some advantages. For example, o it is self-describing (the markup describes the structure and type names of the data, although not the semantics), o it is portable (Unicode), and o it can describe data in tree or graph structures. It also has some disadvantages. For example, 118

16 o it is verbose and o access to the data is slow due to parsing and text conversion Does XML and its surrounding technologies constitute a DBMS? The answer to this question is, "Sort of." On the plus side, XML provides many of the things found in databases: o storage (XML documents), o schemas (DTDs, XML Schemas, and so on), o query languages (XQuery, XPath, XQL, XML-QL, QUILT, etc.), o programming interfaces (SAX, DOM, JDOM), and so on. On the minus side, it lacks many of the things found in real databases: o efficient storage, indexes, o security, transactions and data integrity, o multi-user access, triggers, o and so on. It may be possible to use an XML document or documents as a database in environments o with small amounts of data, o few users, and o modest performance requirements, It will fail in most production environments, o which have many users, 119

17 o strict data integrity requirements, o and the need for good performance. Examples of less sophisticated data sets for which an XML document might be suitable as a database are o personal contact lists (names, phone numbers, addresses, etc.), o descriptions of the MP3s you've got However, given the low price and ease of use of databases like dbase and Access, there seems little reason to use an XML document as a database even in these cases. Databases and XML Database content can be presented in XML o XML processor can access DBMS or file system and convert data to XML o Web server can serve content as either XML or HTML 120

18 The extensible Markup Language (XML) A metalanguage o A language used to describe other languages using markup Markup describes properties of the data Designed to be structured o Strict rules about how data can be formatted Designed to be extensible o Can define own terms and markup Markup Languages XML has its roots in document management XML is derived from a language for structuring large documents known as the Standard Generalized Markup Language (SGML). To understand XML, it is important to understand its roots as a document markup language The term markup refers to anything in a document that is not intended to be part of the printed output In electronic document processing, a markup language is a formal description of o what part of the document is content, o what part is markup, and o what the markup means. 121

19 Markup languages evolved from specifying instructions for how to print parts of the document to specify the function of the content. For instance, with functional markup, text representing section headings (for this section, the words Markup language ) would be marked up as being a section heading, instead of being marked up as text to be printed in large size, bold font. Such functional markup allowed the document to be formatted differently in different situations. It also helps different parts of a large document, or different pages in a large Web site to be formatted in a uniform manner. Functional markup also helps automate extraction of key parts of documents. For the family of markup languages that includes HTML, SGML, and XML the markup takes the form of tags enclosed in angle-brackets, <>. Tags are used in pairs, with <tag> and </tag> delimiting the beginning and the end of the portion of the document to which the tag refers. For example, the title of a document might he marked up as follows. <title> Database System Concepts </title> Unlike HTML, XML does not prescribe the set of tags allowed, and the set may be specialized as needed. This feature is the key to XML s major role in data representation and exchange, whereas HTML is used primarily for document formatting. The ability to specify new tags, and to create nested tag structures made XML a great way to exchange data, not just documents. 122

20 Much of the use of XML has been in data exchange applications, not as a replacement for HTML Tags make data (relatively) self-documenting, describing : o Syntax The permitted arrangement or structure of letters and words in a language as defined by a grammar (XML) o Semantics The meaning of letters or words in a language A brief history of markup GML: Generalised Markup Language o Developed in 60 s and 70 s by IBM o Used for IBM technical manuals SGML: Standardized GML HTML XML o 70 s, 80 s with ANSI standard in 1983 o Flexible and very general, but difficult and costly o Early 90 s: compact markup for hypertext docs o Now seen as a step backwards 123

21 XML is Simpler than SGML More flexible than HTML An application of SGML a toolkit however, common to refer to documents as being written in XML Surrounded by a family of technologies which extend its use (eg transformation) XML Features and motivations XML features Represent most kinds of information, Easily customizable Allows validation of documents Easy to read by humans and machines Open standard, managed by W3C A Piece of XML Tags provide context for each value and allow semantics of the value to be identified. 124

22 Another Piece of XML Another Piece of XML 125

23 XML : Advantages Compared to storage of data in a database, the XML representation may be inefficient, since tag names are repeated throughout the document. However, in spite of this disadvantage, an XML representation has significant advantages when it is used to exchange data. o First, the presence of the tags makes the message self documenting; that is, a schema need not be consulted to understand the meaning of the text. o Second, the format of the document is not rigid. For example if some sender adds additional information, such as a tag lastaccessed noting the last date on which an account was accessed, the recipient of the XML data may simply ignore the tag. The ability to recognize and ignore unexpected tags allows the format of the data to evolve over time, without invalidating existing applications. o Finally, since the axml format is widely accepted. a wide variety of tools are available to assist in its processing, including browser software and database tools. 126

24 XML: Motivation Data interchange is critical in today s networked world o Examples: Banking: funds transfer Order processing (especially inter-company orders) Scientific data Chemistry: ChemML, Genetics: BSML (Bio-Sequence Markup Language), o Paper flow of information between organizations is being replaced by electronic flow of information Each application area has its own set of standards for representing information XML has become the basis for all new generation data interchange formats Just as SQL is the dominant language for querying relational data, XML is becoming the dominant format for data exchange. Each XML based standard defines what are valid elements, using o XML type specification languages to specify the syntax DTD (Document Type Descriptors) XML Schema o Plus textual descriptions of the semantics 127

25 A wide variety of tools is available for parsing, browsing and querying XML documents/data Many other specifications based upon XML coming out o XSL/XSLT, XQL, XPath, XPointer, XLink, MathML, CML, BIOML, GAME, BSML, XML- Communication and Integration XML can represent o many kinds of structured data used in business applications o database data XML is particularly useful as a data format when an application must communicate with another application, or integrate information from several other applications XML Structure Structure of XML Data The fundamental construct in an XML document is the element. Element: section of data beginning with <tagname> and ending with matching </tagname> Tag: label for a section of data 128

26 Example Elements must be properly nested o Proper nesting <account> <balance>. </balance> </account> o Improper nesting <account> <balance>. </account> </balance> o Formally: every start tag must have a unique matching end tag, that is in the context of the same parent element. o Every document must have a single top-level element 129

27 Rules for Well-Formed XML Some basic rules for XML o All tags must be balanced - <TAG>...</TAG> o Empty tags expressed - <EMPTY_TAG/> o Tags must be nested - <B><I> </B></I> o All element attributes must be quoted - <TAG name= value > o Text is case-sensitive - <TAG>!= <Tag> o Comments are allowed - <!-- --> o Must begin - <?xml version= 1.0?> o Special characters must be escaped Special Characters Some characters need to be escaped because they have special significance: o < o > o & o o < > & &apos; " If they were not escaped - would be processed as markup by XML engine 130

28 A Piece of XML Example of Nested Elements Motivation for Nesting Nesting of data is useful in data transfer o Example: elements representing customer-id, customer name, and address nested within an order element 131

29 Nesting is not supported, or discouraged, in relational databases o With multiple orders, customer name and address are stored redundantly o Normalization replaces nested structures in each order by foreign key into table storing customer name and address information But nesting is appropriate when transferring data o External application does not have direct access to data referenced by a foreign key Nested representations are widely used in XML data interchange applications to avoid joins. Structure of XML Mixture of text with sub-elements is legal in XML. o Example: <account> This account is seldom used any more. <account-number> A-102</account-number> <branch-name> Perryridge</branch-name> <balance>400 </balance> </account> o Useful for document processing context, but discouraged for more structured data representation such as database content in XML 132

30 Attributes Elements can have attributes <account acct-type = checking monthly-fee= 5 > </account> <account-number> A-102 </account-number> <branch-name> Perryridge </branch-name> <balance> 400 </balance> Attributes are specified by name = value pairs inside the starting tag of an element Attributes are strings, and do not contain markup An element may have several attributes, but each attribute name can only occur once in a given tag Another Example <account acct-type = checking monthly-fee= 5 > 133

31 Attributes Vs. Subelements Distinction between subelement and attribute o In the context of documents, attributes are part of markup, while subelement contents are part of the basic document contents o In the context of data representation, the difference is unclear and may be confusing Same information can be represented in two ways <account account-number = A-101 >. </account> <account> <account-number>a-101</account-number> </account> o Suggestion: use attributes for identifiers of elements, and use subelements for contents More on XML Syntax Elements without subelements or text content can be abbreviated by ending the start tag with a /> and deleting the end tag <account number= A-101 branch= Perryridge balance= 200 /> Other XML Constructs XML Declaration, comments, processing instructions, DTD XML Declaration Comments <?xml version = 1.0 standalone= yes encoding= UTF-8?> <! this is a comment --> 134

32 Processing Instruction <?xml-stylesheet href= book.css type= text/css?> Namespaces Since anybody can create their own tags, possibility of naming collisions XML data has to be exchanged between organizations Same tag name may have different meaning in different organizations, causing confusion on exchanged documents Specifying a unique string as an element name avoids confusion Better solution: use unique-name:element-name Avoid using long unique names all over document by using XML Namespaces Example Want to provide two different ratings to each movie <movie> </movie> <name>apollo 13</name> <off-rating>pg 13</off-rating> <qual-rating>excellent</qual-rating> 135

33 Better <movie xmlns:off= </movie> Namespaces xmlns:qual= > <name>apollo 13</name> <off:rating>pg 13</off:rating> <qual:rating>excellent</qual:rating> <bank Xmlns:FB= > </bank> <FB:branch> <FB:branchname>Downtown</FB:branchname> <FB:branchcity> Brooklyn </FB:branchcity> </FB:branch> 136

34 XML: A Simple Example Here s a simple example of an XML document -- in this case a made-up document describing information related to ordering parts (from a factory, for example). Note how the tags are written --surrounded by angle brackets. Tags of the form <Tag> always have a matching end tag of the form </Tag> -- this is one of the syntax rules. Tags that don t have ends (and don t have stuff inside them) can be written as <Tag /> -- with a special trailing slash. That s another syntax rule. The start and end tag, plus the stuff in between, is called an element -- and is the basic component part of an XML document. These must be organized hierarchically -- again one of the syntax rules. Elements can be assigned properties, called attributes, by putting the attribute and its value in the start tag, as in <order ref= xyxxy > 137

35 Example Revisited Here s a simple example of an XML document -- in this case a made-up document describing information related to ordering parts (from a factory, for example). Note how the tags are written --surrounded by angle brackets. Tags of the form <Tag> always have a matching end tag of the form </Tag> -- this is one of the syntax rules. Tags that don t have ends (and don t have stuff inside them) can be written as <Tag /> -- with a special trailing slash. That s another syntax rule. The start and end tag, plus the stuff in between, is called an element -- and is the basic component part of an XML document. These must be organized hierarchically -- again one of the syntax rules. Elements can be assigned properties, called attributes, by putting the attribute and its value in the start tag, as in <order ref= xyxxy > 138

36 XML Data Model - A Tree This illustrates how the document structure maps directly onto a data tree (a directed acyclic graph, actually) XML Document Schema Database schemas constrain o what information can be stored, and o the data types of stored values XML documents are not required to have an associated schema However, schemas are very important for XML data exchange o Otherwise, a site cannot automatically interpret data received from another site Possible to use XML as is as long as it follows the rules for wellformed XML 139

37 But doesn t allow structure to be validated o Can t check that all elements are present and correct o Can t check that attributes are correct o Can t specify value type of attributes o Can t describe format to others Two mechanisms for specifying XML schema, i.e., define and validate XML o Document Type Definition (DTD) Widely used o XML Schema Newer, increasing use Document Type Definition (DTD) DTD is a file which contains a formal definition of the permitted structure of the document DTD constraints structure of XML data o What elements can occur o What attributes can/must an element have o What subelements can/must occur inside each element, and how many times. DTD does not constrain data types o All values represented as strings in XML 140

38 DTD syntax o <!ELEMENT element (subelements-specification) > o <!ATTLIST element (attributes) > Element Specification in DTD Elements can be specified as o names of subelements, or o #PCDATA (parsed character data), i.e., character strings Example <! ELEMENT depositor (customer-name account-number)> <! ELEMENT customer-name (#PCDATA)> <! ELEMENT account-number (#PCDATA)> Subelement specification may have regular expressions Bank DTD <!ELEMENT bank ( ( account customer depositor)+)> Notation: <!DOCTYPE bank [ - alternatives or more occurrences * - 0 or more occurrences <!ELEMENT bank ( ( account customer depositor)+)> <!ELEMENT account (account-number branch-name balance)> 141

39 <!ELEMENT customer(customer-name customer-street customer-city)> <!ELEMENT depositor (customer-name account-number)> <!ELEMENT account-number (#PCDATA)> <!ELEMENT branch-name (#PCDATA)> <!ELEMENT balance(#pcdata)> <!ELEMENT customer-name(#pcdata)> <!ELEMENT customer-street(#pcdata)> <!ELEMENT customer-city(#pcdata)> ]> Attribute specification in DTD: for each attribute o A Name o A Type CDATA character data ID (identifier) or IDREF (ID reference) or IDREFS (multiple IDREFs) more on this later o A Default declaration mandatory (#REQUIRED) has a default value (value), or neither (#IMPLIED) no default value has been provided 142

40 Example of Attribute specification in DTD A DTD specification for the element account which has an attribute of type acc-type, with default value checking <!ATTLIST account acct-type CDATA checking > A DTD specification for the element customer o <!ATTLIST customer customer-id ID # REQUIRED accounts IDREFS # REQUIRED > IDs and IDREFs An element can have at most one attribute of type ID The ID attribute value of each element in an XML document must be distinct o Thus the ID attribute value is an object identifier An attribute of type IDREF must contain the ID value of an element in the same document An attribute of type IDREFS contains a set of (0 or more) ID values. Each IDREF value must contain the ID value of an element in the same document 143

41 Bank DTD with Attributes XML data with ID and IDREF attributes 144

42 Limitations of DTDs No typing of text elements and attributes o All values are strings, no integers, reals, etc. Difficult to specify unordered sets of subelements o Order is usually irrelevant in databases o (A B)* allows specification of an unordered set, but Cannot ensure that each of A and B occurs only once IDs and IDREFs are untyped o The owners attribute of an account may contain a reference to another account, which is meaningless owners attribute should ideally be constrained to refer to customer elements Can have a single key item (ID), but: XML Schema o No support for multi-attribute keys XML Schema is a more sophisticated schema language which addresses the drawbacks of DTDs. Supports o Includes primitive data types (integers, strings, dates, etc.) Also, constraints on min/max values o Specified in XML format, unlike DTDs More standard representation, but verbose o Is integrated with namespaces 145

43 o Many more features List types, uniqueness and foreign key constraints, inheritance.. o Supports value-based constraints More on XML Schema E.g., (integers > 100) BUT: significantly more complicated than DTDs, not yet widely used. XML document that conforms to a given schema is said to be schema valid and is called an instance of the schema Similarly to DTDs, the XML Schema specification does not require an XML processor to actually use the document schema o In contrast with databases where ALL data MUST comply with schema XML Processing: The XML Parser The parser must verify that the XML is syntactically correct Such data is said to be well-formed o The minimal requirement to be XML 146

44 A parser MUST stop processing if the data isn t well-formed o E.g., stop processing and throw an exception to the XML-based application. The XML 1.0 spec requires this behaviour This is how all parsers work. They all have some sort of API so a program can get at the XML data. Also all parsers must indicate failure, and refuse to further process XML data, if the data violates basic syntax requirements of XML. These requirements are called well-formedness constraints, since if the data doesn t comply with those constraints, then it is not well-formed XML. Well-Formed XML Documents XML documents are subject to two specific constraints o Well-formedness: An XML document is well-formed if: It has a root element Every opening tag is followed by a matching closing tag, and the elements are properly nested inside each other Any attribute can occur at most once in a given opening tag, its value must be provided, and the value must be quoted o Validity: An XML document is valid, if it obeys the document type definition (DTD) or XML schema that you use to specify the legal syntax of the document Ensures that XML document parses into labeled tree 147

45 XML Schema Version of Bank DTD A Piece of XML 148

46 Important XML Standards XSL/XSLT: presentation and transformation standards Xpath/Xpointer/Xlink: standard for linking to documents and elements within Namespaces: for resolving name clashes DOM: Document Object Model for manipulating XML documents SAX: Simple API for XML parsing XQuery: query language XML Terminology Summary Tags: book, title, author, o start tag: <book>, end tag: </book> Elements: <book> <book>,<author> </author> o elements can be nested o empty element: <red></red> (Can be abbrv. <red/>) XML document: Has a single root element Well-formed XML document: Has matching tags Valid XML document: conforms to a schema 149

EMERGING TECHNOLOGIES. XML Documents and Schemas for XML documents

EMERGING TECHNOLOGIES. XML Documents and Schemas for XML documents EMERGING TECHNOLOGIES XML Documents and Schemas for XML documents Outline 1. Introduction 2. Structure of XML data 3. XML Document Schema 3.1. Document Type Definition (DTD) 3.2. XMLSchema 4. Data Model

More information

Introduction to Semistructured Data and XML. Overview. How the Web is Today. Based on slides by Dan Suciu University of Washington

Introduction to Semistructured Data and XML. Overview. How the Web is Today. Based on slides by Dan Suciu University of Washington Introduction to Semistructured Data and XML Based on slides by Dan Suciu University of Washington CS330 Lecture April 8, 2003 1 Overview From HTML to XML DTDs Querying XML: XPath Transforming XML: XSLT

More information

ADT 2005 Lecture 7 Chapter 10: XML

ADT 2005 Lecture 7 Chapter 10: XML ADT 2005 Lecture 7 Chapter 10: XML Stefan Manegold Stefan.Manegold@cwi.nl http://www.cwi.nl/~manegold/ Database System Concepts Silberschatz, Korth and Sudarshan The Challenge: Comic Strip Finder The Challenge:

More information

Introduction. " Documents have tags giving extra information about sections of the document

Introduction.  Documents have tags giving extra information about sections of the document Chapter 10: XML Introduction! XML: Extensible Markup Language! Defined by the WWW Consortium (W3C)! Originally intended as a document markup language not a database language " Documents have tags giving

More information

Introduction. " Documents have tags giving extra information about sections of the document

Introduction.  Documents have tags giving extra information about sections of the document Chapter 10: XML Introduction! XML: Extensible Markup Language! Defined by the WWW Consortium (W3C)! Originally intended as a document markup language not a database language " Documents have tags giving

More information

Introduction to Semistructured Data and XML

Introduction to Semistructured Data and XML Introduction to Semistructured Data and XML Chapter 27, Part D Based on slides by Dan Suciu University of Washington Database Management Systems, R. Ramakrishnan 1 How the Web is Today HTML documents often

More information

XML: extensible Markup Language

XML: extensible Markup Language Datamodels XML: extensible Markup Language Slides are based on slides from Database System Concepts Silberschatz, Korth and Sudarshan See www.db-book.com for conditions on re-use Many examples are from

More information

Introduction to XML. Yanlei Diao UMass Amherst April 17, Slides Courtesy of Ramakrishnan & Gehrke, Dan Suciu, Zack Ives and Gerome Miklau.

Introduction to XML. Yanlei Diao UMass Amherst April 17, Slides Courtesy of Ramakrishnan & Gehrke, Dan Suciu, Zack Ives and Gerome Miklau. Introduction to XML Yanlei Diao UMass Amherst April 17, 2008 Slides Courtesy of Ramakrishnan & Gehrke, Dan Suciu, Zack Ives and Gerome Miklau. 1 Structure in Data Representation Relational data is highly

More information

Lecture 7 Introduction to XML Data Management

Lecture 7 Introduction to XML Data Management Lecture 7 Introduction to XML Data Management Shuigeng Zhou April 16, 2014 School of Computer Science Fudan University Outline Structure of XML Data XML Document Schema Querying and Transformation Application

More information

Introduction to Database Systems CSE 414

Introduction to Database Systems CSE 414 Introduction to Database Systems CSE 414 Lecture 14-15: XML CSE 414 - Spring 2013 1 Announcements Homework 4 solution will be posted tomorrow Midterm: Monday in class Open books, no notes beyond one hand-written

More information

Introduction to Data Management CSE 344

Introduction to Data Management CSE 344 Introduction to Data Management CSE 344 Lecture 11: XML and XPath 1 XML Outline What is XML? Syntax Semistructured data DTDs XPath 2 What is XML? Stands for extensible Markup Language 1. Advanced, self-describing

More information

XML. Structure of XML Data XML Document Schema Querying and Transformation Application Program Interfaces to XML Storage of XML Data XML Applications

XML. Structure of XML Data XML Document Schema Querying and Transformation Application Program Interfaces to XML Storage of XML Data XML Applications Chapter 10: XML XML Structure of XML Data XML Document Schema Querying and Transformation Application Program Interfaces to XML Storage of XML Data XML Applications Introduction XML: Extensible Markup

More information

Introduction to XML. XML: basic elements

Introduction to XML. XML: basic elements Introduction to XML XML: basic elements XML Trying to wrap your brain around XML is sort of like trying to put an octopus in a bottle. Every time you think you have it under control, a new tentacle shows

More information

10/24/12. What We Have Learned So Far. XML Outline. Where We are Going Next. XML vs Relational. What is XML? Introduction to Data Management CSE 344

10/24/12. What We Have Learned So Far. XML Outline. Where We are Going Next. XML vs Relational. What is XML? Introduction to Data Management CSE 344 What We Have Learned So Far Introduction to Data Management CSE 344 Lecture 12: XML and XPath A LOT about the relational model Hand s on experience using a relational DBMS From basic to pretty advanced

More information

Introduction to Database Systems CSE 414

Introduction to Database Systems CSE 414 Introduction to Database Systems CSE 414 Lecture 13: XML and XPath 1 Announcements Current assignments: Web quiz 4 due tonight, 11 pm Homework 4 due Wednesday night, 11 pm Midterm: next Monday, May 4,

More information

Additional Readings on XPath/XQuery Main source on XML, but hard to read:

Additional Readings on XPath/XQuery Main source on XML, but hard to read: Introduction to Database Systems CSE 444 Lecture 10 XML XML (4.6, 4.7) Syntax Semistructured data DTDs XML Outline April 21, 2008 1 2 Further Readings on XML Additional Readings on XPath/XQuery Main source

More information

Chapter 13 XML: Extensible Markup Language

Chapter 13 XML: Extensible Markup Language Chapter 13 XML: Extensible Markup Language - Internet applications provide Web interfaces to databases (data sources) - Three-tier architecture Client V Application Programs Webserver V Database Server

More information

Relational Data Model is quite rigid. powerful, but rigid.

Relational Data Model is quite rigid. powerful, but rigid. Lectures Desktop - 2 (C) Page 1 XML Tuesday, April 27, 2004 8:43 AM Motivation: Relational Data Model is quite rigid. powerful, but rigid. With the explosive growth of the Internet, electronic information

More information

Semistructured data, XML, DTDs

Semistructured data, XML, DTDs Semistructured data, XML, DTDs Introduction to Databases Manos Papagelis Thanks to Ryan Johnson, John Mylopoulos, Arnold Rosenbloom and Renee Miller for material in these slides Structured vs. unstructured

More information

Introduction to XML Zdeněk Žabokrtský, Rudolf Rosa

Introduction to XML Zdeněk Žabokrtský, Rudolf Rosa NPFL092 Technology for Natural Language Processing Introduction to XML Zdeněk Žabokrtský, Rudolf Rosa November 28, 2018 Charles Univeristy in Prague Faculty of Mathematics and Physics Institute of Formal

More information

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 27-1

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 27-1 Slide 27-1 Chapter 27 XML: Extensible Markup Language Chapter Outline Introduction Structured, Semi structured, and Unstructured Data. XML Hierarchical (Tree) Data Model. XML Documents, DTD, and XML Schema.

More information

Parallel/Distributed Databases XML

Parallel/Distributed Databases XML Parallel/Distributed Databases XML Mihai Pop CMSC424 most slides courtesy of Amol Deshpande Project due today Admin Sign up for demo, if you haven't already myphpbib.sourceforge.net - example publication

More information

A tutorial report for SENG Agent Based Software Engineering. Course Instructor: Dr. Behrouz H. Far. XML Tutorial.

A tutorial report for SENG Agent Based Software Engineering. Course Instructor: Dr. Behrouz H. Far. XML Tutorial. A tutorial report for SENG 609.22 Agent Based Software Engineering Course Instructor: Dr. Behrouz H. Far XML Tutorial Yanan Zhang Department of Electrical and Computer Engineering University of Calgary

More information

Structured documents

Structured documents Structured documents An overview of XML Structured documents Michael Houghton 15/11/2000 Unstructured documents Broadly speaking, text and multimedia document formats can be structured or unstructured.

More information

XML: Extensible Markup Language

XML: Extensible Markup Language XML: Extensible Markup Language CSC 375, Fall 2015 XML is a classic political compromise: it balances the needs of man and machine by being equally unreadable to both. Matthew Might Slides slightly modified

More information

COMP9321 Web Application Engineering

COMP9321 Web Application Engineering COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 4 http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid=2411 1 Extensible

More information

extensible Markup Language

extensible Markup Language extensible Markup Language XML is rapidly becoming a widespread method of creating, controlling and managing data on the Web. XML Orientation XML is a method for putting structured data in a text file.

More information

Introduction to Database Systems CSE 444

Introduction to Database Systems CSE 444 Introduction to Database Systems CSE 444 Lecture 25: XML 1 XML Outline XML Syntax Semistructured data DTDs XPath Coverage of XML is much better in new edition Readings Sections 11.1 11.3 and 12.1 [Subset

More information

M359 Block5 - Lecture12 Eng/ Waleed Omar

M359 Block5 - Lecture12 Eng/ Waleed Omar Documents and markup languages The term XML stands for extensible Markup Language. Used to label the different parts of documents. Labeling helps in: Displaying the documents in a formatted way Querying

More information

Data Formats and APIs

Data Formats and APIs Data Formats and APIs Mike Carey mjcarey@ics.uci.edu 0 Announcements Keep watching the course wiki page (especially its attachments): https://grape.ics.uci.edu/wiki/asterix/wiki/stats170ab-2018 Ditto for

More information

Copyright 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley. Chapter 7 XML

Copyright 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley. Chapter 7 XML Chapter 7 XML 7.1 Introduction extensible Markup Language Developed from SGML A meta-markup language Deficiencies of HTML and SGML Lax syntactical rules Many complex features that are rarely used HTML

More information

ADT XML, XPath & XQuery

ADT XML, XPath & XQuery Introduction to XML, XPath & XQuery Chapter 10 in Silberschatz, Korth, Sudarshan Database System Concepts Stefan Manegold Stefan.Manegold@cwi.nl http://www.cwi.nl/~manegold/ 2 Google Hits of 3-letter combinations

More information

COMP9321 Web Application Engineering. Extensible Markup Language (XML)

COMP9321 Web Application Engineering. Extensible Markup Language (XML) COMP9321 Web Application Engineering Extensible Markup Language (XML) Dr. Basem Suleiman Service Oriented Computing Group, CSE, UNSW Australia Semester 1, 2016, Week 4 http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid=2442

More information

7.1 Introduction. extensible Markup Language Developed from SGML A meta-markup language Deficiencies of HTML and SGML

7.1 Introduction. extensible Markup Language Developed from SGML A meta-markup language Deficiencies of HTML and SGML 7.1 Introduction extensible Markup Language Developed from SGML A meta-markup language Deficiencies of HTML and SGML Lax syntactical rules Many complex features that are rarely used HTML is a markup language,

More information

.. Cal Poly CPE/CSC 366: Database Modeling, Design and Implementation Alexander Dekhtyar..

.. Cal Poly CPE/CSC 366: Database Modeling, Design and Implementation Alexander Dekhtyar.. .. Cal Poly CPE/CSC 366: Database Modeling, Design and Implementation Alexander Dekhtyar.. XML in a Nutshell XML, extended Markup Language is a collection of rules for universal markup of data. Brief History

More information

Digital Asset Management 3. Multimedia Database System

Digital Asset Management 3. Multimedia Database System Digital Asset Management 3. Multimedia Database System 2015-10-13 QoS Outline 1. MM content organization 2. MM database system architecture 3. MM system service model 4. Multimedia Data Storage 5. Multimedia

More information

Overview. Structured Data. The Structure of Data. Semi-Structured Data Introduction to XML Querying XML Documents. CMPUT 391: XML and Querying XML

Overview. Structured Data. The Structure of Data. Semi-Structured Data Introduction to XML Querying XML Documents. CMPUT 391: XML and Querying XML Database Management Systems Winter 2004 CMPUT 391: XML and Querying XML Lecture 12 Overview Semi-Structured Data Introduction to XML Querying XML Documents Dr. Osmar R. Zaïane University of Alberta Chapter

More information

CSCI3030U Database Models

CSCI3030U Database Models CSCI3030U Database Models CSCI3030U RELATIONAL MODEL SEMISTRUCTURED MODEL 1 Content Design of databases. relational model, semistructured model. Database programming. SQL, XPath, XQuery. Not DBMS implementation.

More information

XML. Extensible Markup Language

XML. Extensible Markup Language XML Extensible Markup Language Acknowledgements and copyrights: these slides are a result of combination of notes and slides with contributions from: Michael Kiffer, Arthur Bernstein, Philip Lewis, Hanspeter

More information

The XML Metalanguage

The XML Metalanguage The XML Metalanguage Mika Raento mika.raento@cs.helsinki.fi University of Helsinki Department of Computer Science Mika Raento The XML Metalanguage p.1/442 2003-09-15 Preliminaries Mika Raento The XML Metalanguage

More information

XML in Databases. Albrecht Schmidt. al. Albrecht Schmidt, Aalborg University 1

XML in Databases. Albrecht Schmidt.   al. Albrecht Schmidt, Aalborg University 1 XML in Databases Albrecht Schmidt al@cs.auc.dk http://www.cs.auc.dk/ al Albrecht Schmidt, Aalborg University 1 What is XML? (1) Where is the Life we have lost in living? Where is the wisdom we have lost

More information

Semistructured Data and XML

Semistructured Data and XML Semistructured Data and XML Computer Science E-66 Harvard University David G. Sullivan, Ph.D. Structured Data The logical models we've covered thus far all use some type of schema to define the structure

More information

The Xlint Project * 1 Motivation. 2 XML Parsing Techniques

The Xlint Project * 1 Motivation. 2 XML Parsing Techniques The Xlint Project * Juan Fernando Arguello, Yuhui Jin {jarguell, yhjin}@db.stanford.edu Stanford University December 24, 2003 1 Motivation Extensible Markup Language (XML) [1] is a simple, very flexible

More information

XML Origin and Usages

XML Origin and Usages Kapitel 1 XML Outline XML Basics DTDs, XML Schema XPath, XSLT, XQuery SQL/XML Application Programming Integration N. Ritter, WfWS, Kapitel1, SS 2005 1 XML Origin and Usages Defined by the WWW Consortium

More information

COMP9321 Web Application Engineering

COMP9321 Web Application Engineering COMP9321 Web Application Engineering Semester 2, 2017 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 4 http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid= 2465 1

More information

Chapter 1: Semistructured Data Management XML

Chapter 1: Semistructured Data Management XML Chapter 1: Semistructured Data Management XML XML - 1 The Web has generated a new class of data models, which are generally summarized under the notion semi-structured data models. The reasons for that

More information

Author: Irena Holubová Lecturer: Martin Svoboda

Author: Irena Holubová Lecturer: Martin Svoboda NPRG036 XML Technologies Lecture 1 Introduction, XML, DTD 19. 2. 2018 Author: Irena Holubová Lecturer: Martin Svoboda http://www.ksi.mff.cuni.cz/~svoboda/courses/172-nprg036/ Lecture Outline Introduction

More information

SDPL : XML Basics 2. SDPL : XML Basics 1. SDPL : XML Basics 4. SDPL : XML Basics 3. SDPL : XML Basics 5

SDPL : XML Basics 2. SDPL : XML Basics 1. SDPL : XML Basics 4. SDPL : XML Basics 3. SDPL : XML Basics 5 2 Basics of XML and XML documents 2.1 XML and XML documents Survivor's Guide to XML, or XML for Computer Scientists / Dummies 2.1 XML and XML documents 2.2 Basics of XML DTDs 2.3 XML Namespaces XML 1.0

More information

Data Presentation and Markup Languages

Data Presentation and Markup Languages Data Presentation and Markup Languages MIE456 Tutorial Acknowledgements Some contents of this presentation are borrowed from a tutorial given at VLDB 2000, Cairo, Agypte (www.vldb.org) by D. Florescu &.

More information

XML. COSC Dr. Ramon Lawrence. An attribute is a name-value pair declared inside an element. Comments. Page 3. COSC Dr.

XML. COSC Dr. Ramon Lawrence. An attribute is a name-value pair declared inside an element. Comments. Page 3. COSC Dr. COSC 304 Introduction to Database Systems XML Dr. Ramon Lawrence University of British Columbia Okanagan ramon.lawrence@ubc.ca XML Extensible Markup Language (XML) is a markup language that allows for

More information

XML: Introduction. !important Declaration... 9:11 #FIXED... 7:5 #IMPLIED... 7:5 #REQUIRED... Directive... 9:11

XML: Introduction. !important Declaration... 9:11 #FIXED... 7:5 #IMPLIED... 7:5 #REQUIRED... Directive... 9:11 !important Declaration... 9:11 #FIXED... 7:5 #IMPLIED... 7:5 #REQUIRED... 7:4 @import Directive... 9:11 A Absolute Units of Length... 9:14 Addressing the First Line... 9:6 Assigning Meaning to XML Tags...

More information

Chapter 1: Semistructured Data Management XML

Chapter 1: Semistructured Data Management XML Chapter 1: Semistructured Data Management XML 2006/7, Karl Aberer, EPFL-IC, Laboratoire de systèmes d'informations répartis XML - 1 The Web has generated a new class of data models, which are generally

More information

XML, DTD, and XPath. Announcements. From HTML to XML (extensible Markup Language) CPS 116 Introduction to Database Systems. Midterm has been graded

XML, DTD, and XPath. Announcements. From HTML to XML (extensible Markup Language) CPS 116 Introduction to Database Systems. Midterm has been graded XML, DTD, and XPath CPS 116 Introduction to Database Systems Announcements 2 Midterm has been graded Graded exams available in my office Grades posted on Blackboard Sample solution and score distribution

More information

Comp 336/436 - Markup Languages. Fall Semester Week 4. Dr Nick Hayward

Comp 336/436 - Markup Languages. Fall Semester Week 4. Dr Nick Hayward Comp 336/436 - Markup Languages Fall Semester 2017 - Week 4 Dr Nick Hayward XML - recap first version of XML became a W3C Recommendation in 1998 a useful format for data storage and exchange config files,

More information

The concept of DTD. DTD(Document Type Definition) Why we need DTD

The concept of DTD. DTD(Document Type Definition) Why we need DTD Contents Topics The concept of DTD Why we need DTD The basic grammar of DTD The practice which apply DTD in XML document How to write DTD for valid XML document The concept of DTD DTD(Document Type Definition)

More information

Well-formed XML Documents

Well-formed XML Documents Well-formed XML Documents Asst. Prof. Dr. Kanda Runapongsa Saikaew (krunapon@kku.ac.th) Dept. of Computer Engineering Khon Kaen University 1 Agenda Types of XML documents Why Well-formed XML Documents

More information

XML Introduction 1. XML Stands for EXtensible Mark-up Language (XML). 2. SGML Electronic Publishing challenges -1986 3. HTML Web Presentation challenges -1991 4. XML Data Representation challenges -1996

More information

11. EXTENSIBLE MARKUP LANGUAGE (XML)

11. EXTENSIBLE MARKUP LANGUAGE (XML) 11. EXTENSIBLE MARKUP LANGUAGE (XML) Introduction Extensible Markup Language is a Meta language that describes the contents of the document. So these tags can be called as self-describing data tags. XML

More information

EXtensible Markup Language (XML) a W3C standard to complement HTML A markup language much like HTML

EXtensible Markup Language (XML)   a W3C standard to complement HTML A markup language much like HTML XML and XPath EXtensible Markup Language (XML) a W3C standard to complement HTML A markup language much like HTML origins: structured text SGML motivation: HTML describes presentation XML describes content

More information

Introduction to XML. Chapter 133

Introduction to XML. Chapter 133 Chapter 133 Introduction to XML A. Multiple choice questions: 1. Attributes in XML should be enclosed within. a. single quotes b. double quotes c. both a and b d. none of these c. both a and b 2. Which

More information

CSE 544 Data Models. Lecture #3. CSE544 - Spring,

CSE 544 Data Models. Lecture #3. CSE544 - Spring, CSE 544 Data Models Lecture #3 1 Announcements Project Form groups by Friday Start thinking about a topic (see new additions to the topic list) Next paper review: due on Monday Homework 1: due the following

More information

2009 Martin v. Löwis. Data-centric XML. XML Syntax

2009 Martin v. Löwis. Data-centric XML. XML Syntax Data-centric XML XML Syntax 2 What Is XML? Extensible Markup Language Derived from SGML (Standard Generalized Markup Language) Two goals: large-scale electronic publishing exchange of wide variety of data

More information

CS145 Introduction. About CS145 Relational Model, Schemas, SQL Semistructured Model, XML

CS145 Introduction. About CS145 Relational Model, Schemas, SQL Semistructured Model, XML CS145 Introduction About CS145 Relational Model, Schemas, SQL Semistructured Model, XML 1 Content of CS145 Design of databases. E/R model, relational model, semistructured model, XML, UML, ODL. Database

More information

Markup Languages SGML, HTML, XML, XHTML. CS 431 February 13, 2006 Carl Lagoze Cornell University

Markup Languages SGML, HTML, XML, XHTML. CS 431 February 13, 2006 Carl Lagoze Cornell University Markup Languages SGML, HTML, XML, XHTML CS 431 February 13, 2006 Carl Lagoze Cornell University Problem Richness of text Elements: letters, numbers, symbols, case Structure: words, sentences, paragraphs,

More information

XML. extensible Markup Language. Overview. Overview. Overview XML Components Document Type Definition (DTD) Attributes and Tags An XML schema

XML. extensible Markup Language. Overview. Overview. Overview XML Components Document Type Definition (DTD) Attributes and Tags An XML schema XML extensible Markup Language An introduction in XML and parsing XML Overview XML Components Document Type Definition (DTD) Attributes and Tags An XML schema 3011 Compiler Construction 2 Overview Overview

More information

EMERGING TECHNOLOGIES

EMERGING TECHNOLOGIES EMERGING TECHNOLOGIES XML (Part 3): XQuery Outline 1. Introduction 2. Structure of XML data 3. XML Document Schema 3.1. Document Type Definition (DTD) 3.2. XMLSchema 4. Data Model for XML documents. 5.

More information

Comp 336/436 - Markup Languages. Fall Semester Week 4. Dr Nick Hayward

Comp 336/436 - Markup Languages. Fall Semester Week 4. Dr Nick Hayward Comp 336/436 - Markup Languages Fall Semester 2018 - Week 4 Dr Nick Hayward XML - recap first version of XML became a W3C Recommendation in 1998 a useful format for data storage and exchange config files,

More information

Overview. Introduction. Introduction XML XML. Lecture 16 Introduction to XML. Boriana Koleva Room: C54

Overview. Introduction. Introduction XML XML. Lecture 16 Introduction to XML. Boriana Koleva Room: C54 Overview Lecture 16 Introduction to XML Boriana Koleva Room: C54 Email: bnk@cs.nott.ac.uk Introduction The Syntax of XML XML Document Structure Document Type Definitions Introduction Introduction SGML

More information

Chapter 1: Getting Started. You will learn:

Chapter 1: Getting Started. You will learn: Chapter 1: Getting Started SGML and SGML document components. What XML is. XML as compared to SGML and HTML. XML format. XML specifications. XML architecture. Data structure namespaces. Data delivery,

More information

Delivery Options: Attend face-to-face in the classroom or remote-live attendance.

Delivery Options: Attend face-to-face in the classroom or remote-live attendance. XML Programming Duration: 5 Days Price: $2795 *California residents and government employees call for pricing. Discounts: We offer multiple discount options. Click here for more info. Delivery Options:

More information

XDS An Extensible Structure for Trustworthy Document Content Verification Simon Wiseman CTO Deep- Secure 3 rd June 2013

XDS An Extensible Structure for Trustworthy Document Content Verification Simon Wiseman CTO Deep- Secure 3 rd June 2013 Assured and security Deep-Secure XDS An Extensible Structure for Trustworthy Document Content Verification Simon Wiseman CTO Deep- Secure 3 rd June 2013 This technical note describes the extensible Data

More information

XML Structures. Web Programming. Uta Priss ZELL, Ostfalia University. XML Introduction Syntax: well-formed Semantics: validity Issues

XML Structures. Web Programming. Uta Priss ZELL, Ostfalia University. XML Introduction Syntax: well-formed Semantics: validity Issues XML Structures Web Programming Uta Priss ZELL, Ostfalia University 2013 Web Programming XML1 Slide 1/32 Outline XML Introduction Syntax: well-formed Semantics: validity Issues Web Programming XML1 Slide

More information

XML. Rodrigo García Carmona Universidad San Pablo-CEU Escuela Politécnica Superior

XML. Rodrigo García Carmona Universidad San Pablo-CEU Escuela Politécnica Superior XML Rodrigo García Carmona Universidad San Pablo-CEU Escuela Politécnica Superior XML INTRODUCTION 2 THE XML LANGUAGE XML: Extensible Markup Language Standard for the presentation and transmission of information.

More information

XML 2 APPLICATION. Chapter SYS-ED/ COMPUTER EDUCATION TECHNIQUES, INC.

XML 2 APPLICATION. Chapter SYS-ED/ COMPUTER EDUCATION TECHNIQUES, INC. XML 2 APPLIATION hapter SYS-ED/ OMPUTER EDUATION TEHNIQUES, IN. Objectives You will learn: How to create an XML document. The role of the document map, prolog, and XML declarations. Standalone declarations.

More information

XML Processing & Web Services. Husni Husni.trunojoyo.ac.id

XML Processing & Web Services. Husni Husni.trunojoyo.ac.id XML Processing & Web Services Husni Husni.trunojoyo.ac.id Based on Randy Connolly and Ricardo Hoar Fundamentals of Web Development, Pearson Education, 2015 Objectives 1 XML Overview 2 XML Processing 3

More information

XML and Web Services

XML and Web Services XML and Web Services Lecture 8 1 XML (Section 17) Outline XML syntax, semistructured data Document Type Definitions (DTDs) XML Schema Introduction to XML based Web Services 2 Additional Readings on XML

More information

Informatics 1: Data & Analysis

Informatics 1: Data & Analysis Informatics 1: Data & Analysis Lecture 9: Trees and XML Ian Stark School of Informatics The University of Edinburgh Tuesday 11 February 2014 Semester 2 Week 5 http://www.inf.ed.ac.uk/teaching/courses/inf1/da

More information

markup language carry data define your own tags self-descriptive W3C Recommendation

markup language carry data define your own tags self-descriptive W3C Recommendation XML intro What is XML? XML stands for EXtensible Markup Language XML is a markup language much like HTML XML was designed to carry data, not to display data XML tags are not predefined. You must define

More information

CSC Web Technologies, Spring Web Data Exchange Formats

CSC Web Technologies, Spring Web Data Exchange Formats CSC 342 - Web Technologies, Spring 2017 Web Data Exchange Formats Web Data Exchange Data exchange is the process of transforming structured data from one format to another to facilitate data sharing between

More information

Solutions. a. Yes b. No c. Cannot be determined without the DTD. d. Schema. 9. Explain the term extensible. 10. What is an attribute?

Solutions. a. Yes b. No c. Cannot be determined without the DTD. d. Schema. 9. Explain the term extensible. 10. What is an attribute? Chapter 7: Information Representation Method XML Solutions Summative Assessment Multiple-Choice Questions (MCQs) 1. XML was developed to overcome the limitations of the markup language. a. EDI b. SGML

More information

Chapter 1: Introduction

Chapter 1: Introduction Chapter 1: Introduction Chapter 1: Introduction Purpose of Database Systems Database Languages Relational Databases Database Design Data Models Database Internals Database Users and Administrators Overall

More information

The Semi-Structured Data Model. csc343, Introduction to Databases Diane Horton originally based on slides by Jeff Ullman Fall 2017

The Semi-Structured Data Model. csc343, Introduction to Databases Diane Horton originally based on slides by Jeff Ullman Fall 2017 The Semi-Structured Data Model csc343, Introduction to Databases Diane Horton originally based on slides by Jeff Ullman Fall 2017 Recap: Data models A data model is a notation for describing data, including:

More information

Using UML To Define XML Document Types

Using UML To Define XML Document Types Using UML To Define XML Document Types W. Eliot Kimber ISOGEN International, A DataChannel Company Created On: 10 Dec 1999 Last Revised: 14 Jan 2000 Defines a convention for the use of UML to define XML

More information

PART. Oracle and the XML Standards

PART. Oracle and the XML Standards PART I Oracle and the XML Standards CHAPTER 1 Introducing XML 4 Oracle Database 10g XML & SQL E xtensible Markup Language (XML) is a meta-markup language, meaning that the language, as specified by the

More information

XML. Document Type Definitions. Database Systems and Concepts, CSCI 3030U, UOIT, Course Instructor: Jarek Szlichta

XML. Document Type Definitions. Database Systems and Concepts, CSCI 3030U, UOIT, Course Instructor: Jarek Szlichta XML Document Type Definitions 1 XML XML stands for extensible Markup Language. XML was designed to describe data. XML has come into common use for the interchange of data over the Internet. 2 Well-Formed

More information

Introduction to XML. An Example XML Document. The following is a very simple XML document.

Introduction to XML. An Example XML Document. The following is a very simple XML document. Introduction to XML Extensible Markup Language (XML) was standardized in 1998 after 2 years of work. However, it developed out of SGML (Standard Generalized Markup Language), a product of the 1970s and

More information

Introduction to XML. Asst. Prof. Dr. Kanda Runapongsa Saikaew Dept. of Computer Engineering Khon Kaen University

Introduction to XML. Asst. Prof. Dr. Kanda Runapongsa Saikaew Dept. of Computer Engineering Khon Kaen University Introduction to XML Asst. Prof. Dr. Kanda Runapongsa Saikaew Dept. of Computer Engineering Khon Kaen University http://gear.kku.ac.th/~krunapon/xmlws 1 Topics p What is XML? p Why XML? p Where does XML

More information

extensible Markup Language (XML) Basic Concepts

extensible Markup Language (XML) Basic Concepts (XML) Basic Concepts Giuseppe Della Penna Università degli Studi di L Aquila dellapenna@univaq.it http://www.di.univaq.it/gdellape This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike

More information

W3C XML XML Overview

W3C XML XML Overview Overview Jaroslav Porubän 2008 References Tutorials, http://www.w3schools.com Specifications, World Wide Web Consortium, http://www.w3.org David Hunter, et al.: Beginning, 4th Edition, Wrox, 2007, 1080

More information

Introduction to XML (Extensible Markup Language)

Introduction to XML (Extensible Markup Language) Introduction to XML (Extensible Markup Language) 1 History and References XML is a meta-language, a simplified form of SGML (Standard Generalized Markup Language) XML was initiated in large parts by Jon

More information

Introduction to XML 3/14/12. Introduction to XML

Introduction to XML 3/14/12. Introduction to XML Introduction to XML Asst. Prof. Dr. Kanda Runapongsa Saikaew Dept. of Computer Engineering Khon Kaen University http://gear.kku.ac.th/~krunapon/xmlws 1 Topics p What is XML? p Why XML? p Where does XML

More information

Delivery Options: Attend face-to-face in the classroom or via remote-live attendance.

Delivery Options: Attend face-to-face in the classroom or via remote-live attendance. XML Programming Duration: 5 Days US Price: $2795 UK Price: 1,995 *Prices are subject to VAT CA Price: CDN$3,275 *Prices are subject to GST/HST Delivery Options: Attend face-to-face in the classroom or

More information

Data Exchange. Hyper-Text Markup Language. Contents: HTML Sample. HTML Motivation. Cascading Style Sheets (CSS) Problems w/html

Data Exchange. Hyper-Text Markup Language. Contents: HTML Sample. HTML Motivation. Cascading Style Sheets (CSS) Problems w/html Data Exchange Contents: Mariano Cilia / cilia@informatik.tu-darmstadt.de Origins (HTML) Schema DOM, SAX Semantic Data Exchange Integration Problems MIX Model 1 Hyper-Text Markup Language HTML Hypertext:

More information

XML & Related Languages

XML & Related Languages Unit 1 1 Introduction: XML XML = extensible Markup Language the universal format for structured documents and data on the Web. www.w3c.org/xml simple, very flexible text format derived from SGML (ISO 8879).

More information

CSI 3140 WWW Structures, Techniques and Standards. Representing Web Data: XML

CSI 3140 WWW Structures, Techniques and Standards. Representing Web Data: XML CSI 3140 WWW Structures, Techniques and Standards Representing Web Data: XML XML Example XML document: An XML document is one that follows certain syntax rules (most of which we followed for XHTML) Guy-Vincent

More information

Introduction. Web Pages. Example Graph

Introduction. Web Pages. Example Graph COSC 454 DB And the Web Introduction Overview Dynamic web pages XML and databases Reference: (Elmasri & Navathe, 5th ed) Ch. 26 - Web Database Programming Using PHP Ch. 27 - XML: Extensible Markup Language

More information

CSE 880. Advanced Database Systems. Semistuctured Data and XML

CSE 880. Advanced Database Systems. Semistuctured Data and XML CSE 880 Advanced Database Systems Semistuctured Data and XML S. Pramanik 1 Semistructured Data 1. Data is self describing with schema embedded to the data itself. 2. Theembeddedschemacanchangewithtimejustlike

More information

RepCom: A Customisable Report Generator Component System using XML-driven, Component-based Development Approach

RepCom: A Customisable Report Generator Component System using XML-driven, Component-based Development Approach RepCom: A Customisable Generator Component System using XML-driven, Component-based Development Approach LEONG CHEE HOONG, DR LEE SAI PECK Faculty of Computer Science & Information Technology University

More information

XML and information exchange. XML extensible Markup Language XML

XML and information exchange. XML extensible Markup Language XML COS 425: Database and Information Management Systems XML and information exchange 1 XML extensible Markup Language History 1988 SGML: Standard Generalized Markup Language Annotate text with structure 1992

More information