Web Data Management: An Introduction to Semistructured and XML Databases
|
|
- Louisa Webb
- 6 years ago
- Views:
Transcription
1 Web Data Management: An Introduction to Semistructured and XML Databases V. CHRISTOPHIDES I FUNDULAKI Department of Computer Science University of Crete ICS - FORTH, Heraklion, Crete 1 A bit of History Research: Internet industry: 1950 s: Lisp [Mac Carthy] 1957: Sputnik launches ARPA 1960 s: Tree languages [Buchi] 1972: First demonstration of ARPANET 1970 s: Relational DBs [Codd] 1989: Number of hosts breaks 100, : Graphlog [Univ. Toronto] 1991: CERN releases the World Wide Web HTML as the support for information 1994: O2 extensions [INRIA] 1997: 20 Million Hosts,1 Million Web sites 1995: Tsimmis & OEM [Stanford] 1998 : W3C releases XML to represent 1995: UnQL [UPenn] information on the Web Need to handle irregular Web data. Use graph data models.? XML provides a syntax for irregular textual Web information. 2 1
2 Documents vs Databases Document world Database world plenty of small documents usually static implicit structure section, paragraph, toc, tagging human friendly content form/layout, annotation paradigms Save as, WYSIWYG metadata author name, date, subject a few large databases usually dynamic explicit structure types records machine friendly content data, methods paradigms Data Independence, Transaction Management, Query Languages metadata schema description 3 What to do with them Documents Database editing spell-checking updating cleaning counting words retrieving (IR) printing querying composing/transforming 4 2
3 Query Languages Document Retrieval Claude Monet and San Diego Museum of Art Database Querying select p from Artists a, a.artwork p where a.first = Claude and a.last = Monet and p.located = San Diego Museum of Art 5 How the Web is Today? Information and its presentations are mixed up in the form of HTML documents all intended for human consumption many generated automatically by applications Easy to fetch any Web page, from any server, any platform access through a uniform interface 6 3
4 The Secrets of HTML Success Everybody can write it: HTML is simple HTML is textual: it is human readable, you can use any editor,... Hypertext Links Everybody can read it: HTML is portable on any platform The browser is the universal application Everybody can search it: Keyword-based Search Engines: high recall, low precision It connects pieces of information together Through hypertext links 7 What s Wrong with HTML If written properly, normal HTML markup may reflect document presentation, but it cannot adequately represent the semantics & structure of data Artist Name Artifact Title <B>MONET, Claude<B><BR> Date Haystacks at Chailly at Sunrise<BR> 1865<BR> Dimensions Oil on canvas<br> Material 30 x 60 cm (11 7/8 x 23 3/4 in.)<br> San Diego Museum of Art <BR> Image <P> Reference Museum <IMG SRC= m/monet/hayricks.jpg > 8 4
5 HTML Document Presentation 9 But Modern Web Applications Need More! Infomediaries: Community Web Portals Digital Museums & Libraries Electronic commerce: On-line Catalogs & Procurement Comparison Shoppers Market Places Virtual Enterprises Scientific applications: E-learning Data & Knowledge Grids Advanced Information Management finding, extracting, representing, interpreting, maintaining Flexible, Quick Interoperation: the ability to uniformly share, interpret and manipulate heterogeneous information applications cannot consume HTML More than HTML documents: Data on the Web More than Web browsers: Web-enabled Applications 10 5
6 Paradigm Shift on the Web New Web standard XML: XML generated by applications XML consumed by applications Data exchange: across platforms across organizations XML Data application application object-relational Integrate WEB (HTTP) Transform Warehouse Web: from collection of documents to Web data published as documents relational data application legacy data 11 XML Data Representation: The Document View <ARTIST> Element Name Element Content <NAME> <FIRST>Claude</FIRST> <LAST>Monet</LAST> </NAME> <ARTWORK> <ARTIFACT> <TITLE>Haystacks at Chailly at Sunrise</TITLE> Attribute Name <DATE>1865</DATE> <MATERIAL>Oil on canvas</material> <DIM Metric= cm > <HEIGHT>30</HEIGHT><WIDTH>60</WIDTH></DIM> <DIM Metric= in > <HEIGHT>11 7/8</HEIGHT><WIDTH>23 3/4</WIDTH></DIM> <LOCATION>San Diego Museum of Art</LOCATION> <IMAGE File= /> </ARTIFACT> </ARTWORK> </ARTIST> Empty Element Attribute Value 12 6
7 XML Data Representation: The Database View ARTIST NAME ARTWORK FIRST LAST ARTIFACT Claude MONET TITLE DATE Haystacks 1865 Oil on canvas DIM DIM IMAGE...hayricks.jpg H W H W /8 3/4 MATERIAL LOCATION San Diego Mus. 13 The Secrets of XML Popularity It looks like HTML... Simple, familiar, easy to learn, human-readable Universal and portable Supported by the W3C: trusted and quickly adopted by the industry but it s more than HTML! flexible: you can represent any information extensible: you can represent it the way you want! Increasing precision in XML specifications Well-Formed: already better than plain text Valid: Structure conforms to a DTD or an XML Schema <?XML!> 14 7
8 Well-Formed XML An object is said to be a well-formed XML document if it meets all the well-formedness constraints (WFCs) of the XML syntax: tags (etc.) are syntactically correct every tag has an end-tag tags are properly nested there exists a root By definition if a document is not well-formed, it is not XML This means that there is no an XML document which is not wellformed, and XML processors are not required to do anything with such documents 15 Valid XML A well-formed document is valid only if it contains a proper DTD (or Schema) and if the document obeys the constraints of that DTD (or Schema) and therefore the XML Validity Constraints (VCs) only declared tags (element or attribute names) are used all tag occurrences conform to specified content models Examples: The following XML Document is well-formed but not valid <ARTIST>Claude Monet</ARTIST> The following XML Document is not even well-formed <FIRST>Claude</FIRST><LAST>Monet</LAST> 16 8
9 XML Document Type Definition (DTD) <!DOCTYPE artist [ <!ELEMENT artist (name, born, death, artwork, nationality?, influences)> <!ATTLIST artist oid ID #REQUIRED> <!ELEMENT name (first, last)> <!ELEMENT first (#PCDATA)> <!ELEMENT last (#PCDATA)>... <!ELEMENT artwork (artifact+)> <!ELEMENT artifact (title, date, material, dim*, location, image)> <!ELEMENT title (#PCDATA)>... <!ELEMENT dim (height, width)> <!ATTLIST dim metric (cm in) cm > <!ELEMENT location (#PCDATA)> <!ELEMENT image EMPTY> <!ATTLIST image file ENTITY #REQUIRED> <!ELEMENT influences (PCDATA aref)*> <!ELEMENT aref EMPTY> <!ATTLIST aref oref IDREF #IMPLIED> ]> 17 XML Anatomy 18 9
10 Is XML the Solution to Interoperability? Document = medium for exchanging information Still need to agree on: DTDs or Schemas Meaning of tags Operations on data Meaning of operations ARTIST ARTIST NAME ARTWORK NAME ARTWORK FIRST LAST ClaudeMONET TITLE DATE Hayst1865 acks MATERIAL ARTIFACT DIM Oil on canvas DIM IMAGE H W H W hayricks.jpg /83/4 LOCATION San Diego Mus. Communication FIRST LAST ClaudeMONET TITLE DATE Hayst1865 acks MATERIAL ARTIFACT Application 1 Application 2 DIM Oil on canvas DIM IMAGE H W H W hayricks.jpg /83/4 LOCATION San Diego Mus. 19 Large Scale Interoperation on the Web? Communication Partner using DTD B?? XML-based Communication using DTD A Communication Partner using DTD C Sender using DTD A Recipient using DTD A 20 10
11 Interoperability is still an Open Issue! Semantic discrepancies : Synonymy & Polysemy & Taxonomy <ARTIFACT> vs. <ARTEFACT> is <ARTWORK> paintings or songs? how < Style= Impressionism > is related to < Style= Pointillism >? Structural discrepancies : Aggregation <NAME><FIRST>Claude</FIRST><LAST>Monet</LAST></NAME> vs<name>claude Monet</NAME> Type <ARTIFACT Kind= Painting >... </ARTIFACT> vs<painting>claude Monet</PAINTING> Syntactic discrepancies : <ARTIST Name= Claude Monet >... </ARTIST> vs<artist> <NAME>Claude Monet</NAME>... </ARTIST> More than Web Data: Semantics on the Web More than Web Applications: Web Services 21 The Semantic Web Vision: A Web of Meaning The Next Generation Web aims to provide infrastructure for expressing information in a precise, human-readable, and machineinterpretable form Enable both syntactic and semantic/ structural interoperability among independently-developed Web applications, allowing them to efficiently perform sophisticated tasks for humans Enable Web resources (data & applications) to be accessible by their meaning rather than by keywords and syntactic forms Conceptual Navigation & Querying Inference Services (Picasso is an Artist) Techniques Artists Semantic Relationships Artifacts Museums 22 11
12 Web Innovation: From Web Sites to Web Services Packets Connect TCP/IP Browse/ Present Integrate/ Web Sites Today Collaborate Transact Program/ Web Services HTML XML UDDI, SOAP, RosettaNet 1970s 1980s 1990s Middleware Evolution & Interoperability 24 12
13 About the W3C and the XML Activities Membership organization Core XML WG Different types of groups inside W3C: extensible Markup Language (XML 1.0), namespaces, Infoset Working groups XML Linking WG Interest groups XML Pointer Language (Xpointer), Coordination groups XML Linking language Status for W3C documents: XML Schema WG Working draft XML Query WG Last Call XML Data Model, Algebra and Candidate/proposed Query Language recommendation Document Object Model WG Recommendation ~ Standard XSL WG XPath (with XML Linking WG) Transformation and stylesheet language (XSLT/XSL) 25 W3C XML Related Specifications W3C rec industry std W3C draft Open std XML Core XML 1.0 XML namespaces Xfragment RDF Canonical Xpath APIs JDOM JAXP DOM 1 DOM 2 DOM 3 XML signature XSLT XSL Xpointer XML query. Xlink XML base Infoset XHTML events MathML SMIL 1 & 2 SVG... XHTML 1.0 SAX 1 SAX 2 XML schema SOAP UDDI Biztalk FinXML CSS 1 XML-RPC ebxml IFX dirxml CSS 2 WSDL WDDX XMI CSS 3... FpML Style Protocols Web Services Application areas Xforms Modularized XHTML 100's more... XHTML basic Ian GRAHAM 26 13
14 XML is Just the Beginning... We now want to build advanced Web applications There is an urgent need for XML tools Designing XML tools is a data management problem: XML 1.0 to describe structured documents = Syntax for trees XML data models to describe the information content = Data model for trees XML schemas to describe the structure of information = Data definition language for trees XML languages to describe information processing = Data manipulation language for trees 27 Why is Database Work Important? Databases are large collections of data Even basic APIs to XML will fail on large XML documents Other reasons -- all the good things that the DB community brought the world: Data models, integrity constraints, and schemas Query languages, optimizers, fast joins Views, Updates Concurrency Federated database systems The emergence of XML underscores the importance of semistructured data 28 14
15 XML Main Characteristics: Semistructured Data Data Schema is not what it used to be: not given in advance (self-describing, schema-less), descriptive, not prescriptive (designed by document, not db experts), partial (documents and data mixed together), rapidly evolving (without notice), may be large (compared to the size of the data) Data Types are not what they used to be: elements and attributes are not strongly typed missing or additional attributes multiple attributes elements in the same collection may have different types i.e., heterogeneous collections attributes with different types in different elements 29 Schemas are Useful for Data readers What info is in a given collection? Thus, what queries might make sense? Data writers What should I call this piece of info? Is it okay to put this kind of data here? Efficient/effective data manipulation Optimize query processing Facilitate integration of multiple data sources Improve storage Construct indexes, statistics Forbid certain types of updates 30 15
16 Needs also for a DB Paradigm Shift Managing semistructured/xml data requires rethinking the design of components of a DBMS: How do we model it? directed labeled trees with references (i.e. graphs) How do we query it? a new standard based on functional languages and regular path expressions How do we store the data? looking for structural patterns How do we optimize queries? beginning to understand (algebras, indexes, etc.) What about Integrity constraints, views, updates,,? 31 Towards a Convergence Databases: relax rigid constraints imposed by schemas Move to a dynamic type system: semistructured data SGML, Document Management Origins HTML, Web Pages Documents: enrich formatting instructions with structuring/ semantic information Add types to documents: XML Semistructured Data ~ = XML ANS.1, ACeDB Scientific Data Formats Semi-structured data models for data integration 32 16
17 What This Course is About What the database community has done: Semistructured data model: SSD-exps, labeled graphs OEM, UnQL and YATL Schemas, Storage, Query Optimization What the Web community has done: Data formats and APIs: XML 1.0, DOM Transformation and Stylesheet languages (XSLT/XSL) Where they meet and where they differ Comparison to relational and object-oriented data models Present emerging XML technology as a data management issue XML Data models XML Data Definition (Schema) Languages XML Data Manipulation (Query) languages 33 Your Research Projects XML Data Semantics Type Systems Structural & Integrity Constraints Incremental Validation XML Query Processing XQuery Algebras Tree Query Pattern Containment & Minimization XPath Engines Stream-based Query Processing XML Query Optimization Storage Schemes Labelling & Indexing Schemes Structural Joins & Cost Models Data Statistics & Compression Benchmarks, Real & Synthetic Data XML Data Management Updates, Evolution & Versioning Access Control & Active Rules Data Publishing & Relational Databases Warehouses & View Maintenance XML Database Systems Commercial DBMS Native DBMS 34 17
18 Retrospective 1960 s: Data Centric 1970 s: Process Centric 1980 s: Object Oriented 1990 s: Component Based 2000 s: XML? 35 Data was our First Focus Record Layouts Printer Layouts System Flow Charts Decision Tables 60 s Data Batch Jobs were a Series of small Programs 36 18
19 Then we Focused on Logic GOTO-Less Programming Structured Programming Top-Down Design 60 s Data 70 s Logic Programs Became Very Large 37 Object Oriented Programming Focused on Runtime Behavior Common Terms for Analysis and Design Tightly Coupled Code 60 s Data 70 s Logic Code Reuse was the Holy Grail, Rarely Achieved 80 s OO 38 19
20 Component Programming Shifted the Focus to Interfaces Code Reuse IDE-Based Composition Limited Acceptance 60 s Data 90 s Comp 70 s Logic Serialization Tied to Code 80 s OO 39 XML Returns the Focus to Data XML Wrappers for Incompatible Systems Industry-Specific Markup Languages XML for Persistent Data and Composition 00 s XML 90 s Comp 70 s Logic XML Enables Middleware for Application-Specific Data 80 s OO 40 20
XML: Extensible Markup Language
XML: Extensible Markup Language CSC 375, Fall 2015 XML is a classic political compromise: it balances the needs of man and machine by being equally unreadable to both. Matthew Might Slides slightly modified
More informationDelivery Options: Attend face-to-face in the classroom or remote-live attendance.
XML Programming Duration: 5 Days Price: $2795 *California residents and government employees call for pricing. Discounts: We offer multiple discount options. Click here for more info. Delivery Options:
More informationDelivery Options: Attend face-to-face in the classroom or via remote-live attendance.
XML Programming Duration: 5 Days US Price: $2795 UK Price: 1,995 *Prices are subject to VAT CA Price: CDN$3,275 *Prices are subject to GST/HST Delivery Options: Attend face-to-face in the classroom or
More informationIntroduction to XML. XML: basic elements
Introduction to XML XML: basic elements XML Trying to wrap your brain around XML is sort of like trying to put an octopus in a bottle. Every time you think you have it under control, a new tentacle shows
More informationA tutorial report for SENG Agent Based Software Engineering. Course Instructor: Dr. Behrouz H. Far. XML Tutorial.
A tutorial report for SENG 609.22 Agent Based Software Engineering Course Instructor: Dr. Behrouz H. Far XML Tutorial Yanan Zhang Department of Electrical and Computer Engineering University of Calgary
More informationXML Metadata Standards and Topic Maps
XML Metadata Standards and Topic Maps Erik Wilde 16.7.2001 XML Metadata Standards and Topic Maps 1 Outline what is XML? a syntax (not a data model!) what is the data model behind XML? XML Information Set
More informationXML Applications. Introduction Jaana Holvikivi 1
XML Applications Introduction 1.4.2009 Jaana Holvikivi 1 Outline XML standards Application areas 1.4.2009 Jaana Holvikivi 2 Basic XML standards XML a meta language for the creation of languages to define
More informationChapter 13 XML: Extensible Markup Language
Chapter 13 XML: Extensible Markup Language - Internet applications provide Web interfaces to databases (data sources) - Three-tier architecture Client V Application Programs Webserver V Database Server
More informationXML for Java Developers G Session 8 - Main Theme XML Information Rendering (Part II) Dr. Jean-Claude Franchitti
XML for Java Developers G22.3033-002 Session 8 - Main Theme XML Information Rendering (Part II) Dr. Jean-Claude Franchitti New York University Computer Science Department Courant Institute of Mathematical
More informationCopyright 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley. Chapter 7 XML
Chapter 7 XML 7.1 Introduction extensible Markup Language Developed from SGML A meta-markup language Deficiencies of HTML and SGML Lax syntactical rules Many complex features that are rarely used HTML
More informationIntroduction to XML. Asst. Prof. Dr. Kanda Runapongsa Saikaew Dept. of Computer Engineering Khon Kaen University
Introduction to XML Asst. Prof. Dr. Kanda Runapongsa Saikaew Dept. of Computer Engineering Khon Kaen University http://gear.kku.ac.th/~krunapon/xmlws 1 Topics p What is XML? p Why XML? p Where does XML
More information7.1 Introduction. extensible Markup Language Developed from SGML A meta-markup language Deficiencies of HTML and SGML
7.1 Introduction extensible Markup Language Developed from SGML A meta-markup language Deficiencies of HTML and SGML Lax syntactical rules Many complex features that are rarely used HTML is a markup language,
More informationIntroduction to XML 3/14/12. Introduction to XML
Introduction to XML Asst. Prof. Dr. Kanda Runapongsa Saikaew Dept. of Computer Engineering Khon Kaen University http://gear.kku.ac.th/~krunapon/xmlws 1 Topics p What is XML? p Why XML? p Where does XML
More informationCOMP9321 Web Application Engineering
COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 4 http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid=2411 1 Extensible
More informationThe XML Metalanguage
The XML Metalanguage Mika Raento mika.raento@cs.helsinki.fi University of Helsinki Department of Computer Science Mika Raento The XML Metalanguage p.1/442 2003-09-15 Preliminaries Mika Raento The XML Metalanguage
More informationIntroduction to Semistructured Data and XML. Overview. How the Web is Today. Based on slides by Dan Suciu University of Washington
Introduction to Semistructured Data and XML Based on slides by Dan Suciu University of Washington CS330 Lecture April 8, 2003 1 Overview From HTML to XML DTDs Querying XML: XPath Transforming XML: XSLT
More informationAgenda. Summary of Previous Session. XML for Java Developers G Session 7 - Main Theme XML Information Rendering (Part II)
XML for Java Developers G22.3033-002 Session 7 - Main Theme XML Information Rendering (Part II) Dr. Jean-Claude Franchitti New York University Computer Science Department Courant Institute of Mathematical
More informationKINGS COLLEGE OF ENGINEERING DEPARTMENT OF INFORMATION TECHNOLOGY. (An NBA Accredited Programme) ACADEMIC YEAR / EVEN SEMESTER
KINGS COLLEGE OF ENGINEERING DEPARTMENT OF INFORMATION TECHNOLOGY (An NBA Accredited Programme) ACADEMIC YEAR 2012-2013 / EVEN SEMESTER YEAR / SEM : IV / VIII BATCH: 2009-2013 (2008 Regulation) SUB CODE
More informationWeb Technologies Present and Future of XML
Web Technologies Present and Future of XML Faculty of Computer Science A.I.Cuza University of Iasi, Romania busaco@infoiasi.ro http://www.infoiasi.ro/~busaco Ph.D. Student: Multimedia Object Manipulation
More informationextensible Markup Language
extensible Markup Language XML is rapidly becoming a widespread method of creating, controlling and managing data on the Web. XML Orientation XML is a method for putting structured data in a text file.
More informationChapter 1: Getting Started. You will learn:
Chapter 1: Getting Started SGML and SGML document components. What XML is. XML as compared to SGML and HTML. XML format. XML specifications. XML architecture. Data structure namespaces. Data delivery,
More informationAuthor: Irena Holubová Lecturer: Martin Svoboda
NPRG036 XML Technologies Lecture 1 Introduction, XML, DTD 19. 2. 2018 Author: Irena Holubová Lecturer: Martin Svoboda http://www.ksi.mff.cuni.cz/~svoboda/courses/172-nprg036/ Lecture Outline Introduction
More informationComp 336/436 - Markup Languages. Fall Semester Week 4. Dr Nick Hayward
Comp 336/436 - Markup Languages Fall Semester 2017 - Week 4 Dr Nick Hayward XML - recap first version of XML became a W3C Recommendation in 1998 a useful format for data storage and exchange config files,
More informationXML: Introduction. !important Declaration... 9:11 #FIXED... 7:5 #IMPLIED... 7:5 #REQUIRED... Directive... 9:11
!important Declaration... 9:11 #FIXED... 7:5 #IMPLIED... 7:5 #REQUIRED... 7:4 @import Directive... 9:11 A Absolute Units of Length... 9:14 Addressing the First Line... 9:6 Assigning Meaning to XML Tags...
More informationData Presentation and Markup Languages
Data Presentation and Markup Languages MIE456 Tutorial Acknowledgements Some contents of this presentation are borrowed from a tutorial given at VLDB 2000, Cairo, Agypte (www.vldb.org) by D. Florescu &.
More informationPASS4TEST. IT Certification Guaranteed, The Easy Way! We offer free update service for one year
PASS4TEST IT Certification Guaranteed, The Easy Way! \ http://www.pass4test.com We offer free update service for one year Exam : 000-141 Title : XML and related technologies Vendors : IBM Version : DEMO
More informationComp 336/436 - Markup Languages. Fall Semester Week 4. Dr Nick Hayward
Comp 336/436 - Markup Languages Fall Semester 2018 - Week 4 Dr Nick Hayward XML - recap first version of XML became a W3C Recommendation in 1998 a useful format for data storage and exchange config files,
More informationCopyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 27-1
Slide 27-1 Chapter 27 XML: Extensible Markup Language Chapter Outline Introduction Structured, Semi structured, and Unstructured Data. XML Hierarchical (Tree) Data Model. XML Documents, DTD, and XML Schema.
More informationExtreme Java G Session 3 - Sub-Topic 5 XML Information Rendering. Dr. Jean-Claude Franchitti
Extreme Java G22.3033-007 Session 3 - Sub-Topic 5 XML Information Rendering Dr. Jean-Claude Franchitti New York University Computer Science Department Courant Institute of Mathematical Sciences 1 Agenda
More informationMarkup Languages SGML, HTML, XML, XHTML. CS 431 February 13, 2006 Carl Lagoze Cornell University
Markup Languages SGML, HTML, XML, XHTML CS 431 February 13, 2006 Carl Lagoze Cornell University Problem Richness of text Elements: letters, numbers, symbols, case Structure: words, sentences, paragraphs,
More informationXML. Jonathan Geisler. April 18, 2008
April 18, 2008 What is? IS... What is? IS... Text (portable) What is? IS... Text (portable) Markup (human readable) What is? IS... Text (portable) Markup (human readable) Extensible (valuable for future)
More informationIntroduction to Semistructured Data and XML
Introduction to Semistructured Data and XML Chapter 27, Part D Based on slides by Dan Suciu University of Washington Database Management Systems, R. Ramakrishnan 1 How the Web is Today HTML documents often
More informationM359 Block5 - Lecture12 Eng/ Waleed Omar
Documents and markup languages The term XML stands for extensible Markup Language. Used to label the different parts of documents. Labeling helps in: Displaying the documents in a formatted way Querying
More informationXML. extensible Markup Language. ... and its usefulness for linguists
XML extensible Markup Language... and its usefulness for linguists Thomas Mayer thomas.mayer@uni-konstanz.de Fachbereich Sprachwissenschaft, Universität Konstanz Seminar Computerlinguistik II (Miriam Butt)
More information11. EXTENSIBLE MARKUP LANGUAGE (XML)
11. EXTENSIBLE MARKUP LANGUAGE (XML) Introduction Extensible Markup Language is a Meta language that describes the contents of the document. So these tags can be called as self-describing data tags. XML
More informationStructured documents
Structured documents An overview of XML Structured documents Michael Houghton 15/11/2000 Unstructured documents Broadly speaking, text and multimedia document formats can be structured or unstructured.
More informationCOMP9321 Web Application Engineering
COMP9321 Web Application Engineering Semester 2, 2017 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 4 http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid= 2465 1
More informationIntroduction to Semistructured Data and XML. Contents
Contents Overview... 106 What is XML?... 106 How the Web is Today... 108 New Universal Data Exchange Format: XML... 108 What is the W3C?... 108 Semistructured Data... 110 What is Self-describing Data?...
More informationAgenda. XML Generics. XML for Java Developers G Session 1 - Main Theme Markup Language Technologies (Part I)
XML for Java Developers G22.3033-002 Session 1 - Main Theme Markup Language Technologies (Part I) Dr. Jean-Claude Franchitti New York University Computer Science Department Courant Institute of Mathematical
More informationAgenda. Summary of Previous Session. XML for Java Developers G Session 6 - Main Theme XML Information Processing (Part II)
XML for Java Developers G22.3033-002 Session 6 - Main Theme XML Information Processing (Part II) Dr. Jean-Claude Franchitti New York University Computer Science Department Courant Institute of Mathematical
More informationW3C XML XML Overview
Overview Jaroslav Porubän 2008 References Tutorials, http://www.w3schools.com Specifications, World Wide Web Consortium, http://www.w3.org David Hunter, et al.: Beginning, 4th Edition, Wrox, 2007, 1080
More informationIntroduction. Leonidas Fegaras University of Texas at Arlington. Web Data Management and XML L1: Introduction 1
Introduction Leonidas Fegaras University of Texas at Arlington Web Data Management and XML L1: Introduction 1 Information Class: TuTh 2:00-3:20pm (NH 111) Instructor: Leonidas Fegaras Office: GACB 115
More informationChapter 1: Semistructured Data Management XML
Chapter 1: Semistructured Data Management XML XML - 1 The Web has generated a new class of data models, which are generally summarized under the notion semi-structured data models. The reasons for that
More informationData Exchange. Hyper-Text Markup Language. Contents: HTML Sample. HTML Motivation. Cascading Style Sheets (CSS) Problems w/html
Data Exchange Contents: Mariano Cilia / cilia@informatik.tu-darmstadt.de Origins (HTML) Schema DOM, SAX Semantic Data Exchange Integration Problems MIX Model 1 Hyper-Text Markup Language HTML Hypertext:
More information- XML. - DTDs - XML Schema - XSLT. Web Services. - Well-formedness is a REQUIRED check on XML documents
Purpose of this day Introduction to XML for parliamentary documents (and all other kinds of documents, actually) Prof. Fabio Vitali University of Bologna Introduce the principal aspects of electronic management
More informationXML Technologies Dissected Erik Wilde Swiss Federal Institute of Technology, Zürich
XML Technologies Dissected Erik Wilde Swiss Federal Institute of Technology, Zürich The lack of well-defined information models in many XML technologies can generate compatibility problems and lower the
More informationCOMP9321 Web Application Engineering. Extensible Markup Language (XML)
COMP9321 Web Application Engineering Extensible Markup Language (XML) Dr. Basem Suleiman Service Oriented Computing Group, CSE, UNSW Australia Semester 1, 2016, Week 4 http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid=2442
More informationContents. Markup Language and the need of XML. Using environment XML and growth direction. To understand dxml standard.
Contents Markup Language and the need of XML. Using environment XML and growth direction. To understand dxml standard. d An application of XML documents. Markup Language and the need of XML Markup definition
More informationUser Interaction: XML and JSON
User Interaction: XML and JSON Assoc. Professor Donald J. Patterson INF 133 Fall 2012 1 HTML and XML 1989: Tim Berners-Lee invents the Web with HTML as its publishing language Based on SGML Separates data
More informationDEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
SHRI ANGALAMMAN COLLEGE OF ENGINEERING & TECHNOLOGY (An ISO 9001:2008 Certified Institution) SIRUGANOOR,TRICHY-621105. DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING Year/Sem: IV / VII CS1401 INTERNET
More informationXML: some structural principles
XML: some structural principles Hayo Thielecke University of Birmingham www.cs.bham.ac.uk/~hxt October 18, 2011 1 / 25 XML in SSC1 versus First year info+web Information and the Web is optional in Year
More informationXML Technologies. Doc. RNDr. Irena Holubova, Ph.D. Web pages:
XML Technologies Doc. RNDr. Irena Holubova, Ph.D. holubova@ksi.mff.cuni.cz Web pages: http://www.ksi.mff.cuni.cz/~holubova/nprg036/ Outline Introduction to XML format, overview of XML technologies DTD
More information- What we actually mean by documents (the FRBR hierarchy) - What are the components of documents
Purpose of these slides Introduction to XML for parliamentary documents (and all other kinds of documents, actually) Prof. Fabio Vitali University of Bologna Part 1 Introduce the principal aspects of electronic
More informationChapter 1: Semistructured Data Management XML
Chapter 1: Semistructured Data Management XML 2006/7, Karl Aberer, EPFL-IC, Laboratoire de systèmes d'informations répartis XML - 1 The Web has generated a new class of data models, which are generally
More informationXML Processing & Web Services. Husni Husni.trunojoyo.ac.id
XML Processing & Web Services Husni Husni.trunojoyo.ac.id Based on Randy Connolly and Ricardo Hoar Fundamentals of Web Development, Pearson Education, 2015 Objectives 1 XML Overview 2 XML Processing 3
More informationSDPL : XML Basics 2. SDPL : XML Basics 1. SDPL : XML Basics 4. SDPL : XML Basics 3. SDPL : XML Basics 5
2 Basics of XML and XML documents 2.1 XML and XML documents Survivor's Guide to XML, or XML for Computer Scientists / Dummies 2.1 XML and XML documents 2.2 Basics of XML DTDs 2.3 XML Namespaces XML 1.0
More information1.264 Lecture 13 XML
1.264 Lecture 13 XML What is XML? Extensible Markup Language (XML) is: a World Wide Web Consortium (W3C) proposed recommendation for a file format to easily and cheaply distribute electronic documents
More informationIntroduction Syntax and Usage XML Databases Java Tutorial XML. November 5, 2008 XML
Introduction Syntax and Usage Databases Java Tutorial November 5, 2008 Introduction Syntax and Usage Databases Java Tutorial Outline 1 Introduction 2 Syntax and Usage Syntax Well Formed and Valid Displaying
More informationXML. Objectives. Duration. Audience. Pre-Requisites
XML XML - extensible Markup Language is a family of standardized data formats. XML is used for data transmission and storage. Common applications of XML include business to business transactions, web services
More informationIntroduction to Web Technologies
Introduction to Web Technologies James Curran and Tara Murphy 16th April, 2009 The Internet CGI Web services HTML and CSS 2 The Internet is a network of networks ˆ The Internet is the descendant of ARPANET
More informationDatabase Systems: Design, Implementation, and Management Tenth Edition. Chapter 14 Database Connectivity and Web Technologies
Database Systems: Design, Implementation, and Management Tenth Edition Chapter 14 Database Connectivity and Web Technologies Database Connectivity Mechanisms by which application programs connect and communicate
More information.. Cal Poly CPE/CSC 366: Database Modeling, Design and Implementation Alexander Dekhtyar..
.. Cal Poly CPE/CSC 366: Database Modeling, Design and Implementation Alexander Dekhtyar.. XML in a Nutshell XML, extended Markup Language is a collection of rules for universal markup of data. Brief History
More informationXML and information exchange. XML extensible Markup Language XML
COS 425: Database and Information Management Systems XML and information exchange 1 XML extensible Markup Language History 1988 SGML: Standard Generalized Markup Language Annotate text with structure 1992
More informationXML Update. Royal Society of the Arts London, December 8, Jon Bosak Sun Microsystems
XML Update Royal Society of the Arts London, December 8, 1998 Jon Bosak Sun Microsystems XML Basics...A-1 The XML Concept...B-1 XML in Context...C-1 XML and Open Standards...D-1 XML Update XML Basics XML
More information5/19/2015. Objectives. JavaScript, Sixth Edition. Introduction to the World Wide Web (cont d.) Introduction to the World Wide Web
Objectives JavaScript, Sixth Edition Chapter 1 Introduction to JavaScript When you complete this chapter, you will be able to: Explain the history of the World Wide Web Describe the difference between
More informationXML: the document format of the future?
Arco User Conference 99 XML: the document format of the future? Hans C. Arents senior IT market analyst I.T. Works Guiding the IT Professional Innovation Center, Technologiepark 3, B-9052 Gent (Belgium),
More informationCSI 3140 WWW Structures, Techniques and Standards. Representing Web Data: XML
CSI 3140 WWW Structures, Techniques and Standards Representing Web Data: XML XML Example XML document: An XML document is one that follows certain syntax rules (most of which we followed for XHTML) Guy-Vincent
More informationEMERGING TECHNOLOGIES. XML Documents and Schemas for XML documents
EMERGING TECHNOLOGIES XML Documents and Schemas for XML documents Outline 1. Introduction 2. Structure of XML data 3. XML Document Schema 3.1. Document Type Definition (DTD) 3.2. XMLSchema 4. Data Model
More informationDevice Independent Principles for Adapted Content Delivery
Device Independent Principles for Adapted Content Delivery Tayeb Lemlouma 1 and Nabil Layaïda 2 OPERA Project Zirst 655 Avenue de l Europe - 38330 Montbonnot, Saint Martin, France Tel: +33 4 7661 5281
More informationKINGS COLLEGE OF ENGINEERING 1
KINGS COLLEGE OF ENGINEERING Department of Computer Science & Engineering Academic Year 2011 2012(Odd Semester) QUESTION BANK Subject Code/Name: CS1401-Internet Computing Year/Sem : IV / VII UNIT I FUNDAMENTALS
More informationWeb Programming Paper Solution (Chapter wise)
What is valid XML document? Design an XML document for address book If in XML document All tags are properly closed All tags are properly nested They have a single root element XML document forms XML tree
More informationSEARCH SEMI-STRUCTURED DATA ON WEB
SEARCH SEMI-STRUCTURED DATA ON WEB Sabin-Corneliu Buraga 1, Teodora Rusu 2 1 Faculty of Computer Science, Al.I.Cuza University of Iaşi, Romania Berthelot Str., 16 6600 Iaşi, Romania, tel: +40 (32 201529,
More informationSERVICE ORIENTED ARCHITECTURE 2 MARK QUESTION WITH ANSWER
SERVICE ORIENTED ARCHITECTURE 2 MARK QUESTION WITH ANSWER UNIT-I 1. What is XML? XML is a set of rules for structuring, storing and transferring information. This language is used to describe the data
More informationIntroduction to XML Zdeněk Žabokrtský, Rudolf Rosa
NPFL092 Technology for Natural Language Processing Introduction to XML Zdeněk Žabokrtský, Rudolf Rosa November 28, 2018 Charles Univeristy in Prague Faculty of Mathematics and Physics Institute of Formal
More informationXML Primer Plus By Nicholas Chase
Table of Contents Index XML Primer Plus By Nicholas Chase Publisher : Sams Publishing Pub Date : December 16, 2002 ISBN : 0-672-32422-9 Pages : 1024 This book presents XML programming from a conceptual
More informationIntroduction. Web Pages. Example Graph
COSC 454 DB And the Web Introduction Overview Dynamic web pages XML and databases Reference: (Elmasri & Navathe, 5th ed) Ch. 26 - Web Database Programming Using PHP Ch. 27 - XML: Extensible Markup Language
More informationXML: Managing with the Java Platform
In order to learn which questions have been answered correctly: 1. Print these pages. 2. Answer the questions. 3. Send this assessment with the answers via: a. FAX to (212) 967-3498. Or b. Mail the answers
More informationIT6801-SERVICE ORIENTED ARCHITECTURE
ST.JOSEPH COLLEGE OF ENGINEERING DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING IT 6801-SERVICE ORIENTED ARCHITECTURE UNIT I 2 MARKS 1. Define XML. Extensible Markup Language(XML) is a markup language
More informationEMC Documentum xdb. High-performance native XML database optimized for storing and querying large volumes of XML content
DATA SHEET EMC Documentum xdb High-performance native XML database optimized for storing and querying large volumes of XML content The Big Picture Ideal for content-oriented applications like dynamic publishing
More informationThe Wonderful World of XML. Presented by Laurie K. Brooks AML Consulting, Inc.
The Wonderful World of XML Presented by Laurie K. Brooks AML Consulting, Inc. XML Precursors Hypertext and Multimedia => Hypermedia Internet => World Wide Web Generalized Markup => SGML and HTML Proprietary
More informationDocument-Centric Computing
Document-Centric Computing White Paper Abstract A document is a basic instrument for business and personal interaction and for capturing and communicating information and knowledge. Until the invention
More informationData Formats and APIs
Data Formats and APIs Mike Carey mjcarey@ics.uci.edu 0 Announcements Keep watching the course wiki page (especially its attachments): https://grape.ics.uci.edu/wiki/asterix/wiki/stats170ab-2018 Ditto for
More informationDatabases and Information Retrieval Integration TIETS42. Kostas Stefanidis Autumn 2016
+ Databases and Information Retrieval Integration TIETS42 Autumn 2016 Kostas Stefanidis kostas.stefanidis@uta.fi http://www.uta.fi/sis/tie/dbir/index.html http://people.uta.fi/~kostas.stefanidis/dbir16/dbir16-main.html
More informationService Oriented Architectures Visions Concepts Reality
Service Oriented Architectures Visions Concepts Reality CSC March 2006 Alexander Schatten Vienna University of Technology Vervest und Heck, 2005 A Service Oriented Architecture enhanced by semantics, would
More informationXML. Rodrigo García Carmona Universidad San Pablo-CEU Escuela Politécnica Superior
XML Rodrigo García Carmona Universidad San Pablo-CEU Escuela Politécnica Superior XML INTRODUCTION 2 THE XML LANGUAGE XML: Extensible Markup Language Standard for the presentation and transmission of information.
More informationSome more XML applications and XML-related standards (XLink, XPointer, XForms)
Some more XML applications and XML-related standards (XLink, XPointer, XForms) Patryk Czarnik XML and Applications 2014/2015 Lecture 12 19.01.2015 Standards for inter-document relations XPointer addressing
More informationAgenda Summary of Previous Session / Review New Syllabus
XML for Java Developers G22.3033-002 Session 2 - Main Theme Markup Language Technologies (Part II) Dr. Jean-Claude Franchitti New York University Computer Science Department Courant Institute of Mathematical
More informationTeiid Designer User Guide 7.5.0
Teiid Designer User Guide 1 7.5.0 1. Introduction... 1 1.1. What is Teiid Designer?... 1 1.2. Why Use Teiid Designer?... 2 1.3. Metadata Overview... 2 1.3.1. What is Metadata... 2 1.3.2. Editing Metadata
More informationSemistructured Content
On our first day Semistructured Content 1 Structured data : database system tagged, typed well-defined semantic interpretation Semi-structured data: tagged - (HTML?) some help with semantic interpretation
More informationUser Interaction: XML and JSON
User Interaction: XML and JSON Asst. Professor Donald J. Patterson INF 133 Fall 2011 1 What might a design notebook be like? Cooler What does a design notebook entry look like? HTML and XML 1989: Tim Berners-Lee
More informationCOMP9321 Web Application Engineering
COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 12 (Wrap-up) http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid=2411
More informationCOPYRIGHTED MATERIAL. Contents. Part I: Introduction 1. Chapter 1: What Is XML? 3. Chapter 2: Well-Formed XML 23. Acknowledgments
Acknowledgments Introduction ix xxvii Part I: Introduction 1 Chapter 1: What Is XML? 3 Of Data, Files, and Text 3 Binary Files 4 Text Files 5 A Brief History of Markup 6 So What Is XML? 7 What Does XML
More informationCOMP9321 Web Application Engineering
COMP9321 Web Application Engineering Semester 1, 2017 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 12 (Wrap-up) http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid=2457
More informationRich Web Application Backplane
IBM Software Group Rich Web Application Backplane John Boyer One way to look at it Markup delivered by a web application today must juggle hardening requirements Layout and rendition of text, images, interaction
More informationXML Overview, part 1
XML Overview, part 1 Norman Gray Revision 1.4, 2002/10/30 XML Overview, part 1 p.1/28 Contents The who, what and why XML Syntax Programming with XML Other topics The future http://www.astro.gla.ac.uk/users/norman/docs/
More informationIntroduction to Web Services & SOA
References: Web Services, A Technical Introduction, Deitel & Deitel Building Scalable and High Performance Java Web Applications, Barish Service-Oriented Programming (SOP) SOP A programming paradigm that
More informationIntroduction to Database Systems CSE 414
Introduction to Database Systems CSE 414 Lecture 14-15: XML CSE 414 - Spring 2013 1 Announcements Homework 4 solution will be posted tomorrow Midterm: Monday in class Open books, no notes beyond one hand-written
More informationIT2353 WEB TECHNOLOGY Question Bank UNIT I 1. What is the difference between node and host? 2. What is the purpose of routers? 3. Define protocol. 4.
IT2353 WEB TECHNOLOGY Question Bank UNIT I 1. What is the difference between node and host? 2. What is the purpose of routers? 3. Define protocol. 4. Why are the protocols layered? 5. Define encapsulation.
More informationUsing UML To Define XML Document Types
Using UML To Define XML Document Types W. Eliot Kimber ISOGEN International, A DataChannel Company Created On: 10 Dec 1999 Last Revised: 14 Jan 2000 Defines a convention for the use of UML to define XML
More informationCWI. Multimedia on the Semantic Web. Jacco van Ossenbruggen, Lynda Hardman, Frank Nack. Multimedia and Human-Computer Interaction CWI, Amsterdam
Multimedia on the Semantic Web Jacco van Ossenbruggen, Lynda Hardman, Frank Nack Multimedia and Human-Computer Interaction, Amsterdam Short history of the Web in three generations (see thesis for long
More information