Databases and the World Wide Web

Size: px
Start display at page:

Download "Databases and the World Wide Web"

Transcription

1 Databases and the World Wide Web Paolo Atzeni D.I.A. - Università di Roma Tre thanks to the Araneus group: G. Mecca, P. Merialdo, A. Masci, V. Crescenzi, G. Sindoni, Outline Databases and database research Web-based Information Systems The Araneus project Querying Web sites Web-site design Integration on the Web The Road Runner Project 1

2 Database technology Developed with business applications in mind (banks, accounting, personnel, inventory, reservations, ) for integrated databases but within a controlled environments (one organization or a portion of it) 3 Database technology Major achievements (Silberschatz et al and 1996) relational systems (physical independence, usability, flexibility) transaction management (reliability) distribution 4 2

3 The current trend Widen / increase / enlarge, from various perspectives new types of data more users (not only humans) more cooperating systems more diverse systems (distributed, heterogeneous, autonomous) 5 The need for cross-fertilization database technology was developed within the domain of business applications; other domains have other requirements (also very different from each other) database technology and X technology can be complementary, with potential mutual benefit basic problems have been solved; specific areas may have specific problems, for which the general solutions need not be satisfactory 6 3

4 A ten-year goal for database research The Asilomar report (Bernstein et al. Sigmod Record 1999, The information utility: make it easy for everyone to store, organize, access, and analyze the majority of human information online 7 General themes for database research Development of plug and play database systems Scalability of federated databases Integration of data and processes Integration of structured and semistrucured data 8 4

5 WIS: Web-based Information Systems Complexity of data Data-intensive WIS WIS Web-presence sites Service-oriented sites Complexity of functions 9 Database approaches to the Web bottom-up: accessing information from Web sources top-down: designing and maintaining Web sites global: integrating existing sites and offering the information through new ones 5

6 Outline Databases and database research Web-based Information Systems The Araneus project Querying Web sites Web-site design Integration on the Web The Road Runner Project The Araneus Project Started in 1996 Goals: extend database-like techniques to Web data management About 15 people involved (researchers, developers, students) Several prototype systems 12 6

7 The Araneus Approach identification of sites of interest sites are wrapped to extract information sites are navigated and data are extracted data from the sites are integrated into a single database a new site is generated from the integrated database Wrapper Layer Site Generation Layer Araneus Web Site Development System (Homer) Integration Layer Mediator Layer Wrapper Layer SQL and Java Araneus Query System (Ulixes) Wrapper Layer Araneus Wrapper Toolkit (Minerva) 13 Example Applications The Integrated Web Museum a site integrating data coming from the Uffizi, Louvre and Capodimonte Web sites The ACM Sigmod Record XML Version XML+XSL Version of an existing (and currently running) Web site The Integrated Patent Search Site allows integrated searches in patent Web sites 14 7

8 Integration of Web Sites: The Integrated Web Museum Integrates data coming from several Virtual Museums from the Web uffizi.firenze.it, louvre.fr and Capodimonte Data are re-organized: Uffizi, paintings organized by rooms Louvre, Capodimonte, works organized by collections Integrated Museum, data organized by author 15 The Integrated Web Museum DB 16 8

9 Web Site Re-Engineering: ACM Sigmod Record in XML The ACM Sigmod Record Site (acm.org/sigmod/record) a site in HTML The ACM Sigmod Record Site, XML Edition a site in XML with XSL Stylesheets

10 19 Integrated Patent Search 20 10

11 Integrated Patent Search: Result 21 Building Applications in Araneus Phase A: Reverse Engineering Existing Sites Phase B: Data Integration Phase C: Developing New Integrated Sites 22 11

12 Building Applications in Araneus Phase A: Reverse Engineering First Step: Deriving the logical structure of data in the site ADM Scheme Second Step: Wrapping pages in order to map physical HTML sources to database objects Third Step: Extracting Data from the Site by Queries and Navigation 23 Modeling Web Sites: The ARANEUS Data Model ADM - ODMG-like model - Pages are URL-identified nested objects - Heterogeneous Union-Types Page-type: describes a set of homogeneous pages Site-scheme: set of page-types connected by links 24 12

13 Proceedings Sigmod Record On-Line acm.org/sigmod HomePage Abstract & IT OrdinaryIssue about 1000 pages integrated with other data dblp.uni-trier.de 25 HomePage Sigmod Record Proceedings OrdinaryIssue Abstract & IT 26 13

14 The Integrated Web Museum 27 The Integrated Web Museum 28 14

15 Building Applications in Araneus Phase A: Reverse Engineering First Step: Deriving the logical structure of data in the site ADM Scheme Second Step: Wrapping pages in order to map physical HTML sources to database objects Third Step: Extracting Data from the Site by Queries and Navigation 29 Wrapping Web Sites: The Araneus Wrapper Toolkit The need for wrappers March, 98 Internet wrapper Date: TEXT HTML ADM Pages are often logically homogeneous but physically heterogeneous 30 15

16 The Araneus Wrapper Toolkit <table> <tr> <td><a href="9903/index.html">march 1999</a></td> <td><font size=-1>vol 28, No 1</font> <B>(Semantic Interoperability in Global Information Systems)</B></td> </tr> <tr>... </tr> </table> $OrdinaryIssues := <table>( <tr><td><a*>$date</a></td> <td><font*>vol $Volume, No $Number</font> [<b>( $Description ) </b>] </td> </tr>)+ </table> 31 The Araneus Wrapper Toolkit Advantages Procedural Languages Flexible in presence of irregularities and exceptions Allows document restructurings Grammars Concise and declarative Simple in the case of strong regularity Drawbacks Tedious even with document strongly structured Difficult to maintain Not flexible in presence of irregularities Allow for limited restructurings 32 16

17 The Araneus Wrapper Toolkit: Minerva + Editor Minerva is a Wrapper generator It couples a declarative, grammar-based approach with the flexibility of procedural programming: grammar-based procedural exception handling 33 Building Applications in Araneus Phase A: Reverse Engineering First Step: Deriving the logical structure of data in the site ADM Scheme Second Step: Wrapping pages in order to map physical HTML sources to database objects Third Step: Extracting Data from the Site by Queries and Navigation 34 17

18 Queries over Web Sites: Query Interfaces: Ulixes Example of SQL Query: Titles and Years of Papers Published by Codd at SIGMOD " CREATE VIEW TitlesOfCoddPapers (Title, Year) OVER AS SELECT ProceedingsPage.SectionList.Articles.Title, HomePage.YearList.Year FROM HomePage.YearList.NumberList.ToIsssues -> ProceedingsPage.SectionList.Articles WHERE ProceedingsPage.SectionList.Articles.Authors LIKE %Codd% 35 Queries over Web Sites: Query Interfaces: Polyphemus 36 18

19 Building Applications in Araneus Phase A: Reverse Engineering Existing Sites Phase B: Data Integration Phase C: Developing New Integrated Sites 37 Building Applications in Araneus Phase C: Web Site Development Model-Based Development in Araneus CASE-Tool Approach (Homer) 38 19

20 Model-Based Development in Araneus Clear Separation between Site Design and Site Implementation: high-level models for site design tools for site implementation Overall Goal: users should concentrate on design issues they should not write any line of code Flexibility in Site Implementation: Transparent Generation of HTML, XML (or WML) sites Independence from the actual page-generation tool 39 Model-Based Development in Araneus Homer, a CASE-Tool for Web Site Development The Araneus Site Design Methodology High-Level Models: Modeling data: relational (or object-relational) Modeling hypertext: ADM Modeling presentation: Telemachus Styles 40 20

21 Presentation Modeling: Telemachus Requirements: precise notion of style for pages platform independence rapid prototyping and flexible maintenance working with sample pages ( templates ) Telemachus Styles: Attribute Styles: formatting directives for attributes in pages Page Styles: collections of attribute styles plus header and footer 41 Telemachus: attribute styles In the database: CoursePage Course: TEXT;... Table: Courses CourseName... Operating Systems... In the hypertext: Course: [<FONT color= red > <B>] [</B></FONT>] In the page: <FONT color= red ><B> Operating Systems </B></FONT> 42 21

22 Telemachus: attribute styles Course: [<TABLE border="0" rows= 2" cols= 3"> <TR> <TD><IMG src= dotyel.gif width="30"></td> <TD><FONT color= white"><b>course Name:</B></FONT> <TD><FONT color= yellow face= Arial ><B>] [ </B></FONT></TD> </TR> <TR><TD></TD><TD COLSPAN= 2 ><HR></TD></TR> </TABLE><HR>] 43 Telemachus: page styles one attribute style for each attribute in the page header and footer Header Footer 44 22

23 Telemachus: page styles Style for CoursePage Header: [<HTML STYLE="font-size:12pt"> <BODY background= bicolor.gif"> <CENTER> <TABLE width="100%" BORDER="0"> ] Course: [<B>] [</B>] Description: [<BR><B>Description</B>] [] ToInstructor: [<BR>] [<BR>] InstructorName : [... Footer: <FONT FACE="Times New Roman" COLOR="yellow"><B>] [</B></FONT></A>] [<TABLE>... </BODY> </HTML>] Easily implemented with any tool (including XSL) Users don t want to write so much code 45 Telemachus: templates Sample pages with placeholders Users work with Templates and Telemachus generates styles from them

24 Presentation Design with Telemachus Site Template user editing Telemachus user editing Telemachus Page Templ. 1 Es: PersonPage Page Templ. 2 Es: CoursePage Page Templ. N Es:GroupPage Page Style 1 Page Style 2 Page Style n This approach is strongly based on ADM; 47 Homer: A Case Tool for Web Sites Putting All Together: different models and different activities: database design (relational or object-relational); hypertext design (ADM); presentation design (Telemachus Styles); a precise Design Methodology; Homer: a Case Tool for the Design Process: graphical interface to assist users in the design; automatic production of code for page generation

25 Homer: A Case Tool for Web Sites Homer Step 1: ER Scheme user editing DB Scheme Step 2: ADM Scheme user editing Homer Step 3: Page Templ. Site Generation Code 49 Homer: A Case Tool for Web Sites Step 1: User Draws an Entity-Relationship Scheme; 50 25

26 Homer: A Case Tool for Web Sites 51 Homer: A Case Tool for Web Sites Step 2: ER Scheme ==> ADM Scheme; 52 26

27 Homer: A Case Tool for Web Sites Step 2: ER Scheme ==> ADM Scheme; 53 Homer: A Case Tool for Web Sites Homer Engine: all intermediate models (ER, Relational...) are seen as subsets of ADM design steps are transformations (views) nested-relational algebra with URL-invention view composition Implementation: HTML, XML with XSL stylesheets (WML) Penelope, JSP,

28 After Araneus Current Research (of my group;unfortunately,not really mine): automatic data extraction Web data sources theroadrunner Project 55 RoadRunner: motivation Web: standard platform for next generation IS the largest knowledge base ever developed Intended to be browsed by humans Hard to compute over Web: the modern legacy system Tools for data extraction, integration and transformation 56 28

29 RoadRuner: the Ultimate Goal URL of a data-intensive site Es: BookTitle The HTML Sourcebook Computer Networks Database Systems Author J. Graham A. Tannenbaum R. Elmasri, S. Navathe Editor RoadRunner Data on the Web S. Abiteboul, P. Buneman, D. Suciu HTML pages Automatic Wrappers database tables (or XML docs) 57 Background Manual wrappers are largely used today (Es: shopping agents) Several Recent Works on Automatic Wrapper Generation [Kushmerick et al, IJCAI 97] WIEN: machine learning background [Adelberg SIGMOD 98] NoDoSE: real-time databases background [Embley et al, SIGMOD 99] conceptual modeling background 58 29

30 Automatic Wrapper Generators Different inspirations, different techniques, some common features Rougly speaking, in all systems: one HTML page at a time (multiple-record document) some a priori knowledge about the page organization wrapper inference is based on labeled examples provided to the system 59 Wrapper Generation from Examples FC Kaiserslautern Kaiserslautern Club Location User Interaction or Domain-Specific Heuristics 60 30

31 Originality and Contributions Goal: Automating the process to the largest extent No a priori knowledge about the structure No need for labeled examples Main Ideas: HTML pages = encodings of a database wrapper generation = decoding process by comparing homogeneous pages (similarities and differences) 61 Overall System Architecture Classifier site URL Page clusters {C 1,C 2,,C n } Expander per C i =1 Matcher per C i >1 Wrappers 62 31

WEB SITES NEED MODELS AND SCHEMES

WEB SITES NEED MODELS AND SCHEMES WEB SITES NEED MODELS AND SCHEMES Paolo Atzeni atzeni@dia.uniroma3.it http://www.dia.uniroma3.it/~atzeni Dipartimento di informatica e automazione Università di Roma Tre Outline Databases and information

More information

is that the question?

is that the question? To structure or not to structure, is that the question? Paolo Atzeni Based on work done with (or by!) G. Mecca, P.Merialdo, P. Papotti, and many others Lyon, 24 August 2009 Content A personal (and questionable

More information

XXII. Website Design. The Web

XXII. Website Design. The Web XXII. Website Design The Web Hypertext Data Independence Data Models for Hypertext Documents The Araneus Data Model (ADM) The Navigational Conceptual Model (NCM) The Araneus Methodology for Website Design

More information

Semistructured Data and Mediation

Semistructured Data and Mediation Semistructured Data and Mediation Prof. Letizia Tanca Dipartimento di Elettronica e Informazione Politecnico di Milano SEMISTRUCTURED DATA FOR THESE DATA THERE IS SOME FORM OF STRUCTURE, BUT IT IS NOT

More information

Basi di dati e World Wide Web

Basi di dati e World Wide Web Basi di dati e World Wide Web Paolo Atzeni 30/05/2003 Problems with many Web-sites design information is often poorly organized and difficult to access it is not even clear which pieces of information

More information

Universita degli Studi di Roma Tre. Dipartimento di Informatica e Automazione. Design and Maintenance of. Data-Intensive Web Sites

Universita degli Studi di Roma Tre. Dipartimento di Informatica e Automazione. Design and Maintenance of. Data-Intensive Web Sites Universita degli Studi di Roma Tre Dipartimento di Informatica e Automazione Via della Vasca Navale, 84 { 00146 Roma, Italy. Design and Maintenance of Data-Intensive Web Sites Paolo Atzeni y, Giansalvatore

More information

Java Framework for Database-Centric Web Site Engineering

Java Framework for Database-Centric Web Site Engineering Java Framework for Database-Centric Web Site Engineering Beat Signer, Michael Grossniklaus and Moira C. Norrie fsigner, grossniklaus, norrieg@inf.ethz.ch Institute for Information Systems ETH Zurich CH-8092

More information

Information Discovery, Extraction and Integration for the Hidden Web

Information Discovery, Extraction and Integration for the Hidden Web Information Discovery, Extraction and Integration for the Hidden Web Jiying Wang Department of Computer Science University of Science and Technology Clear Water Bay, Kowloon Hong Kong cswangjy@cs.ust.hk

More information

ABSTRACT The Araneus project aims at developing tools for data-management on the World Wide Web. Web-based information systems deal with data of heter

ABSTRACT The Araneus project aims at developing tools for data-management on the World Wide Web. Web-based information systems deal with data of heter Universita degli Studi di Roma Tre Dipartimento di Informatica e Automazione Via della Vasca Navale, 84 { 00146 Roma, Italy. From Databases to Web-Bases: The Araneus Experience G. Mecca 1, P. Atzeni 2,

More information

Semistructured Data and Mediation

Semistructured Data and Mediation Semistructured Data and Mediation Prof. Letizia Tanca Dipartimento di Elettronica e Informazione Politecnico di Milano SEMISTRUCTURED DATA FOR THESE DATA THERE IS SOME FORM OF STRUCTURE, BUT IT IS NOT

More information

A Survey on Unsupervised Extraction of Product Information from Semi-Structured Sources

A Survey on Unsupervised Extraction of Product Information from Semi-Structured Sources A Survey on Unsupervised Extraction of Product Information from Semi-Structured Sources Abhilasha Bhagat, ME Computer Engineering, G.H.R.I.E.T., Savitribai Phule University, pune PUNE, India Vanita Raut

More information

ISSN (Online) ISSN (Print)

ISSN (Online) ISSN (Print) Accurate Alignment of Search Result Records from Web Data Base 1Soumya Snigdha Mohapatra, 2 M.Kalyan Ram 1,2 Dept. of CSE, Aditya Engineering College, Surampalem, East Godavari, AP, India Abstract: Most

More information

Extracting Data From Web Sources. Giansalvatore Mecca

Extracting Data From Web Sources. Giansalvatore Mecca Extracting Data From Web Sources Giansalvatore Mecca Università della Basilicata mecca@unibas.it http://www.difa.unibas.it/users/gmecca/ EDBT Summer School 2002 August, 26 2002 Outline Subject of this

More information

Building Multi-device, Content-Centric Applications Using WebML and the W3I3 Tool Suite

Building Multi-device, Content-Centric Applications Using WebML and the W3I3 Tool Suite Building Multi-device, Content-Centric Applications Using WebML and the W3I3 Tool Suite Angela Bonifati 1, Stefano Ceri 1, Piero Fraternali 1, and Andrea Maurino 1 Politecnico di Milano,Piazza L. da Vinci

More information

MIWeb: Mediator-based Integration of Web Sources

MIWeb: Mediator-based Integration of Web Sources MIWeb: Mediator-based Integration of Web Sources Susanne Busse and Thomas Kabisch Technical University of Berlin Computation and Information Structures (CIS) sbusse,tkabisch@cs.tu-berlin.de Abstract MIWeb

More information

EXTRACTION AND ALIGNMENT OF DATA FROM WEB PAGES

EXTRACTION AND ALIGNMENT OF DATA FROM WEB PAGES EXTRACTION AND ALIGNMENT OF DATA FROM WEB PAGES Praveen Kumar Malapati 1, M. Harathi 2, Shaik Garib Nawaz 2 1 M.Tech, Computer Science Engineering, 2 M.Tech, Associate Professor, Computer Science Engineering,

More information

HERA: Automatically Generating Hypermedia Front- Ends for Ad Hoc Data from Heterogeneous and Legacy Information Systems

HERA: Automatically Generating Hypermedia Front- Ends for Ad Hoc Data from Heterogeneous and Legacy Information Systems HERA: Automatically Generating Hypermedia Front- Ends for Ad Hoc Data from Heterogeneous and Legacy Information Systems Geert-Jan Houben 1,2 1 Eindhoven University of Technology, Dept. of Mathematics and

More information

Html basics Course Outline

Html basics Course Outline Html basics Course Outline Description Learn the essential skills you will need to create your web pages with HTML. Topics include: adding text any hyperlinks, images and backgrounds, lists, tables, and

More information

Design concepts for data-intensive applications

Design concepts for data-intensive applications 6 th International Conference on Applied Informatics Eger, Hungary, January 27 31, 2004. Design concepts for data-intensive applications Attila Adamkó Department of Information Technology, Institute of

More information

Introduction. Web Pages. Example Graph

Introduction. Web Pages. Example Graph COSC 454 DB And the Web Introduction Overview Dynamic web pages XML and databases Reference: (Elmasri & Navathe, 5th ed) Ch. 26 - Web Database Programming Using PHP Ch. 27 - XML: Extensible Markup Language

More information

Handling Irregularities in ROADRUNNER

Handling Irregularities in ROADRUNNER Handling Irregularities in ROADRUNNER Valter Crescenzi Universistà Roma Tre Italy crescenz@dia.uniroma3.it Giansalvatore Mecca Universistà della Basilicata Italy mecca@unibas.it Paolo Merialdo Universistà

More information

A survey: Web mining via Tag and Value

A survey: Web mining via Tag and Value A survey: Web mining via Tag and Value Khirade Rajratna Rajaram. Information Technology Department SGGS IE&T, Nanded, India Balaji Shetty Information Technology Department SGGS IE&T, Nanded, India Abstract

More information

Using Webspaces to Model Document Collections on the Web

Using Webspaces to Model Document Collections on the Web Using Webspaces to Model Document Collections on the Web Roelof van Zwol and Peter M.G. Apers University of Twente, Department of Computer Science P.O.box 217, 7500 AE, Enschede, the Netherlands {zwol,

More information

Automatic Generation of Wrapper for Data Extraction from the Web

Automatic Generation of Wrapper for Data Extraction from the Web Automatic Generation of Wrapper for Data Extraction from the Web 2 Suzhi Zhang 1, 2 and Zhengding Lu 1 1 College of Computer science and Technology, Huazhong University of Science and technology, Wuhan,

More information

Web Data Extraction. Craig Knoblock University of Southern California. This presentation is based on slides prepared by Ion Muslea and Kristina Lerman

Web Data Extraction. Craig Knoblock University of Southern California. This presentation is based on slides prepared by Ion Muslea and Kristina Lerman Web Data Extraction Craig Knoblock University of Southern California This presentation is based on slides prepared by Ion Muslea and Kristina Lerman Extracting Data from Semistructured Sources NAME Casablanca

More information

Object Extraction. Output Tagging. A Generated Wrapper

Object Extraction. Output Tagging. A Generated Wrapper Wrapping Data into XML Wei Han, David Buttler, Calton Pu Georgia Institute of Technology College of Computing Atlanta, Georgia 30332-0280 USA fweihan, buttler, calton g@cc.gatech.edu Abstract The vast

More information

Manual Wrapper Generation. Automatic Wrapper Generation. Grammar Induction Approach. Overview. Limitations. Website Structure-based Approach

Manual Wrapper Generation. Automatic Wrapper Generation. Grammar Induction Approach. Overview. Limitations. Website Structure-based Approach Automatic Wrapper Generation Kristina Lerman University of Southern California Manual Wrapper Generation Manual wrapper generation requires user to Specify the schema of the information source Single tuple

More information

X-S Framework Leveraging XML on Servlet Technology

X-S Framework Leveraging XML on Servlet Technology X-S Framework Leveraging XML on Servlet Technology Rajesh Kumar R Abstract This paper talks about a XML based web application framework that is based on Java Servlet Technology. This framework leverages

More information

Certified HTML5 Developer VS-1029

Certified HTML5 Developer VS-1029 VS-1029 Certified HTML5 Developer Certification Code VS-1029 HTML5 Developer Certification enables candidates to develop websites and web based applications which are having an increased demand in the

More information

Review for Exam 1 CS474 (Norton)

Review for Exam 1 CS474 (Norton) Review for Exam 1 CS474 (Norton) What is a Database? Properties of a database Stores data to derive information Data in a database is, in general: Integrated Shared Persistent Uses of Databases The Integrated

More information

Using Data-Extraction Ontologies to Foster Automating Semantic Annotation

Using Data-Extraction Ontologies to Foster Automating Semantic Annotation Using Data-Extraction Ontologies to Foster Automating Semantic Annotation Yihong Ding Department of Computer Science Brigham Young University Provo, Utah 84602 ding@cs.byu.edu David W. Embley Department

More information

Web Data Extraction Using Tree Structure Algorithms A Comparison

Web Data Extraction Using Tree Structure Algorithms A Comparison Web Data Extraction Using Tree Structure Algorithms A Comparison Seema Kolkur, K.Jayamalini Abstract Nowadays, Web pages provide a large amount of structured data, which is required by many advanced applications.

More information

SEEK: Scalable Extraction of Enterprise Knowledge

SEEK: Scalable Extraction of Enterprise Knowledge SEEK: Scalable Extraction of Enterprise Knowledge Joachim Hammer Dept. of CISE University of Florida 26-Feb-2002 1 Project Overview Faculty Joachim Hammer Mark Schmalz Computer Science Students Sangeetha

More information

Database System Concepts and Architecture

Database System Concepts and Architecture CHAPTER 2 Database System Concepts and Architecture Copyright 2017 Ramez Elmasri and Shamkant B. Navathe Slide 2-2 Outline Data Models and Their Categories History of Data Models Schemas, Instances, and

More information

2004 WebGUI Users Conference

2004 WebGUI Users Conference WebGUI Site Design 2004 WebGUI Users Conference General Rules of Web Design Content is King good content is more important than anything else. keeps people interested. even if your design is bad, content

More information

IBM Rational Developer for System z Version 7.5

IBM Rational Developer for System z Version 7.5 Providing System z developers with tools for building traditional and composite applications in an SOA and Web 2.0 environment IBM Rational Developer for System z Version 7.5 Highlights Helps developers

More information

FROM 4D WRITE TO 4D WRITE PRO INTRODUCTION. Presented by: Achim W. Peschke

FROM 4D WRITE TO 4D WRITE PRO INTRODUCTION. Presented by: Achim W. Peschke 4 D S U M M I T 2 0 1 8 FROM 4D WRITE TO 4D WRITE PRO Presented by: Achim W. Peschke INTRODUCTION In this session we will talk to you about the new 4D Write Pro. I think in between everyone knows what

More information

Aspects of an XML-Based Phraseology Database Application

Aspects of an XML-Based Phraseology Database Application Aspects of an XML-Based Phraseology Database Application Denis Helic 1 and Peter Ďurčo2 1 University of Technology Graz Insitute for Information Systems and Computer Media dhelic@iicm.edu 2 University

More information

Google Sites Guide Nursing Student Portfolio

Google Sites Guide Nursing Student Portfolio Google Sites Guide Nursing Student Portfolio Use the template as base, but customize it according to your design! Change the colors and text, but maintain the required pages and information. Topic Outline:

More information

1/6/ :28 AM Approved New Course (First Version) CS 50A Course Outline as of Fall 2014

1/6/ :28 AM Approved New Course (First Version) CS 50A Course Outline as of Fall 2014 1/6/2019 12:28 AM Approved New Course (First Version) CS 50A Course Outline as of Fall 2014 CATALOG INFORMATION Dept and Nbr: CS 50A Title: WEB DEVELOPMENT 1 Full Title: Web Development 1 Last Reviewed:

More information

Reference Services Division Presents. Microsoft Word 2

Reference Services Division Presents. Microsoft Word 2 Reference Services Division Presents Microsoft Word 2 Welcome to Word 2. This handout includes step-by-step instructions for each of the tasks we will be covering in class. Changes to Word 2007 There are

More information

Blackboard staff how to guide Accessible Course Design

Blackboard staff how to guide Accessible Course Design The purpose of this guide is to help online course authors in creating accessible content using the Blackboard page editor. The advice is based primarily on W3C s Web Content Accessibility Guidelines 1.0

More information

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe Slide 2-1

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe Slide 2-1 Copyright 2007 Ramez Elmasri and Shamkant B. Navathe Slide 2-1 Chapter 2 Database System Concepts and Architecture Copyright 2007 Ramez Elmasri and Shamkant B. Navathe Outline Data Models and Their Categories

More information

Keywords Data alignment, Data annotation, Web database, Search Result Record

Keywords Data alignment, Data annotation, Web database, Search Result Record Volume 5, Issue 8, August 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Annotating Web

More information

Hyper Text Markup Language HTML: A Tutorial

Hyper Text Markup Language HTML: A Tutorial Hyper Text Markup Language HTML: A Tutorial Ahmed Othman Eltahawey December 21, 2016 The World Wide Web (WWW) is an information space where documents and other web resources are located. Web is identified

More information

Features and Requirements for an XML View Definition Language: Lessons from XML Information Mediation

Features and Requirements for an XML View Definition Language: Lessons from XML Information Mediation Page 1 of 5 Features and Requirements for an XML View Definition Language: Lessons from XML Information Mediation 1. Introduction C. Baru, B. Ludäscher, Y. Papakonstantinou, P. Velikhov, V. Vianu XML indicates

More information

Web Systems & Technologies: An Introduction

Web Systems & Technologies: An Introduction Web Systems & Technologies: An Introduction Prof. Ing. Andrea Omicini Ingegneria Due, Università di Bologna a Cesena andrea.omicini@unibo.it 2005-2006 Web Systems Architecture Basic architecture information

More information

Enterprise Modernization for IBM System z:

Enterprise Modernization for IBM System z: Enterprise Modernization for IBM System z: Transform 3270 green screens to Web UI using Rational Host Access Transformation Services for Multiplatforms Extend a host application to the Web using System

More information

Custom Tables with the LandXML Report Extension David Zavislan, P.E.

Custom Tables with the LandXML Report Extension David Zavislan, P.E. December 2-5, 2003 MGM Grand Hotel Las Vegas Custom Tables with the LandXML Report Extension David Zavislan, P.E. CV41-2 Learn some basic concepts of LandXML and the extensible Stylesheet Language (XSL)

More information

WEB DATA EXTRACTION METHOD BASED ON FEATURED TERNARY TREE

WEB DATA EXTRACTION METHOD BASED ON FEATURED TERNARY TREE WEB DATA EXTRACTION METHOD BASED ON FEATURED TERNARY TREE *Vidya.V.L, **Aarathy Gandhi *PG Scholar, Department of Computer Science, Mohandas College of Engineering and Technology, Anad **Assistant Professor,

More information

By Ryan Stevenson. Guidebook #2 HTML

By Ryan Stevenson. Guidebook #2 HTML By Ryan Stevenson Guidebook #2 HTML Table of Contents 1. HTML Terminology & Links 2. HTML Image Tags 3. HTML Lists 4. Text Styling 5. Inline & Block Elements 6. HTML Tables 7. HTML Forms HTML Terminology

More information

CHAPTER 2 MARKUP LANGUAGES: XHTML 1.0

CHAPTER 2 MARKUP LANGUAGES: XHTML 1.0 WEB TECHNOLOGIES A COMPUTER SCIENCE PERSPECTIVE CHAPTER 2 MARKUP LANGUAGES: XHTML 1.0 Modified by Ahmed Sallam Based on original slides by Jeffrey C. Jackson reserved. 0-13-185603-0 HTML HELLO WORLD! Document

More information

Table-Based Web Pages

Table-Based Web Pages Table-Based Web Pages Web Authoring and Design Benjamin Kenwright Outline What do we mean by Table-Based Web Sites? Review Table Tags/Structure Tips/Debugging/Applications Summary Review/Discussion Submissions/Quizzes/GitHub

More information

QRG: Using the WYSIWYG Editor

QRG: Using the WYSIWYG Editor WYSIWYG Editor QRG: Using the WYSIWYG Editor WYSIWYG stands for What You See Is What You Get. The WYSIWYG Editor is the reason you don t need to be an IT Programmer to write content for your web page.

More information

Information Management (IM)

Information Management (IM) 1 2 3 4 5 6 7 8 9 Information Management (IM) Information Management (IM) is primarily concerned with the capture, digitization, representation, organization, transformation, and presentation of information;

More information

A CRITICAL EVALUATION OF SCHEMA INTEGRATION METHODOLOGIES AND TRENDS

A CRITICAL EVALUATION OF SCHEMA INTEGRATION METHODOLOGIES AND TRENDS Association for Information Systems AIS Electronic Library (AISeL) AMCIS 2002 Proceedings Americas Conference on Information Systems (AMCIS) December 2002 A CRITICAL EVALUATION OF SCHEMA INTEGRATION METHODOLOGIES

More information

Web Publishing Basics II

Web Publishing Basics II Web Publishing Basics II Jeff Pankin Information Services and Technology Table of Contents Course Objectives... 2 Create a Site Definition... 3 The Dreamweaver CS4 Interface... 4 Panels are groups of icons

More information

Joining the BRICKS Network - A Piece of Cake

Joining the BRICKS Network - A Piece of Cake Joining the BRICKS Network - A Piece of Cake Robert Hecht and Bernhard Haslhofer 1 ARC Seibersdorf research - Research Studios Studio Digital Memory Engineering Thurngasse 8, A-1090 Wien, Austria {robert.hecht

More information

Formatting documents in Microsoft Word Using a Windows Operating System

Formatting documents in Microsoft Word Using a Windows Operating System Formatting documents in Microsoft Word Using a Windows Operating System 2017-07-20 Research & Scholarship, McLaughlin Library University of Guelph 50 Stone Road East Guelph, Ontario N1G 2W1 2 Contents

More information

Web Systems & Technologies: An Introduction

Web Systems & Technologies: An Introduction Web Systems & Technologies: An Introduction Prof. Ing. Andrea Omicini Ingegneria Due, Università di Bologna a Cesena andrea.omicini@unibo.it 2006-2007 Web Systems Architecture Basic architecture information

More information

Creating Accessible PDFs

Creating Accessible PDFs Creating Accessible PDFs Using Word to Create Accessible PDF Documents This documentation is designed to be a tool for students, faculty and staff. When authoring electronic documents, it is important

More information

Chapter 1: Introduction

Chapter 1: Introduction Chapter 1: Introduction Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Outline The Need for Databases Data Models Relational Databases Database Design Storage Manager Query

More information

International Jmynal of Intellectual Advancements and Research in Engineering Computations

International Jmynal of Intellectual Advancements and Research in Engineering Computations www.ijiarec.com ISSN:2348-2079 DEC-2015 International Jmynal of Intellectual Advancements and Research in Engineering Computations VIRTUALIZATION OF DISTIRIBUTED DATABASES USING XML 1 M.Ramu ABSTRACT Objective

More information

A Framework for the Internationalization of Data-Intensive Web Applications

A Framework for the Internationalization of Data-Intensive Web Applications A Framework for the Internationalization of Data-Intensive Web Applications Alberto Belussi and Roberto Posenato Dipartimento di Informatica - Università degli Studi di Verona Ca Vignal, Strada le Grazie,

More information

Introduction to IBM Rational HATS For IBM System i (5250)

Introduction to IBM Rational HATS For IBM System i (5250) Introduction to IBM Rational HATS For IBM System i (5250) Introduction to IBM Rational HATS 1 Lab instructions This lab teaches you how to use IBM Rational HATS to create a Web application capable of transforming

More information

HTML. LBSC 690: Jordan Boyd-Graber. October 1, LBSC 690: Jordan Boyd-Graber () HTML October 1, / 29

HTML. LBSC 690: Jordan Boyd-Graber. October 1, LBSC 690: Jordan Boyd-Graber () HTML October 1, / 29 HTML LBSC 690: Jordan Boyd-Graber October 1, 2012 LBSC 690: Jordan Boyd-Graber () HTML October 1, 2012 1 / 29 Goals Review Assignment 1 Assignment 2 and Midterm Hands on HTML LBSC 690: Jordan Boyd-Graber

More information

Developing Web Applications

Developing Web Applications Developing Web Applications Ralph Moseley Middlesex University IIICENTCNNIAL 1807 ewiley 2007 13ICCNTENNIAL John Wiley & Sons, Ltd Preface Introduction Features Additional Materials Trademarks Acknowledgments

More information

Searching the Deep Web

Searching the Deep Web Searching the Deep Web 1 What is Deep Web? Information accessed only through HTML form pages database queries results embedded in HTML pages Also can included other information on Web can t directly index

More information

IBM DB2 Web Query Tool Version 1.3

IBM DB2 Web Query Tool Version 1.3 IBM DB2 Web Query Tool Version 1.3 A database workbench for DB2 and the Web Functions and features Agenda What is DB2 Web Query Tool? What can you do with DB2 Web Query Tool? How does DB2 Web Query Tool

More information

Internet Application Developer

Internet Application Developer Internet Application Developer SUN-Java Programmer Certification Building a Web Presence with XHTML & XML 5 days or 12 evenings $2,199 CBIT 081 J A V A P R O G R A M M E R Fundamentals of Java and Object

More information

BRA BIHAR UNIVERSITY, MUZAFFARPUR DIRECTORATE OF DISTANCE EDUCATION

BRA BIHAR UNIVERSITY, MUZAFFARPUR DIRECTORATE OF DISTANCE EDUCATION BSCIT/3 RD /BIT13-OOPS with Java Q. 1. What do you mean by Java Virtual Machine? Q. 2. Define Bytecode. Write different features of Java. Q. 3. How do you compile and execute a Java program? Q. 4. Discuss

More information

From ER Diagrams to the Relational Model. Rose-Hulman Institute of Technology Curt Clifton

From ER Diagrams to the Relational Model. Rose-Hulman Institute of Technology Curt Clifton From ER Diagrams to the Relational Model Rose-Hulman Institute of Technology Curt Clifton Review Entity Sets and Attributes Entity set: collection of things in the DB Attribute: property of an entity calories

More information

Certified HTML Designer VS-1027

Certified HTML Designer VS-1027 VS-1027 Certification Code VS-1027 Certified HTML Designer Certified HTML Designer HTML Designer Certification allows organizations to easily develop website and other web based applications which are

More information

Searching the Deep Web

Searching the Deep Web Searching the Deep Web 1 What is Deep Web? Information accessed only through HTML form pages database queries results embedded in HTML pages Also can included other information on Web can t directly index

More information

ESRI stylesheet selects a subset of the entire body of the metadata and presents it as if it was in a tabbed dialog.

ESRI stylesheet selects a subset of the entire body of the metadata and presents it as if it was in a tabbed dialog. Creating Metadata using ArcCatalog (ACT) 1. Choosing a metadata editor in ArcCatalog ArcCatalog comes with FGDC metadata editor, which create FGDC-compliant documentation. Metadata in ArcCatalog stored

More information

Grid-Based Data Mining and the KNOWLEDGE GRID Framework

Grid-Based Data Mining and the KNOWLEDGE GRID Framework Grid-Based Data Mining and the KNOWLEDGE GRID Framework DOMENICO TALIA (joint work with M. Cannataro, A. Congiusta, P. Trunfio) DEIS University of Calabria ITALY talia@deis.unical.it Minneapolis, September

More information

Presentation + Integration + Extension delivering business intelligence

Presentation + Integration + Extension delivering business intelligence Figure 1. BI:Scope Report Display Figure 2. Print Preview Presentation + Integration + Extension delivering business intelligence BI:Scope is a web enabled, rich client, Report Deployment product for business

More information

Introduction to Semistructured Data and XML

Introduction to Semistructured Data and XML Introduction to Semistructured Data and XML Chapter 27, Part D Based on slides by Dan Suciu University of Washington Database Management Systems, R. Ramakrishnan 1 How the Web is Today HTML documents often

More information

Wide Area Query Systems The Hydra of Databases

Wide Area Query Systems The Hydra of Databases Wide Area Query Systems The Hydra of Databases Stonebraker et al. 96 Gribble et al. 02 Zachary G. Ives University of Pennsylvania January 21, 2003 CIS 650 Data Sharing and the Web The Vision A World Wide

More information

A Review on Identifying the Main Content From Web Pages

A Review on Identifying the Main Content From Web Pages A Review on Identifying the Main Content From Web Pages Madhura R. Kaddu 1, Dr. R. B. Kulkarni 2 1, 2 Department of Computer Scienece and Engineering, Walchand Institute of Technology, Solapur University,

More information

XML and XSLT. XML and XSLT 10 February

XML and XSLT. XML and XSLT 10 February XML and XSLT XML (Extensible Markup Language) has the following features. Not used to generate layout but to describe data. Uses tags to describe different items just as HTML, No predefined tags, just

More information

HTML TIPS FOR DESIGNING.

HTML TIPS FOR DESIGNING. This is the first column. Look at me, I m the second column.

More information

TIRA: Text based Information Retrieval Architecture

TIRA: Text based Information Retrieval Architecture TIRA: Text based Information Retrieval Architecture Yunlu Ai, Robert Gerling, Marco Neumann, Christian Nitschke, Patrick Riehmann yunlu.ai@medien.uni-weimar.de, robert.gerling@medien.uni-weimar.de, marco.neumann@medien.uni-weimar.de,

More information

Introduction to Semistructured Data and XML. Overview. How the Web is Today. Based on slides by Dan Suciu University of Washington

Introduction to Semistructured Data and XML. Overview. How the Web is Today. Based on slides by Dan Suciu University of Washington Introduction to Semistructured Data and XML Based on slides by Dan Suciu University of Washington CS330 Lecture April 8, 2003 1 Overview From HTML to XML DTDs Querying XML: XPath Transforming XML: XSLT

More information

Oracle Application Express 5 New Features

Oracle Application Express 5 New Features Oracle Application Express 5 New Features 20th HrOUG conference October 16, 2015 Vladislav Uvarov Software Development Manager Database Server Technologies Division Copyright 2015, Oracle and/or its affiliates.

More information

STARCOUNTER. Technical Overview

STARCOUNTER. Technical Overview STARCOUNTER Technical Overview Summary 3 Introduction 4 Scope 5 Audience 5 Prerequisite Knowledge 5 Virtual Machine Database Management System 6 Weaver 7 Shared Memory 8 Atomicity 8 Consistency 9 Isolation

More information

Chapter 4 Notes. Creating Tables in a Website

Chapter 4 Notes. Creating Tables in a Website Chapter 4 Notes Creating Tables in a Website Project for Chapter 4 Statewide Realty Web Site Chapter Objectives Define table elements Describe the steps used to plan, design, and code a table Create a

More information

MetaNews: An Information Agent for Gathering News Articles On the Web

MetaNews: An Information Agent for Gathering News Articles On the Web MetaNews: An Information Agent for Gathering News Articles On the Web Dae-Ki Kang 1 and Joongmin Choi 2 1 Department of Computer Science Iowa State University Ames, IA 50011, USA dkkang@cs.iastate.edu

More information

Mathematics. Computer Applications. New Jersey Quality Single Accountability Continuum (NJQSAC) Department: Course Title.

Mathematics. Computer Applications. New Jersey Quality Single Accountability Continuum (NJQSAC) Department: Course Title. Textbook(s): Date: Content Unit 1 MS PowerPoint September 1-11 Name the parts of the PowerPoint Window. How do you use a Design Template? What types of toolbars are there in PowerPoint? How do you insert

More information

Course Outline Faculty of Computing and Information Technology

Course Outline Faculty of Computing and Information Technology Course Outline Faculty of Computing and Information Technology Title Code Instructor Name Credit Hours Prerequisite Prerequisite Skill/Knowledge/Understanding Category Course Goals Statement of Course

More information

Handout created by Cheryl Tice, Instructional Support for Technology, GST BOCES

Handout created by Cheryl Tice, Instructional Support for Technology, GST BOCES Handout created by Cheryl Tice, Instructional Support for Technology, GST BOCES Intro to FrontPage OVERVIEW: This handout provides a general overview of Microsoft FrontPage. AUDIENCE: All Instructional

More information

Chapter 1 Introduction to HTML, XHTML, and CSS

Chapter 1 Introduction to HTML, XHTML, and CSS Chapter 1 Introduction to HTML, XHTML, and CSS MULTIPLE CHOICE 1. The world s largest network is. a. the Internet c. Newsnet b. the World Wide Web d. both A and B A PTS: 1 REF: HTML 2 2. ISPs utilize data

More information

Unit 5 Web Publishing Systems Page 1 of 13 Part 4 HTML Part 4

Unit 5 Web Publishing Systems Page 1 of 13 Part 4 HTML Part 4 Unit 5 Web Publishing Systems Page 1 of 13 Part 4 HTML 4.01 Version: 4.01 Transitional Hypertext Markup Language is the coding behind web publishing. In this tutorial, basic knowledge of HTML will be covered

More information

Maintenance of Data and Metadata in Web Site Management Systems

Maintenance of Data and Metadata in Web Site Management Systems Università degli Studi di Roma La Sapienza Dottorato di Ricerca in Informatica Medica XI Ciclo 1999 Maintenance of Data and Metadata in Web Site Management Systems Giuseppe Sindoni Università degli Studi

More information

CS425 Fall 2016 Boris Glavic Chapter 1: Introduction

CS425 Fall 2016 Boris Glavic Chapter 1: Introduction CS425 Fall 2016 Boris Glavic Chapter 1: Introduction Modified from: Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Textbook: Chapter 1 1.2 Database Management System (DBMS)

More information

PRIMIX SOLUTIONS. Core Labs. Tapestry : Java Web Components Whitepaper

PRIMIX SOLUTIONS. Core Labs. Tapestry : Java Web Components Whitepaper PRIMIX SOLUTIONS Core Labs Tapestry : Java Web s Whitepaper CORE LABS Tapestry: Java Web s Whitepaper Primix Solutions One Arsenal Marketplace Phone (617) 923-6639 Fax (617) 923-5139 Tapestry contact information:

More information

A SURVEY- WEB MINING TOOLS AND TECHNIQUE

A SURVEY- WEB MINING TOOLS AND TECHNIQUE International Journal of Latest Trends in Engineering and Technology Vol.(7)Issue(4), pp.212-217 DOI: http://dx.doi.org/10.21172/1.74.028 e-issn:2278-621x A SURVEY- WEB MINING TOOLS AND TECHNIQUE Prof.

More information

The figure below shows the Dreamweaver Interface.

The figure below shows the Dreamweaver Interface. Dreamweaver Interface Dreamweaver Interface In this section you will learn about the interface of Dreamweaver. You will also learn about the various panels and properties of Dreamweaver. The Macromedia

More information

Where do these data come from? What technologies do they use?? Whatever they use, they need models (schemas, metadata, )

Where do these data come from? What technologies do they use?? Whatever they use, they need models (schemas, metadata, ) Week part 2: Database Applications and Technologies Data everywhere SQL Databases, Packaged applications Data warehouses, Groupware Internet databases, Data mining Object-relational databases, Scientific

More information

Zen Garden. CSS Zen Garden

Zen Garden. CSS Zen Garden CSS Patrick Behr CSS HTML = content CSS = display It s important to keep them separated Less code in your HTML Easy maintenance Allows for different mediums Desktop Mobile Print Braille Zen Garden CSS

More information