Application software office packets, databases and data warehouses.

Size: px
Start display at page:

Download "Application software office packets, databases and data warehouses."

Transcription

1 Introduction to Computer Systems (9) Application software office packets, databases and data warehouses. Piotr Mielecki Ph. D.

2 1. Office packets Definitions. Office packet of software, usually called an office application suite or productivity suite is a set of applications intended to be used by typical office worker and/or knowledge workers. The components are generally distributed together, have a consistent user interface and usually can interact with each other, sometimes in ways that the operating system would not normally allow mechanisms like Object Linking and Embedding (OLE), for example. Most of office application suites include at least: Word processor (more formally known as document preparation system), which is an application used for the production (including composition, editing, formatting, and possibly printing) of any sort of printable material, stored as the electronic document. Spreadsheet rectangular table (or grid) of arranged information (financial very often). The electronic spreadsheet supports automatic calculations (mathematical, statistical, financial etc.) and tools for graphical presentations (different diagrams).

3 In addition to these, the suite may contain: Presentation program designed for preparing sets of electronic slides, usually based on contents of other electronic documents and different multimedia formats. The applications of this kind have usually poor support for edition of multimedia files or documents, but the OLE mechanism (advanced graphic editor as an OLE server, for example) can be used to speed-up work on presentation. MS-PowerPoint has become a standard for applications of this kind. Desktop database tool the application which can be used to create small (desktop) databases or as the client and/or reportgenerator for remote client-server (SQL) databases. Actually MS-Access is the most popular application of this kind. Graphics suite the application (or set of applications) designed for editing different bitmap and vector graphical formats. Actually the more advanced graphics suites (like CorelDraw! or Adobe Photoshop, for example) are not included in the particular office packages, but they can be OLE servers for them. On the other hand, the OpenOffice.org suite has its own, not very poor vector graphic editor.

4 Other additional components of the office suite: Communication tools, including: client and/or Personal Information Manager (PIM) or groupware package. Microsoft Outlook is a good example of this kind. Programming language designed for supporting automatic processing of documents and data included in these documents (like Visual Basic for Applications in MS-Office, for example).

5 1.2. Common problems. One of the most important problems is the standard format (or set of formats) for electronic documents. The attempts were (and still are) made to establish a format suitable for different office packets (from different manufacturers). Using the format not dependent on the particular office suite would be much better for all the customers they could change the office software without changing the document formats, still having access to older documents. On the other hand, they could exchange electronic documents between each-other (via , for example) not necessary using the same office software to display them and work on them. Of course, the large manufacturers like Microsoft are not interested in developing universal standards. The Rich Text Format (RTF) is one of the most well-known formats for word processors (formatted text files), but its implementation in MS-Word is still not quite correct. The ISO/IEC Open Document Format (ODF), based on XML and supporting formatted text files, spreadsheets, diagrams and presentations is an alternative for closed formats like DOC (DOCX), XLS or PPT. The international non-profit committee which is developing this standard is Organization for the Advancement of Structured Information Standards (OASIS).

6 The other important thing is good support for work in organized groups. That means the document-flow management inside the organization s structure, in many locations sometimes. In the small team of people it s relatively easy to share the documents in Local Area Network (LAN) environment, sometimes using Virtual Private Network (VPN) channels to connect with company s LAN from remote locations and interchanging document-files just accessing them from the shared folders or using common . The large companies have problems with safe and efficient flow of different documents (and their subsequent versions), created by single people or by workgroups together. Application software which can support these tasks is rather sophisticated and usually database-oriented (it means that the efficient database server with appropriate client software is the center of the company s document-flow system). IBM Lotus Notes / Domino is now probably most advanced system of this kind. Domino is the name of the server software package, while Lotus is client application (a bit similar to MS-Outlook) which organizes the user s work. Another well-known solutions are Microsoft Exchange (relatively poor in comparison with Lotus but much cheaper and fully integrated with ActiveDirectory, based on non-sql database and MS- Outlook as the only client) and Novell GroupWise.

7 2. Databases and Data Warehouses Definitions. Database is a structured collection of records or data that is stored in a computer system so that a computer program or person using a query language (SQL in most of cases) can consult it to answer queries or append new data. The records retrieved in answer to queries are information that can be used to make decisions. The computer software used to manage and query a database is known as a Database Management System (DBMS). The central concept of a database is that of a collection of records, or pieces of information. Typically, for a given database, there is a structural description of the type of facts held in that database: this description is known as a schema. The schema describes the objects that are represented in the database, and the relationships among them. There are a number of different ways of organizing a schema, that is, of modeling the database structure they are known as database models (or data models).

8 The data model in most common use today is the relational model, which represents all information in the form of multiple related tables, each consisting of rows and columns (the formal definition uses mathematical terminology and is much less understandable, so rather not used by IT professionals). This model represents relationships by the use of values common for more than one table keys. Other models such as the hierarchical model and the network model use a more explicit representation of relationships

9 2.2. Relational databases. Three key terms are used extensively in relational database models: Relations a relation is a table with columns and rows (notice: term relation doesn t mean the relationship between two or more tables). Attributes the named columns of the relation are called attributes. Domains the domain is the set of values the attributes are allowed to take. The basic data structure of the relational model is the table, where information about a particular kind of objects entity (a student, for example) is represented in columns and rows (also called tuples). Thus, the relation refers to the various different tables in the database a relation is a set of tuples. The columns enumerate the various attributes of the entity (the student s name, first name, unique ID number and date of birth, for example), and a row is an actual instance of the entity (particular student) that is represented by the relation. As a result, each tuple (or simply row) of the table of students represents various attributes of a single student.

10 All relations (tables) in a relational database have to adhere to some basic rules to be qualified as relations: the ordering of columns is not important in a table, there can t be identical tuples (rows) in a table, each tuple will contain a single (only one at the moment) value for each of its attributes i.e. each tuple has an atomic value. A relational database contains multiple tables, each similar to the one in the flat database model. One of the advantages of the relational model is that any value occurring in two different records (belonging to the same table or more frequently to different tables), implies a relationship among those two records. In order to enforce explicit integrity constraints, relationships between records in tables can also be defined explicitly, by identifying parent-child relationships characterized by assigning cardinality: 1 to 1 (or 0), 1 to Many, Many to Many.

11 Tables can also have a designated single attribute or a set of attributes that can act as a key, which can be used to uniquely identify each tuple in the table. Such a unique key is called a primary key. Keys are commonly used to join or combine data from two or more tables. For example, the Students table may contain a column named Faculty which contains a value that matches the key of a Faculties table (notice that one student can have relationships with many faculties, so this example with only one Faculty column in the Students table is maybe too simplified). Keys are also critical in the creation of indices (indexes), which facilitate fast retrieval of data from large tables. Any column can be a key, or multiple columns can be grouped together into a compound key.

12 2.3. Relational operations. Users (or programs) request data from a relational database by sending it a query that is written in a special language, usually a dialect of Structured Query Language (SQL). Although SQL was originally intended for end-users, it is much more common for SQL queries to be embedded into software that provides an easier user interface (see the chapter about Data Warehouses). In response to a query, the database returns a result set, which is just a list of rows from one or more (related) tables, containing the answers. The simplest query is just to return all the rows from a particular, single table which interests us at this moment: SELECT * FROM <table> More often, the rows are filtered in some way to return just the answer wanted: SELECT <column_list> FROM <table> WHERE <condition> Very often data from multiple tables are combined into one, by doing a JOIN. There are a number of relational operations in addition to JOIN.

13 2.4. Database normalization. Database normalization is a technique for designing relational database tables to minimize duplication of information and, in so doing, to safeguard the database against certain types of logical or structural problems (data anomalies). For example, when multiple instances of a given piece of information occur in a table, the possibility exists that these instances will not be kept consistent when the data within the table is updated, leading to a loss of data integrity. A table that is sufficiently normalized is less sensitive for problems of this kind. On the other hand, in systems designed to hold the important electronic documentation (medical, for example) it s very important to save all the subsequent versions of each document, so sometimes very similar or even identical tuples can be found. Good design should lead to distinguish between them (using the version number column, for example).

14 Database theory describes a table s degree of normalization in terms of normal forms of successively higher degrees. A table in third normal form (3NF), for example, is consequently in second normal form (2NF) as well. Higher degrees of normalization typically involve more tables and create the need for a larger number of joins, which can reduce performance. Accordingly, more highly normalized tables are typically used in database applications involving many isolated transactions, while less normalized tables tend to be used in database applications that do not need to map complex relationships between data entities and data attributes. Although the normal forms are often defined informally in terms of the characteristics of tables, rigorous definitions of the normal forms are concerned with the characteristics of pure mathematical constructs known as relations (more theoretical approach than just tables). Whenever information is represented relationally, it is meaningful to consider the extent to which the representation is normalized.

15 2.5. Data Warehouses Definitions. A Data Warehouse is something more than database-supported information system or simply database itself. It s the main repository of all organization s historical data, its corporate memory. It contains the raw materials for management s decision support systems like Enterprise Resource Planning (ERP) systems first of all. The critical factor leading to the use of a Data Warehouse is that a data analyst can perform complex queries and analysis, such as data mining, on the information without slowing down the operation systems or application-level information systems (sometimes called Operational Systems) which are supporting everyday work (accounting, human-resource management and so on).

16 Formally we can define a Data Warehouse in the following terms: Subject-oriented the data in the database is organized so that all the data elements relating to the same real-world event or object are linked together. Time-variant all the changes to the data in the database are tracked and recorded so that reports can be produced showing changes over time. Non-volatile data in the database is never over-written nor deleted; once committed, the data is static, read-only, but retained for future reporting. Integrated the database contains data from most or all of an organization s operational applications, and that this data is made consistent. Operational Systems are optimized for simplicity and speed of modification, using Online Transaction Processing (OLTP) for data entry and retrieval, database normalization and an entityrelationship model for clear design. The data warehouse is optimized for reporting and analysis (Online Analytical Processing OLAP). Frequently data in Data Warehouses are heavily denormalised, summarised or stored in a dimension-based model. However, this is not always required to achieve very short query response times.

17 History. Data Warehouses are a distinct type of computer database that were first developed during the late 1980-ties and early 1990-ties. They were developed to meet a growing demand for management information and analysis that could not be met by operational systems. Operational systems were unable to meet this need for a range of reasons: The processing load of sophisticated reporting was not neutral for the response time of the operational systems, so the everyday work in strongly computer-supported organizations was slower. The database designs of operational systems were not optimized for advanced information analysis and reporting. Most organizations had more than one operational system (several domain subsystems), so company-wide reporting couldn t be supported from a single system. Development of reports in operational systems often required writing specific computer programs which was slow, difficult to use and expensive.

18 As a result, separate computer databases began to be built that were specifically designed to support management information and advanced analysis purposes. These Data Warehouses were able to bring in data from a range of different data sources, such as mainframe computers, minicomputers, as well as personal computers and office automation software such as spreadsheets, and integrate this information in a single place. This capability, coupled with user-friendly reporting tools and freedom from operational impacts, has led to a growth of this type of computer systems. As technology improved (lower cost for more performance) and user requirements increased (faster data load cycle times and more features), Data Warehouses have evolved through several fundamental stages: Off-line Operational Databases Data Warehouses in this initial stage are developed by simply copying the database of an operational system to an off-line server, where the processing load of reporting does not impact on the operational system s performance.

19 Off-line Data Warehouses Data Warehouses in this stage of evolution are updated on a regular time cycle (usually daily, weekly or monthly) from the operational systems and the data is stored in an integrated reporting-oriented data structure (different from the operational). In this case Extract-Transform- Load (ETL) processes performed by enterprise -level integration tools are responsible for supplying data from operational systems. Real Time Data Warehouses Data Warehouses at this stage are updated on a transaction or event basis, every time an operational system performs a transaction (e.g. an order or a delivery or a booking etc.). On-line Enterprise Service Bus (ESB) is responsible for supplying data from operational systems. Integrated Data Warehouses Data Warehouses at this stage are used to generate activity or transactions that are passed back into the operational systems for use in the daily activity of the organization. Today s application-level protocols, defined for data interchange between different domain-oriented applications used by the organization or some organizations working together (HL7 in medicine, for example), were designed on this concept and can be used with ESB subsystems.

20 Architecture of Data Warehouses. The concept of data warehousing dates back at least to the mid ties, and possibly earlier. In essence, it was intended to provide an architectural model for the flow of data from operational systems to decision support environments. It attempted to address the various problems associated with this flow, and the high costs associated with it. In the absence of such architecture, there usually existed an enormous amount of redundancy in the delivery of management information. In larger corporations it was typical for multiple decision support projects to operate independently, each serving different users but often requiring much of the same data. The process of gathering, cleaning and integrating data from various sources, often legacy systems (old computer systems or application programs that continue to be used because the organization doesn t want to replace or redesign them), was typically replicated for each decision support project. Moreover, legacy systems were frequently being revisited as new requirements emerged, each requiring a different view of the legacy data.

21 Based on analogies with real-life warehouses, Data Warehouses were intended as large-scale collection/storage/staging areas for corporate data. From here data could be distributed to retail stores or data marts which were tailored for access by decision support users (or consumers ). While the Data Warehouse was designed to manage the bulk supply of data from its suppliers (e.g. operational systems), and to handle the organization and storage of this data, the retail stores or data marts could be focused on packaging and presenting selections of the data to end-users, to meet specific management information needs. Somewhere along the way this analogy and architectural vision was lost, as some vendors and industry speakers redefined the Data Warehouse as simply a management reporting database. This is a subtle but important deviation from the original vision of the Data Warehouse as the hub of a management information architecture, where the decision support systems were actually the data marts or retail stores.

22 Data storage models for Data Warehouses. The goal of a Data Warehouse is to bring data together from a variety of existing databases to support management and reporting needs. The generally accepted principle is that data should be stored at its most elementary level because this provides for the most useful and flexible basis for use in reporting and information analysis. However, because of different focus on specific requirements, there can be alternative methods for design and implementing data warehouses. There are two leading approaches to organizing the data in a data warehouse: the dimensional approach and the normalized approach.

23 The dimensional approach is very useful in data mart design, but it can result in serious problems of long term data integration and abstraction complications when used in a Data Warehouse. In the dimensional approach, transaction data is partitioned into either a measured facts, which are generally numeric data that captures specific values or dimensions, which contain the reference information that gives each transaction its context: As an example, a sales transaction could be broken up into facts such as the number of products ordered and the price paid, and dimensions such as date, customer, product (and its price), geographical location and salesperson. The main advantage of a dimensional approach is that the Data Warehouse is easy for business staff with limited IT experience to understand and use. Also, because the data is pre-joined into the dimensional form, the data warehouse tends to operate very quickly. The main disadvantage of the dimensional approach is that it s difficult to add or change the stored information later, if the company changes the way in which it makes business.

24 The normalized approach uses database normalization. In this method, the data in the Data Warehouse is stored in 3rd normal form. Tables are then grouped together by subject areas that reflect the general definition of the data (customer, product, finance, etc.). The main advantage of this approach is that it is quite straightforward to add new information into the database. The primary disadvantage is that because of the number of tables involved, it can be rather slow to produce information and reports. Furthermore, since the segregation of facts and dimensions is not explicit in this type of data model, it is difficult for users to join the required data elements into meaningful information without a precise understanding of the data structure (having detailed documentation about database and more advanced skills).

25 Operational (Domain) Systems Sales ERP Accounting Personal Resources Integration Tools: - ETL - ESB Business Intelligence (BI) System Data Warehouse BI applications (Data Marts) Analytical workers Production

DATABASE DEVELOPMENT (H4)

DATABASE DEVELOPMENT (H4) IMIS HIGHER DIPLOMA QUALIFICATIONS DATABASE DEVELOPMENT (H4) December 2017 10:00hrs 13:00hrs DURATION: 3 HOURS Candidates should answer ALL the questions in Part A and THREE of the five questions in Part

More information

1 DATAWAREHOUSING QUESTIONS by Mausami Sawarkar

1 DATAWAREHOUSING QUESTIONS by Mausami Sawarkar 1 DATAWAREHOUSING QUESTIONS by Mausami Sawarkar 1) What does the term 'Ad-hoc Analysis' mean? Choice 1 Business analysts use a subset of the data for analysis. Choice 2: Business analysts access the Data

More information

File Processing Approaches

File Processing Approaches Relational Database Basics Review Overview Database approach Database system Relational model File Processing Approaches Based on file systems Data are recorded in various types of files organized in folders

More information

Meaning & Concepts of Databases

Meaning & Concepts of Databases 27 th August 2015 Unit 1 Objective Meaning & Concepts of Databases Learning outcome Students will appreciate conceptual development of Databases Section 1: What is a Database & Applications Section 2:

More information

A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective

A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective B.Manivannan Research Scholar, Dept. Computer Science, Dravidian University, Kuppam, Andhra Pradesh, India

More information

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing. About the Tutorial A data warehouse is constructed by integrating data from multiple heterogeneous sources. It supports analytical reporting, structured and/or ad hoc queries and decision making. This

More information

Data warehouse architecture consists of the following interconnected layers:

Data warehouse architecture consists of the following interconnected layers: Architecture, in the Data warehousing world, is the concept and design of the data base and technologies that are used to load the data. A good architecture will enable scalability, high performance and

More information

Data Warehouses Chapter 12. Class 10: Data Warehouses 1

Data Warehouses Chapter 12. Class 10: Data Warehouses 1 Data Warehouses Chapter 12 Class 10: Data Warehouses 1 OLTP vs OLAP Operational Database: a database designed to support the day today transactions of an organization Data Warehouse: historical data is

More information

DATA MINING AND WAREHOUSING

DATA MINING AND WAREHOUSING DATA MINING AND WAREHOUSING Qno Question Answer 1 Define data warehouse? Data warehouse is a subject oriented, integrated, time-variant, and nonvolatile collection of data that supports management's decision-making

More information

Fig 1.2: Relationship between DW, ODS and OLTP Systems

Fig 1.2: Relationship between DW, ODS and OLTP Systems 1.4 DATA WAREHOUSES Data warehousing is a process for assembling and managing data from various sources for the purpose of gaining a single detailed view of an enterprise. Although there are several definitions

More information

QM Chapter 1 Database Fundamentals Version 10 th Ed. Prepared by Dr Kamel Rouibah / Dept QM & IS

QM Chapter 1 Database Fundamentals Version 10 th Ed. Prepared by Dr Kamel Rouibah / Dept QM & IS QM 433 - Chapter 1 Database Fundamentals Version 10 th Ed Prepared by Dr Kamel Rouibah / Dept QM & IS www.cba.edu.kw/krouibah Dr K. Rouibah / dept QM & IS Chapter 1 (433) Database fundamentals 1 Objectives

More information

DATA MINING TRANSACTION

DATA MINING TRANSACTION DATA MINING Data Mining is the process of extracting patterns from data. Data mining is seen as an increasingly important tool by modern business to transform data into an informational advantage. It is

More information

IT1105 Information Systems and Technology. BIT 1 ST YEAR SEMESTER 1 University of Colombo School of Computing. Student Manual

IT1105 Information Systems and Technology. BIT 1 ST YEAR SEMESTER 1 University of Colombo School of Computing. Student Manual IT1105 Information Systems and Technology BIT 1 ST YEAR SEMESTER 1 University of Colombo School of Computing Student Manual Lesson 3: Organizing Data and Information (6 Hrs) Instructional Objectives Students

More information

CHAPTER 3 Implementation of Data warehouse in Data Mining

CHAPTER 3 Implementation of Data warehouse in Data Mining CHAPTER 3 Implementation of Data warehouse in Data Mining 3.1 Introduction to Data Warehousing A data warehouse is storage of convenient, consistent, complete and consolidated data, which is collected

More information

Question Bank. 4) It is the source of information later delivered to data marts.

Question Bank. 4) It is the source of information later delivered to data marts. Question Bank Year: 2016-2017 Subject Dept: CS Semester: First Subject Name: Data Mining. Q1) What is data warehouse? ANS. A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile

More information

Topics covered 10/12/2015. Pengantar Teknologi Informasi dan Teknologi Hijau. Suryo Widiantoro, ST, MMSI, M.Com(IS)

Topics covered 10/12/2015. Pengantar Teknologi Informasi dan Teknologi Hijau. Suryo Widiantoro, ST, MMSI, M.Com(IS) Pengantar Teknologi Informasi dan Teknologi Hijau Suryo Widiantoro, ST, MMSI, M.Com(IS) 1 Topics covered 1. Basic concept of managing files 2. Database management system 3. Database models 4. Data mining

More information

DATABASE MANAGEMENT SYSTEMS. UNIT I Introduction to Database Systems

DATABASE MANAGEMENT SYSTEMS. UNIT I Introduction to Database Systems DATABASE MANAGEMENT SYSTEMS UNIT I Introduction to Database Systems Terminology Data = known facts that can be recorded Database (DB) = logically coherent collection of related data with some inherent

More information

Test bank for accounting information systems 1st edition by richardson chang and smith

Test bank for accounting information systems 1st edition by richardson chang and smith Test bank for accounting information systems 1st edition by richardson chang and smith Chapter 04 Relational Databases and Enterprise Systems True / False Questions 1. Three types of data models used today

More information

Chapter 6. Foundations of Business Intelligence: Databases and Information Management VIDEO CASES

Chapter 6. Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Chapter 6 Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:

More information

Management Information Systems Review Questions. Chapter 6 Foundations of Business Intelligence: Databases and Information Management

Management Information Systems Review Questions. Chapter 6 Foundations of Business Intelligence: Databases and Information Management Management Information Systems Review Questions Chapter 6 Foundations of Business Intelligence: Databases and Information Management 1) The traditional file environment does not typically have a problem

More information

INTRODUCTORY INFORMATION TECHNOLOGY ENTERPRISE DATABASES AND DATA WAREHOUSES. Faramarz Hendessi

INTRODUCTORY INFORMATION TECHNOLOGY ENTERPRISE DATABASES AND DATA WAREHOUSES. Faramarz Hendessi INTRODUCTORY INFORMATION TECHNOLOGY ENTERPRISE DATABASES AND DATA WAREHOUSES Faramarz Hendessi INTRODUCTORY INFORMATION TECHNOLOGY Lecture 7 Fall 2010 Isfahan University of technology Dr. Faramarz Hendessi

More information

Database Processing. Fundamentals, Design, and Implementation. Global Edition

Database Processing. Fundamentals, Design, and Implementation. Global Edition Database Processing Fundamentals, Design, and Implementation 14th Edition Global Edition Database Processing: Fundamentals, Design, and Implementation, Global Edition Table of Contents Cover Title Page

More information

Databases and Data Warehouses

Databases and Data Warehouses Databases and Data Warehouses Content Concept Definitions of Databases,Data Warehouses Database models History Databases Data Warehouses OLTP vs. Data Warehouse Concept Definition Database Data Warehouse

More information

5-1McGraw-Hill/Irwin. Copyright 2007 by The McGraw-Hill Companies, Inc. All rights reserved.

5-1McGraw-Hill/Irwin. Copyright 2007 by The McGraw-Hill Companies, Inc. All rights reserved. 5-1McGraw-Hill/Irwin Copyright 2007 by The McGraw-Hill Companies, Inc. All rights reserved. 5 hapter Data Resource Management Data Concepts Database Management Types of Databases McGraw-Hill/Irwin Copyright

More information

Data Mining & Data Warehouse

Data Mining & Data Warehouse Data Mining & Data Warehouse Associate Professor Dr. Raed Ibraheem Hamed University of Human Development, College of Science and Technology (1) 2016 2017 1 Points to Cover Why Do We Need Data Warehouses?

More information

Full file at

Full file at Chapter 2 Data Warehousing True-False Questions 1. A real-time, enterprise-level data warehouse combined with a strategy for its use in decision support can leverage data to provide massive financial benefits

More information

Introduction: Databases and. Database Users

Introduction: Databases and. Database Users Types of Databases and Database Applications Basic Definitions Typical DBMS Functionality Example of a Database (UNIVERSITY) Main Characteristics of the Database Approach Database Users Advantages of Using

More information

Overview. Introduction to Data Warehousing and Business Intelligence. BI Is Important. What is Business Intelligence (BI)?

Overview. Introduction to Data Warehousing and Business Intelligence. BI Is Important. What is Business Intelligence (BI)? Introduction to Data Warehousing and Business Intelligence Overview Why Business Intelligence? Data analysis problems Data Warehouse (DW) introduction A tour of the coming DW lectures DW Applications Loosely

More information

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 1-1

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 1-1 Slide 1-1 Chapter 1 Introduction: Databases and Database Users Outline Types of Databases and Database Applications Basic Definitions Typical DBMS Functionality Example of a Database (UNIVERSITY) Main

More information

TDWI strives to provide course books that are contentrich and that serve as useful reference documents after a class has ended.

TDWI strives to provide course books that are contentrich and that serve as useful reference documents after a class has ended. Previews of TDWI course books offer an opportunity to see the quality of our material and help you to select the courses that best fit your needs. The previews cannot be printed. TDWI strives to provide

More information

ALTERNATE SCHEMA DIAGRAMMING METHODS DECISION SUPPORT SYSTEMS. CS121: Relational Databases Fall 2017 Lecture 22

ALTERNATE SCHEMA DIAGRAMMING METHODS DECISION SUPPORT SYSTEMS. CS121: Relational Databases Fall 2017 Lecture 22 ALTERNATE SCHEMA DIAGRAMMING METHODS DECISION SUPPORT SYSTEMS CS121: Relational Databases Fall 2017 Lecture 22 E-R Diagramming 2 E-R diagramming techniques used in book are similar to ones used in industry

More information

Introduction to Relational Databases. Introduction to Relational Databases cont: Introduction to Relational Databases cont: Relational Data structure

Introduction to Relational Databases. Introduction to Relational Databases cont: Introduction to Relational Databases cont: Relational Data structure Databases databases Terminology of relational model Properties of database relations. Relational Keys. Meaning of entity integrity and referential integrity. Purpose and advantages of views. The relational

More information

Data Warehousing. Overview

Data Warehousing. Overview Data Warehousing Overview Basic Definitions Normalization Entity Relationship Diagrams (ERDs) Normal Forms Many to Many relationships Warehouse Considerations Dimension Tables Fact Tables Star Schema Snowflake

More information

0. Database Systems 1.1 Introduction to DBMS Information is one of the most valuable resources in this information age! How do we effectively and efficiently manage this information? - How does Wal-Mart

More information

Data Strategies for Efficiency and Growth

Data Strategies for Efficiency and Growth Data Strategies for Efficiency and Growth Date Dimension Date key (PK) Date Day of week Calendar month Calendar year Holiday Channel Dimension Channel ID (PK) Channel name Channel description Channel type

More information

Chapter 3. Databases and Data Warehouses: Building Business Intelligence

Chapter 3. Databases and Data Warehouses: Building Business Intelligence Chapter 3 Databases and Data Warehouses: Building Business Intelligence How Can a Business Increase its Intelligence? Summary Overview of Main Concepts Details/Design of a Relational Database Creating

More information

CS614 - Data Warehousing - Midterm Papers Solved MCQ(S) (1 TO 22 Lectures)

CS614 - Data Warehousing - Midterm Papers Solved MCQ(S) (1 TO 22 Lectures) CS614- Data Warehousing Solved MCQ(S) From Midterm Papers (1 TO 22 Lectures) BY Arslan Arshad Nov 21,2016 BS110401050 BS110401050@vu.edu.pk Arslan.arshad01@gmail.com AKMP01 CS614 - Data Warehousing - Midterm

More information

Evolution of Database Systems

Evolution of Database Systems Evolution of Database Systems Krzysztof Dembczyński Intelligent Decision Support Systems Laboratory (IDSS) Poznań University of Technology, Poland Intelligent Decision Support Systems Master studies, second

More information

Introduction: Databases and Database Users. Copyright 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1

Introduction: Databases and Database Users. Copyright 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1 Copyright 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1 Introduction: Databases and Database Users Copyright 2007 Ramez Elmasri and Shamkant B. Navathe Types of Databases and Database Applications

More information

MCQ IN BCOM II SEMESTER MANAGEMENT INFORMTION SYSTEM

MCQ IN BCOM II SEMESTER MANAGEMENT INFORMTION SYSTEM MCQ IN BCOM II SEMESTER MANAGEMENT INFORMTION SYSTEM Multiple choice questions 1. Relational calculus is a a. Procedural language. b. None- Procedural language. c. Data definition language. d. High level

More information

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe Copyright 2016 Ramez Elmasri and Shamkant B. Navathe CHAPTER 1 Databases and Database Users Copyright 2016 Ramez Elmasri and Shamkant B. Navathe Slide 1-2 OUTLINE Types of Databases and Database Applications

More information

Data Mining: Approach Towards The Accuracy Using Teradata!

Data Mining: Approach Towards The Accuracy Using Teradata! Data Mining: Approach Towards The Accuracy Using Teradata! Shubhangi Pharande Department of MCA NBNSSOCS,Sinhgad Institute Simantini Nalawade Department of MCA NBNSSOCS,Sinhgad Institute Ajay Nalawade

More information

OLAP Introduction and Overview

OLAP Introduction and Overview 1 CHAPTER 1 OLAP Introduction and Overview What Is OLAP? 1 Data Storage and Access 1 Benefits of OLAP 2 What Is a Cube? 2 Understanding the Cube Structure 3 What Is SAS OLAP Server? 3 About Cube Metadata

More information

CHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP)

CHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP) CHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP) INTRODUCTION A dimension is an attribute within a multidimensional model consisting of a list of values (called members). A fact is defined by a combination

More information

Managing Data Resources

Managing Data Resources Chapter 7 Managing Data Resources 7.1 2006 by Prentice Hall OBJECTIVES Describe basic file organization concepts and the problems of managing data resources in a traditional file environment Describe how

More information

Oracle #1 RDBMS Vendor

Oracle #1 RDBMS Vendor Oracle #1 RDBMS Vendor IBM 20.7% Microsoft 18.1% Other 12.6% Oracle 48.6% Source: Gartner DataQuest July 2008, based on Total Software Revenue Oracle 2 Continuous Innovation Oracle 11g Exadata Storage

More information

1. Inroduction to Data Mininig

1. Inroduction to Data Mininig 1. Inroduction to Data Mininig 1.1 Introduction Universe of Data Information Technology has grown in various directions in the recent years. One natural evolutionary path has been the development of the

More information

The Evolution of Data Warehousing. Data Warehousing Concepts. The Evolution of Data Warehousing. The Evolution of Data Warehousing

The Evolution of Data Warehousing. Data Warehousing Concepts. The Evolution of Data Warehousing. The Evolution of Data Warehousing The Evolution of Data Warehousing Data Warehousing Concepts Since 1970s, organizations gained competitive advantage through systems that automate business processes to offer more efficient and cost-effective

More information

Data Warehousing. Data Warehousing and Mining. Lecture 8. by Hossen Asiful Mustafa

Data Warehousing. Data Warehousing and Mining. Lecture 8. by Hossen Asiful Mustafa Data Warehousing Data Warehousing and Mining Lecture 8 by Hossen Asiful Mustafa Databases Databases are developed on the IDEA that DATA is one of the critical materials of the Information Age Information,

More information

The functions performed by a typical DBMS are the following:

The functions performed by a typical DBMS are the following: MODULE NAME: Database Management TOPIC: Introduction to Basic Database Concepts LECTURE 2 Functions of a DBMS The functions performed by a typical DBMS are the following: Data Definition The DBMS provides

More information

Data Mining Concepts & Techniques

Data Mining Concepts & Techniques Data Mining Concepts & Techniques Lecture No. 01 Databases, Data warehouse Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro

More information

B.H.GARDI COLLEGE OF MASTER OF COMPUTER APPLICATION. Ch. 1 :- Introduction Database Management System - 1

B.H.GARDI COLLEGE OF MASTER OF COMPUTER APPLICATION. Ch. 1 :- Introduction Database Management System - 1 Basic Concepts :- 1. What is Data? Data is a collection of facts from which conclusion may be drawn. In computer science, data is anything in a form suitable for use with a computer. Data is often distinguished

More information

Database Systems: Design, Implementation, and Management Tenth Edition. Chapter 1 Database Systems

Database Systems: Design, Implementation, and Management Tenth Edition. Chapter 1 Database Systems Database Systems: Design, Implementation, and Management Tenth Edition Chapter 1 Database Systems Objectives In this chapter, you will learn: The difference between data and information What a database

More information

Fundamentals of Information Systems, Seventh Edition

Fundamentals of Information Systems, Seventh Edition Chapter 3 Data Centers, and Business Intelligence 1 Why Learn About Database Systems, Data Centers, and Business Intelligence? Database: A database is an organized collection of data. Databases also help

More information

Data Mining & Data Warehouse

Data Mining & Data Warehouse Data Mining & Data Warehouse Asso. Profe. Dr. Raed Ibraheem Hamed University of Human Development, College of Science and Technology Department of Information Technology 2016 2017 (1) Points to Cover Problem:

More information

Business Intelligence and Decision Support Systems

Business Intelligence and Decision Support Systems Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 8: Data Warehousing Learning Objectives Understand the basic definitions and concepts of data warehouses Learn different

More information

Q1) Describe business intelligence system development phases? (6 marks)

Q1) Describe business intelligence system development phases? (6 marks) BUISINESS ANALYTICS AND INTELLIGENCE SOLVED QUESTIONS Q1) Describe business intelligence system development phases? (6 marks) The 4 phases of BI system development are as follow: Analysis phase Design

More information

Meltem Özturan misprivate.boun.edu.tr/ozturan/mis515

Meltem Özturan misprivate.boun.edu.tr/ozturan/mis515 Meltem Özturan misprivate.boun.edu.tr/ozturan/mis515 1 2 1 Selecting the Best Alternative Major Activities in the Analysis Phase Gather information Define system requirements Prototype for feasibility

More information

CHAPTER 6 DATABASE MANAGEMENT SYSTEMS

CHAPTER 6 DATABASE MANAGEMENT SYSTEMS CHAPTER 6 DATABASE MANAGEMENT SYSTEMS Management Information Systems, 10 th edition, By Raymond McLeod, Jr. and George P. Schell 2007, Prentice Hall, Inc. 1 Learning Objectives Understand the hierarchy

More information

TIM 50 - Business Information Systems

TIM 50 - Business Information Systems TIM 50 - Business Information Systems Lecture 15 UC Santa Cruz May 20, 2014 Announcements DB 2 Due Tuesday Next Week The Database Approach to Data Management Database: Collection of related files containing

More information

Managing Information Resources

Managing Information Resources Managing Information Resources 1 Managing Data 2 Managing Information 3 Managing Contents Concepts & Definitions Data Facts devoid of meaning or intent e.g. structured data in DB Information Data that

More information

DATA Data and information are used in our daily life. Each type of data has its own importance that contribute toward useful information.

DATA Data and information are used in our daily life. Each type of data has its own importance that contribute toward useful information. INFORMATION SYSTEM LESSON 41 DATA, INFORMATION AND INFORMATION SYSTEM SMK Sultan Yahya Petra 1 DATA Data and information are used in our daily life. Each type of data has its own importance that contribute

More information

The Six Principles of BW Data Validation

The Six Principles of BW Data Validation The Problem The Six Principles of BW Data Validation Users do not trust the data in your BW system. The Cause By their nature, data warehouses store large volumes of data. For analytical purposes, the

More information

Cognos also provides you an option to export the report in XML or PDF format or you can view the reports in XML format.

Cognos also provides you an option to export the report in XML or PDF format or you can view the reports in XML format. About the Tutorial IBM Cognos Business intelligence is a web based reporting and analytic tool. It is used to perform data aggregation and create user friendly detailed reports. IBM Cognos provides a wide

More information

Data Warehouse and Data Mining

Data Warehouse and Data Mining Data Warehouse and Data Mining Lecture No. 07 Terminologies Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro Database

More information

Department of Industrial Engineering. Sharif University of Technology. Operational and enterprises systems. Exciting directions in systems

Department of Industrial Engineering. Sharif University of Technology. Operational and enterprises systems. Exciting directions in systems Department of Industrial Engineering Sharif University of Technology Session# 9 Contents: The role of managers in Information Technology (IT) Organizational Issues Information Technology Operational and

More information

Managing Data Resources

Managing Data Resources Chapter 7 OBJECTIVES Describe basic file organization concepts and the problems of managing data resources in a traditional file environment Managing Data Resources Describe how a database management system

More information

TDWI Data Modeling. Data Analysis and Design for BI and Data Warehousing Systems

TDWI Data Modeling. Data Analysis and Design for BI and Data Warehousing Systems Data Analysis and Design for BI and Data Warehousing Systems Previews of TDWI course books offer an opportunity to see the quality of our material and help you to select the courses that best fit your

More information

Outline. Managing Information Resources. Concepts and Definitions. Introduction. Chapter 7

Outline. Managing Information Resources. Concepts and Definitions. Introduction. Chapter 7 Outline Managing Information Resources Chapter 7 Introduction Managing Data The Three-Level Database Model Four Data Models Getting Corporate Data into Shape Managing Information Four Types of Information

More information

Sample Answers to Discussion Questions

Sample Answers to Discussion Questions Human Resource Information Systems Basics Applications and Future Directions 4th Edition Kavanagh Solutions Full Download: https://testbanklive.com/download/human-resource-information-systems-basics-applications-and-future-

More information

Chapter 1: Introduction

Chapter 1: Introduction Chapter 1: Introduction Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Outline The Need for Databases Data Models Relational Databases Database Design Storage Manager Query

More information

1. Introduction. 2. Technology concepts

1. Introduction. 2. Technology concepts 1 Table of Contents 1. Introduction...2 2. Technology Concepts...3 2.1. Sharding...4 2.2. Service Oriented Data Architecture...4 2.3. Aspect Oriented Programming...4 3. Technology/Platform-Specific Features...5

More information

KNGX NOTES INFS1603 [INFS1603] KEVIN NGUYEN

KNGX NOTES INFS1603 [INFS1603] KEVIN NGUYEN 1 [] KEVIN NGUYEN 1 2 TABLE OF CONTENTS Table of Contents...... 2 1. Database Systems........ 3 2. Data Models..... 9 3. The Relational Database Model.......... 18 4. Entity Relationship (ER) Model....

More information

CT75 DATA WAREHOUSING AND DATA MINING DEC 2015

CT75 DATA WAREHOUSING AND DATA MINING DEC 2015 Q.1 a. Briefly explain data granularity with the help of example Data Granularity: The single most important aspect and issue of the design of the data warehouse is the issue of granularity. It refers

More information

DOWNLOAD PDF INSIDE RELATIONAL DATABASES

DOWNLOAD PDF INSIDE RELATIONAL DATABASES Chapter 1 : Inside Microsoft's Cosmos DB ZDNet Inside Relational Databases is an excellent introduction to the topic and a very good resource. I read the book cover to cover and found the authors' insights

More information

WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems

WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems Management Information Systems Management Information Systems B10. Data Management: Warehousing, Analyzing, Mining, and Visualization Code: 166137-01+02 Course: Management Information Systems Period: Spring

More information

Data Warehousing. Seminar report. Submitted in partial fulfillment of the requirement for the award of degree Of Computer Science

Data Warehousing. Seminar report.  Submitted in partial fulfillment of the requirement for the award of degree Of Computer Science A Seminar report On Data Warehousing Submitted in partial fulfillment of the requirement for the award of degree Of Computer Science SUBMITTED TO: SUBMITTED BY: www.studymafia.org www.studymafia.org Preface

More information

Chapter 1. Types of Databases and Database Applications. Basic Definitions. Introduction to Databases

Chapter 1. Types of Databases and Database Applications. Basic Definitions. Introduction to Databases Chapter 1 Introduction to Databases Types of Databases and Database Applications Numeric and Textual Databases Multimedia Databases Geographic Information Systems (GIS) Data Warehouses Real-time and Active

More information

FROM A RELATIONAL TO A MULTI-DIMENSIONAL DATA BASE

FROM A RELATIONAL TO A MULTI-DIMENSIONAL DATA BASE FROM A RELATIONAL TO A MULTI-DIMENSIONAL DATA BASE David C. Hay Essential Strategies, Inc In the buzzword sweepstakes of 1997, the clear winner has to be Data Warehouse. A host of technologies and techniques

More information

Information Management course

Information Management course Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 05(b) : 23/10/2012 Data Mining: Concepts and Techniques (3 rd ed.) Chapter

More information

UNIT I. Introduction

UNIT I. Introduction UNIT I Introduction Objective To know the need for database system. To study about various data models. To understand the architecture of database system. To introduce Relational database system. Introduction

More information

Fundamentals of Database Systems (INSY2061)

Fundamentals of Database Systems (INSY2061) Fundamentals of Database Systems (INSY2061) 1 What the course is about? These days, organizations are considering data as one important resource like finance, human resource and time. The management of

More information

Chapter 11 Database Concepts

Chapter 11 Database Concepts Chapter 11 Database Concepts INTRODUCTION Database is collection of interrelated data and database system is basically a computer based record keeping system. It contains the information about one particular

More information

CA Test Data Manager Key Scenarios

CA Test Data Manager Key Scenarios WHITE PAPER APRIL 2016 CA Test Data Manager Key Scenarios Generate and secure all the data needed for rigorous testing, and provision it to highly distributed teams on demand. Muhammad Arif Application

More information

Database Technology Introduction. Heiko Paulheim

Database Technology Introduction. Heiko Paulheim Database Technology Introduction Outline The Need for Databases Data Models Relational Databases Database Design Storage Manager Query Processing Transaction Manager Introduction to the Relational Model

More information

DBM/500 COURSE NOTES

DBM/500 COURSE NOTES WK 1 APPROACHES DBM/500 COURSE NOTES MICROSOFT RELATIONAL ACCESS SYSTEM WK 2 DESIGN TOOLS FOR ANALYSIS ENTITY RELATIONSHIP ERD EXAMPLE WK 3 UNIFIED MODELING LANGUAGE USE CASES APPLICATIONS TOOLS WK 4 NORMALIZATION

More information

CHAPTER 8 DECISION SUPPORT V2 ADVANCED DATABASE SYSTEMS. Assist. Prof. Dr. Volkan TUNALI

CHAPTER 8 DECISION SUPPORT V2 ADVANCED DATABASE SYSTEMS. Assist. Prof. Dr. Volkan TUNALI CHAPTER 8 DECISION SUPPORT V2 ADVANCED DATABASE SYSTEMS Assist. Prof. Dr. Volkan TUNALI Topics 2 Business Intelligence (BI) Decision Support System (DSS) Data Warehouse Online Analytical Processing (OLAP)

More information

Data Warehouse and Data Mining

Data Warehouse and Data Mining Data Warehouse and Data Mining Lecture No. 03 Architecture of DW Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro Basic

More information

Introduction To Computers

Introduction To Computers Introduction To Computers Chapter No 7 Introduction To Databases Overview Introduction to database To make use of information, you have to be able to find the information Data files and databases are no

More information

Chapter 6 VIDEO CASES

Chapter 6 VIDEO CASES Chapter 6 Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:

More information

Data Warehouse Design Using Row and Column Data Distribution

Data Warehouse Design Using Row and Column Data Distribution Int'l Conf. Information and Knowledge Engineering IKE'15 55 Data Warehouse Design Using Row and Column Data Distribution Behrooz Seyed-Abbassi and Vivekanand Madesi School of Computing, University of North

More information

Business Intelligence

Business Intelligence Business Intelligence The Metadata Layer Asroni Ver. 01 asroni@umy.ac.id Part IV Business Intelligence Applications 345 Applications In This Part Chapter 12: The Metadata Layer Chapter 13: Using the Pentaho

More information

Management Information Systems MANAGING THE DIGITAL FIRM, 12 TH EDITION FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT

Management Information Systems MANAGING THE DIGITAL FIRM, 12 TH EDITION FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT MANAGING THE DIGITAL FIRM, 12 TH EDITION Chapter 6 FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT VIDEO CASES Case 1: Maruti Suzuki Business Intelligence and Enterprise Databases

More information

An Introduction to Databases and Database Management Systems.

An Introduction to Databases and Database Management Systems. An Introduction to Databases and Database Management Systems. Introduction An important aspect of most every business is record keeping. In our information society, this has become an important aspect

More information

How to integrate data into Tableau

How to integrate data into Tableau 1 How to integrate data into Tableau a comparison of 3 approaches: ETL, Tableau self-service and WHITE PAPER WHITE PAPER 2 data How to integrate data into Tableau a comparison of 3 es: ETL, Tableau self-service

More information

by Prentice Hall

by Prentice Hall Chapter 6 Foundations of Business Intelligence: Databases and Information Management 6.1 2010 by Prentice Hall Organizing Data in a Traditional File Environment File organization concepts Computer system

More information

DATA WAREHOUSE- MODEL QUESTIONS

DATA WAREHOUSE- MODEL QUESTIONS DATA WAREHOUSE- MODEL QUESTIONS 1. The generic two-level data warehouse architecture includes which of the following? a. At least one data mart b. Data that can extracted from numerous internal and external

More information

Data Warehouse and Data Mining

Data Warehouse and Data Mining Data Warehouse and Data Mining Lecture No. 04-06 Data Warehouse Architecture Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology

More information

Enterprise Data Warehousing

Enterprise Data Warehousing Enterprise Data Warehousing SQL Server 2005 Ron Dunn Data Platform Technology Specialist Integrated BI Platform Integrated BI Platform Agenda Can SQL Server cope? Do I need Enterprise Edition? Will I avoid

More information

Testing Masters Technologies

Testing Masters Technologies 1. What is Data warehouse ETL TESTING Q&A Ans: A Data warehouse is a subject oriented, integrated,time variant, non volatile collection of data in support of management's decision making process. Subject

More information