Data Warehouse. TobiasGroup, Inc Crow Drive Suite 218 Macedonia, Ohio USA

Size: px
Start display at page:

Download "Data Warehouse. TobiasGroup, Inc Crow Drive Suite 218 Macedonia, Ohio USA"

Transcription

1 Data Warehouse TobiasGroup, Inc Crow Drive Suite 218 Macedonia, Ohio USA

2 1. Introduction 2 2. Data Warehousing 3 History 3 The Goals of a Data Warehouse 4 Makes an organization s information accessible. 4 Makes the organization s information consistent. 4 Is an adaptive and resilient source of information. 4 Is a secure bastion that protects the organization s information asset. 4 Is the foundation for decision making. 4 Data Warehouse Information Flow 5 Data Access 7 Data Cleansing 7 Business Rule Application 7 Data Translation 7 Warehouse Databases 7 Querying 7 Information Access 8 Basic Elements of a Data Warehouse 9 Source System 9 Staging Area 9 Presentation Area 10 End User Data Access Tools 10 Metadata 10 Basic Processes of the Data Warehouse 10 Conforming Dimensions 10 Extracting 10 Transforming 10 Loading and Indexing 11 Quality Assurance Checking 11 Release/Publishing 11 Updating 11 Querying 11 Data Feedback/Feeding in Reverse 11 Auditing 11 Securing 11 Backing Up and Recovering Terms and Definitions 13 Data Warehousing 13 Common Terms 14

3 1. Introduction At any given time the optimal IT architecture depends on a few important factors. They include; the business requirements of the enterprise, the available technology of the time, and the accumulated investments of the enterprise from earlier technologies. Today, technology is moving at a record pace. IT professionals strive to meet business requirements through existing technologies while planning and implementing new technology into their business architecture. Competitiveness and business success or failures have an increasing dependence on the resulting IT architecture. From a business perspective, information management requirements fall into specific categories. Data warehousing focuses on the operational and decisional categories. In his book The Data Warehouse Toolkit, Dr. Ralph Kimball describes the data warehouse as The place where people can go to access their data. Simply put, a data warehouse is a central repository of information. Data warehouses have a common infrastructure and common business definitions for data elements. They bring together large volumes of quantitative business information obtained from transaction processing systems, operational systems, outside sources, and sometimes other systems designed to collect data not normally captured by the first three. The information is cleansed and transformed so that it is complete and reliable, and is collected and retained over time so that changes and trends can be identified. This document recommends a lifecycle approach to building a data warehouse. The comprehensive lifecycle approach has been developed over the past seven years. It is based upon a combination of published best-practices to building a data warehouse and our development and implementation experience. The lifecycle approach emphasizes a focus on architecture and process design during the first phase of implementation. One subject area is normally delivered; the second and subsequent phases add more subject areas and dimensions. This reiterative approach ensures that sufficient attention is given to each subject area added to the warehouse. It also gives the business community time to resolve definition and conformance issues. Although relatively new to the data warehousing market, the Microsoft set of tools are recommended because of price and the relative ease of use of the tools. Skills are easily transferable from one tool to another. The individual components of the Microsoft Data Warehousing Framework (Data Transformation Services - DTS, Relational Database Management System RDBMS, and OLAP Services, and Repository are becoming the de-facto standard tools for data warehousing. TOBIASGROUP MARCH 1999 PAGE 2 OF 18 02/07/01

4 2. Data Warehousing History Data warehousing methodology has grown out of the need for immediate and comprehensive access to enterprise information. Fast, informed business decisions are no longer a competitive advantage, but a requirement. In the past, information technology has focused almost entirely on OLTP (On Line Transaction Processing) systems. OLTP or Operational Systems track our customers and orders, process general ledger and other accounting data, tell us about inventory levels and how much was spent on raw material last year, but do little to answer questions requiring data from multiple operational systems. When someone asks such a question, traditional MIS gathers information from the OLTP systems and delivers a new report containing the answer. Hopefully, this only takes a few days. The goal of Data Warehousing is to answer questions in a few minutes or seconds rather than days. Operational systems are designed for fast data entry and storage, and immediate retrieval of simple information. Usually based on a simple query - name and address of customer #34865, quantity of item AB-11 in stock, OLTP s do deliver sophisticated reports but usually not in real time. To paraphrase one IT professional - Operational reporting gives a perfect picture of the wake of the boat, but does little to help steer a course. Data Warehousing is NOT an attempt to replace or redesign any operational system. OLTP systems have been described as the heart of an enterprise. Most corporations could not operate for a single day without them. Data Warehousing IS an attempt to make information technology just as valuable as the brain of an enterprise. To answer questions in seconds, new system design and implementation philosophies are required. Data Warehouses are Information Access Systems. Data must be stored in new ways for faster access. OLTP systems are optimized for fast entry and quick record processing. On the other hand, data warehouses must retrieve large amounts of information quickly and deliver it to an end user s desktop. Business information must be extracted from the OLTP system and loaded into the Data Warehouse s new fast access formats. A Data Warehouse must then be able to deliver loaded information in a variety of formats. Hardcopy and spreadsheet straight line processing is no longer sufficient. A comprehensive repository of business rules information is also required. Business definitions about sales periods or even the simple what is customer? have previously been defined differently in different departments and facilities. Well yes, but my number also includes or Our period ends on, so you can t apply that number to these figures are familiar comments in today s conference room. Redefining and consolidating all the business rules in the warehouse eliminates these difficulties. End users have many new tools for information access that require complex information. Most of these new tools translate business language questions into database queries that were previously only written in the MIS department. In addition, there are now many applications that perform very sophisticated calculations and modeling. TOBIASGROUP MARCH 1999 PAGE 3 OF 18 02/07/01

5 The Goals of a Data Warehouse One of the most important assets of an organization is its information. These assets are usually kept by an organization in three forms: the operational systems of record; distributed, ad-hoc, or departmental documents and databases utilized to satisfy reporting requirements; and the data warehouse. While most of the ad-hoc documents, usually in spreadsheets, will be replaced over time with the warehoused data, the data warehouse will never be a substitute for the operational systems. The data warehouse has profoundly different needs, clients, structures, and rhythms than the operational systems. The operational systems of record are where data is put in, and the data warehouse is where the data is taken out. While the basic goals of the operational systems are to capture the daily transactions and to aid in the day-to-day running of that business, the data warehouse: Makes an organization s information accessible. The contents of the data warehouse are understandable and navigable, and the access is characterized by fast performance. Understandable means correctly labeled and obvious. Navigable means recognizing the destination on the screens and getting there in one click. Fast performance means zero wait time. In addition, accessibility of the data means the data warehouse can be the source of information for many data-hungry business improvement efforts such as process modeling and simulation, budgeting and forecasting, activity-based costing, and new product development. Makes the organization s information consistent. Information from one part of the organization can be matched with information from another part of the organization. If two measures of an organization have the same name, then they must mean the same thing. Conversely, if two measures don t mean the same thing, then they are labeled differently. Consistent information means high quality information. It means that all of the information is accounted for and is complete. Is an adaptive and resilient source of information. The data warehouse is designed for continuous change. When new questions are asked of the data warehouse, the existing data and the technologies are not changed or disrupted. When new data is added to the data warehouse, the existing data are not changed or disrupted. The design of the separate data marts that make up the data warehouse must be distributed and incremental. Is a secure bastion that protects the organization s information asset. The data warehouse not only controls access to the data effectively, but gives its owners great visibility into the uses and abuses of that data, even after it has left the data warehouse. Is the foundation for decision making. The data warehouse has the right data in it to support decision-making. There is only one true output from a data warehouse: the decisions that are made after the data warehouse has presented its evidence. TOBIASGROUP MARCH 1999 PAGE 4 OF 18 02/07/01

6 Data Warehouse Information Flow Over the past two decades, information technology has been adapted and integrated into all aspects of the enterprise. Information access was never ignored, but was usually designed by operational systems professionals. OLTP and reporting systems were departmentalized and so diverse that information gathering was problematic. Most of today s information access systems look something like the following:??figure 2-1 Information Access Today Because data needs to be gathered and translated from a variety of different sources, reports may take more than a day to run. The 4GL reporting tools are only understood by MIS professionals, so new reports and report modifications may take days or weeks. Each department has its own business rules making reports hard or impossible to correlate. TOBIASGROUP MARCH 1999 PAGE 5 OF 18 02/07/01

7 In contrast, a Data Warehouse collects data from the OLTP sources, applies a consistent set of business rules, and stores the data in a readily accessible format separate from the OLTP sources. Since data gathering is done once a day or month at low processing times, updated information is always available to the end user. Data Warehouse information access is illustrated in the following diagram.??figure 2-2 Data Warehouse Information Access TOBIASGROUP MARCH 1999 PAGE 6 OF 18 02/07/01 There are several important considerations in the process model above. Data generally, but not exclusively, moves from the left to the right. In the Data Access layer, information is gathered from the existing operational systems. Data Staging is arguably the most important and the most complex layer. It can be divided into Data Cleansing, Business Rule Application, and Data Translation. Information requests are compiled in the Querying layer. Finally, data is processed, displayed, and reported in the Information Access layer. Process Management applications drive the warehouse and control each layer using Metadata Functions. Metadata or data about data is used in every layer and defines each process. At a minimum, Metadata includes data warehouse group, table, and field names with descriptions; original data source; allowable values and formats; and simple business rules. Ideally, complex business rules, cleansing information, source data formats, and end user data formats are also included.

8 Data Access The Data Access layer is the first step when loading the warehouse with new information. Any new data since the last load must be retrieved from all existing OLTP and operational systems. Data Access contains tools that understand all the mainframe and PC database formats. Metadata is used to specify which data is included and where it resides in the operational systems. Data Cleansing There is virtually no computer-based information that is 100% accurate. The old OLTP saying garbage in, garbage out applies even more dramatically to data warehousing. Correcting data entry errors is only a small part of the cleansing process. Typically, data relationship problems are the most daunting. The simple question Who is our largest customer? may be answered incorrectly if data is not cleansed properly. OLTP systems in different divisions may have separate codes for the unrelated customers Digital Equipment Corporation, Digital, and DEC. Products and services may be just as hard to reconcile. Stephen Brown of Vality Technology, Inc. reports that 10 times the allotted resources are usually spent implementing the cleansing layer. Any rules for correcting inconsistencies should be stored in the Metadata repository for reference and ease of modification. Business Rule Application Business Rules (as defined in Metadata) are applied in the data-staging layer. Consistent periods and relationships are necessary to correlate departmental and divisional information. Data Translation While Cleansing and Business Rule applications occur, data is translated into standard formats and stored in the warehouse. Warehouse Databases The physical warehouse data store may contain one or many standardized database formats. Optimization of information access speed is determined by varying format selections. Data Normalization was once thought to be a rule for any database design, but is now known to apply to OLTP-like systems only. In a warehouse, data is frequently denormalized. For example, records may contain redundant and uncoded information; additionally, summary data is stored separately or alongside detail. The many different data formats are defined in Metadata. Querying Querying is the simple and hopefully speedy process of compiling warehouse information for delivery to an end user. In the query layer, data is gathered per user request and translated from the standardized warehouse formats into any new formats required by end user tools. Again, information about all the different formats is stored in Metadata. TOBIASGROUP MARCH 1999 PAGE 7 OF 18 02/07/01

9 Information Access On the end user s desktop are the varieties of information access tools mentioned above. Tools with built-in warehousing technology may query the warehouse directly and produce reports. Other tools can be populated with data by applications in the Querying layer. TOBIASGROUP MARCH 1999 PAGE 8 OF 18 02/07/01

10 Basic Elements of a Data Warehouse From the information process flow above, the following basic elements are derived and illustrated below:??figure 2-3 Elements of a Data Warehouse Source System An operational system of record whose function it is to capture the transactions of the business. A source system is often called a legacy system in the mainframe environment. The main priorities of the source system are uptime and availability. Queries against source systems are narrow, account-based queries that are part of the normal transaction flow and severely restricted in their demands on the legacy system. Staging Area A storage area and set of processes that clean, transform, combine, de-duplicate, household, archive, and prepare source data for use in the presentation server. In many cases, the primary objects in this area are a set of flat-file tables representing extracted (from the source systems) data, loading and transformation routines, and a resulting set of tables containing clean data Dynamic Data Store. This area does not usually provide query and presentation services. TOBIASGROUP MARCH 1999 PAGE 9 OF 18 02/07/01

11 Presentation Area The presentation area are the target physical machines on which the data warehouse data is organized and stored for direct querying by end users, report writers, and other applications. The set of presentable data, or Analytical Data Store, normally take the form of dimensionally modeled tables when stored in a relational database, and cube files when stored in an Olap database. End User Data Access Tools End user data access tools are any clients of the data warehouse. An end user access tool can be as simple as an ad hoc query tool, or can be as complex as a sophisticated data mining or modeling application. Metadata All of the information in the data warehouse environment that is not the actual data itself. This data about data is catalogued, versioned, documented, and backed up. Basic Processes of the Data Warehouse Conforming Dimensions The process of aligning business user s understanding of the dimensions used in the data warehouse. The resulting conformed dimensions are dimensions that mean the same thing with every possible fact table to which it can be joined. Examples of obvious conformed dimensions include customer, product, location, and calendar (time). Extracting The extract step is the first step of getting data into the data warehouse environment. Extracting means reading and understanding the source data, and copying the parts that are needed to the data staging area for further work. Transforming Once the data is extracted into the data staging area, there are many possible transformation steps including?? Cleaning the data by correcting misspellings, resolving domain conflicts (such as a city name that is incompatible with a postal code), dealing with missing data elements, and parsing into standard formats?? Purging selected fields from the legacy data that are not useful for the data warehouse?? Combining data sources, by matching exactly on key values or by performing fuzzy matches on non-key attributes, including looking up textual equivalents of legacy system codes?? Creating surrogate keys for each dimension record in order to avoid a dependence on legacy defined keys, where the surrogate key generation process enforces referential integrity between the dimension tables and the fact tables?? Building aggregates for boosting the performance of common queries TOBIASGROUP MARCH 1999 PAGE 10 OF 18 02/07/01

12 Loading and Indexing At the end of the transformation process, the data is in the form of load record images. Loading in the data warehouse environment usually takes the form of replicating the dimension tables and fact tables and presenting these tables to the bulk loading facilities of the presentation area servers. Quality Assurance Checking When each presentation server is loaded, indexed, and supplied with appropriate aggregates, the last step before publishing is the quality assurance step. Quality assurance can be checked by running a comprehensive exception report over the entire set of newly loaded data. All of the reporting categories must be present, and the counts and totals must be satisfactory. All reported values must be consistent with the time series of similar values that preceded them. The exception report is probably built with an end user report writing facility. All issues dealing with transformation should have already been resolved. Release/Publishing The user community is notified that the new data is ready ring the data bell. Updating Incorrect data should obviously be corrected. Changes in labels, hierarchies, status, and corporate ownership often trigger necessary changes in the original data stored in the data warehouse. In general, these are managed load updates, not transactional updates. Querying Querying is a broad term that encompasses all the activities of requesting data, report writing, complex decision support applications, requests from models, and full-fledged data mining. Querying never takes place in the staging area. Data Feedback/Feeding in Reverse When modeling tools are used in a data warehousing environment, results from these tools are sometimes loaded into the warehouse. Auditing At times it is critically important to know where the data came from and what were the calculations performed. Securing Every data warehouse has an exquisite dilemma: the need to publish the data widely to as many users as possible with the easiest-to-use interface, but at the same time protect the valuable sensitive data from hackers, snoopers, and industrial spies. Data warehouse security must be managed centrally while users must be able to access all the constituent data with a single sign-on. TOBIASGROUP MARCH 1999 PAGE 11 OF 18 02/07/01

13 Backing Up and Recovering The project team will decide where to take the necessary snapshots of the data for archival purposes and disaster recovery. TOBIASGROUP MARCH 1999 PAGE 12 OF 18 02/07/01

14 3. Terms and Definitions Data Warehousing William H Inmon in Building the Data Warehouse defines a data warehouse as a collection of integrated, subject-oriented databases designed to supply the information required for decision-making. Data warehousing then, is the process of building, creating, and maintaining a data warehouse. Below is a closer look at each of these components followed by a listing of the common terms used within a data warehousing effort. Integrated The data warehouse is comprised of data from many systems. Each system may be similar in nature or have a totally different use. For example, customer A may be an active buyer of goods and services and their information is stored in an accounting system. They may also be tracked in a marketing database used to track new construction projects. Each of these systems store data about the same customer, but neither of them knows anything about the other. A data warehouse captures information from both of these systems, integrates the information so they can be related, and provides new, meaningful ways of looking at the data. Subject Oriented The data warehouse takes a different approach than the traditional OLTP systems. It looks at subjects like customers, sales, and profits as opposed to the systems that focus on one department or process. Databases The term data warehouse refers to the entire collection of tools, processes and hardware required to plan, develop, implement, and use the system. At its core is very large, typically read-only database that collects both internal and external data and provides unique ways of viewing the data. Internal data includes the operational system within the organization. External data may come from customer, the government, research, and other organizations that sell data related to your organization. Decision-Making Traditional operational systems are typically built on normalized relational databases that are designed to maintain a high level of relational data integrity. Data warehouses are denormalized in order to make the data more meaningful to the users. It is designed for presentation and performance, allows for many different views. Product managers may be interested in sales per region while a financial manager is interested in profitability. TOBIASGROUP MARCH 1999 PAGE 13 OF 18 02/07/01

15 Common Terms TOBIASGROUP MARCH 1999 PAGE 14 OF 18 02/07/01 The purpose of the design is to enable users with different interests to slice and dice the data in ways that suit their needs. It also permits drilling through many levels from summary to detail in order to pinpoint the cause of issues. Analytical Tools This is an umbrella phrase used to connote software that employs some sort of mathematical algorithm(s) to analyze the data contained in the warehouse. Data Mining, OLAP, ROLAP and other terms are used to designate types of these tools with different functionality. In practice, however, a given analytical tool may provide more than one type of analysis procedure and may also encompass some middleware functionality. Such tools are difficult to classify. Ad Hoc Query Processing The process of extracting and reporting information from a database through the issuance of a structured query. Programmers usually write queries using special languages that are associated with database management systems. Most relational database managers use a variant of 4GL (4th Generation Language originally developed by IBM). An example of an ad hoc query might be "How many customers called the UK between the hours of 6-8 am? Several packages available that make the construction of queries user-friendlier than writing language constructs. These usually employ some sort of graphic/visualization front end. Business Intelligence (BI) A phrase coined by (or at least popularized by) Gartner Group that covers any computerized process used to extract and/or analyze business data. Data Extraction, Cleansing and Transformation Process The process by which data is extracted from an operational database, cleaned and then transformed into a format useful for a data warehouse-based application. Data Mall A data mall is a collection of data marts. Data Mart (DM) A Data Mart is a data warehouse that is restricted to dealing with a single subject or topic. The operational data that feeds a data mart generally comes from a single set or source of operational data. Data Mining Data Mining is a process by which the computer looks for trends and patterns in the data and flags potentially significant information. An example of a data-mining query might be "What are the psychological factors associated with child abusers?" Data Warehouse (DW)

16 A repository for data organized in a format that is suitable for ad hoc query processing, data mining, OLAP and/or other analytical applications. Data Warehouses are built from operational databases. The operational data is "cleaned" and transformed in such a way that it is amenable to fast retrieval and efficient analysis. A single-purpose data warehouse is sometimes referred to as a "data mart." DBMS A Database Management System manages the storage and retrieval of a collection(s) of data. Most DBMSs associated with data warehousing are based on relational technology, since relational databases are particularly amenable to the kinds of tasks associated with BI. Virtually all BI systems have a large DBMS system as its foundation. (The leaders are IBM's DB2, Oracle, Sybase, Informix and Microsoft.) However, small data warehouses or data marts may not employ a general-purpose DBMS at all. Decision Support System (DSS) DSS is an umbrella expression that encompasses ad hoc query, data mining, OLAP/ROLAP, vertical applications and, in the minds of at least some, the data warehouse as well. DSS, appears to be falling into disuse in some circles and is being replaced with Business Intelligence (BI) Enterprise Data Model Refers to a single collection of data designed to serve the diverse needs of an enterprise. The opposing concept is that of a collection of smallish databases, each designed to support a limited requirement. Enterprise Data Repository A database containing "metadata" used to control data transformations for DW/BI systems. A leading exponent of the data repository concept is a software company called Platinum Technology. Enterprise Data Warehouse A single repository holding data from several operational sources that serves many different users, typically in different divisions or departments. An enterprise data warehouse for a large company might, for example, contain data from several separate divisions, and serve the needs of both those divisions and of corporate users wishing to analyze consolidated information. Enterprise Resource Planning (ERP) ERP systems are comprised of software programs which tie together all of an enterprise's various functions -- such as finance, manufacturing, sales and human resources. This software also provides for the analysis of the data from these areas to plan production, forecast sales and analyze quality. Today many organizations are realizing that to maximize the value of the information stored in their ERP systems, it is necessary to extend the ERP architectures to include more advanced reporting, analytical and decision support capabilities. This is best accomplished through the application of data warehousing tools and techniques. TOBIASGROUP MARCH 1999 PAGE 15 OF 18 02/07/01

17 Knowledge Discovery A phrase coined by (or at least popularized by) Gartner Group defined as the process of discovering meaningful new correlations, patterns and trends by sifting through large amounts of data stored in repositories (e.g., data warehouses), using such technologies as pattern recognition, statistics and other mathematical techniques. Knowledge Discovery is really the same thing as data mining. Knowledge Management An umbrella term that is used by some in the same context as Business Intelligence. Logical Systems Architect A person or organization that designs the software for the DW/BI application. Middleware An umbrella term used to describe software that bridges various parts of a DW/DSS system. For example, software that extracts, cleans or separates data. Microsoft A software vendor based in Redmond, Washington, USA. MOLAP Multidimensional OLAP A set of user interfaces, applications, and proprietary database technologies that have a strongly dimensional flavor. Network/Traffic Pattern Analysis Application Normally associated with telecommunications, this application analyzes traffic patterns in order to discover facts about customer behaviors, project future demand or to conduct market analysis to determine need for new services. The results can also be used by both telcos and large private network operators to reduce network costs and/or to improve network efficiency by analyzing such issues as capacity and maintenance. OLAP/ROLAP/MOLAP OnLine Analytical Processing/Relational OnLine Analytical Processing/MultiDimensional OnLine Analytical Processing (OLAP/ROLAP/MOLAP) are applications that seek to verify complex hypotheses. An example of an OLAP query might be "Compare the costs of shipping to customers in the east to those in the west." OLAP OnLine Analytical Processing The general activity of querying and presenting text and number data from data warehouses, as well as a specifically dimensional style of querying and presenting that is exemplified by a number of OLAP vendors. The OLAP vendors technology is nonrelational and is almost always based on an explicit multidimentional cube of data. OLAP databases are also known as multidimensional databases, or MDDBs. TOBIASGROUP MARCH 1999 PAGE 16 OF 18 02/07/01

18 Operational Data Store (ODS) A database of operational data that is formatted as it is collected for use as a data warehouse. This is opposed to the more common process of creating a separate database out of existing operational databases designed expressly for use by the data warehouse. Operational or OLTP Database Operational data is the data collected from operations such as order processing, accounting, manufacturing, marketing, etc. Most modern companies collect most of this data using a form of OnLine Transaction Processing (OLTP). Data generated by these systems is generally not in a format that makes for efficient query processing or analysis. Relational Data Data that has been formatted and organized to work in a database designed to work using relational schema. ROLAP Relational OLAP A set of user interfaces and applications that give a relational database a dimensional flavor. TOBIASGROUP MARCH 1999 PAGE 17 OF 18 02/07/01

CHAPTER 3 Implementation of Data warehouse in Data Mining

CHAPTER 3 Implementation of Data warehouse in Data Mining CHAPTER 3 Implementation of Data warehouse in Data Mining 3.1 Introduction to Data Warehousing A data warehouse is storage of convenient, consistent, complete and consolidated data, which is collected

More information

The Evolution of Data Warehousing. Data Warehousing Concepts. The Evolution of Data Warehousing. The Evolution of Data Warehousing

The Evolution of Data Warehousing. Data Warehousing Concepts. The Evolution of Data Warehousing. The Evolution of Data Warehousing The Evolution of Data Warehousing Data Warehousing Concepts Since 1970s, organizations gained competitive advantage through systems that automate business processes to offer more efficient and cost-effective

More information

Fig 1.2: Relationship between DW, ODS and OLTP Systems

Fig 1.2: Relationship between DW, ODS and OLTP Systems 1.4 DATA WAREHOUSES Data warehousing is a process for assembling and managing data from various sources for the purpose of gaining a single detailed view of an enterprise. Although there are several definitions

More information

WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems

WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems Management Information Systems Management Information Systems B10. Data Management: Warehousing, Analyzing, Mining, and Visualization Code: 166137-01+02 Course: Management Information Systems Period: Spring

More information

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing. About the Tutorial A data warehouse is constructed by integrating data from multiple heterogeneous sources. It supports analytical reporting, structured and/or ad hoc queries and decision making. This

More information

Data Warehouses Chapter 12. Class 10: Data Warehouses 1

Data Warehouses Chapter 12. Class 10: Data Warehouses 1 Data Warehouses Chapter 12 Class 10: Data Warehouses 1 OLTP vs OLAP Operational Database: a database designed to support the day today transactions of an organization Data Warehouse: historical data is

More information

TDWI Data Modeling. Data Analysis and Design for BI and Data Warehousing Systems

TDWI Data Modeling. Data Analysis and Design for BI and Data Warehousing Systems Data Analysis and Design for BI and Data Warehousing Systems Previews of TDWI course books offer an opportunity to see the quality of our material and help you to select the courses that best fit your

More information

Q1) Describe business intelligence system development phases? (6 marks)

Q1) Describe business intelligence system development phases? (6 marks) BUISINESS ANALYTICS AND INTELLIGENCE SOLVED QUESTIONS Q1) Describe business intelligence system development phases? (6 marks) The 4 phases of BI system development are as follow: Analysis phase Design

More information

Data warehouse architecture consists of the following interconnected layers:

Data warehouse architecture consists of the following interconnected layers: Architecture, in the Data warehousing world, is the concept and design of the data base and technologies that are used to load the data. A good architecture will enable scalability, high performance and

More information

A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective

A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective B.Manivannan Research Scholar, Dept. Computer Science, Dravidian University, Kuppam, Andhra Pradesh, India

More information

Data Warehouse and Data Mining

Data Warehouse and Data Mining Data Warehouse and Data Mining Lecture No. 02 Introduction to Data Warehouse Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology

More information

Data Warehousing. Ritham Vashisht, Sukhdeep Kaur and Shobti Saini

Data Warehousing. Ritham Vashisht, Sukhdeep Kaur and Shobti Saini Advance in Electronic and Electric Engineering. ISSN 2231-1297, Volume 3, Number 6 (2013), pp. 669-674 Research India Publications http://www.ripublication.com/aeee.htm Data Warehousing Ritham Vashisht,

More information

Data Warehouse. Asst.Prof.Dr. Pattarachai Lalitrojwong

Data Warehouse. Asst.Prof.Dr. Pattarachai Lalitrojwong Data Warehouse Asst.Prof.Dr. Pattarachai Lalitrojwong Faculty of Information Technology King Mongkut s Institute of Technology Ladkrabang Bangkok 10520 pattarachai@it.kmitl.ac.th The Evolution of Data

More information

The strategic advantage of OLAP and multidimensional analysis

The strategic advantage of OLAP and multidimensional analysis IBM Software Business Analytics Cognos Enterprise The strategic advantage of OLAP and multidimensional analysis 2 The strategic advantage of OLAP and multidimensional analysis Overview Online analytical

More information

The University of Iowa Intelligent Systems Laboratory The University of Iowa Intelligent Systems Laboratory

The University of Iowa Intelligent Systems Laboratory The University of Iowa Intelligent Systems Laboratory Warehousing Outline Andrew Kusiak 2139 Seamans Center Iowa City, IA 52242-1527 andrew-kusiak@uiowa.edu http://www.icaen.uiowa.edu/~ankusiak Tel. 319-335 5934 Introduction warehousing concepts Relationship

More information

1 DATAWAREHOUSING QUESTIONS by Mausami Sawarkar

1 DATAWAREHOUSING QUESTIONS by Mausami Sawarkar 1 DATAWAREHOUSING QUESTIONS by Mausami Sawarkar 1) What does the term 'Ad-hoc Analysis' mean? Choice 1 Business analysts use a subset of the data for analysis. Choice 2: Business analysts access the Data

More information

Data Mining Concepts & Techniques

Data Mining Concepts & Techniques Data Mining Concepts & Techniques Lecture No. 01 Databases, Data warehouse Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro

More information

Data Warehousing. Adopted from Dr. Sanjay Gunasekaran

Data Warehousing. Adopted from Dr. Sanjay Gunasekaran Data Warehousing Adopted from Dr. Sanjay Gunasekaran Main Topics Overview of Data Warehouse Concept of Data Conversion Importance of Data conversion and the steps involved Common Industry Methodology Outline

More information

DATA MINING AND WAREHOUSING

DATA MINING AND WAREHOUSING DATA MINING AND WAREHOUSING Qno Question Answer 1 Define data warehouse? Data warehouse is a subject oriented, integrated, time-variant, and nonvolatile collection of data that supports management's decision-making

More information

Data Warehousing. Data Warehousing and Mining. Lecture 8. by Hossen Asiful Mustafa

Data Warehousing. Data Warehousing and Mining. Lecture 8. by Hossen Asiful Mustafa Data Warehousing Data Warehousing and Mining Lecture 8 by Hossen Asiful Mustafa Databases Databases are developed on the IDEA that DATA is one of the critical materials of the Information Age Information,

More information

ASG WHITE PAPER DATA INTELLIGENCE. ASG s Enterprise Data Intelligence Solutions: Data Lineage Diving Deeper

ASG WHITE PAPER DATA INTELLIGENCE. ASG s Enterprise Data Intelligence Solutions: Data Lineage Diving Deeper THE NEED Knowing where data came from, how it moves through systems, and how it changes, is the most critical and most difficult task in any data management project. If that process known as tracing data

More information

CT75 DATA WAREHOUSING AND DATA MINING DEC 2015

CT75 DATA WAREHOUSING AND DATA MINING DEC 2015 Q.1 a. Briefly explain data granularity with the help of example Data Granularity: The single most important aspect and issue of the design of the data warehouse is the issue of granularity. It refers

More information

Decision Support, Data Warehousing, and OLAP

Decision Support, Data Warehousing, and OLAP Decision Support, Data Warehousing, and OLAP : Contents Terminology : OLAP vs. OLTP Data Warehousing Architecture Technologies References 1 Decision Support and OLAP Information technology to help knowledge

More information

Data-Driven Driven Business Intelligence Systems: Parts I. Lecture Outline. Learning Objectives

Data-Driven Driven Business Intelligence Systems: Parts I. Lecture Outline. Learning Objectives Data-Driven Driven Business Intelligence Systems: Parts I Week 5 Dr. Jocelyn San Pedro School of Information Management & Systems Monash University IMS3001 BUSINESS INTELLIGENCE SYSTEMS SEM 1, 2004 Lecture

More information

Benefits of Automating Data Warehousing

Benefits of Automating Data Warehousing Benefits of Automating Data Warehousing Introduction Data warehousing can be defined as: A copy of data specifically structured for querying and reporting. In most cases, the data is transactional data

More information

Data Warehouse and Mining

Data Warehouse and Mining Data Warehouse and Mining 1. is a subject-oriented, integrated, time-variant, nonvolatile collection of data in support of management decisions. A. Data Mining. B. Data Warehousing. C. Web Mining. D. Text

More information

The Data Organization

The Data Organization C V I T F E P A O TM The Data Organization 1251 Yosemite Way Hayward, CA 94545 (510) 303-8868 rschoenrank@computer.org Business Intelligence Process Architecture By Rainer Schoenrank Data Warehouse Consultant

More information

Full file at

Full file at Chapter 2 Data Warehousing True-False Questions 1. A real-time, enterprise-level data warehouse combined with a strategy for its use in decision support can leverage data to provide massive financial benefits

More information

Data Warehousing and OLAP

Data Warehousing and OLAP Data Warehousing and OLAP INFO 330 Slides courtesy of Mirek Riedewald Motivation Large retailer Several databases: inventory, personnel, sales etc. High volume of updates Management requirements Efficient

More information

INTRODUCTION. Chris Claterbos, Vlamis Software Solutions, Inc. REVIEW OF ARCHITECTURE

INTRODUCTION. Chris Claterbos, Vlamis Software Solutions, Inc. REVIEW OF ARCHITECTURE BUILDING AN END TO END OLAP SOLUTION USING ORACLE BUSINESS INTELLIGENCE Chris Claterbos, Vlamis Software Solutions, Inc. claterbos@vlamis.com INTRODUCTION Using Oracle 10g R2 and Oracle Business Intelligence

More information

DHANALAKSHMI COLLEGE OF ENGINEERING, CHENNAI

DHANALAKSHMI COLLEGE OF ENGINEERING, CHENNAI DHANALAKSHMI COLLEGE OF ENGINEERING, CHENNAI Department of Information Technology IT6702 Data Warehousing & Data Mining Anna University 2 & 16 Mark Questions & Answers Year / Semester: IV / VII Regulation:

More information

collection of data that is used primarily in organizational decision making.

collection of data that is used primarily in organizational decision making. Data Warehousing A data warehouse is a special purpose database. Classic databases are generally used to model some enterprise. Most often they are used to support transactions, a process that is referred

More information

Introduction to DWML. Christian Thomsen, Aalborg University. Slides adapted from Torben Bach Pedersen and Man Lung Yiu

Introduction to DWML. Christian Thomsen, Aalborg University. Slides adapted from Torben Bach Pedersen and Man Lung Yiu Introduction to DWML Christian Thomsen, Aalborg University Slides adapted from Torben Bach Pedersen and Man Lung Yiu Course Structure Business intelligence Extract knowledge from large amounts of data

More information

Working with the Business to Build Effective Dimensional Models

Working with the Business to Build Effective Dimensional Models Working with the Business to Build Effective Dimensional Models Laura L. Reeves Co-Founder & Principal April, 2009 Copyright 2009 StarSoft Solutions, Inc. Slide 1 Instructor Information: Laura L. Reeves,

More information

Data Warehousing. Overview

Data Warehousing. Overview Data Warehousing Overview Basic Definitions Normalization Entity Relationship Diagrams (ERDs) Normal Forms Many to Many relationships Warehouse Considerations Dimension Tables Fact Tables Star Schema Snowflake

More information

Question Bank. 4) It is the source of information later delivered to data marts.

Question Bank. 4) It is the source of information later delivered to data marts. Question Bank Year: 2016-2017 Subject Dept: CS Semester: First Subject Name: Data Mining. Q1) What is data warehouse? ANS. A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile

More information

Information Management course

Information Management course Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 05(b) : 23/10/2012 Data Mining: Concepts and Techniques (3 rd ed.) Chapter

More information

The Data Organization

The Data Organization C V I T F E P A O TM The Data Organization Best Practices Metadata Dictionary Application Architecture Prepared by Rainer Schoenrank January 2017 Table of Contents 1. INTRODUCTION... 3 1.1 PURPOSE OF THE

More information

CHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP)

CHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP) CHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP) INTRODUCTION A dimension is an attribute within a multidimensional model consisting of a list of values (called members). A fact is defined by a combination

More information

Data Strategies for Efficiency and Growth

Data Strategies for Efficiency and Growth Data Strategies for Efficiency and Growth Date Dimension Date key (PK) Date Day of week Calendar month Calendar year Holiday Channel Dimension Channel ID (PK) Channel name Channel description Channel type

More information

Data Warehouse and Data Mining

Data Warehouse and Data Mining Data Warehouse and Data Mining Lecture No. 07 Terminologies Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro Database

More information

Business Intelligence An Overview. Zahra Mansoori

Business Intelligence An Overview. Zahra Mansoori Business Intelligence An Overview Zahra Mansoori Contents 1. Preference 2. History 3. Inmon Model - Inmonities 4. Kimball Model - Kimballities 5. Inmon vs. Kimball 6. Reporting 7. BI Algorithms 8. Summary

More information

Rocky Mountain Technology Ventures

Rocky Mountain Technology Ventures Rocky Mountain Technology Ventures Comparing and Contrasting Online Analytical Processing (OLAP) and Online Transactional Processing (OLTP) Architectures 3/19/2006 Introduction One of the most important

More information

Data Warehousing and OLAP Technologies for Decision-Making Process

Data Warehousing and OLAP Technologies for Decision-Making Process Data Warehousing and OLAP Technologies for Decision-Making Process Hiren H Darji Asst. Prof in Anand Institute of Information Science,Anand Abstract Data warehousing and on-line analytical processing (OLAP)

More information

Data Warehousing ETL. Esteban Zimányi Slides by Toon Calders

Data Warehousing ETL. Esteban Zimányi Slides by Toon Calders Data Warehousing ETL Esteban Zimányi ezimanyi@ulb.ac.be Slides by Toon Calders 1 Overview Picture other sources Metadata Monitor & Integrator OLAP Server Analysis Operational DBs Extract Transform Load

More information

1. Analytical queries on the dimensionally modeled database can be significantly simpler to create than on the equivalent nondimensional database.

1. Analytical queries on the dimensionally modeled database can be significantly simpler to create than on the equivalent nondimensional database. 1. Creating a data warehouse involves using the functionalities of database management software to implement the data warehouse model as a collection of physically created and mutually connected database

More information

Chapter 6 VIDEO CASES

Chapter 6 VIDEO CASES Chapter 6 Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:

More information

IT1105 Information Systems and Technology. BIT 1 ST YEAR SEMESTER 1 University of Colombo School of Computing. Student Manual

IT1105 Information Systems and Technology. BIT 1 ST YEAR SEMESTER 1 University of Colombo School of Computing. Student Manual IT1105 Information Systems and Technology BIT 1 ST YEAR SEMESTER 1 University of Colombo School of Computing Student Manual Lesson 3: Organizing Data and Information (6 Hrs) Instructional Objectives Students

More information

Data Warehousing. Seminar report. Submitted in partial fulfillment of the requirement for the award of degree Of Computer Science

Data Warehousing. Seminar report.  Submitted in partial fulfillment of the requirement for the award of degree Of Computer Science A Seminar report On Data Warehousing Submitted in partial fulfillment of the requirement for the award of degree Of Computer Science SUBMITTED TO: SUBMITTED BY: www.studymafia.org www.studymafia.org Preface

More information

Application software office packets, databases and data warehouses.

Application software office packets, databases and data warehouses. Introduction to Computer Systems (9) Application software office packets, databases and data warehouses. Piotr Mielecki Ph. D. http://www.wssk.wroc.pl/~mielecki piotr.mielecki@pwr.edu.pl pmielecki@gmail.com

More information

Designing Data Warehouses. Data Warehousing Design. Designing Data Warehouses. Designing Data Warehouses

Designing Data Warehouses. Data Warehousing Design. Designing Data Warehouses. Designing Data Warehouses Designing Data Warehouses To begin a data warehouse project, need to find answers for questions such as: Data Warehousing Design Which user requirements are most important and which data should be considered

More information

Topics covered 10/12/2015. Pengantar Teknologi Informasi dan Teknologi Hijau. Suryo Widiantoro, ST, MMSI, M.Com(IS)

Topics covered 10/12/2015. Pengantar Teknologi Informasi dan Teknologi Hijau. Suryo Widiantoro, ST, MMSI, M.Com(IS) Pengantar Teknologi Informasi dan Teknologi Hijau Suryo Widiantoro, ST, MMSI, M.Com(IS) 1 Topics covered 1. Basic concept of managing files 2. Database management system 3. Database models 4. Data mining

More information

Chapter 13 Business Intelligence and Data Warehouses The Need for Data Analysis Business Intelligence. Objectives

Chapter 13 Business Intelligence and Data Warehouses The Need for Data Analysis Business Intelligence. Objectives Chapter 13 Business Intelligence and Data Warehouses Objectives In this chapter, you will learn: How business intelligence is a comprehensive framework to support business decision making How operational

More information

IT DATA WAREHOUSING AND DATA MINING UNIT-2 BUSINESS ANALYSIS

IT DATA WAREHOUSING AND DATA MINING UNIT-2 BUSINESS ANALYSIS PART A 1. What are production reporting tools? Give examples. (May/June 2013) Production reporting tools will let companies generate regular operational reports or support high-volume batch jobs. Such

More information

IDU0010 ERP,CRM ja DW süsteemid Loeng 5 DW concepts. Enn Õunapuu

IDU0010 ERP,CRM ja DW süsteemid Loeng 5 DW concepts. Enn Õunapuu IDU0010 ERP,CRM ja DW süsteemid Loeng 5 DW concepts Enn Õunapuu enn.ounapuu@ttu.ee Content Oveall approach Dimensional model Tabular model Overall approach Data modeling is a discipline that has been practiced

More information

Data Warehouse and Data Mining

Data Warehouse and Data Mining Data Warehouse and Data Mining Lecture No. 03 Architecture of DW Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro Basic

More information

Summary of Last Chapter. Course Content. Chapter 2 Objectives. Data Warehouse and OLAP Outline. Incentive for a Data Warehouse

Summary of Last Chapter. Course Content. Chapter 2 Objectives. Data Warehouse and OLAP Outline. Incentive for a Data Warehouse Principles of Knowledge Discovery in bases Fall 1999 Chapter 2: Warehousing and Dr. Osmar R. Zaïane University of Alberta Dr. Osmar R. Zaïane, 1999 Principles of Knowledge Discovery in bases University

More information

Managing Data Resources

Managing Data Resources Chapter 7 Managing Data Resources 7.1 2006 by Prentice Hall OBJECTIVES Describe basic file organization concepts and the problems of managing data resources in a traditional file environment Describe how

More information

CHAPTER 8 DECISION SUPPORT V2 ADVANCED DATABASE SYSTEMS. Assist. Prof. Dr. Volkan TUNALI

CHAPTER 8 DECISION SUPPORT V2 ADVANCED DATABASE SYSTEMS. Assist. Prof. Dr. Volkan TUNALI CHAPTER 8 DECISION SUPPORT V2 ADVANCED DATABASE SYSTEMS Assist. Prof. Dr. Volkan TUNALI Topics 2 Business Intelligence (BI) Decision Support System (DSS) Data Warehouse Online Analytical Processing (OLAP)

More information

Introduction to Federation Server

Introduction to Federation Server Introduction to Federation Server Alex Lee IBM Information Integration Solutions Manager of Technical Presales Asia Pacific 2006 IBM Corporation WebSphere Federation Server Federation overview Tooling

More information

DATA WAREHOUSE EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY

DATA WAREHOUSE EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY DATA WAREHOUSE EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY CHARACTERISTICS Data warehouse is a central repository for summarized and integrated data

More information

Chapter 3. Databases and Data Warehouses: Building Business Intelligence

Chapter 3. Databases and Data Warehouses: Building Business Intelligence Chapter 3 Databases and Data Warehouses: Building Business Intelligence How Can a Business Increase its Intelligence? Summary Overview of Main Concepts Details/Design of a Relational Database Creating

More information

Database Systems: Design, Implementation, and Management Tenth Edition. Chapter 1 Database Systems

Database Systems: Design, Implementation, and Management Tenth Edition. Chapter 1 Database Systems Database Systems: Design, Implementation, and Management Tenth Edition Chapter 1 Database Systems Objectives In this chapter, you will learn: The difference between data and information What a database

More information

1 Overview of Database Management

1 Overview of Database Management 1 Overview of Database Management 1.1 Data, information and knowledge 1.1.1 Data Data is/are the facts of the World. For example, take yourself. You may be 5ft tall, have brown hair and blue eyes. All

More information

Data Warehouse and Data Mining

Data Warehouse and Data Mining Data Warehouse and Data Mining Lecture No. 04-06 Data Warehouse Architecture Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology

More information

TDWI strives to provide course books that are contentrich and that serve as useful reference documents after a class has ended.

TDWI strives to provide course books that are contentrich and that serve as useful reference documents after a class has ended. Previews of TDWI course books offer an opportunity to see the quality of our material and help you to select the courses that best fit your needs. The previews cannot be printed. TDWI strives to provide

More information

Chapter 3: AIS Enhancements Through Information Technology and Networks

Chapter 3: AIS Enhancements Through Information Technology and Networks Accounting Information Systems: Essential Concepts and Applications Fourth Edition by Wilkinson, Cerullo, Raval, and Wong-On-Wing Chapter 3: AIS Enhancements Through Information Technology and Networks

More information

QM Chapter 1 Database Fundamentals Version 10 th Ed. Prepared by Dr Kamel Rouibah / Dept QM & IS

QM Chapter 1 Database Fundamentals Version 10 th Ed. Prepared by Dr Kamel Rouibah / Dept QM & IS QM 433 - Chapter 1 Database Fundamentals Version 10 th Ed Prepared by Dr Kamel Rouibah / Dept QM & IS www.cba.edu.kw/krouibah Dr K. Rouibah / dept QM & IS Chapter 1 (433) Database fundamentals 1 Objectives

More information

Business Intelligence and Decision Support Systems

Business Intelligence and Decision Support Systems Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 8: Data Warehousing Learning Objectives Understand the basic definitions and concepts of data warehouses Learn different

More information

Microsoft SQL Server Training Course Catalogue. Learning Solutions

Microsoft SQL Server Training Course Catalogue. Learning Solutions Training Course Catalogue Learning Solutions Querying SQL Server 2000 with Transact-SQL Course No: MS2071 Two days Instructor-led-Classroom 2000 The goal of this course is to provide students with the

More information

This tutorial has been prepared for computer science graduates to help them understand the basic-to-advanced concepts related to data mining.

This tutorial has been prepared for computer science graduates to help them understand the basic-to-advanced concepts related to data mining. About the Tutorial Data Mining is defined as the procedure of extracting information from huge sets of data. In other words, we can say that data mining is mining knowledge from data. The tutorial starts

More information

Data Mining & Data Warehouse

Data Mining & Data Warehouse Data Mining & Data Warehouse Associate Professor Dr. Raed Ibraheem Hamed University of Human Development, College of Science and Technology (1) 2016 2017 1 Points to Cover Why Do We Need Data Warehouses?

More information

Exploiting Key Answers from Your Data Warehouse Using SAS Enterprise Reporter Software

Exploiting Key Answers from Your Data Warehouse Using SAS Enterprise Reporter Software Eploiting Key Answers from Your Data Warehouse Using SAS Enterprise Reporter Software Donna Torrence, SAS Institute Inc., Cary, North Carolina Juli Staub Perry, SAS Institute Inc., Cary, North Carolina

More information

Partner Presentation Faster and Smarter Data Warehouses with Oracle OLAP 11g

Partner Presentation Faster and Smarter Data Warehouses with Oracle OLAP 11g Partner Presentation Faster and Smarter Data Warehouses with Oracle OLAP 11g Vlamis Software Solutions, Inc. Founded in 1992 in Kansas City, Missouri Oracle Partner and reseller since 1995 Specializes

More information

DATA MINING TRANSACTION

DATA MINING TRANSACTION DATA MINING Data Mining is the process of extracting patterns from data. Data mining is seen as an increasingly important tool by modern business to transform data into an informational advantage. It is

More information

DATAWAREHOUSING AND ETL PROCESSES: An Explanatory Research

DATAWAREHOUSING AND ETL PROCESSES: An Explanatory Research DATAWAREHOUSING AND ETL PROCESSES: An Explanatory Research Priyanshu Gupta ETL Software Developer United Health Group Abstract- In this paper, the author has focused on explaining Data Warehousing and

More information

Syllabus. Syllabus. Motivation Decision Support. Syllabus

Syllabus. Syllabus. Motivation Decision Support. Syllabus Presentation: Sophia Discussion: Tianyu Metadata Requirements and Conclusion 3 4 Decision Support Decision Making: Everyday, Everywhere Decision Support System: a class of computerized information systems

More information

Managing Data Resources

Managing Data Resources Chapter 7 OBJECTIVES Describe basic file organization concepts and the problems of managing data resources in a traditional file environment Managing Data Resources Describe how a database management system

More information

Pro Tech protechtraining.com

Pro Tech protechtraining.com Course Summary Description This course provides students with the skills necessary to plan, design, build, and run the ETL processes which are needed to build and maintain a data warehouse. It is based

More information

Chapter 3: Data Warehousing

Chapter 3: Data Warehousing Solution Manual Business Intelligence and Analytics Systems for Decision Support 10th Edition Sharda Instant download and all chapters Solution Manual Business Intelligence and Analytics Systems for Decision

More information

A Systems Approach to Dimensional Modeling in Data Marts. Joseph M. Firestone, Ph.D. White Paper No. One. March 12, 1997

A Systems Approach to Dimensional Modeling in Data Marts. Joseph M. Firestone, Ph.D. White Paper No. One. March 12, 1997 1 of 8 5/24/02 4:43 PM A Systems Approach to Dimensional Modeling in Data Marts By Joseph M. Firestone, Ph.D. White Paper No. One March 12, 1997 OLAP s Purposes And Dimensional Data Modeling Dimensional

More information

1. Inroduction to Data Mininig

1. Inroduction to Data Mininig 1. Inroduction to Data Mininig 1.1 Introduction Universe of Data Information Technology has grown in various directions in the recent years. One natural evolutionary path has been the development of the

More information

Capturing Your Changed Data

Capturing Your Changed Data Capturing Your Changed Data with the CONNX Data Synchronization Tool Table of Contents Executive Summary 1 Fulfilling a Need with Minimal Investment 2 Departmental Reporting Servers 3 Data Migration 4

More information

Teradata Aggregate Designer

Teradata Aggregate Designer Data Warehousing Teradata Aggregate Designer By: Sam Tawfik Product Marketing Manager Teradata Corporation Table of Contents Executive Summary 2 Introduction 3 Problem Statement 3 Implications of MOLAP

More information

Data Warehousing and Decision Support. Introduction. Three Complementary Trends. [R&G] Chapter 23, Part A

Data Warehousing and Decision Support. Introduction. Three Complementary Trends. [R&G] Chapter 23, Part A Data Warehousing and Decision Support [R&G] Chapter 23, Part A CS 432 1 Introduction Increasingly, organizations are analyzing current and historical data to identify useful patterns and support business

More information

DC Area Business Objects Crystal User Group (DCABOCUG) Data Warehouse Architectures for Business Intelligence Reporting.

DC Area Business Objects Crystal User Group (DCABOCUG) Data Warehouse Architectures for Business Intelligence Reporting. DC Area Business Objects Crystal User Group (DCABOCUG) Data Warehouse Architectures for Business Intelligence Reporting April 14, 2009 Whitemarsh Information Systems Corporation 2008 Althea Lane Bowie,

More information

MCQ IN BCOM II SEMESTER MANAGEMENT INFORMTION SYSTEM

MCQ IN BCOM II SEMESTER MANAGEMENT INFORMTION SYSTEM MCQ IN BCOM II SEMESTER MANAGEMENT INFORMTION SYSTEM Multiple choice questions 1. Relational calculus is a a. Procedural language. b. None- Procedural language. c. Data definition language. d. High level

More information

An Overview of Data Warehousing and OLAP Technology

An Overview of Data Warehousing and OLAP Technology An Overview of Data Warehousing and OLAP Technology CMPT 843 Karanjit Singh Tiwana 1 Intro and Architecture 2 What is Data Warehouse? Subject-oriented, integrated, time varying, non-volatile collection

More information

STRATEGIC INFORMATION SYSTEMS IV STV401T / B BTIP05 / BTIX05 - BTECH DEPARTMENT OF INFORMATICS. By: Dr. Tendani J. Lavhengwa

STRATEGIC INFORMATION SYSTEMS IV STV401T / B BTIP05 / BTIX05 - BTECH DEPARTMENT OF INFORMATICS. By: Dr. Tendani J. Lavhengwa STRATEGIC INFORMATION SYSTEMS IV STV401T / B BTIP05 / BTIX05 - BTECH DEPARTMENT OF INFORMATICS LECTURE: 05 (A) DATA WAREHOUSING (DW) By: Dr. Tendani J. Lavhengwa lavhengwatj@tut.ac.za 1 My personal quote:

More information

Jet Enterprise Frequently Asked Questions

Jet Enterprise Frequently Asked Questions Pg. 1 03/18/2011 Jet Enterprise Regarding Jet Enterprise What are the software requirements for Jet Enterprise? The following components must be installed to take advantage of Jet Enterprise: SQL Server

More information

CSE 544 Principles of Database Management Systems. Alvin Cheung Fall 2015 Lecture 8 - Data Warehousing and Column Stores

CSE 544 Principles of Database Management Systems. Alvin Cheung Fall 2015 Lecture 8 - Data Warehousing and Column Stores CSE 544 Principles of Database Management Systems Alvin Cheung Fall 2015 Lecture 8 - Data Warehousing and Column Stores Announcements Shumo office hours change See website for details HW2 due next Thurs

More information

Meaning & Concepts of Databases

Meaning & Concepts of Databases 27 th August 2015 Unit 1 Objective Meaning & Concepts of Databases Learning outcome Students will appreciate conceptual development of Databases Section 1: What is a Database & Applications Section 2:

More information

Xcelerated Business Insights (xbi): Going beyond business intelligence to drive information value

Xcelerated Business Insights (xbi): Going beyond business intelligence to drive information value KNOWLEDGENT INSIGHTS volume 1 no. 5 October 7, 2011 Xcelerated Business Insights (xbi): Going beyond business intelligence to drive information value Today s growing commercial, operational and regulatory

More information

A Star Schema Has One To Many Relationship Between A Dimension And Fact Table

A Star Schema Has One To Many Relationship Between A Dimension And Fact Table A Star Schema Has One To Many Relationship Between A Dimension And Fact Table Many organizations implement star and snowflake schema data warehouse The fact table has foreign key relationships to one or

More information

Dr.G.R.Damodaran College of Science

Dr.G.R.Damodaran College of Science 1 of 20 8/28/2017 2:13 PM Dr.G.R.Damodaran College of Science (Autonomous, affiliated to the Bharathiar University, recognized by the UGC)Reaccredited at the 'A' Grade Level by the NAAC and ISO 9001:2008

More information

OLAP Introduction and Overview

OLAP Introduction and Overview 1 CHAPTER 1 OLAP Introduction and Overview What Is OLAP? 1 Data Storage and Access 1 Benefits of OLAP 2 What Is a Cube? 2 Understanding the Cube Structure 3 What Is SAS OLAP Server? 3 About Cube Metadata

More information

Data, Information, and Databases

Data, Information, and Databases Data, Information, and Databases BDIS 6.1 Topics Covered Information types: transactional vsanalytical Five characteristics of information quality Database versus a DBMS RDBMS: advantages and terminology

More information

CS614 - Data Warehousing - Midterm Papers Solved MCQ(S) (1 TO 22 Lectures)

CS614 - Data Warehousing - Midterm Papers Solved MCQ(S) (1 TO 22 Lectures) CS614- Data Warehousing Solved MCQ(S) From Midterm Papers (1 TO 22 Lectures) BY Arslan Arshad Nov 21,2016 BS110401050 BS110401050@vu.edu.pk Arslan.arshad01@gmail.com AKMP01 CS614 - Data Warehousing - Midterm

More information

Zusammenfassung zur Vorlesung Data Warehousing

Zusammenfassung zur Vorlesung Data Warehousing Zusammenfassung zur Vorlesung Data Warehousing gehalten im WS 2003 von O. Univ. Prof. Dr. A Min Tjoa rn Februar 2004 1 Contents 1 The Compelling Need for Data Warehousing 4 1.1 Data Warehouse Defined............................

More information

Course chapter 8: Data Warehousing - Introduction. F. Radulescu - Data warehousing - introduction 1

Course chapter 8: Data Warehousing - Introduction. F. Radulescu - Data warehousing - introduction 1 Course chapter 8: Data Warehousing - Introduction F. Radulescu - Data warehousing - introduction 1 Some definitions Wikipedia: Data warehouse is a repository of an organization's electronically stored

More information