TDWI strives to provide course books that are content-rich and that serve as useful reference documents after a class has ended.

Size: px
Start display at page:

Download "TDWI strives to provide course books that are content-rich and that serve as useful reference documents after a class has ended."

Transcription

1 Previews of TDWI course books are provided as an opportunity to see the quality of our material and help you to select the courses that best fit your needs. The previews can not be printed. TDWI strives to provide course books that are content-rich and that serve as useful reference documents after a class has ended. This preview shows selected pages that are representative of the entire course book. The pages shown are not consecutive. The page numbers as they appear in the actual course material are shown at the bottom of each page. All table-of-contents pages are included to illustrate all of the topics covered by a course.

2 TDWI Dimensional Data Modeling Primer from Requirements to Business Analytics The Data Warehousing Institute i

3 The Data Warehousing Institute takes pride in the educational soundness and technical accuracy of all of our courses. Please give us your comments we d like to hear from you. Address your feedback to: info@dw-institute.com Publication Date: September 2004 Copyright 2004 by The Data Warehousing Institute. All rights reserved. No part of this document may be reproduced in any form, or by any means, without written permission from The Data Warehousing Institute. ii The Data Warehousing Institute

4 TABLE OF CONTENTS Module 1 Dimensional Modeling Concepts Module 2 Requirements Gathering for Dimensional Modeling Module 3 Logical Dimensional Modeling Module 4 From Logical Model to Star Schema Module 5 Dimensional Data and Business Analytics Appendix A Bibliography and References... A-1 Exercises Exercise Instructions and Worksheets W-1 The Data Warehousing Institute iii

5 Dimensional Modeling Concepts Module 1 Dimensional Modeling Concepts Topic Page Dimensional Modeling Basics 1-2 Comparing Relational and Dimensional Models 1-10 Concepts Summary 1-22 The Data Warehousing Institute 1-1

6 Dimensional Modeling Concepts TDWI Dimensional Data Modeling Primer Dimensional Modeling Basics Dimensional Data Models Conceptual Data Model? how many xyz in abc what is the rate of kjljklj how frequently does how long does it take to xxxx what does it cost to xxxxx what is the value of abcde how many times does xxxxx an analysis process model of business requirements starting from vague and uncertain evolving to specific and certain business questions list & fact/qualifier matrix Logical Data Model EMPLOYEE EMPLOYEE empl_number EMPLOYEE SATISFACTION empl_name empl_gender satisfaction_score empl_length_of_svc empl_age job_turnover_rate avg_length_of_employ ment complaint_frequency TIME disciplinary_action_fre quency MONTH date calendar_year calendar_month fiscal_year fiscal_month JOB DEPARTMENT dept_code dept_name JOB job_id job_description job_title job_shift_code job_location_code job_status_code a business (non-technical) design process model of business solution starting from specific business requirements evolving to product specification logical dimensional data model Physical Data Model location year location_code date_yyyy location_name fiscal_yy location_address location_city_name location_state_abbr month location_zip_code date_yyyymm fiscal_yyyymm fiscal_yyyyqq labor union union_id_number union_name union_group_code employee satisfaction employee emp_id_number emp_age emp_gender emp_name emp_hire_date emp_term_date emp_status_code emp_term_reason department dept_number dept_name dept_abbr an implementation (technical) design process model of technology solution starting from product specification evolving to database specification star schema trade trade_code trade_soc_code trade_name contract_start_date contract_end_date job_change_count employment_length_months complaint_count resignation_count termination_count promotion_count demotion_count disciplinary_action_count job job_id_number job_title job_shift_code job_shift_name 1-8 The Data Warehousing Institute

7 Dimensional Modeling Concepts Dimensional Modeling Basics Dimensional Data Models LEVELS OF MODELING Dimensional data design, like any other design process, involves a transition from abstract to highly specific. Abstract models are a means to understand requirements, while implementation models must specify the solution precisely. A three-level approach to data modeling works well where: Conceptual modeling describes the business needs. Logical modeling describes the business solution. Physical modeling specifies the technology solution. CONCEPTUAL MODELS LOGICAL MODELS Conceptual models are produced through analysis of business needs, and are intended to structure business requirements such that they can be verified and used as input to logical data modeling. When designing dimensional data, conceptual models include a representative list of business questions and analysis of those questions to understand them as a collection of data facts and data qualifiers. Logical modeling is a point of transition from analysis to design. Logical models describe the design of a solution to requirements expressed by conceptual models. A logical model is analogous with a product specification, describing what is to be produced but not detailing how it will be assembled. A logical dimensional model meets this requirement for design of dimensional data. PHYSICAL MODELS Physical models are produced by technical design processes. They describe and specify technology solutions. The physical design process transforms a logical model into a specification for implementation. Physical modeling adds the details necessary to describe how a product is structured and assembled. When designing dimensional data, the most common and widely-accepted physical model is a star-schema. The Data Warehousing Institute 1-9

8 Dimensional Modeling Concepts TDWI Dimensional Data Modeling Primer Comparing Relational and Dimensional Models A Quick Review of Relational Modeling EMPLOYEE empl_number empl_name empl_gender empl_hire_date empl_birth_date performs held by JOB job_id job_description job_title job_shift_code job_location_code job_status_code owned by owns changed by changes DEPARTMENT dept_code dept_name PERSONNEL ACTION is one of PA_id_number PA_type_code PA_effective_date PA_comments hiring approval_code hire_authority_id resignation resign_reason voluntary_code termination term_authority_code for_cause_code 1-10 The Data Warehousing Institute

9 Dimensional Modeling Concepts Comparing Relational and Dimensional Models A Quick Review of Relational Modeling ENTITY- RELATIONSHIP MODELS This diagram illustrates a simple entity-relationship (ER) model. The primary components of an ER model are: Entity An entity is a subject about which the business has the need, will, and means to collect data a person, place, thing, concept, or event that is of business interest. Entities are represented as labeled boxes in the ER model. Examples in the diagram include EMPLOYEE, JOB, and DEPARTMENT. Attribute An attribute is a property or characteristic of an entity that can be collected as data. Attributes are listed inside the box of the entities that they describe. Job_description and job_title, for example, are attributes of the entity JOB. Relationship A relationship is an important association among entities that is of business interest and may be collected as data. Examples of relationships in this diagram include EMPLOYEE PERFORMS JOB and DEPARTMENT OWNS JOB. Other important ER concepts include: Cardinality which describes the number of occurrences of each entity type that may participate in an occurrence of a relationship. Cardinality options include zero or one, exactly one, one or more, and zero more. In this diagram some of the relationship cardinalities are: a PERSONNEL ACTION changes exactly one JOB, a JOB is changed by one or more PERSONNEL ACTIONs, a JOB is held by zero or one EMPLOYEEs. Specialization (also called subtyping) creates a hierarchy or parent/child relationship between an ENTITY super-type and its subtypes. Specialization makes sense when the entity sub-types have unique attributes or participate in relationships not common to all subtypes. In this diagram PERSONNEL ACTION is an entity super-type with three sub-types. The Data Warehousing Institute 1-11

10 Dimensional Modeling Concepts TDWI Dimensional Data Modeling Primer Comparing Relational and Dimensional Models Introduction to Logical Dimensional Modeling EMPLOYEE EMPLOYEE empl_number empl_name empl_gender empl_length_of_svc empl_age TIME MONTH date calendar_year calendar_month fiscal_year fiscal_month EMPLOYEE SATISFACTION satisfaction_score job_turnover_rate avg_length_of_employment complaint_frequency disciplinary_action_frequency resignation_rate termination_rate promotion_rate demotion_rate JOB DEPARTMENT dept_code dept_name JOB job_id job_description job_title job_shift_code job_location_code job_status_code 1-12 The Data Warehousing Institute

11 Dimensional Modeling Concepts Comparing Relational and Dimensional Models Introduction to Logical Dimensional Modeling COMPONENTS AND STRUCTURE OF THE DIMENSIONAL MODEL This diagram illustrates a logical dimensional model. The components of the model include: A Meter that contains related measures of business interest. Each logical dimensional model has only one meter. In this example, the meter is EMPLOYEE SATISFACTION. Related measures include satisfaction_score, job_turnover_rate, avg_length_of_employment, complaint_frequency, etc. Visually, meters in a logical dimensional model look much like entities in the ER model, and measures look much like attributes in the ER model. Dimensions that provide the means to select, sort, filter and summarize business measures. In this diagram the dimensions are EMPLOYEE, TIME, and JOB. Dimension Levels describe hierarchies that exist within dimensions. This diagram contains one multi-level dimension JOB which contains dimensions levels of DEPARTMENT and JOB. The EMPLOYEE dimension contains dimension level EMPLOYEE, and the TIME dimension contains dimension level MONTH. Visually, dimension levels in a logical dimensional model look much like entities in an ER model. Dimension Attributes include identifiers and descriptive data about dimension levels. Some of the dimension attributes of JOB are job_id, job_description, and job_title. Dimension attributes are diagrammed in the same way as attributes of entities in an ER model. Associations within a dimension describe the dimension hierarchy and are represented as one-to-many relationships from one dimension level to another for example one DEPARTMENT contains many JOBS. Dimension to meter associations are represented as one-to-many relationships from the lowest level of each dimension to the meter. In this diagram the examples are MONTH to EMPLOYEE SATISFACTION, JOB to EMPLOYEE SATISFACTION, and EMPLOYEE to EMPLOYEE SATISFACTION. What this means is that each unique set of EMPLOYEE SATISFACTION measures is for one MONTH, one JOB, and one EMPLOYEE. The Data Warehousing Institute 1-13

12 Dimensional Modeling Concepts TDWI Dimensional Data Modeling Primer Comparing Relational and Dimensional Models A Basis for Comparison How would you represent employee satisfaction in an entity-relationship model? Some business questions: 1. Which departments have the highest employee satisfaction? Which have the lowest? 2. What is the average length of employment by trade and department? 3. What is the job turnover rate by department, employee age, and gender? 4. What is the frequency of complaints by job location? By shift? By trade? 5. Is there a relationship between complaint frequency and union representation? 6. Do age and gender affect promotion & demotion rates in some departments? 7. Do rotating shift jobs have higher rates of complaints & disciplinary actions? 8. Can we profile the factors for jobs with high termination rates? 1-16 The Data Warehousing Institute

13 Dimensional Modeling Concepts Comparing Relational and Dimensional Models A Basis for Comparison STARTING WITH BUSINESS QUESTIONS Business questions are the starting place for dimensional modeling and the basis for business metrics. The questions listed here are examples of the kinds of business questions that may be of interest for business performance in the area of EMPLOYEE SATISFACTION. These questions are used as the foundation of examples throughout the course. The Data Warehousing Institute 1-17

14 Requirements Gathering for Dimensional Modeling Module 2 Requirements Gathering for Dimensional Modeling Topic Page Business Context for Data Modeling 2-2 Business Questions as Requirements Models 2-6 Fact/Qualifier Analysis 2-12 Requirements Gathering Summary 2-20 The Data Warehousing Institute 2-1

15 This page intentionally left blank.

16 Requirements Gathering for Dimensional Modeling TDWI Dimensional Data Modeling Primer Business Questions as Requirements Models Addressing People, Performance, and Processes Enterprise Metrics / KPIs executive context Process Metrics vision and strategy Growth Metrics Organizational Metrics senior manager / department head context Process Metrics line manager / process manager context Financial Metrics Customer Metrics Activity Metrics knowledge worker context business processes events / transactions activities sources inputs workforce product customers Develop a robust, representative set of business questions that represents the perspectives of: business performance metrics and measures (financial, process, customer, growth) people who will use those metrics and measures (from executives to knowledge workers) the business processes in which those people participate (all parts of the processes customers, products, activities, workforce, events, transactions, inputs, and sources) 2-6 The Data Warehousing Institute

17 Requirements Gathering for Dimensional Modeling Business Questions as Requirements Models Addressing People, Performance, and Processes PUTTING IT ALL TOGETHER PEOPLE PERFORMANCE PROCESSES Requirements modeling begins with business questions the foundation for all subsequent dimensional modeling activities. The first context-level model is a robust and representative list of business questions that are within the scope of the project. Note the key words robust and representative. It is not possible to develop a complete and exhaustive list of all questions that might be asked. The objective is to create a list of questions that is fertile enough to drive development of a highlyadaptable data structure that is capable of answers to questions not yet known. The people, performance, and process perspectives help to ensure richness in the list of questions. Who are the stakeholders of the data marts or analytic applications being developed Executives? Managers? Knowledge workers? What metrics do they use today? What unanswered questions do they have? What levels of metrics do they need? Which classes of metrics are within the scope of modeling Financial? Process? Customer? Growth? How do stakeholder questions relate to these four categories of performance management? How do they relate to the KPIs for the business? Which business processes are within the scope of modeling? What can be measured about those processes? What metrics are needed to effectively manage the processes? What metrics are needed for knowledge workers to perform the process activities? Which process components make good metrics subjects? The Data Warehousing Institute 2-7

18 Requirements Gathering for Dimensional Modeling TDWI Dimensional Data Modeling Primer Business Questions as Requirements Models A Framework for Business Questions business questions (BPM / Customer) business questions (CRM / Product) Activity Process Organization Enterprise Customer Management Disciplines CRM BPM SCM BAM etc. Subject Areas Product Supplier Workforce Cost Value etc. Duration etc. business questions (SCM / Supplier) business questions (BAM / Workforce) 2-8 The Data Warehousing Institute

19 Requirements Gathering for Dimensional Modeling Business Questions as Requirements Models A Framework for Business Questions A DIMENSIONAL VIEW OF METRICS The diagram on the facing page illustrates a three dimensional view that is useful in managing a comprehensive set of business metrics. The dimensions are: The levels of metrics enterprise, organization, process, and activity that were previously discussed. The major subject areas of your business, typically found in a subject area model and the basis for the subject-oriented data warehouse. Subjects vary depending on the industry and the things that make your business unique. They include such things as customer, supplier, product, service, finance, etc. Performance management methods that are used by your business. These include disciplines such as business performance management (BPM), customer relationship management (CRM), supply chain management (SCM), etc. INSIDE THE FRAMEWORK METRIC CLASS WORDS Each intersection of the three dimensions is an opportunity to find business questions and to manage business metrics. At the intersection of enterprise, product, and CRM for example, you may ask: What needs to be known about products at the executive level to make CRM effective? What are the KPIs (from enterprise level) for products that are important to CRM? How are product measures for CRM related to those for BPM and SCM? How are product measures for CRM related to customer measures for CRM? And many more. In this way, the framework becomes both a tool to discover questions and a tool to manage cohesion and consistency across a collection of metrics. Similar to the class words of common data elements (date, code, etc.) metrics have class words (i.e., cost, value, duration, count) that describe the kind of quantity that they express. Classification is useful both in finding metrics (what can we measure?) and in achieving consistency among a set of related metrics. The Data Warehousing Institute 2-9

20 Requirements Gathering for Dimensional Modeling TDWI Dimensional Data Modeling Primer Fact/Qualifier Analysis From Business Requirements to Data Requirements Mapping of Business Questions FACTS: Discrete items of business information. QUALIFIERS: Criteria used to access, sort, group, summarize, and present information. associations among facts & qualifiers 2-12 The Data Warehousing Institute

21 Requirements Gathering for Dimensional Modeling Fact/Qualifier Analysis From Business Requirements to Data Requirements STRUCTURING REQUIREMENTS FOR DATA Business questions are the basis of requirements analysis, but they are not the final result. Full understanding of requirements is achieved through analysis of those questions. Requirements are much more than simply knowing which data is needed. To provide a foundation for logical design you ll also need to know how it will be accessed, navigated, and used. Business questions begin the transition to more formal data models through a matrix modeling technique known as fact/qualifier analysis. Each business question is examined to determine the fact(s) to be contained in the answer and the qualifiers that give the fact(s) meaning. The terms fact and qualifier describe roles that a data item assumes in a specific instance of business usage. Facts are discrete items of business information. Qualifiers are criteria used to access, sort, group, summarize, and present information. A single data item may be a fact in one instance and a qualifier in another. Consider, for example, the business question What is the average value of inventory for each store at the end of each month? The fact inherent in this question is average value of inventory the answer that the business person wants to obtain. The qualifiers found in the question are store and month the additional data needed to give the answer context and meaning. The Data Warehousing Institute 2-13

22 Logical Dimensional Modeling Module 3 Logical Dimensional Modeling Topic Page Modeling Meters and Measures 3-2 Modeling Dimensions 3-4 More about Meters and Measures 3-12 Logical Modeling Summary 3-18 Th Data Warehousing Institute 3-1

23 This page intentionally left blank.

24 Logical Dimensional Modeling TDWI Dimensional Data Modeling Primer Modeling Dimensions Adding Dimensions from the Qualifiers labor union year employee emp_age emp_gender trade month employee satisfaction job_turnover_rate avg_length_of_employment number_of_complaints location department shift job turnover rate avg. length of employment number of complaints department employee gender employee age trade year shift month labor union location 1, The Data Warehousing Institute

25 Logical Dimensional Modeling Modeling Dimensions Adding Dimensions from the Qualifiers DIMENSIONS IN THE LOGICAL MODEL Once dimension hierarchies are known, the logical dimensional model is extended by adding dimensions and associating them with the meter. For every measure contained in the meter, all associated qualifiers must be represented by a dimension. To add dimensions to the model: 1. Include each dimension hierarchy previously identified. Associate only the lowest level of each hierarchy with the meter as a one-tomany relationship. 2. Examine the remaining qualifiers to find those that may be dimension attributes instead of dimension levels. In this example, employee gender and employee age are attributes that describe employee. Thus employee becomes the dimension level with age and gender modeled as attributes. The dimension level employee is associated with the meter as a one-to-many relationship. 3. All remaining qualifiers are mapped as flat (single- level, nonhierarchical) dimensions, and each is associated with the meter using a one-to-many relationship. The Data Warehousing Institute 3-7

26 Logical Dimensional Modeling TDWI Dimensional Data Modeling Primer Modeling Dimensions Completing the Dimensions location location_code location_name location_address location_city_name location_state_abbr location_zip_code labor union union_id_number union_name union_group_code trade trade_code trade_soc_code trade_name contract_start_date contract_end_date year date_yyyy fiscal_yy month date_yyyymm fiscal_yyyymm fiscal_yyyyqq employee satisfaction job_turnover_rate avg_length_of_employment number_of_complaints employee emp_id_number emp_age emp_gender emp_name emp_hire_date emp_term_date emp_status_code emp_term_reason department dept_number dept_name dept_abbr job job_id_number job_title job_shift_code job_shift_name 3-10 The Data Warehousing Institute

27 Logical Dimensional Modeling Modeling Dimensions Completing the Dimensions IDENTIFY THE DIMENSION ATTRIBUTES Logical modeling of the dimensions is completed by expanding the list of attributes for each dimension level. Recall the earlier statement that The analysis and design processes of dimensional modeling extend, expand, and refine the collection of measures and dimensions such that the resulting data structure has the ability to answer many questions not found on the original list. This refinement activity is one of the key areas where that expansion occurs. Examine each dimension level with the following questions: What operational system identifiers are used for the dimension level? What is the natural identifier or key for the dimension level? What attributes of the dimension level appear in operational systems reporting? What attributes of the dimension level are collected as part of business transactions? What dimension level attributes are available in the source data? What dimension attributes are useful for business analysis? The Data Warehousing Institute 3-11

28 Logical Dimensional Modeling TDWI Dimensional Data Modeling Primer More about Meters and Measures Granularity and the Measures location location_code location_name location_address location_city_name location_state_abbr location_zip_code labor union union_id_number union_name union_group_code trade trade_code trade_soc_code trade_name contract_start_date contract_end_date year date_yyyy fiscal_yy month date_yyyymm fiscal_yyyymm fiscal_yyyyqq employee satisfaction job_turnover_rate job_change_count avg_length_of_employment employment_length_months number_of_complaints complaint_count each instance represents a unique combination of 1 month, 1 employee, 1 job, 1 location, and 1 trade employee emp_id_number emp_age emp_gender emp_name emp_hire_date emp_term_date emp_status_code emp_term_reason department dept_number dept_name dept_abbr job job_id_number job_title job_shift_code job_shift_name 3-14 The Data Warehousing Institute

29 Logical Dimensional Modeling More about Meters and Measures Granularity and the Measures WHAT CAN BE MEASURES AT THE LEVEL OF DETAIL? Once the grain has been declared, examine the measures to be certain that they make sense in context of the grain. In this example the following refinements have been applied: Job_turnover_rate is replaced with job_change_count. It is not possible to determine a turnover rate for a single employee and month. It is possible, however, to count the number of job changes for a single employee in one month. And it is practical with a collection of data for many employees and multiple months, and using an OLAP tool, to derive and display job_turnover_rate. Avg_length_of_employment is replaced with the measure of employment_length_months. An average for one employee has the same numeric value as that employee s individual length of employment. Calculating the average for a group of employees is an OLAP function. Number_of_complaints is replaced with complaint_count. This is a syntactic and naming conventions preference, and is aligned with the concept of metric class words that was discussed earlier in the course. Consistency of naming between job_change_count and complaint_count makes the data more understandable and usable for the business. Apply similar standards for other classes of data values date, average, minimum, maximum, mean, rate, duration, etc. The Data Warehousing Institute 3-15

30 Physical Dimensional Modeling Module 4 From Logical Model to Star Schema Topic Page Star Schema Dimensions 4-2 Star Schema Fact Tables 4-10 Star Schema Design Challenges 4-14 Modeling Process Summary 4-30 The Data Warehousing Institute 4-1

31 This page intentionally left blank.

32 Physical Dimensional Modeling TDWI Dimensional Data Modeling Primer Star Schema Dimensions Modeling Dimension Tables location location_code location_name location_address location_city_name location_state_abbr location_zip_code labor organization union_id_number union_name union_group_code trade_code trade_soc_code trade_name contract_start_date contract_end_date time date_yyyymm fiscal_yyyymm fiscal_yyyyqq employee satisfaction job_change_count employment_length_months complaint_count resignation_count termination_count promotion_count demotion_count disciplinary_action_count satisfaction_score employee emp_id_number emp_age emp_gender emp_name emp_hire_date emp_term_date emp_status_code emp_term_reason employment organization dept_number dept_name dept_abbr job_id_number job_title job_shift_code job_shift_name 4-4 The Data Warehousing Institute

33 Physical Dimensional Modeling Star Schema Dimensions Modeling Dimension Tables ONE TABLE PER DIMENSION Up to this point in the modeling process each dimension is normalized, with dimension hierarchies intact and data redundancy eliminated. Starschema design de-normalizes the dimensions, collapsing each dimension to a single table regardless of the number of dimension levels. Dimension attributes at higher levels of a hierarchy are kept redundantly with each occurrence of the lowest level in the dimension. This design, while suboptimal for transaction processing, is highly effective for analytical processing where access and navigation not update integrity and performance are the driving factors. This is the point of transition from logical (business solution) design to physical (technical implementation) design. Although the star-schema looks much like the logical model, the components are different: Where the logical model has a meter (a business performance measurement concept) the physical model has a fact table (a database table). The fact table contains facts the data elements to physically implement business measures. Where the logical model has dimension levels and relationships within them and to the meter (the business context of measures) the physical model has dimension tables (again database tables). The dimension tables contain dimension elements the data elements to physically implement dimension attributes. The Data Warehousing Institute 4-5

34 Physical Dimensional Modeling TDWI Dimensional Data Modeling Primer Star Schema Fact Tables Defining the Fact Table Key location location_code location_name location_address location_city_name location_state_abbr location_zip_code location_key labor organization union_id_number union_name union_group_code trade_code trade_soc_code trade_name contract_start_date contract_end_date labor_org_key time date_yyyymm fiscal_yyyymm fiscal_yyyyqq time_key employee satisfaction job_change_count employment_length_months complaint_count resignation_count termination_count promotion_count demotion_count disciplinary_action_count satisfaction_score time_key emp_key location_key employment_org_key labor_org_key emp_age emp_gender employee emp_id_number emp_age emp_gender emp_name emp_hire_date emp_term_date emp_status_code emp_term_reason emp_key employment organization dept_number dept_name dept_abbr job_id_number job_title job_shift_code job_shift_name employment_org_key 4-10 The Data Warehousing Institute

35 Physical Dimensional Modeling Star Schema Fact Tables Defining the Fact Table Key DIMENSION TO FACT NAVIGATION The foundation of OLAP processing is navigation from a set of dimensions to the facts associated with those dimensions. For example, the business question 4. What is the rate of job turnover by department, shift, and location? How does it change from month to month? asks that a specific measure (rate of job turnover) be reported by a group of dimensions (department, shift, location, and month). Each unique value of job_turnover_rate is determined from a unique combination of key values for all of the participating dimensions. Note that job_turnover_rate does not exist in the star schema on the facing page. It is a measure that must be calculated based on the values of other data in the fact table. Derived measures are discussed on the next set of pages. A COMPOSITE KEY TABLELESS DIMENSIONS The fact table key is simply the composite of all dimension table keys a concatenation of dimension surrogate keys. This works well because the keys are used only for navigation by the OLAP tool and are never exposed to a business user of the tool. Note that the dimensions tables for employee age and gender have been removed. Once the keys emp_age and emp_gender are migrated into the fact table there is no need to maintain them redundantly as single column tables. It is also acceptable, but not essential, to remove those elements from the employee dimension table. The Data Warehousing Institute 4-11

36 Physical Dimensional Modeling TDWI Dimensional Data Modeling Primer Star Schema Fact Tables Supporting Calculated Measures location location_code location_name location_address location_city_name location_state_abbr location_zip_code location_key labor organization union_id_number union_name union_group_code trade_code trade_soc_code trade_name contract_start_date contract_end_date labor_org_key time date_yyyymm fiscal_yyyymm fiscal_yyyyqq time_key employee satisfaction job_change_count employment_length_months complaint_count resignation_count termination_count promotion_count demotion_count disciplinary_action_count satisfaction_score employee_count time_key emp_key location_key employment_org_key labor_org_key emp_age emp_gender employee emp_id_number emp_age emp_gender emp_name emp_hire_date emp_term_date emp_status_code emp_term_reason emp_key employment organization dept_number dept_name dept_abbr job_id_number job_title job_shift_code job_shift_name employment_org_key 4-12 The Data Warehousing Institute

37 Physical Dimensional Modeling Star Schema Fact Tables Supporting Calculated Measures FACTS NEEDED FOR CALCULATION Continuing the example from the previous pages, recall that job_turnover_rate does not exist in the star schema. It is a measure that must be calculated based on the values of other data in the fact table. It is essential then, to ensure that all data needed to calculate the measure exists in the fact table. In this example, job_turnover_rate is calculated as job_change_count divided by the number of employees in the dimension set. One of the needed facts job_turnover_rate exists in the fact table. The other necessary item number of employees does not. Depending on the capabilities of the OLAP tool(s) being used, it may be necessary to include employee_count as a fact. The star schema design on the facing page includes employee_count. Given the grain of the fact table each row represents a unique combination of one employee, one month, one job, one location, and one trade the value of employee_count will always be exactly one. Thus the value of job_turnover_rate will always be equal to the value of job_change_count at the row level. However, when data values are accumulated for several rows for a dimension set job_turnover_rate becomes a meaningful measure. The Data Warehousing Institute 4-13

38 Physical Dimensional Modeling TDWI Dimensional Data Modeling Primer Star Schema Design Challenges Slowly Changing Dimensions Type 2 Example Location Table as of January 1 location_key location_code location_name location_address location_city location_state NWR001 retail store SE Stark Portland OR NWR002 retial store S. Main Roseburg OR NWWHSE warehouse 180 N. Broadw Portland OR Location Table as of September 1 location_key location_code location_name location_address location_city location_state NWR001 retail store SE Stark Portland OR NWR002 Product Table retial store September S Main Roseburg OR NWWHSE warehouse 180 N. Broadw Portland OR NWD001 dist. center N. Broadw Portland OR NWD002 dist. center Coburg Rd Eugene OR Keep Current & History Rows in Dimension Table 4-22 The Data Warehousing Institute

39 Physical Dimensional Modeling Star Schema Design Challenges Slowly Changing Dimensions Type 2 Example KEEPING ALL OF THE HISTORY Type 2 dimensions insert a new row, with a new surrogate key value, into the dimension table whenever a change occurs to the value of a dimension element. This is the most common design choice for slowly changing dimensions. The obvious advantage is that a type 2 dimension preserves all of the history of dimension data changes. For a structure that is also dimensioned by time, there is no need to time-stamp dimension records. Each row of the fact table is associated with only one row of each dimension table, thus the time dimension serves as the time stamp. The most significant disadvantage of type 2 dimensions is rapid growth. Type 2 dimensions may result in very large dimension tables contributing to both database size and query performance concerns. The Data Warehousing Institute 4-23

40 Physical Dimensional Modeling TDWI Dimensional Data Modeling Primer Star Schema Design Challenges Semi-Additive and Non-Additive Facts location location_code location_name location_address location_city_name location_state_abbr location_zip_code location_key labor organization union_id_number union_name union_group_code trade_code trade_soc_code trade_name contract_start_date contract_end_date labor_org_key time date_yyyymm fiscal_yyyymm fiscal_yyyyqq time_key employee satisfaction job_change_count employment_length_months complaint_count resignation_count termination_count promotion_count demotion_count disciplinary_action_count satisfaction_score employee_count time_key emp_key location_key employment_org_key labor_org_key emp_age emp_gender employee emp_id_number emp_age emp_gender emp_name emp_hire_date emp_term_date emp_status_code emp_term_reason emp_key employment organization dept_number dept_name dept_abbr job_id_number job_title job_shift_code job_shift_name employment_org_key 4-28 The Data Warehousing Institute

41 Physical Dimensional Modeling Star Schema Design Challenges Semi-Additive and Non-Additive Facts MEASURES AND SUMMARIZATION RULES One of the powerful features of OLAP processing is the ability to summarize measures using hierarchical dimensions to sum monthly sales measures, for example, to report a year-to-date sales. Not all facts, however, should be summarized for all dimensions. Each measure in a fact table needs to be examined to determine whether it is: Additive Producing useful and meaningful information when summarized along any set of dimensions. Semi-Additive Useful to summarize using some dimensions but not all dimensions. Non-Additive Impractical to sum along any set of dimensions. AN EXAMPLE The employee satisfaction star schema illustrates an example of a semiadditive fact employee_count. To count the employees in a given trade for a specific month, you could sum employee_count using those two dimensions and produce a meaningful result. The same approach could be used to count employees in a specific department in any month. Yet, consider what happens if the goal is to count employees of a department during the first quarter of the year. Any employee who worked in that department throughout the quarter will have three rows one for January, one for February, and one for March. Each row has an employee_count value of one. Each of those employees would be counted three times, resulting in an incorrect total. Employee_count is a semi-additive fact because it can be summed along most dimensions, but may not be summarized up the hierarchy of the time dimension. THE SOLUTION The solution to semi-additive and non-additive facts depends on the specific OLAP tool(s) being used. Some tools have features to specify summarization constraints. When your tool has these features it is important that the constraints be specified. Without these features, business user education and describing summarization constraints in the metadata are helpful. The Data Warehousing Institute 4-29

42 Physical Dimensional Modeling TDWI Dimensional Data Modeling Primer Modeling Process Summary From Business Requirements to Star Schema business requirements define the scope (metrics for business management) develop a list of business questions map questions into fact/qualifier matrix dimensional modeling logical dimensional modeling model the meter model the dimensions refine the meter name the meter select measures from F/Q matrix select qualifiers from F/Q matrix find hierarchies in the qualifiers add dimensions & associate with meter refine the dimensions add dimension attributes declare the grain refine the measures physical dimensional modeling name the dimensions model the dimension tables define the dimension table keys define the fact table key 4-30 The Data Warehousing Institute

43 Physical Dimensional Modeling Modeling Process Summary From Business Requirements to Star Schema PROCESS OVERVIEW The facing page illustrates dimensional data design activities from requirements through physical design. This diagram provides a simple, easy-to-reference summary of the dimensional modeling process from beginning to end. The Data Warehousing Institute 4-31

44 Dimensional Data and Business Analytics Module 5 Dimensional Data and Business Analytics Topic Page Delivering Business Value 5-2 An OLAP Demonstration 5-4 The Data Warehousing Institute 5-1

45 This page intentionally left blank.

46 Dimensional Data and Business Analytics TDWI Dimensional Data Modeling Primer An OLAP Demonstration Business Metrics in Action location location_code location_name location_address location_city_name location_state_abbr location_zip_code location_key labor organization union_id_number union_name union_group_code trade_code trade_soc_code trade_name contract_start_date contract_end_date labor_org_key time date_yyyymm fiscal_yyyymm fiscal_yyyyqq time_key employee satisfaction job_change_count employment_length_months complaint_count resignation_count termination_count promotion_count demotion_count disciplinary_action_count satisfaction_score employee_count time_key emp_key location_key employment_org_key labor_org_key emp_age emp_gender employee emp_id_number emp_age emp_gender emp_name emp_hire_date emp_term_date emp_status_code emp_term_reason emp_key employment organization dept_number dept_name dept_abbr job_id_number job_title job_shift_code job_shift_name employment_org_key 5-4 The Data Warehousing Institute

47 Dimensional Data and Business Analytics An OLAP Demonstration Business Metrics in Action OLAP DEMO This OLAP demonstration completes the chain of activity from requirements to business analytics. NOTES The Data Warehousing Institute 5-5

TDWI strives to provide course books that are contentrich and that serve as useful reference documents after a class has ended.

TDWI strives to provide course books that are contentrich and that serve as useful reference documents after a class has ended. Previews of TDWI course books offer an opportunity to see the quality of our material and help you to select the courses that best fit your needs. The previews cannot be printed. TDWI strives to provide

More information

TDWI Data Modeling. Data Analysis and Design for BI and Data Warehousing Systems

TDWI Data Modeling. Data Analysis and Design for BI and Data Warehousing Systems Data Analysis and Design for BI and Data Warehousing Systems Previews of TDWI course books offer an opportunity to see the quality of our material and help you to select the courses that best fit your

More information

TDWI strives to provide course books that are content-rich and that serve as useful reference documents after a class has ended.

TDWI strives to provide course books that are content-rich and that serve as useful reference documents after a class has ended. Previews of TDWI course books are provided as an opportunity to see the quality of our material and help you to select the courses that best fit your needs. The previews can not be printed. TDWI strives

More information

Extended TDWI Data Modeling: An In-Depth Tutorial on Data Warehouse Design & Analysis Techniques

Extended TDWI Data Modeling: An In-Depth Tutorial on Data Warehouse Design & Analysis Techniques : An In-Depth Tutorial on Data Warehouse Design & Analysis Techniques Class Format: The class is an instructor led format using multiple learning techniques including: lecture to present concepts, principles,

More information

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing. About the Tutorial A data warehouse is constructed by integrating data from multiple heterogeneous sources. It supports analytical reporting, structured and/or ad hoc queries and decision making. This

More information

FROM A RELATIONAL TO A MULTI-DIMENSIONAL DATA BASE

FROM A RELATIONAL TO A MULTI-DIMENSIONAL DATA BASE FROM A RELATIONAL TO A MULTI-DIMENSIONAL DATA BASE David C. Hay Essential Strategies, Inc In the buzzword sweepstakes of 1997, the clear winner has to be Data Warehouse. A host of technologies and techniques

More information

Data Warehousing. Overview

Data Warehousing. Overview Data Warehousing Overview Basic Definitions Normalization Entity Relationship Diagrams (ERDs) Normal Forms Many to Many relationships Warehouse Considerations Dimension Tables Fact Tables Star Schema Snowflake

More information

Data Strategies for Efficiency and Growth

Data Strategies for Efficiency and Growth Data Strategies for Efficiency and Growth Date Dimension Date key (PK) Date Day of week Calendar month Calendar year Holiday Channel Dimension Channel ID (PK) Channel name Channel description Channel type

More information

This module presents the star schema, an alternative to 3NF schemas intended for analytical databases.

This module presents the star schema, an alternative to 3NF schemas intended for analytical databases. Topic 3.3: Star Schema Design This module presents the star schema, an alternative to 3NF schemas intended for analytical databases. Star Schema Overview The star schema is a simple database architecture

More information

Data warehouse architecture consists of the following interconnected layers:

Data warehouse architecture consists of the following interconnected layers: Architecture, in the Data warehousing world, is the concept and design of the data base and technologies that are used to load the data. A good architecture will enable scalability, high performance and

More information

Basics of Dimensional Modeling

Basics of Dimensional Modeling Basics of Dimensional Modeling Data warehouse and OLAP tools are based on a dimensional data model. A dimensional model is based on dimensions, facts, cubes, and schemas such as star and snowflake. Dimension

More information

TDWI strives to provide course books that are contentrich and that serve as useful reference documents after a class has ended.

TDWI strives to provide course books that are contentrich and that serve as useful reference documents after a class has ended. Previews of TDWI course books offer an opportunity to see the quality of our material and help you to select the courses that best fit your needs. The previews cannot be printed. TDWI strives to provide

More information

Information Management Fundamentals by Dave Wells

Information Management Fundamentals by Dave Wells Information Management Fundamentals by Dave Wells All rights reserved. Reproduction in whole or part prohibited except by written permission. Product and company names mentioned herein may be trademarks

More information

Designing Data Warehouses. Data Warehousing Design. Designing Data Warehouses. Designing Data Warehouses

Designing Data Warehouses. Data Warehousing Design. Designing Data Warehouses. Designing Data Warehouses Designing Data Warehouses To begin a data warehouse project, need to find answers for questions such as: Data Warehousing Design Which user requirements are most important and which data should be considered

More information

Aggregating Knowledge in a Data Warehouse and Multidimensional Analysis

Aggregating Knowledge in a Data Warehouse and Multidimensional Analysis Aggregating Knowledge in a Data Warehouse and Multidimensional Analysis Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com Objectives Explain the basics of: 1. Data

More information

Chapter 3. The Multidimensional Model: Basic Concepts. Introduction. The multidimensional model. The multidimensional model

Chapter 3. The Multidimensional Model: Basic Concepts. Introduction. The multidimensional model. The multidimensional model Chapter 3 The Multidimensional Model: Basic Concepts Introduction Multidimensional Model Multidimensional concepts Star Schema Representation Conceptual modeling using ER, UML Conceptual modeling using

More information

DC Area Business Objects Crystal User Group (DCABOCUG) Data Warehouse Architectures for Business Intelligence Reporting.

DC Area Business Objects Crystal User Group (DCABOCUG) Data Warehouse Architectures for Business Intelligence Reporting. DC Area Business Objects Crystal User Group (DCABOCUG) Data Warehouse Architectures for Business Intelligence Reporting April 14, 2009 Whitemarsh Information Systems Corporation 2008 Althea Lane Bowie,

More information

A Systems Approach to Dimensional Modeling in Data Marts. Joseph M. Firestone, Ph.D. White Paper No. One. March 12, 1997

A Systems Approach to Dimensional Modeling in Data Marts. Joseph M. Firestone, Ph.D. White Paper No. One. March 12, 1997 1 of 8 5/24/02 4:43 PM A Systems Approach to Dimensional Modeling in Data Marts By Joseph M. Firestone, Ph.D. White Paper No. One March 12, 1997 OLAP s Purposes And Dimensional Data Modeling Dimensional

More information

Oracle Database 11g: Data Warehousing Fundamentals

Oracle Database 11g: Data Warehousing Fundamentals Oracle Database 11g: Data Warehousing Fundamentals Duration: 3 Days What you will learn This Oracle Database 11g: Data Warehousing Fundamentals training will teach you about the basic concepts of a data

More information

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1394

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1394 Data Mining Data warehousing Hamid Beigy Sharif University of Technology Fall 1394 Hamid Beigy (Sharif University of Technology) Data Mining Fall 1394 1 / 22 Table of contents 1 Introduction 2 Data warehousing

More information

The Data Organization

The Data Organization C V I T F E P A O TM The Data Organization 1251 Yosemite Way Hayward, CA 94545 (510) 303-8868 rschoenrank@computer.org Business Intelligence Process Architecture By Rainer Schoenrank Data Warehouse Consultant

More information

Business Intelligence. You can t manage what you can t measure. You can t measure what you can t describe. Ahsan Kabir

Business Intelligence. You can t manage what you can t measure. You can t measure what you can t describe. Ahsan Kabir Business Intelligence You can t manage what you can t measure. You can t measure what you can t describe Ahsan Kabir A broad category of applications and technologies for gathering, storing, analyzing,

More information

Cognos also provides you an option to export the report in XML or PDF format or you can view the reports in XML format.

Cognos also provides you an option to export the report in XML or PDF format or you can view the reports in XML format. About the Tutorial IBM Cognos Business intelligence is a web based reporting and analytic tool. It is used to perform data aggregation and create user friendly detailed reports. IBM Cognos provides a wide

More information

DATA MINING AND WAREHOUSING

DATA MINING AND WAREHOUSING DATA MINING AND WAREHOUSING Qno Question Answer 1 Define data warehouse? Data warehouse is a subject oriented, integrated, time-variant, and nonvolatile collection of data that supports management's decision-making

More information

Data Modeling: Beginning and Advanced HDT825 Five Days

Data Modeling: Beginning and Advanced HDT825 Five Days Five Days Prerequisites Students should have experience designing databases. Who Should Attend This course is targeted at database designers, data modelers, database analysts, and anyone else who needs

More information

Lectures for the course: Data Warehousing and Data Mining (IT 60107)

Lectures for the course: Data Warehousing and Data Mining (IT 60107) Lectures for the course: Data Warehousing and Data Mining (IT 60107) Week 1 Lecture 1 21/07/2011 Introduction to the course Pre-requisite Expectations Evaluation Guideline Term Paper and Term Project Guideline

More information

Data Warehouses Chapter 12. Class 10: Data Warehouses 1

Data Warehouses Chapter 12. Class 10: Data Warehouses 1 Data Warehouses Chapter 12 Class 10: Data Warehouses 1 OLTP vs OLAP Operational Database: a database designed to support the day today transactions of an organization Data Warehouse: historical data is

More information

QM Chapter 1 Database Fundamentals Version 10 th Ed. Prepared by Dr Kamel Rouibah / Dept QM & IS

QM Chapter 1 Database Fundamentals Version 10 th Ed. Prepared by Dr Kamel Rouibah / Dept QM & IS QM 433 - Chapter 1 Database Fundamentals Version 10 th Ed Prepared by Dr Kamel Rouibah / Dept QM & IS www.cba.edu.kw/krouibah Dr K. Rouibah / dept QM & IS Chapter 1 (433) Database fundamentals 1 Objectives

More information

An Overview of Data Warehousing and OLAP Technology

An Overview of Data Warehousing and OLAP Technology An Overview of Data Warehousing and OLAP Technology CMPT 843 Karanjit Singh Tiwana 1 Intro and Architecture 2 What is Data Warehouse? Subject-oriented, integrated, time varying, non-volatile collection

More information

Dta Mining and Data Warehousing

Dta Mining and Data Warehousing CSCI6405 Fall 2003 Dta Mining and Data Warehousing Instructor: Qigang Gao, Office: CS219, Tel:494-3356, Email: q.gao@dal.ca Teaching Assistant: Christopher Jordan, Email: cjordan@cs.dal.ca Office Hours:

More information

Application software office packets, databases and data warehouses.

Application software office packets, databases and data warehouses. Introduction to Computer Systems (9) Application software office packets, databases and data warehouses. Piotr Mielecki Ph. D. http://www.wssk.wroc.pl/~mielecki piotr.mielecki@pwr.edu.pl pmielecki@gmail.com

More information

Essentials of Database Management

Essentials of Database Management Essentials of Database Management Jeffrey A. Hoffer University of Dayton Heikki Topi Bentley University V. Ramesh Indiana University PEARSON Boston Columbus Indianapolis New York San Francisco Upper Saddle

More information

Call: SAS BI Course Content:35-40hours

Call: SAS BI Course Content:35-40hours SAS BI Course Content:35-40hours Course Outline SAS Data Integration Studio 4.2 Introduction * to SAS DIS Studio Features of SAS DIS Studio Tasks performed by SAS DIS Studio Navigation to SAS DIS Studio

More information

Fig 1.2: Relationship between DW, ODS and OLTP Systems

Fig 1.2: Relationship between DW, ODS and OLTP Systems 1.4 DATA WAREHOUSES Data warehousing is a process for assembling and managing data from various sources for the purpose of gaining a single detailed view of an enterprise. Although there are several definitions

More information

WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems

WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems Management Information Systems Management Information Systems B10. Data Management: Warehousing, Analyzing, Mining, and Visualization Code: 166137-01+02 Course: Management Information Systems Period: Spring

More information

Information Management course

Information Management course Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 07 : 06/11/2012 Data Mining: Concepts and Techniques (3 rd ed.) Chapter

More information

CS614 - Data Warehousing - Midterm Papers Solved MCQ(S) (1 TO 22 Lectures)

CS614 - Data Warehousing - Midterm Papers Solved MCQ(S) (1 TO 22 Lectures) CS614- Data Warehousing Solved MCQ(S) From Midterm Papers (1 TO 22 Lectures) BY Arslan Arshad Nov 21,2016 BS110401050 BS110401050@vu.edu.pk Arslan.arshad01@gmail.com AKMP01 CS614 - Data Warehousing - Midterm

More information

Data Warehouse and Data Mining

Data Warehouse and Data Mining Data Warehouse and Data Mining Lecture No. 05 Data Modeling Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro Data Modeling

More information

IT1105 Information Systems and Technology. BIT 1 ST YEAR SEMESTER 1 University of Colombo School of Computing. Student Manual

IT1105 Information Systems and Technology. BIT 1 ST YEAR SEMESTER 1 University of Colombo School of Computing. Student Manual IT1105 Information Systems and Technology BIT 1 ST YEAR SEMESTER 1 University of Colombo School of Computing Student Manual Lesson 3: Organizing Data and Information (6 Hrs) Instructional Objectives Students

More information

Data Warehousing Fundamentals by Mark Peco

Data Warehousing Fundamentals by Mark Peco Data Warehousing Fundamentals by Mark Peco All rights reserved. Reproduction in whole or part prohibited except by written permission. Product and company names mentioned herein may be trademarks of their

More information

Full file at

Full file at Chapter 2 Data Warehousing True-False Questions 1. A real-time, enterprise-level data warehouse combined with a strategy for its use in decision support can leverage data to provide massive financial benefits

More information

ten mistakes to avoid In Dimensional Modeling

ten mistakes to avoid In Dimensional Modeling EXCLUSIVELY FOR TDWI PREMIUM MEMBERS Fourth Quarter 2011 ten mistakes to avoid In Dimensional Modeling By Christopher Adamson 1 7 4 8 tdwi.org 9 2 3 5 10 6 ten mistakes to avoid In Dimensional Modeling

More information

Data Mining Concepts & Techniques

Data Mining Concepts & Techniques Data Mining Concepts & Techniques Lecture No. 01 Databases, Data warehouse Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro

More information

Course Number : SEWI ZG514 Course Title : Data Warehousing Type of Exam : Open Book Weightage : 60 % Duration : 180 Minutes

Course Number : SEWI ZG514 Course Title : Data Warehousing Type of Exam : Open Book Weightage : 60 % Duration : 180 Minutes Birla Institute of Technology & Science, Pilani Work Integrated Learning Programmes Division M.S. Systems Engineering at Wipro Info Tech (WIMS) First Semester 2014-2015 (October 2014 to March 2015) Comprehensive

More information

Chapter 1: The Database Environment

Chapter 1: The Database Environment Chapter 1: The Database Environment Modern Database Management 6 th Edition Jeffrey A. Hoffer, Mary B. Prescott, Fred R. McFadden Prentice Hall, 2002 1 Definitions Data: Meaningful facts, text, graphics,

More information

Account Payables Dimension and Fact Job Aid

Account Payables Dimension and Fact Job Aid Contents Introduction... 2 Financials AP Overview Subject Area:... 10 Financials AP Holds Subject Area:... 13 Financials AP Voucher Accounting Subject Area:... 15 Financials AP Voucher Line Distrib Details

More information

Star Schema Design (Additonal Material; Partly Covered in Chapter 8) Class 04: Star Schema Design 1

Star Schema Design (Additonal Material; Partly Covered in Chapter 8) Class 04: Star Schema Design 1 Star Schema Design (Additonal Material; Partly Covered in Chapter 8) Class 04: Star Schema Design 1 Star Schema Overview Star Schema: A simple database architecture used extensively in analytical applications,

More information

The strategic advantage of OLAP and multidimensional analysis

The strategic advantage of OLAP and multidimensional analysis IBM Software Business Analytics Cognos Enterprise The strategic advantage of OLAP and multidimensional analysis 2 The strategic advantage of OLAP and multidimensional analysis Overview Online analytical

More information

Advanced Data Management Technologies Written Exam

Advanced Data Management Technologies Written Exam Advanced Data Management Technologies Written Exam 02.02.2016 First name Student number Last name Signature Instructions for Students Write your name, student number, and signature on the exam sheet. This

More information

Government of Ontario IT Standard (GO ITS)

Government of Ontario IT Standard (GO ITS) Government of Ontario IT Standard (GO ITS) GO-ITS Number 56.3 Information Modeling Standard Version # : 1.5 Status: Approved Prepared under the delegated authority of the Management Board of Cabinet Queen's

More information

Data about data is database Select correct option: True False Partially True None of the Above

Data about data is database Select correct option: True False Partially True None of the Above Within a table, each primary key value. is a minimal super key is always the first field in each table must be numeric must be unique Foreign Key is A field in a table that matches a key field in another

More information

IDU0010 ERP,CRM ja DW süsteemid Loeng 5 DW concepts. Enn Õunapuu

IDU0010 ERP,CRM ja DW süsteemid Loeng 5 DW concepts. Enn Õunapuu IDU0010 ERP,CRM ja DW süsteemid Loeng 5 DW concepts Enn Õunapuu enn.ounapuu@ttu.ee Content Oveall approach Dimensional model Tabular model Overall approach Data modeling is a discipline that has been practiced

More information

CHAPTER 6 DATABASE MANAGEMENT SYSTEMS

CHAPTER 6 DATABASE MANAGEMENT SYSTEMS CHAPTER 6 DATABASE MANAGEMENT SYSTEMS Management Information Systems, 10 th edition, By Raymond McLeod, Jr. and George P. Schell 2007, Prentice Hall, Inc. 1 Learning Objectives Understand the hierarchy

More information

DATA STEWARDSHIP BODY OF KNOWLEDGE (DSBOK)

DATA STEWARDSHIP BODY OF KNOWLEDGE (DSBOK) DATA STEWARDSHIP BODY OF KNOWLEDGE (DSBOK) Release 2.2 August 2013. This document was created in collaboration of the leading experts and educators in the field and members of the Certified Data Steward

More information

Evolution of Database Systems

Evolution of Database Systems Evolution of Database Systems Krzysztof Dembczyński Intelligent Decision Support Systems Laboratory (IDSS) Poznań University of Technology, Poland Intelligent Decision Support Systems Master studies, second

More information

Product Documentation SAP Business ByDesign August Analytics

Product Documentation SAP Business ByDesign August Analytics Product Documentation PUBLIC Analytics Table Of Contents 1 Analytics.... 5 2 Business Background... 6 2.1 Overview of Analytics... 6 2.2 Overview of Reports in SAP Business ByDesign... 12 2.3 Reports

More information

Government of Ontario IT Standard (GO ITS) GO-ITS Number 56.3 Information Modeling Standard

Government of Ontario IT Standard (GO ITS) GO-ITS Number 56.3 Information Modeling Standard Government of Ontario IT Standard (GO ITS) GO-ITS Number 56.3 Information Modeling Standard Version # : 1.6 Status: Approved Prepared under the delegated authority of the Management Board of Cabinet Queen's

More information

UNIVERSITY OF CINCINNATI

UNIVERSITY OF CINCINNATI UNIVERSITY OF CINCINNATI, 20 I,, hereby submit this as part of the requirements for the degree of: in: It is entitled: Approved by: Migrating an Operational Database Schema to Data Warehouse Schemas A

More information

Figure 1-1a Data in context. Context helps users understand data

Figure 1-1a Data in context. Context helps users understand data Chapter 1: The Database Environment Modern Database Management 9 th Edition Jeffrey A. Hoffer, Mary B. Prescott, Heikki Topi 2009 Pearson Education, Inc. Publishing as Prentice Hall 1 Definition of terms

More information

SAS. Information Map Studio 3.1: Creating Your First Information Map

SAS. Information Map Studio 3.1: Creating Your First Information Map SAS Information Map Studio 3.1: Creating Your First Information Map The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2006. SAS Information Map Studio 3.1: Creating Your

More information

A Multi-Dimensional Data Model

A Multi-Dimensional Data Model A Multi-Dimensional Data Model A Data Warehouse is based on a Multidimensional data model which views data in the form of a data cube A data cube, such as sales, allows data to be modeled and viewed in

More information

1Z0-526

1Z0-526 1Z0-526 Passing Score: 800 Time Limit: 4 min Exam A QUESTION 1 ABC's Database administrator has divided its region table into several tables so that the west region is in one table and all the other regions

More information

Management Information Systems

Management Information Systems Foundations of Business Intelligence: Databases and Information Management Lecturer: Richard Boateng, PhD. Lecturer in Information Systems, University of Ghana Business School Executive Director, PearlRichards

More information

Computers Are Your Future

Computers Are Your Future Computers Are Your Future Twelfth Edition Chapter 12: Databases and Information Systems Copyright 2012 Pearson Education, Inc. Publishing as Prentice Hall 1 Databases and Information Systems Copyright

More information

1/12/2018. APPA Institute Dallas, TX Feb DATA INTEGRATION PURPOSE OF TODAY S PRESENTATION

1/12/2018. APPA Institute Dallas, TX Feb DATA INTEGRATION PURPOSE OF TODAY S PRESENTATION DATA INTEGRATION APPA Institute for Facilities Management January 23, 2018 Portland, OR PURPOSE OF TODAY S PRESENTATION To provide a broad understanding of: Data as a utility How various units of Facilities

More information

Management Information Systems MANAGING THE DIGITAL FIRM, 12 TH EDITION FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT

Management Information Systems MANAGING THE DIGITAL FIRM, 12 TH EDITION FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT MANAGING THE DIGITAL FIRM, 12 TH EDITION Chapter 6 FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT VIDEO CASES Case 1: Maruti Suzuki Business Intelligence and Enterprise Databases

More information

MaintScape Training Course Table of Contents

MaintScape Training Course Table of Contents MaintScape Training Course Table of Contents Table of Contents... 1 Training Course Requirements... 3 Overview and Main Modules... 3 Search Window... 4 Reports are produced from the Search Window... 6

More information

Decision Support Systems aka Analytical Systems

Decision Support Systems aka Analytical Systems Decision Support Systems aka Analytical Systems Decision Support Systems Systems that are used to transform data into information, to manage the organization: OLAP vs OLTP OLTP vs OLAP Transactions Analysis

More information

Question Bank. 4) It is the source of information later delivered to data marts.

Question Bank. 4) It is the source of information later delivered to data marts. Question Bank Year: 2016-2017 Subject Dept: CS Semester: First Subject Name: Data Mining. Q1) What is data warehouse? ANS. A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile

More information

NOTES ON OBJECT-ORIENTED MODELING AND DESIGN

NOTES ON OBJECT-ORIENTED MODELING AND DESIGN NOTES ON OBJECT-ORIENTED MODELING AND DESIGN Stephen W. Clyde Brigham Young University Provo, UT 86402 Abstract: A review of the Object Modeling Technique (OMT) is presented. OMT is an object-oriented

More information

Index. Symbols = (equal) operator, 87

Index. Symbols = (equal) operator, 87 riordan.book Page 343 Thursday, December 16, 2004 2:23 PM Index Symbols = (equal) operator, 87 A abstract entities, 14 abstract relations, 51 accelerator keys, 321 322 Access (application), 7 access keys,

More information

Factors in the Design and Development of a Data Warehouse for Academic Data

Factors in the Design and Development of a Data Warehouse for Academic Data Factors in the Design and Development of a Data Warehouse for Academic Data A thesis presented to the faculty of the Department of Computer Science East Tennessee State University In partial fulfillment

More information

CHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP)

CHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP) CHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP) INTRODUCTION A dimension is an attribute within a multidimensional model consisting of a list of values (called members). A fact is defined by a combination

More information

The DBMS accepts requests for data from the application program and instructs the operating system to transfer the appropriate data.

The DBMS accepts requests for data from the application program and instructs the operating system to transfer the appropriate data. Managing Data Data storage tool must provide the following features: Data definition (data structuring) Data entry (to add new data) Data editing (to change existing data) Querying (a means of extracting

More information

Managing Data Resources

Managing Data Resources Chapter 7 OBJECTIVES Describe basic file organization concepts and the problems of managing data resources in a traditional file environment Managing Data Resources Describe how a database management system

More information

SAS Data Integration Studio 3.3. User s Guide

SAS Data Integration Studio 3.3. User s Guide SAS Data Integration Studio 3.3 User s Guide The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2006. SAS Data Integration Studio 3.3: User s Guide. Cary, NC: SAS Institute

More information

Vertica Knowledge Base Article. Vertica QuickStart for Tableau

Vertica Knowledge Base Article. Vertica QuickStart for Tableau Vertica Knowledge Base Article Vertica QuickStart for Tableau Document Release Date: 6/14/2018 Legal Notices Warranty The only warranties for Micro Focus International plc products and services are set

More information

Microsoft SharePoint Server 2013 Plan, Configure & Manage

Microsoft SharePoint Server 2013 Plan, Configure & Manage Microsoft SharePoint Server 2013 Plan, Configure & Manage Course 20331-20332B 5 Days Instructor-led, Hands on Course Information This five day instructor-led course omits the overlap and redundancy that

More information

A Road Map for Advancing Your Career. Distinguish yourself professionally. Get an edge over the competition. Advance your career with CBIP.

A Road Map for Advancing Your Career. Distinguish yourself professionally. Get an edge over the competition. Advance your career with CBIP. TDWI Certification A Road Map for Advancing Your Career Distinguish yourself professionally. Get an edge over the competition. Advance your career with CBIP. www.tdwi.org/cbip TDWI s Certified Business

More information

OLAP Introduction and Overview

OLAP Introduction and Overview 1 CHAPTER 1 OLAP Introduction and Overview What Is OLAP? 1 Data Storage and Access 1 Benefits of OLAP 2 What Is a Cube? 2 Understanding the Cube Structure 3 What Is SAS OLAP Server? 3 About Cube Metadata

More information

This tutorial has been prepared for computer science graduates to help them understand the basic-to-advanced concepts related to data mining.

This tutorial has been prepared for computer science graduates to help them understand the basic-to-advanced concepts related to data mining. About the Tutorial Data Mining is defined as the procedure of extracting information from huge sets of data. In other words, we can say that data mining is mining knowledge from data. The tutorial starts

More information

OBJECTIVES DEFINITIONS CHAPTER 1: THE DATABASE ENVIRONMENT AND DEVELOPMENT PROCESS. Figure 1-1a Data in context

OBJECTIVES DEFINITIONS CHAPTER 1: THE DATABASE ENVIRONMENT AND DEVELOPMENT PROCESS. Figure 1-1a Data in context OBJECTIVES CHAPTER 1: THE DATABASE ENVIRONMENT AND DEVELOPMENT PROCESS Modern Database Management 11 th Edition Jeffrey A. Hoffer, V. Ramesh, Heikki Topi! Define terms! Name limitations of conventional

More information

Oracle Field Sales/Laptop

Oracle Field Sales/Laptop Oracle Field Sales/Laptop Concepts and Procedures Release 11i May 2002 Part No. A95947-02 Oracle Field Sales/Laptop Concepts and Procedures, Release 11i Part No. A95947-02 Copyright 2002 Oracle Corporation.

More information

COGNOS (R) 8 GUIDELINES FOR MODELING METADATA FRAMEWORK MANAGER. Cognos(R) 8 Business Intelligence Readme Guidelines for Modeling Metadata

COGNOS (R) 8 GUIDELINES FOR MODELING METADATA FRAMEWORK MANAGER. Cognos(R) 8 Business Intelligence Readme Guidelines for Modeling Metadata COGNOS (R) 8 FRAMEWORK MANAGER GUIDELINES FOR MODELING METADATA Cognos(R) 8 Business Intelligence Readme Guidelines for Modeling Metadata GUIDELINES FOR MODELING METADATA THE NEXT LEVEL OF PERFORMANCE

More information

The COSMIC Functional Size Measurement Method Version 4.0.1

The COSMIC Functional Size Measurement Method Version 4.0.1 The COSMIC Functional Size Measurement Method Version 4.0.1 Guideline for sizing Data Warehouse Application Software Version 1.1 April 2015 Acknowledgements Reviewers of v1.1 (alphabetical order) Diana

More information

Employer Reporting. User Guide

Employer Reporting. User Guide Employer Reporting This user guide provides an overview of features supported by the Employer Reporting application and instructions for viewing, customizing, and exporting reports. System Requirements

More information

DATA MINING TRANSACTION

DATA MINING TRANSACTION DATA MINING Data Mining is the process of extracting patterns from data. Data mining is seen as an increasingly important tool by modern business to transform data into an informational advantage. It is

More information

The Data Organization

The Data Organization C V I T F E P A O TM The Data Organization Best Practices Metadata Dictionary Application Architecture Prepared by Rainer Schoenrank January 2017 Table of Contents 1. INTRODUCTION... 3 1.1 PURPOSE OF THE

More information

by Prentice Hall

by Prentice Hall Chapter 6 Foundations of Business Intelligence: Databases and Information Management 6.1 2010 by Prentice Hall Organizing Data in a Traditional File Environment File organization concepts Computer system

More information

OLAP2 outline. Multi Dimensional Data Model. A Sample Data Cube

OLAP2 outline. Multi Dimensional Data Model. A Sample Data Cube OLAP2 outline Multi Dimensional Data Model Need for Multi Dimensional Analysis OLAP Operators Data Cube Demonstration Using SQL Multi Dimensional Data Model Multi dimensional analysis is a popular approach

More information

Department of Industrial Engineering. Sharif University of Technology. Operational and enterprises systems. Exciting directions in systems

Department of Industrial Engineering. Sharif University of Technology. Operational and enterprises systems. Exciting directions in systems Department of Industrial Engineering Sharif University of Technology Session# 9 Contents: The role of managers in Information Technology (IT) Organizational Issues Information Technology Operational and

More information

SOLUTIONS TO REVIEW QUESTIONS AND EXERCISES FOR PART 3 - DATABASE ANALYSIS AND DESIGN (CHAPTERS 10 15)

SOLUTIONS TO REVIEW QUESTIONS AND EXERCISES FOR PART 3 - DATABASE ANALYSIS AND DESIGN (CHAPTERS 10 15) Instant download and all chapters Solutions Manual Database Systems A Practical Approach to Design, Implementation, and Management 6th Edition Thomas Connolly https://testbankdata.com/download/solutions-manual-database-systems-practicalapproach-design-implementation-management-6th-edition-thomas-connolly/

More information

IT DATA WAREHOUSING AND DATA MINING UNIT-2 BUSINESS ANALYSIS

IT DATA WAREHOUSING AND DATA MINING UNIT-2 BUSINESS ANALYSIS PART A 1. What are production reporting tools? Give examples. (May/June 2013) Production reporting tools will let companies generate regular operational reports or support high-volume batch jobs. Such

More information

Managing Data Resources

Managing Data Resources Chapter 7 Managing Data Resources 7.1 2006 by Prentice Hall OBJECTIVES Describe basic file organization concepts and the problems of managing data resources in a traditional file environment Describe how

More information

1. Analytical queries on the dimensionally modeled database can be significantly simpler to create than on the equivalent nondimensional database.

1. Analytical queries on the dimensionally modeled database can be significantly simpler to create than on the equivalent nondimensional database. 1. Creating a data warehouse involves using the functionalities of database management software to implement the data warehouse model as a collection of physically created and mutually connected database

More information

Search for a Target Company

Search for a Target Company Search for a Target Company Search for a company in the Quick Search section by typing the desired company name. Use an asterisk if you are unsure of the exact spelling; i.e.: Reeb* for Reebok. The next

More information

FAQ: Relational Databases in Accounting Systems

FAQ: Relational Databases in Accounting Systems Question 1: What is the definition of a schema as it relates to a database? What are the three levels? Answer 1: A schema describes the logical structure of a database. The three levels of schemas are

More information

Guide Users along Information Pathways and Surf through the Data

Guide Users along Information Pathways and Surf through the Data Guide Users along Information Pathways and Surf through the Data Stephen Overton, Overton Technologies, LLC, Raleigh, NC ABSTRACT Business information can be consumed many ways using the SAS Enterprise

More information

Sql Fact Constellation Schema In Data Warehouse With Example

Sql Fact Constellation Schema In Data Warehouse With Example Sql Fact Constellation Schema In Data Warehouse With Example Data Warehouse OLAP - Learn Data Warehouse in simple and easy steps using Multidimensional OLAP (MOLAP), Hybrid OLAP (HOLAP), Specialized SQL

More information

Oracle Financial Analyzer Oracle General Ledger

Oracle Financial Analyzer Oracle General Ledger Oracle Financial Analyzer Oracle General Ledger Integrating Oracle Financial Analyzer with Oracle General Ledger Release 11i October 2000 Part No. A86564-01 Integrating Oracle Financial Analyzer with Oracle

More information