DATA WAREHOUSE 03 COMMON DWH ARCHITECTURES ANDREAS BUCKENHOFER, DAIMLER TSS
|
|
- Audra Ray
- 5 years ago
- Views:
Transcription
1 A company of Daimler AG DATA WAREHOUSE 03 COMMON DWH ARCHITECTURES ANDREAS BUCKENHOFER, DAIMLER TSS
2 ABOUT ME Andreas Buckenhofer Senior DB Professional Since 2009 at Daimler TSS Department: Big Data Business Unit: Analytics
3 ANDREAS BUCKENHOFER, DAIMLER TSS GMBH Forming good abstractions and avoiding complexity is an essential part of a successful data architecture Data has always been my main focus during my long-time occupation in the area of data integration. I work for Daimler TSS as Database Professional and Data Architect with over 20 years of experience in Data Warehouse projects. I am working with Hadoop and NoSQL since I keep my knowledge up-to-date - and I learn new things, experiment, and program every day. I share my knowledge in internal presentations or as a speaker at international conferences. I'm regularly giving a full lecture on Data Warehousing and a seminar on modern data architectures at Baden-Wuerttemberg Cooperative State University DHBW. I also gained international experience through a two-year project in Greater London and several business trips to Asia. I m responsible for In-Memory DB Computing at the independent German Oracle User Group (DOAG) and was honored by Oracle as ACE Associate. I hold current certifications such as "Certified Data Vault 2.0 Practitioner (CDVP2)", "Big Data Architect, Oracle Database 12c Administrator Certified Professional, IBM InfoSphere Change Data Capture Technical Professional, etc. Contact/Connect Daimler TSS Data Warehouse / DHBW 3 DOAG DHBW xing
4 NOT JUST AVERAGE: OUTSTANDING. As a 100% Daimler subsidiary, we give 100 percent, always and never less. We love IT and pull out all the stops to aid Daimler's development with our expertise on its journey into the future. Our objective: We make Daimler the most innovative and digital mobility company. Daimler TSS
5 INTERNAL IT PARTNER FOR DAIMLER + Holistic solutions according to the Daimler guidelines + IT strategy + Security + Architecture + Developing and securing know-how + TSS is a partner who can be trusted with sensitive data As subsidiary: maximum added value for Daimler + Market closeness + Independence + Flexibility (short decision making process, ability to react quickly) Daimler TSS 5
6 LOCATIONS Daimler TSS Germany 7 locations 1000 employees* Ulm (Headquarters) Stuttgart Berlin Karlsruhe * as of August 2017 Daimler TSS India Hub Bangalore 22 employees Daimler TSS China Hub Beijing 10 employees Daimler TSS Malaysia Hub Kuala Lumpur 42 employees Daimler TSS Data Warehouse / DHBW 6
7 DWH LECTURE - LEARNING TARGETS Describe different DWH architectures Explain DWH data modeling methods and design logical models Name DB techniques that are well-suited for DWHs Explain ETL processes Specify reporting & project management & meta data requirements Name current DWH trends Daimler TSS Data Warehouse / DHBW 7
8 EXERCISE: CLASSICAL DWH ARCHITECTURES The article compares THE two classic DWH architectures. Read the paper and complete the table / questions on the next slide. (Caution: The paper is biased / favors one approach; you may want to read other/more papers for a neutral view.) Daimler TSS Data Warehouse / DHBW 8
9 EXERCISE: CLASSICAL DWH ARCHITECTURES How are the approaches called? Who invented the approach? How many layers are used and how are the layers called? Which data modeling approaches are used in which layer? In which layer are atomic detail data stored? In which layer are aggregated / summary data stored? List at least 2 advantages List at least 2 disadvantages Daimler TSS Data Warehouse / DHBW 9
10 EXERCISE: CLASSICAL DWH ARCHITECTURES How are the approaches called? Who invented the approach? How many layers are used and how are the layers called? Which data modeling approaches are used in which layer? In which layer are atomic detail data stored? In which layer are aggregated / summary data stored? Kimball Bus Architecture Ralph Kimball Bill Inmon Data Staging Dimensional Data Warehouse Data Staging: variable, corresponds to source system Dimensional Data Warehouse: Dimensional Model Corporate Information Factory Data Acquisition Normalized Data Warehouse Data Delivery / Dimensional Mart Data Acquisition: variable, corresponds to source system Normalized Data Warehouse: 3NF Data Delivery: Dimensional Model Dimensional Data Warehouse Normalized Data Warehouse Dimensional Data Warehouse Data Delivery / Dimensional Mart Daimler TSS Data Warehouse / DHBW 10
11 EXERCISE: CLASSICAL DWH ARCHITECTURES Kimball Bus Architecture Advantages Two layers only mean faster development and less work Rather simple approach to make data fast and easily accessible Lower startup costs (but higher subsequent development costs) Corporate Information Factory Separation of concerns: long-term enterprise data storage separated from data presentation Changes in requirements and scope are easier to manage Lower subsequent development costs (but higher startup costs) Disadvantages If table structures change (instable source systems), high effort to implement the changes and reload data, especially conformed dimensions ( Dimensionitis desease) Non-metric data not optimal for dimensional model Dimensional model (esp. Star Schema) contains data redundancy Data model transformations from 3NF to Dimensional model required More complex as two different data models are required Larger team(s) of specialists required Daimler TSS Data Warehouse / DHBW 11
12 OTHER ARCHITECTURES Kimball Bus Architecture (Central data warehouse based on data marts) Inmon Corporate Information Factory Data Vault 2.0 Architecture (Dan Linstedt) DW 2.0: The Architecture for the Next Generation of Data Warehousing Virtual Data Warehouse Operational Data Store (ODS) Daimler TSS Data Warehouse / DHBW 12
13 KIMBALL BUS ARCHITECTURE (CENTRAL DATA WAREHOUSE BASED ON DATA MARTS) Source: Daimler TSS Data Warehouse / DHBW 13
14 KIMBALL BUS ARCHITECTURE (CENTRAL DATA WAREHOUSE BASED ON DATA MARTS) Internal data sources Data Warehouse OLTP OLTP External data sources Staging Layer (Input Layer) Backend Core Warehouse Layer = Mart Layer Data Mart 1 Data Mart 2 Data Mart 3 More Businessprocess oriented than subjectoriented, integrated, time-variant, non-volatile Frontend Metadata Management Security DWH Manager incl. Monitor Daimler TSS Data Warehouse / DHBW 14
15 KIMBALL BUS ARCHITECTURE (CENTRAL DATA WAREHOUSE BASED ON DATA MARTS) Bottom-up approach Dimensional model with denormalized data Sum of the data marts constitute the Enterprise DWH Enterprise Service Bus / conformed dimensions for integration purposes (don t confuse with ESB as middleware/communication system between applications) Kimball describes that agreeing on conformed dimensions is a hard job and it s expected that the team will get stuck from time to time trying to align the incompatible original vocabularies of different groups Data marts need to be redesigned if incompatibilities exist Daimler TSS Data Warehouse / DHBW 15
16 Core Warehouse Layer DATA INTEGRATION WITH AND WITHOUT CORE WAREHOUSE LAYER Daimler TSS Data Warehouse / DHBW 16
17 INMON CORPORATE INFORMATION FACTORY Source: Daimler TSS Data Warehouse / DHBW 17
18 INMON CORPORATE INFORMATION FACTORY Internal data sources Data Warehouse Backend Frontend OLTP OLTP Staging Layer (Input Layer) Core Warehouse Layer (Storage Layer) subjectoriented, integrated, timevariant, nonvolatile Mart Layer (Output Layer) (Reporting Layer) External data sources Metadata Management Security DWH Manager incl. Monitor Daimler TSS Data Warehouse / DHBW 18
19 INMON CORPORATE INFORMATION FACTORY Top-down approach (Normalized) Core Warehouse is essential for subject-oriented, integrated, time-variant and nonvolatile data storage Create (departmental) Data Marts as subsets of Core Enterprise DWH as needed Data Marts can be designed with Dimensional model The logical standard architecture is more general compared to CIF, but was mainly influenced by CIF Daimler TSS Data Warehouse / DHBW 19
20 DATA VAULT 2.0 ARCHITECTURE TODAY S WORLD (DAN LINSTEDT) Daimler TSS Data Warehouse / DHBW
21 DATA VAULT 2.0 ARCHITECTURE (DAN LINSTEDT) Michael Olschimke, Dan Linstedt: Building a Scalable Data Warehouse with Data Vault 2.0, Morgan Kaufmann, 2015, Chapter 2.2 Daimler TSS Data Warehouse / DHBW 21
22 DATA VAULT 2.0 ARCHITECTURE (DAN LINSTEDT) Internal data sources Data Warehouse Backend Frontend OLTP OLTP External data sources Staging Layer (Input Layer) Hard Rules only Raw Data Vault Metadata Management Security DWH Manager incl. Monitor Business Data Vault Soft Rules Mart Layer (Output Layer) (Reporting Layer) Daimler TSS Data Warehouse / DHBW 22
23 DATA VAULT 2.0 ARCHITECTURE (DAN LINSTEDT) Core Warehouse Layer is modeled with Data Vault and integrates data by BK (business key) only (Data Vault modeling is a separate lecture) Business rules (Soft Rules) are applied from Raw Data Vault Layer to Mart Layer and not earlier Alternatively from Raw Data Vault to additional layer called Business Data Vault Hard Rules don t change data Data is fully auditable Real-time capable architecture Architecture got very popular recently; also applicable to BigData, NoSQL Daimler TSS Data Warehouse / DHBW 23
24 DATA VAULT 2.0 ARCHITECTURE (DAN LINSTEDT) In the classical DWHs, the Core Warehouse Layer is regarded as single version of the truth Integrates + cleanses data from different sources and eliminates contradiction Produces consistent results/reports across Data Marts But: cleansing is (still) objective, Enterprises change regularly, paradigm does not scale as more and more systems exist Data in Raw Data Vault Layer is regarded as Single version of the facts 100% of data is loaded 100% of time Data is not cleansed and bad data is not removed in the Core Layer (Raw Vault) Daimler TSS Data Warehouse / DHBW 24
25 DATA VAULT 2.0 ARCHITECTURE (DAN LINSTEDT) Data Vault is optimized for the following requirements: Flexibility Agility Data historization Data integration Auditability Bill Inmon wrote in 2008: Data Vault is the optimal approach for modeling the EDW in the DW2.0 framework. (DW2.0) Daimler TSS Data Warehouse / DHBW 25
26 Data Lifecycle DW 2.0: THE ARCHITECTURE FOR THE NEXT GENERATION OF DATA WAREHOUSING Operational application data model Integrated corporate data model Integrated corporate data model Archival data model Source: W.H. Inmon, Dan Linstedt: Data Architecture: A Primer for the Data Scientist, Morgan Kaufmann, 2014, chapter 3.1 Daimler TSS Data Warehouse / DHBW 26
27 DW 2.0: THE ARCHITECTURE FOR THE NEXT GENERATION OF DATA WAREHOUSING Main characteristics: Structured and unstructured data, not just metrics Life Cycle of data with different storage areas Hot data: High speed, expensive storage (RAM, SSDs) for most recent data Cold data: Low speed, inexpensive storage (e.g. hard disks) for old data; archival data model with high compression Metadata is an integral part of the DWH and not an afterthought Daimler TSS Data Warehouse / DHBW 27
28 VIRTUAL DATA WAREHOUSE Internal data sources Data Warehouse Backend Frontend OLTP OLTP External data sources Query Management Weakly+partly subject-oriented, Weakly+partly integrated, Not time-variant, Not non-volatile Daimler TSS Data Warehouse / DHBW 28
29 VIRTUAL DATA WAREHOUSE Data not extracted from operational systems and stored separately Standardized interface for all operational data sources One "GUI" for all existing data Generates combined queries Query Processor joins query result data from different sources Can also access data in Hadoop (Polybase, Big SQL, BigData SQL, etc) Daimler TSS Data Warehouse / DHBW 29
30 VIRTUAL DATA WAREHOUSE Query Management manages metadata about all operational systems (physical) location of data and algorithms for extracting data from OLTP system Implementation easier Low cost: can use existing hardware infrastructure Queries cause significant performance problems in operational systems Known problems when analyzing operational data directly Same query is processed multiple times (if queried multiple times) Same query delivers different results when processed at different times Daimler TSS Data Warehouse / DHBW 30
31 OPERATIONAL DATA STORE (ODS) Internal data sources Data Warehouse Backend Frontend OLTP OLTP Staging Layer (Input Layer) Core Warehouse Layer (Storage Layer) subjectoriented, integrated, timevariant, nonvolatile Mart Layer (Output Layer) (Reporting Layer) External data sources Operational Data Store Metadata Management Security DWH Manager incl. Monitor Daimler TSS Data Warehouse / DHBW 31
32 OPERATIONAL DATA STORE (ODS) ODS: Real-time/Right-time layer Replication techniques used to transport data from source database to ODS layer with minimal impact on source system Data in the ODS has no history and is stored without any cleansing and without any integration (1:1 copy from single source) DWH performance not optimal as data model is suited for OLTP and not for reporting requirements ODS normally additionally to Staging / Core DWH / Mart Layer but can exist alone without other layers Daimler TSS Data Warehouse / DHBW 32
33 EXAMPLE DWH FOR STATE OF CONSTRUCTION DOCU Daimler TSS Data Warehouse / DHBW 33
34 ARCHITECTURE FROM AN ACTUAL PROJECT IN THE AUTOMOTIVE INDUSTRY ETL Engine Datastore Source IIDR ReplEngine Source OLTP DB Logs Backend DWH DB Datastore DWH IIDR ReplEngine DWH Mart Layer Raw + Business Data Vault Staging Layer Standard Reports Frontend AdHoc Reports TSM Datastore Mirror IIDR ReplEngine Mirror Mirror DB (Operational Data Store) Daimler TSS Data Warehouse / DHBW 34
35 END USER SAMPLE QUESTIONS Which vehicles or aggregates are documented incompletely? (Data quality) Which vehicles / which control units require SW updates? Which interiors are most common by region? Supply data for external simulations, customs clearance, spare part planning, etc. Daimler TSS Data Warehouse / DHBW 35
36 EXERCISE: RECOMMEND AN ARCHITECTURE Review the presented data warehouse architectures. Which architecture would you recommend for A holding of 3 telecommunication companies An online store with real/right-time data integration needs Marketing department of a bank List advantages and drawbacks of your proposal. Daimler TSS Data Warehouse / DHBW 36
37 EXERCISE: RECOMMEND AN ARCHITECTURE A holding of 3 telecommunication companies Architecture: Virtual Data Warehouse + Companies may not want to provide their data to a new storage + Can easily be extended if new companies join the holding or reduced if a company leaves the holding - Bad performance - Not really data integration achieved, low Data Quality - Firewalls have to be opened Daimler TSS Data Warehouse / DHBW 37
38 EXERCISE: RECOMMEND AN ARCHITECTURE An online store with real-time/right-time data integration needs Architecture: Data Vault Integration of many internal and external source systems (e.g. integrate social media data about the online store) + Fast data delivery in Raw Vault Layer (Real-time/Right-time data integration). Complex data cleansing / transformation / soft rules are delayed downstream towards Mart Layer - Transformation overhead (Source system data model to Data Vault data model to Dimensional data model) Daimler TSS Data Warehouse / DHBW 38
39 EXERCISE: RECOMMEND AN ARCHITECTURE Marketing department of a bank Architecture: Kimball Bus architecture + Start small for a department. If other departments are interested, new data and new Marts can be added on demand - High risk to loose the Enterprise view and several DWHs are built That s still quite a common scenario nowadays. A single Enterprise DWH is often not achieved (e.g. Mergers & Acquisitions, inflexibility due to a single centralized DWH, rapidly changing conditions, etc.) and therefore very often several DWHs with different architectures exist in parallel within a company. Daimler TSS Data Warehouse / DHBW 39
40 EXERCISE - INTRODUCTION AND DWH ARCHITECTURE GROUP TASK Now imagine that you prepare an exam. Identify 1-3 questions about DWH architecture (and/or DWH introduction) that you would ask in an exam. Write down the questions on stick-it cards. Daimler TSS Data Warehouse / DHBW 40
41 SUMMARY Which layers does the logical standard architecture have? Staging (Input), Integration (Cleansing), Core Warehouse (Storage), Aggregation, Mart (Reporting, Output) and additionally Metadata, Security, DWH Manager, Monitor Which other architectures exist? Kimball Bus Architecture (Central data warehouse based on data marts) Inmon Corporate Information Factory Data Vault 2.0 Architecture (Dan Linstedt) DW 2.0: The Architecture for the Next Generation of Data Warehousing Virtual Data Warehouse Operational Data Store (ODS) Daimler TSS Data Warehouse / DHBW 41
42 THANK YOU Daimler TSS GmbH Wilhelm-Runge-Straße 11, Ulm / Telefon / Fax tss@daimler.com / Internet: Domicile and Court of Registry: Ulm / HRB-Nr.: 3844 / Management: Christoph Röger (CEO), Steffen Bäuerle Daimler TSS Data Warehouse / DHBW 42
DATA WAREHOUSE PART LX: PROJECT MANAGEMENT ANDREAS BUCKENHOFER, DAIMLER TSS
A company of Daimler AG LECTURE @DHBW: DATA WAREHOUSE PART LX: PROJECT MANAGEMENT ANDREAS BUCKENHOFER, DAIMLER TSS ABOUT ME Andreas Buckenhofer Senior DB Professional andreas.buckenhofer@daimler.com Since
More informationDATA WAREHOUSE PART XXI: DB SPECIFICS ANDREAS BUCKENHOFER, DAIMLER TSS
A company of Daimler AG LECTURE @DHBW: DATA WAREHOUSE PART XXI: DB SPECIFICS ANDREAS BUCKENHOFER, DAIMLER TSS ABOUT ME Andreas Buckenhofer Senior DB Professional andreas.buckenhofer@daimler.com Since 2009
More informationDATA WAREHOUSE CASE STUDY ANDREAS BUCKENHOFER, DAIMLER TSS
A company of Daimler AG LECTURE @DHBW: DATA WAREHOUSE CASE STUDY ANDREAS BUCKENHOFER, DAIMLER TSS ABOUT ME Andreas Buckenhofer Senior DB Professional andreas.buckenhofer@daimler.com Since 2009 at Daimler
More informationDATA PRODUCTS FROM POC INTO PRODUCTION
Ein Unternehmen der Daimler AG DATA PRODUCTS FROM POC INTO PRODUCTION Andreas Buckenhofer, DOAG K&A, Nuremberg 2018 ANDREAS BUCKENHOFER, DAIMLER TSS GMBH Forming good abstractions and avoiding complexity
More informationDWH REFACTORING WITH DATA VAULT ANDREAS BUCKENHOFER DOAG K&A 2017, NUREMBERG
A company of Daimler AG DWH REFACTORING WITH DATA VAULT ANDREAS BUCKENHOFER DOAG K&A 2017, NUREMBERG ABOUT ME Andreas Buckenhofer Senior DB Professional andreas.buckenhofer@daimler.com Since 2009 at Daimler
More informationDATA WAREHOUSE 01 DWH INTRODUCTION AND DEFINITION ANDREAS BUCKENHOFER, DAIMLER TSS
A company of Daimler AG LECTURE @DHBW: DATA WAREHOUSE 01 DWH INTRODUCTION AND DEFINITION ANDREAS BUCKENHOFER, DAIMLER TSS ABOUT ME Andreas Buckenhofer Senior DB Professional andreas.buckenhofer@daimler.com
More informationIS THE DATA CATALOG A METADATA MANAGEMENT RELOADED?
Ein Unternehmen der Daimler AG IS THE DATA CATALOG A METADATA MANAGEMENT RELOADED? Andreas Buckenhofer, DOAG Big Data Days, Dresden 2018 ANDREAS BUCKENHOFER, DAIMLER TSS GMBH Forming good abstractions
More informationDATA WAREHOUSE PART IV: FRONTEND, METADATA, PROJECTS, ADVANCED TOPICS ANDREAS BUCKENHOFER, DAIMLER TSS
A company of Daimler AG LECTURE @DHBW: DATA WAREHOUSE PART IV: FRONTEND, METADATA, PROJECTS, ADVANCED TOPICS ANDREAS BUCKENHOFER, DAIMLER TSS ABOUT ME Andreas Buckenhofer Senior DB Professional andreas.buckenhofer@daimler.com
More informationData Vault Brisbane User Group
Data Vault Brisbane User Group 26-02-2013 Agenda Introductions A brief introduction to Data Vault Creating a Data Vault based Data Warehouse Comparisons with 3NF/Kimball When is it good for you? Examples
More informationBusiness Intelligence and Decision Support Systems
Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 8: Data Warehousing Learning Objectives Understand the basic definitions and concepts of data warehouses Learn different
More informationNext Generation DWH Modeling. An overview of DWH modeling methods
Next Generation DWH Modeling An overview of DWH modeling methods Ronald Kunenborg www.grundsatzlich-it.nl Topics Where do we stand today Data storage and modeling through the ages Current data warehouse
More informationKent Graziano
Agile Data Warehouse Modeling: Introduction to Data Vault Modeling Kent Graziano Twitter @KentGraziano Agenda Bio What is a Data Vault? Where does it fit in an DW/BI architecture? How to design a Data
More informationDecision Guidance. Data Vault in Data Warehousing
Decision Guidance Data Vault in Data Warehousing DATA VAULT IN DATA WAREHOUSING Today s business environment requires data models, which are resilient to change and enable the integration of multiple data
More informationDATA WAREHOUSE PART III: ETL AND DB SPECIFICS ANDREAS BUCKENHOFER, DAIMLER TSS
A company of Daimler AG LECTURE @DHBW: DATA WAREHOUSE PART III: ETL AND DB SPECIFICS ANDREAS BUCKENHOFER, DAIMLER TSS ABOUT ME Andreas Buckenhofer Senior DB Professional andreas.buckenhofer@daimler.com
More informationDATA VAULT CDVDM. Certified Data Vault Data Modeler Course. Sydney Australia December In cooperation with GENESEE ACADEMY, LLC
DATA VAULT CDVDM Certified Data Vault Data Modeler Course Sydney Australia December 3-5 2012 In cooperation with GENESEE ACADEMY, LLC Course Description and Outline DATA VAULT CDVDM Certified Data Vault
More informationFull file at
Chapter 2 Data Warehousing True-False Questions 1. A real-time, enterprise-level data warehouse combined with a strategy for its use in decision support can leverage data to provide massive financial benefits
More informationARE DATA LAKES THE NEW CORE DWHS? ANDREAS BUCKENHOFER, DAIMLER TSS
A company of Daimler AG ARE DATA LAKES THE NEW CORE DWHS? ANDREAS BUCKENHOFER, DAIMLER TSS DOAG BIG DATA, REPORTING, GEODATA DAYS - KASSEL 2017 ABOUT ME Andreas Buckenhofer Senior DB Professional andreas.buckenhofer@daimler.com
More informationFig 1.2: Relationship between DW, ODS and OLTP Systems
1.4 DATA WAREHOUSES Data warehousing is a process for assembling and managing data from various sources for the purpose of gaining a single detailed view of an enterprise. Although there are several definitions
More informationData Warehouses Chapter 12. Class 10: Data Warehouses 1
Data Warehouses Chapter 12 Class 10: Data Warehouses 1 OLTP vs OLAP Operational Database: a database designed to support the day today transactions of an organization Data Warehouse: historical data is
More informationSTRATEGIC INFORMATION SYSTEMS IV STV401T / B BTIP05 / BTIX05 - BTECH DEPARTMENT OF INFORMATICS. By: Dr. Tendani J. Lavhengwa
STRATEGIC INFORMATION SYSTEMS IV STV401T / B BTIP05 / BTIX05 - BTECH DEPARTMENT OF INFORMATICS LECTURE: 05 (A) DATA WAREHOUSING (DW) By: Dr. Tendani J. Lavhengwa lavhengwatj@tut.ac.za 1 My personal quote:
More informationModern Data Warehouse The New Approach to Azure BI
Modern Data Warehouse The New Approach to Azure BI History On-Premise SQL Server Big Data Solutions Technical Barriers Modern Analytics Platform On-Premise SQL Server Big Data Solutions Modern Analytics
More informationData warehouse architecture consists of the following interconnected layers:
Architecture, in the Data warehousing world, is the concept and design of the data base and technologies that are used to load the data. A good architecture will enable scalability, high performance and
More informationDATA VAULT MODELING GUIDE
DATA VAULT MODELING GUIDE Introductory Guide to Data Vault Modeling GENESEE ACADEMY, LLC 2012 Authored by: Hans Hultgren DATA VAULT MODELING GUIDE Introductory Guide to Data Vault Modeling Forward Data
More informationturning data into dollars
turning data into dollars Tom s Ten Data Tips December 2008 ETL ETL stands for Extract, Transform, Load. This process merges and integrates information from source systems in the data warehouse (DWH).
More informationModeling Pattern Awareness
Modeling Pattern Awareness Modeling Pattern Awareness 2014 Authored by: Hans Hultgren Modeling Pattern Awareness The importance of knowing your pattern Forward Over the past decade Ensemble Modeling has
More informationHow Insurers are Realising the Promise of Big Data
How Insurers are Realising the Promise of Big Data Jason Hunter, CTO Asia-Pacific, MarkLogic A Big Data Challenge: Pushing the Limits of What's Possible The Art of the Possible Multiple Government Agencies
More informationA Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective
A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective B.Manivannan Research Scholar, Dept. Computer Science, Dravidian University, Kuppam, Andhra Pradesh, India
More informationData Warehouse and Mining
Data Warehouse and Mining 1. is a subject-oriented, integrated, time-variant, nonvolatile collection of data in support of management decisions. A. Data Mining. B. Data Warehousing. C. Web Mining. D. Text
More informationPřehled novinek v SQL Server 2016
Přehled novinek v SQL Server 2016 Martin Rys, BI Competency Leader martin.rys@adastragrp.com https://www.linkedin.com/in/martinrys 20.4.2016 1 BI Competency development 2 Trends, modern data warehousing
More informationBusiness Intelligence An Overview. Zahra Mansoori
Business Intelligence An Overview Zahra Mansoori Contents 1. Preference 2. History 3. Inmon Model - Inmonities 4. Kimball Model - Kimballities 5. Inmon vs. Kimball 6. Reporting 7. BI Algorithms 8. Summary
More informationFrom Data Challenge to Data Opportunity
From Data Challenge to Data Opportunity Jason Hunter, CTO Asia-Pacific, MarkLogic A Big Data Challenge: Pushing the Limits of What's Possible The Art of the Possible Multiple Government Agencies Data Hub
More informationData Mining Concepts & Techniques
Data Mining Concepts & Techniques Lecture No. 01 Databases, Data warehouse Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro
More informationOverview. Introduction to Data Warehousing and Business Intelligence. BI Is Important. What is Business Intelligence (BI)?
Introduction to Data Warehousing and Business Intelligence Overview Why Business Intelligence? Data analysis problems Data Warehouse (DW) introduction A tour of the coming DW lectures DW Applications Loosely
More information5 Fundamental Strategies for Building a Data-centered Data Center
5 Fundamental Strategies for Building a Data-centered Data Center June 3, 2014 Ken Krupa, Chief Field Architect Gary Vidal, Solutions Specialist Last generation Reference Data Unstructured OLTP Warehouse
More informationCombine Native SQL Flexibility with SAP HANA Platform Performance and Tools
SAP Technical Brief Data Warehousing SAP HANA Data Warehousing Combine Native SQL Flexibility with SAP HANA Platform Performance and Tools A data warehouse for the modern age Data warehouses have been
More informationChapter 3: Data Warehousing
Solution Manual Business Intelligence and Analytics Systems for Decision Support 10th Edition Sharda Instant download and all chapters Solution Manual Business Intelligence and Analytics Systems for Decision
More informationBuilding a Data Strategy for a Digital World
Building a Data Strategy for a Digital World Jason Hunter, CTO, APAC Data Challenge: Pushing the Limits of What's Possible The Art of the Possible Multiple Government Agencies Data Hub 100 s of Service
More informationSimplifying your upgrade and consolidation to BW/4HANA. Pravin Gupta (Teklink International Inc.) Bhanu Gupta (Molex LLC)
Simplifying your upgrade and consolidation to BW/4HANA Pravin Gupta (Teklink International Inc.) Bhanu Gupta (Molex LLC) AGENDA What is BW/4HANA? Stepping stones to SAP BW/4HANA How to get your system
More informationDATAWAREHOUSING AND ETL PROCESSES: An Explanatory Research
DATAWAREHOUSING AND ETL PROCESSES: An Explanatory Research Priyanshu Gupta ETL Software Developer United Health Group Abstract- In this paper, the author has focused on explaining Data Warehousing and
More informationInstant Data Warehousing with SAP data
Instant Data Warehousing with SAP data» Extracting your SAP data to any destination environment» Fast, simple, user-friendly» 8 different SAP interface technologies» Graphical user interface no previous
More informationData Warehousing Methods and its Applications
International Journal of Engineering Science Invention (IJESI) ISSN (Online): 2319 6734, ISSN (Print): 2319 6726 www.ijesi.org PP. 12-19 Data Warehousing Methods and its Applications 1 Dr. C. Suba 1 (Department
More informationIntroduction to DWML. Christian Thomsen, Aalborg University. Slides adapted from Torben Bach Pedersen and Man Lung Yiu
Introduction to DWML Christian Thomsen, Aalborg University Slides adapted from Torben Bach Pedersen and Man Lung Yiu Course Structure Business intelligence Extract knowledge from large amounts of data
More informationMastering Data Warehouse Design: Relational And Dimensional Techniques Read Free Books and Download ebooks
Mastering Data Warehouse Design: Relational And Dimensional Techniques Read Free Books and Download ebooks A cutting-edge response to Ralph Kimball's challenge to the data warehouse community that answers
More informationDrawing the Big Picture
Drawing the Big Picture Multi-Platform Data Architectures, Queries, and Analytics Philip Russom TDWI Research Director for Data Management August 26, 2015 Sponsor 2 Speakers Philip Russom TDWI Research
More informationChapter 6. Foundations of Business Intelligence: Databases and Information Management VIDEO CASES
Chapter 6 Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:
More informationInformation Management course
Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 05(b) : 23/10/2012 Data Mining: Concepts and Techniques (3 rd ed.) Chapter
More informationThe Evolution of Data Warehousing. Data Warehousing Concepts. The Evolution of Data Warehousing. The Evolution of Data Warehousing
The Evolution of Data Warehousing Data Warehousing Concepts Since 1970s, organizations gained competitive advantage through systems that automate business processes to offer more efficient and cost-effective
More informationDHANALAKSHMI COLLEGE OF ENGINEERING, CHENNAI
DHANALAKSHMI COLLEGE OF ENGINEERING, CHENNAI Department of Information Technology IT6702 Data Warehousing & Data Mining Anna University 2 & 16 Mark Questions & Answers Year / Semester: IV / VII Regulation:
More informationData Warehousing Introduction. Toon Calders
Data Warehousing Introduction Toon Calders toon.calders@ulb.ac.be Course Organization Lectures on Tuesday 14:00 and Friday 16:00 Check http://gehol.ulb.ac.be/ for room Most exercises in computer class
More informationCHAPTER 3 Implementation of Data warehouse in Data Mining
CHAPTER 3 Implementation of Data warehouse in Data Mining 3.1 Introduction to Data Warehousing A data warehouse is storage of convenient, consistent, complete and consolidated data, which is collected
More informationIBM Data Replication for Big Data
IBM Data Replication for Big Data Highlights Stream changes in realtime in Hadoop or Kafka data lakes or hubs Provide agility to data in data warehouses and data lakes Achieve minimum impact on source
More informationSAP BW/4HANA the next generation Data Warehouse
SAP BW/4HANA the next generation Data Warehouse Lothar Henkes, VP Product Management SAP EDW (BW/HANA) July 25 th, 2017 Disclaimer This presentation is not subject to your license agreement or any other
More informationCopyright 2016 Datalynx Pty Ltd. All rights reserved. Datalynx Enterprise Data Management Solution Catalogue
Datalynx Enterprise Data Management Solution Catalogue About Datalynx Vendor of the world s most versatile Enterprise Data Management software Licence our software to clients & partners Partner-based sales
More informationTechnology Note. Data Vault Modeling with ER/Studio Data Architect
Technology Note Data Vault Modeling with ER/Studio Data Architect Dr. Sultan Shiffa March 28, 2018 Data Vault Modeling with ER/Studio Data Architect Overview I have been asked multiple times if ER/Studio
More informationData Warehousing in the Age of In-Memory Computing and Real-Time Analytics. Erich Schneider, Daniel Rutschmann June 2014
Data Warehousing in the Age of In-Memory Computing and Real-Time Analytics Erich Schneider, Daniel Rutschmann June 2014 Disclaimer This presentation outlines our general product direction and should not
More informationBI/DWH Test specifics
BI/DWH Test specifics Jaroslav.Strharsky@s-itsolutions.at 26/05/2016 Page me => TestMoto: inadequate test scope definition? no problem problem cold be only bad test strategy more than 16 years in IT more
More informationPositioning of CML of SAP s LSA with NLS SAND as Archiving Technology
Positioning of CML of SAP s LSA with NLS SAND as Archiving Technology Applies to EDW, SAP BIW 3.5, SAP NetWeaver 7.0. For more information, visit the EDW homepage. Summary This document may help you in
More informationChapter 6 VIDEO CASES
Chapter 6 Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:
More informationData Quality Architecture and Options
Data Quality Architecture and Options Nita Khare Alliances & Technology Team - Solution Architect nita.khare@tcs.com * IBM IM Champion 2013 * December 3, 2013 0 Agenda Pain Areas / Challenges of DQ Solution
More informationData Warehousing. Overview
Data Warehousing Overview Basic Definitions Normalization Entity Relationship Diagrams (ERDs) Normal Forms Many to Many relationships Warehouse Considerations Dimension Tables Fact Tables Star Schema Snowflake
More informationIntroductory Guide to Data Vault Modeling GENESEE ACADEMY, LLC
Introductory Guide to Data Vault Modeling GENESEE ACADEMY, LLC 2016 Authored by: Hans Hultgren Introductory Guide to Data Vault Modeling Forward Data Vault modeling is most compelling when applied to an
More informationProposal of a new Data Warehouse Architecture Reference Model
Proposal of a new Data Warehouse Architecture Reference Model Dariusz Dymek Wojciech Komnata Piotr Szwed AGH University of Science and Technology Department of Applied Computer Science e-mail: eidymek@kinga.cyf-kr.edu.pl,
More informationData Warehousing Concepts
Data Warehousing Concepts Data Warehousing Definition Basic Data Warehousing Architecture Transaction & Transactional Data OLTP / Operational System / Transactional System OLAP / Data Warehouse / Decision
More informationFrom Single Purpose to Multi Purpose Data Lakes. Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019
From Single Purpose to Multi Purpose Data Lakes Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019 Agenda Data Lakes Multiple Purpose Data Lakes Customer Example Demo Takeaways
More informationThe EDW: An Overview For Foothill-De Anza Community College District
The EDW: An Overview For Foothill-De Anza Community College District R. Joanne Keys, SunGard Higher Education October, 2009 1 The Agenda Objective: Set the stage for a successful implementation of the
More information1 DATAWAREHOUSING QUESTIONS by Mausami Sawarkar
1 DATAWAREHOUSING QUESTIONS by Mausami Sawarkar 1) What does the term 'Ad-hoc Analysis' mean? Choice 1 Business analysts use a subset of the data for analysis. Choice 2: Business analysts access the Data
More informationCS614 - Data Warehousing - Midterm Papers Solved MCQ(S) (1 TO 22 Lectures)
CS614- Data Warehousing Solved MCQ(S) From Midterm Papers (1 TO 22 Lectures) BY Arslan Arshad Nov 21,2016 BS110401050 BS110401050@vu.edu.pk Arslan.arshad01@gmail.com AKMP01 CS614 - Data Warehousing - Midterm
More informationDC Area Business Objects Crystal User Group (DCABOCUG) Data Warehouse Architectures for Business Intelligence Reporting.
DC Area Business Objects Crystal User Group (DCABOCUG) Data Warehouse Architectures for Business Intelligence Reporting April 14, 2009 Whitemarsh Information Systems Corporation 2008 Althea Lane Bowie,
More informationDecision Support. applied data warehousing and business intelligence. Paul Boal Sisters of Mercy Health System April 5, 2010
Decision Support applied data warehousing and business intelligence. Paul Boal Sisters of Mercy Health System April 5, 2010 Opening Questions What is one concept that you think businesses have a difficult
More informationDATA MINING AND WAREHOUSING
DATA MINING AND WAREHOUSING Qno Question Answer 1 Define data warehouse? Data warehouse is a subject oriented, integrated, time-variant, and nonvolatile collection of data that supports management's decision-making
More informationData Vault Modeling and its Evolution DECISION SCIENCES INSTITUTE. Conceptual Data Vault Modeling and its Opportunities for the Future
DECISION SCIENCES INSTITUTE Conceptual Data Vault Modeling and its Opportunities for the Future Aarthi Raman, Active Network, Dallas, TX, 75201 itz.aarthi@gmail.com Teuta Cata, Northern Kentucky University,
More informationAppliances and DW Architecture. John O Brien President and Executive Architect Zukeran Technologies 1
Appliances and DW Architecture John O Brien President and Executive Architect Zukeran Technologies 1 OBJECTIVES To define an appliance Understand critical components of a DW appliance Learn how DW appliances
More informationData Warehouse and Data Mining
Data Warehouse and Data Mining Lecture No. 03 Architecture of DW Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro Basic
More informationManaging Data Resources
Chapter 7 OBJECTIVES Describe basic file organization concepts and the problems of managing data resources in a traditional file environment Managing Data Resources Describe how a database management system
More informationVirtuoso Infotech Pvt. Ltd.
Virtuoso Infotech Pvt. Ltd. About Virtuoso Infotech Fastest growing IT firm; Offers the flexibility of a small firm and robustness of over 30 years experience collectively within the leadership team Technology
More informationOracle In-Memory & Data Warehouse: The Perfect Combination?
: The Perfect Combination? UKOUG Tech17, 6 December 2017 Dani Schnider, Trivadis AG @dani_schnider danischnider.wordpress.com BASEL BERN BRUGG DÜSSELDORF FRANKFURT A.M. FREIBURG I.BR. GENEVA HAMBURG COPENHAGEN
More informationTop Five Reasons for Data Warehouse Modernization Philip Russom
Top Five Reasons for Data Warehouse Modernization Philip Russom TDWI Research Director for Data Management May 28, 2014 Sponsor Speakers Philip Russom TDWI Research Director, Data Management Steve Sarsfield
More informationAgent Based Architecture in Distributed Data Warehousing
International Journal of Scientific and Research Publications, Volume 2, Issue 5, May 2012 1 Agent Based Architecture in Distributed Data Warehousing Bindia, Jaspreet Kaur Sahiwal Department of Computer
More informationMichael Roedeske. Query performance monitoring and graphical analysis [EN]
Michael Roedeske Query performance monitoring and graphical analysis [EN] Michael Roedeske CEO and Technical Architect DBPLUS Germany c/o webtelligence IT consulting GmbH Michael graduated from the State
More informationHeisenberg and the uncertainty laws of BI. Zoltan Vago, Senior DWH Consultant 03-June-2015
Heisenberg and the uncertainty laws of BI Zoltan Vago, Senior DWH Consultant zoltan.vago@teradata.com 03-June-2015 The uncerainty principle The more precisely the position of some particle is determined,
More informationturning data into dollars
turning data into dollars Tom s Ten Data Tips November 2012 Data warehouse automation Data warehouse (DWH) automation is a relatively new and burgeoning field. Design patterns have emerged that enable
More informationCustomer SAP BW/4HANA. Salvador Gimeno 7 December SAP SE or an SAP affiliate company. All rights reserved. Customer
SAP BW/4HANA Customer Salvador Gimeno 7 December 2016 2016 SAP SE or an SAP affiliate company. All rights reserved. Customer 1 DISCLAIMER This presentation is not subject to your license agreement or any
More informationTeradata Certified Professional Program Teradata V2R5 Certification Guide
Professional Program Teradata Certification Guide The Professional Program team welcomes you to the Teradata Certification Guide. The guide provides comprehensive information about Teradata certification
More informationTop of Minds Report series Data Warehouse The six levels of integration
Top of Minds Report series Data Warehouse The six levels of integration Recommended reading Before reading this report it is recommended to read ToM Report Series on Data Warehouse Definitions for Integration
More informationData 101 Which DB, When. Joe Yong Azure SQL Data Warehouse, Program Management Microsoft Corp.
Data 101 Which DB, When Joe Yong (joeyong@microsoft.com) Azure SQL Data Warehouse, Program Management Microsoft Corp. The world is changing AI increased by 300% in 2017 Data will grow to 44 ZB in 2020
More informationDATA WAREHOUSE PART II: DWH DATA MODELING & OLAP ANDREAS BUCKENHOFER, DAIMLER TSS
A company of Daimler AG LECTURE @DHBW: DATA WAREHOUSE PART II: DWH DATA MODELING & OLAP ANDREAS BUCKENHOFER, DAIMLER TSS ABOUT ME Andreas Buckenhofer Senior DB Professional andreas.buckenhofer@daimler.com
More informationChapter 3. Foundations of Business Intelligence: Databases and Information Management
Chapter 3 Foundations of Business Intelligence: Databases and Information Management THE DATA HIERARCHY TRADITIONAL FILE PROCESSING Organizing Data in a Traditional File Environment Problems with the traditional
More informationThe Data Organization
C V I T F E P A O TM The Data Organization Best Practices Metadata Dictionary Application Architecture Prepared by Rainer Schoenrank January 2017 Table of Contents 1. INTRODUCTION... 3 1.1 PURPOSE OF THE
More information1 Dulcian, Inc., 2001 All rights reserved. Oracle9i Data Warehouse Review. Agenda
Agenda Oracle9i Warehouse Review Dulcian, Inc. Oracle9i Server OLAP Server Analytical SQL Mining ETL Infrastructure 9i Warehouse Builder Oracle 9i Server Overview E-Business Intelligence Platform 9i Server:
More informationWhen, Where & Why to Use NoSQL?
When, Where & Why to Use NoSQL? 1 Big data is becoming a big challenge for enterprises. Many organizations have built environments for transactional data with Relational Database Management Systems (RDBMS),
More informationMETADATA INTERCHANGE IN SERVICE BASED ARCHITECTURE
UDC:681.324 Review paper METADATA INTERCHANGE IN SERVICE BASED ARCHITECTURE Alma Butkovi Tomac Nagravision Kudelski group, Cheseaux / Lausanne alma.butkovictomac@nagra.com Dražen Tomac Cambridge Technology
More informationDesigning Data Warehouses. Data Warehousing Design. Designing Data Warehouses. Designing Data Warehouses
Designing Data Warehouses To begin a data warehouse project, need to find answers for questions such as: Data Warehousing Design Which user requirements are most important and which data should be considered
More informationLow Friction Data Warehousing WITH PERSPECTIVE ILM DATA GOVERNOR
Low Friction Data Warehousing WITH PERSPECTIVE ILM DATA GOVERNOR Table of Contents Foreword... 2 New Era of Rapid Data Warehousing... 3 Eliminating Slow Reporting and Analytics Pains... 3 Applying 20 Years
More informationData transfer, storage and analysis for data mart enlargement
Data transfer, storage and analysis for data mart enlargement PROKOPOVA ZDENKA, SILHAVY PETR, SILHAVY RADEK Department of Computer and Communication Systems Faculty of Applied Informatics Tomas Bata University
More informationMaking Data Integration Easy For Multiplatform Data Architectures With Diyotta 4.0. WEBINAR MAY 15 th, PM EST 10AM PST
Making Data Integration Easy For Multiplatform Data Architectures With Diyotta 4.0 WEBINAR MAY 15 th, 2018 1PM EST 10AM PST Welcome and Logistics If you have problems with the sound on your computer, switch
More informationCHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP)
CHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP) INTRODUCTION A dimension is an attribute within a multidimensional model consisting of a list of values (called members). A fact is defined by a combination
More informationStages of Data Processing
Data processing can be understood as the conversion of raw data into a meaningful and desired form. Basically, producing information that can be understood by the end user. So then, the question arises,
More informationTaming Structured And Unstructured Data With SAP HANA Running On VCE Vblock Systems
1 Taming Structured And Unstructured Data With SAP HANA Running On VCE Vblock Systems The Defacto Choice For Convergence 2 ABSTRACT & SPEAKER BIO Dealing with enormous data growth is a key challenge for
More informationSyllabus. Syllabus. Motivation Decision Support. Syllabus
Presentation: Sophia Discussion: Tianyu Metadata Requirements and Conclusion 3 4 Decision Support Decision Making: Everyday, Everywhere Decision Support System: a class of computerized information systems
More informationThis tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.
About the Tutorial A data warehouse is constructed by integrating data from multiple heterogeneous sources. It supports analytical reporting, structured and/or ad hoc queries and decision making. This
More information