Integrating evolving MDM and EDW systems by Data Vault based System Catalog

Size: px
Start display at page:

Download "Integrating evolving MDM and EDW systems by Data Vault based System Catalog"

Transcription

1 Integrating evolving MDM and EDW systems by Data Vault based System Catalog D. Jakšić *, V. Jovanović ** and P. Poščić * * Department of informatics-university of Rijeka/ Rijeka, Croatia ** Georgia Southern University/ Statesboro, GA, USA dsubotic@inf.uniri.hr, vladan@georgiasouthern.edu, patrizia@inf.uniri.hr Abstract - The paper presents results of a research on integration of enterprise data warehouse (EDW) and a master data management (MDM) system. The primary goal was solving a schema evolution problem, and the corner stone of our approach was utilization of a data vault modeling of an integrated meta-model of EDW and MDM as an expansion of a traditional relational database system catalog. The main contributions of this paper are: a) common integration architecture, b) new system catalog based on a meta-model for DW and MDM integration, and c) research prototype used for empirical validation of the effectiveness of the proposed solution. I. INTRODUCTION A data warehouse is a subject-oriented, integrated, time-variant and non-volatile collection of data in support of management's decision making process [15]. This means that a data warehouse (DW) can be used to analyze a particular subject area (such as sales, marketing, finance, etc.), it integrates data from multiple heterogeneous data sources, it keeps the history of data, and it never alters the data once it enters a DW. A simpler form of a DW is a data mart (DM). DM is focused on a single subject area and it draws its data not from all the DW data sources, but from a limited number of them (such as retail sales applications with daily transactions). For the logical representation of a DW (or DM), we traditionally use a denormalized dimensional model [12][20] in a combination with a 3NF model for modelling a central and integrated enterprise data warehouse (also called operational data store) [12][15][16]. DW environment (namely its data sources) nowadays is in a state of constant structural (schema) change. Master data management (MDM) comprises the processes, governance, policies, standards and tools that consistently define and manage the critical data of an organization to provide a single point of reference [3][23]. Some of the fundamental tasks of MDM system are duplicate removal, data standardization, and rule implementation and incorporation - all with a goal to eliminate incorrect data from entering the system and to create an authoritative source of master data for further distribution. It is a method of enabling a business organization to link all of its critical data to a common point of reference, which is further shared throughout all the departments and relevant employees. This way, a data quality of an organization greatly improves and the organization can better serve its clients, as well as improve their business by running a more accurate and efficient business analysis and reporting (based on a single version of the truth data pool [15]). Dimensional model can also be used here - for the logical representation of MDM data (master data are represented as dimensions here). Master data are the key business entities and their descriptive attributes (e.g., the customer has a name, address, etc., the product has a name, color, weight, category, etc.), which are used by multiple systems, applications, and business processes of the organization as a unique source of data. Reference data can be internal or external and are used for the validation of other data. MDM environment is also in a state of constant change data sources nowadays often change their structure and content. In this work our focus was on an enterprise size data warehouse (DW) and a master data management (MDM) system integration. We approached the problem through a development of a common system catalog meta-model, with the goal of solving a common schema evolution problem. Schema evolution occurs in both of these systems and is traditionally resolved separately. DW integrates current and historical data from many data sources and serves for business reporting and data analysis. MDM is traditionally used (by other business systems, applications, databases and data warehouses) as a physically independent database of master and reference data (also collected from many data sources). Data sources are often the same ones for DW and MDM and they often change their structure. These changes have to be implemented into both systems, so that they could accurately reflect the current (and historical) state of the real world - so that the DW could provide for effective business analysis and the MDM could achieve the optimal data quality throughout all the systems involved. We aim to resolve this problem on a common level - we state that the schema evolution problem can be viewed as a double issue: at a DW level and at the MDM level. From the DW perspective every event (fact) that is associated with dimensions is monitored, but from the MDM perspective the master data (dimensions) are monitored and the events (facts) give them context. And in both of these cases schema evolution problem to solve exists. The paper is organized as follows: section II gives a brief overview of a related work, section III describes our DW/MDM integration research (general research idea, our common integration architecture and integration part of a system catalog meta-model), section IV presents our research prototype and some relevant test results, and in section V we conclude our work with a brief summary and some plans and guidelines for the future. MIPRO 2017/miproBIS 1633

2 II. RELATED WORK With respect to the literature, the DW evolution can generally be traced through three approaches - schema evolution [5][10][29], schema versioning [1][9][25] and view maintenance [2][7][14]. The first two approaches are more interesting to our research because they are based on a DW defined as a multidimensional schema, and the third is based on a DW that is defined as a set of materialized views. Our extensive and comprehensive state-of-the-art on this problem can be found in [28], but from analyzing the related work we can generally conclude that the process of schema evolution and versioning is still demanding in terms of invested time and resources. It is necessary to balance the resource requirements and the quality of schema evolution process. Perhaps the biggest problem here is the preservation of schema consistency and data integrity (there is still a lack of an integrated system-of-records), as well as the simultaneous performance of temporal queries against multiple versions of the schema. Also, migration and transformation of data is still slow and expensive, the loss of information during these processes is still present and there is a lack of effective integration, organization and management of metadata. On the other hand, the view maintenance process can cause network saturation, depending on the amount of updated views and the amount of information they contain. The problems of anomalies and inconsistent changes in the views are still unresolved, the proposed approaches for view maintenance are still limited in terms of efficiency and performance, and the ETL processes, which are an integral part of most of today's DWs are completely ignored. Different approaches to solving the DW schema evolution problem are presented in literature (including a variety of techniques, algorithms, algebras, models, prototypes, methodologies and frameworks), but there is still no widely accepted solution and general framework for managing DW schema changes. More importantly, previous research does not emphasize the fact that the DW requirements, in this day and age, are increasing in the data and meta-data scope and structure (the growing number of data sources and more new and different types of data), which additionally requires developing some new approaches and solutions to the schema evolution problem. III. MDM AND DW INTEGRATION RESEARCH A. General research idea As we already mentioned, the DW needs to preserve the history of data and metadata changes, as well as the history of schema and scope changes, for a very long time period [19][26]. On the other hand, the MDM needs to preserve a data quality of an organization by achieving and maintaining a single version of the truth data pool as a basis for accurate and efficient business analysis [3][23]. Seeing that the DW and MDM both integrate basically the same data sources and have the same schema evolution problem, we will integrate those two systems into one and address the evolution problem at the common level. Our main research question was: Can our new system catalog model serve to successfully integrate DW and MDM systems?. In order to answer this question, we developed a new, integrated architecture for these systems, as well as a system catalog model based on a Data Vault modelling methodology [21][22]. The data vault (DV) is a data modeling method designed for supporting the long-term storage of historical data collected from various data sources and tracking the origin of data contained in the database [21][22]. The DV model (due to structural separation, usage of empiric meta-data and addition-only policy, which relation model does not implement) is able to track the data source values and the history of changes, which is a vital function of a DW system-of-records [19][26][27]. Furthermore, we developed and tested a research prototype based on said architecture and system catalog model. B. Common integration architecture Fig. 1 shows our common integration architecture. Central and integrated enterprise DW/MDM (purple section in Fig. 1) is in focus of our research and consists of two parts - the raw copy of the source data (SDV) and the synchronized master and business data (PMDV). Both parts are based on the Data Vault (DV) method, in contrast to traditional approaches based on the relational model [12][15][20]. SDV is focused on obtaining and preserving the original and unchanged copy of data sources for the purpose of governance and auditing, and PMDV is focused on obtaining and preserving the data that has been modified according to business and master rules the data that later feeds DMs and master dimensions (MDMs) and is oriented to user requirements. The integration of SDV and PMDV is carried out over an extended DBMS system catalog (i.e. our new meta-data repository, MDV), based on the DV model. MDV repository serves to integrate the two parts and to monitor history of changes of meta-data and their schemas, of the business rules and transformations and of mappings between the two systems. MDV contains the history of data sources meta-data (their domains and schemas), of central DW/MDM integration and schema changes, of DMs and MDMs schema changes and of security schema changes (user access rights). We can say that MDV repository represents the DW on DW and is a key part of our new architecture. In materialized DMs and MDMs (blue section in Fig. 1) master data and business analysis and reporting data is stored. Data is stored according to the user requirements and a dimensional model is used for representation (data are summarized, aggregated and calculated). A traditional DMs are integrated here with the traditional MDMs the PMDV forwards the data to MDMs so the MDMs could then forward the data to DMs and data sources. The end-user can then access the DM/MDMs and analyze the data through the selected data analysis and reporting tools. Additionally, the master data is returned from enterprise DW/MDM to the data sources in order to harmonize and refine the data within business organization. Master data collected from PMDV is maintained in one central MDM location and all source systems and applications, as well as DMs, are using this data. This directly affects the quality of data, which later re-enters the raw SDV and passes again through the described layers of the architecture. Also, by reducing the amount of data over which was necessary to make "heavy" transformations we thus speed up the process of 1634 MIPRO 2017/miproBIS

3 Figure 1. Common integration DW/MDM system architecture data integration [19]. However, these issues are out of the scope of this paper. C. Meta Data Vault (MDV) model for SDV/PMDV integration Fig. 2 shows a simplified meta-data-vault (MDV) model for DW/MDM integration. Due to complexity and size of the full MDV system catalog model and the scope of this work, we will show here one small, but relevant part of the MDV model the one for SDV/PMDV integration. The whole model is much more extensive because it serves for schema evolution purpose - it incorporates data sources, data marts, materialized views and issues of security - but those are out of scope of this paper. The model (as well as other models in this work) is made in IDEF1X method [17] with the use of the CA Erwin Data Modeler 9.5 modelling software [8]. In our MDV model, SDV/PMDV integration is managed through the same-as-links on four main hubs (H_HUB, H_LINK, H_SATELLITE, and H_ATTRIBUTE) which store metadata about three main DV concepts in a data model (hubs, links and satellites respectively additionally H_ATTRIBUTE stores data about satellite attributes; column meta-data). For example, if we have two different relational data sources which both have the table CUSTOMER, we integrate these sources into a DV based central EDW (EDV). This means that the relational table CUSTOMER from both sources becomes one hub H_CUSTOMER (stores business keys for CUSTOMER) with its satellites S_CUSTOMER1 (stores columns from CUSTOMER in Source1) and S_CUSTOMER2 (stores columns from CUSTOMER in Source2). This is considered as a copy of the original data sources (SDV), Figure 2. MDV system catalog model for SDV/PMDV integration MIPRO 2017/miproBIS 1635

4 which is then copied into a PMDV where the business and master rules are applied. For this example it means that hub H_CUSTOMER gets only one satellite S_CUSTOMER by the rules those two satellites are integrated into common one and the data is cleansed. Also, data in the H_CUSTOMER is integrated and deduplicated. The corresponding meta-data is then stored in our MDV system catalog from Fig. 2. In the H_HUB and its satellites S_BUSINESS_KEY and S_HUB_DEF the meta-data about hub H_CUSTOMER (and generally about all the hubs in a enterprise DW/MDM from both SDV and PMDV) is stored, including the data about names of the hubs, names of their keys, hub types and their general descriptions, as well as their load dates and record sources. Also, in the H_SATELLITE and its satellite S_SAT_DEF the meta-data about satellites S_CUSTOMER1, S_CUSTOMER2 and S_CUSTOMER is stored. The same is true for all other MDV structures (links, attributes, ). However, at this point the DW and MDM (SDV and PMDV) hubs are not yet integrated (list of hubs with their descriptive meta-data is just stored in a corresponding meta-hub and its meta-satellites - H_HUB, S_BUSINESS_KEY, S_HUB_DEF). For the integration we use the same-as link SAL_MASTER_HUB which relates a single hub from a list of hubs in H_HUB to another hub from that list, in a base-master relation (SDV hub is base one, PMDV hub is master). In the example it would mean that the hub H_CUSTOMER form PMDV becomes master hub to hub H_CUSTOMER in SDV. For the satellites, their integration is stored in the SAL_MASTER_SATELLITE link for example, the S_CUSTOMER1 and S_CUSTOMER2 meta-data records stored in H_SATELLITE can have (separate) base-master relationship with S_CUSTOMER record. This relationship is stored as a record in SAL_MASTER_SATELLITE where S_CUSTOMER is master structure on S_CUSTOMER1 and S_CUSTOMER2. Additionally, SAL_MASTER_HUB (as well as other master links) relates to H_RULE and H_TRANSFORMATION, the two hub meta-concepts that store the data about business rules that can be applied to data and (ETL) transformations needed to apply those rules, respectively. This way, we get the complete history of SDV/PMDV integration we can keep track of all the business rules and transformations applied to the specific integration, with SAL_MASTER_HUB storing the load date and record source data about that specific integration. In that way we can create and manage master hubs and store the history of their changes. SAL_MASTER_HUB has its satellite S_BDV_CODE_HUB, with descriptive data about the code (ETL or other) used for that specific integration. This satellite will later serve as a basis for automating the process of integration. The same can be applied to the other three main hubs in a MDV model (SAL_MASTER_LINK for H_LINK, SAL_MASTER_SATELLITE for H_SATELLITE and SAL_MASTER_ATTRIBUTE for H_ATTRIBUTE), so we can create complete and historicized golden copy of business and master data in PMDV, from the simple SDV raw copy of data. Regarding the reference table concept, as it exists only in a data model (it represents external or internal reference data and as such it is already consolidated and organized by internal business rules or external data standards), we have no need to further make the master concepts in the meta-model. We will simply store its meta-data in a H_REFERENCE concept in MDV. IV. RESEARCH PROTOTYPE AND RESULTS A. Business case example For the purpose of theoretical validation of the proposed architecture and DW/MDM integration, as well as building a prototype, we developed a simple business case (in a similar way as in [18]), which monitors the work of the employees on projects and their participation in job trainings and educations. We have two data sources (JobDB and TrainingDB) which we integrate into a common SDV and PMDV database. This integrated DW/MDM further feeds the MDM and local DMs with relevant data. JobDB data source schema is describing employees, their bosses and projects they are currently working. TrainingDB data source schema also describes employees, but this time in the context of business trainings in which they participate and competences they achieve through them. SdvDB and PmdvDB databases integrate these two data sources into a single schema, through the use of Data Vault model. However, due to size restrictions, business case data models will not be shown in this paper. B. Prototype description Research prototype has been developed according to the business case examples and has been tested by running a set of queries against SDV, PMDV and MDV databases, as well as the original relational system catalog. Fig. 3 shows the architecture of a prototype. Corresponding to business case models shown in the previous section, there are two source databases (JobDB and TrainingDB) which are then integrated into a raw copy of the source data (SdvDB). A business and master PmdvDB is built and loaded from the SdvDB and it keeps "purified" data. The MdmDB is loaded from PmdvDB and it practically represents the MDM system which keeps the master data. In MdmDB the same data as in PmdvDB is stored, but organized according to a dimensional model - in the dimensional database. MdmDB is then the basis for loading the DmDB, which is also based on a dimensional model and represents a local DM. This way, DW and MDM are integrated - MdmDB serves as the employee master data and the basis of loading the local DMs, and Figure 3. Prototype s architecture diagram 1636 MIPRO 2017/miproBIS

5 has the ability to return 'golden copy' of employee master data back to data sources. The prototype (with all of its separate databases) is developed and implemented on the same Windows 10 Education x64 operating system and is made in Microsoft SQL Server 2012 [24] database management system using Microsoft SQL Server 2012 Integration Services (SSIS) and SQL Server data Tool for Visual Studio 2012 for extracting, loading and transforming the data between specific databases. To generate the data with which we initially loaded the source databases JobDB and TrainingDB, we used the web tool FreeDataGenerator [11]. C. Testing the sustainability of the integration As we already stated, our main research question was Can our new system catalog model serve to successfully integrate DW and MDM systems?. We presume that the successful integration is a sustainabile one if a MDV model for integration is developed (which it is, shown in Fig. 2) and if: a) a new system catalog (MdvDB in the prototype) collects and stores historical meta-data about SdvDB and PmdvDB schema mappings, and b) queries defined over MdmDB and DmDB return the same results. In order to prove the integration was successful we developed and conducted two simple tests on a prototype: a) run a query on MdvDB system catalog that can return information about SdvDB and PmdvDB schema mappings, and b) run equivalent queries over MdmDB and DmDB that return the same results. For the test A we developed 8 queries on the new MdvDB system catalog that return information on SdvDB and PmdvDB structures (hubs, links, satellites and attributes) and their mappings, as well as the mappings between data sources and SdvDB structures, or mappings between PmdvDB and MdmDB structures. Fig. 4 shows one of those queries a query which serves to monitor the mappings and integration between base hubs in SdvDB and their corresponding master hubs in PmdvDB. This way we can monitor the transformations and the origin of master data, as well as structural changes (Change Type and LEDTS columns in Fig. 4). Fig. 5 shows the query that returns a similar information on the history of mappings between data sources tables and SdvDB hubs. All the queries that we have created and run by MdvDB system catalog successfully returned needed information on creation of PmdvDB structures from SdvDB structures, as well as the historical state of integration thus we conclude the test A was successfully conducted. For the test B we made a data change in the JobDB Figure 4. SdvDB and PmdvDB hub mappings stored in MdvDB system catalog Figure 5. SdvDB and data sources hub mappings stored in MdvDB system catalog data source, but only after loading with start data all the databases in the prototype (SdvDB, PmdvDB, MdmDB and DmDB). An employee Desiree Farmer became Desiree Farmer-Matthews we changed her last name. Then we propagated this data change to all the levels in the prototype architecture. Fig. 6 shows a pair of equivalent queries on both MdmDB and DmDB queries that return the number of rows in Employee master table after the data change. We can see they have returned the same results. Fig. 7 shows the data state in the MdmDB and DmDB after the change. Two equivalent queries have been made on those databases and we can also see that the results are identical. From the results from test B (as well as test A) we have gotten the answer to our research Figure 6. Number of rows in DmDB and MdmDB after a data change in data sources Figure 7. DmDB and MdmDB state after a data change in data sources MIPRO 2017/miproBIS 1637

6 question yes, it is possible to successfully integrate DW and MDM systems through the usage of a new system catalog and new DW/MDM system architecture. V. CONCLUSION AND FUTURE WORK In this paper we presented our research on a DW/MDM integration into one common business intelligence system. Seeing that the DW and MDM both integrate basically the same data sources and have the same evolution problem (preserving the history of data and structure), our goal was to integrate those two systems into one so we could in the future address the evolution problem at the common level. We have presented our MDV meta-model which serves to integrate those two systems and to provide the means for managing their evolution at a mutual level. In our proposed common solution, the data repository preserves the history of raw (DW) and master (MDM) data, as well as their schemas. Meta-data repository (i.e. system catalog) preserves the history of meta-data for the data repository. This way, the problem of the DW and the MDM evolution could be addressed at the general level and a permanent general solution on a meta-level could be developed. The end result could be a flexible, modular solution which will be able to track and manage changes in both data and metadata, as well as their schemas. In the paper we described our common integration architecture, which includes the new system catalog based on a Data Vault modelling method. We further described the integration part of underlying data model for the new system catalog and we presented our research prototype used for empirical validation of the sustainability of the proposed solution. Also, we described a couple of tests that we conducted in order to prove this kind of integration is possible and can be successfully deployed. These are also the main contributions of this paper. The benefits of this approach could be various, the least of all a development of a simpler common solution which can effectively manage data and schema evolution in both of those systems. The next step of our research is the main one - proving that the schema evolution problem can be solved more effortlessly and efficiently through the usage of our solution. In order to do so, we plan to define a final set of structural change cases, together with evolution operations and test them against the prototype, as well as the traditional RDBMS system catalog. REFERENCES [1] B. Bebel, J. Eder, C. Koncilia, T. Morzy and R. Wrembel, Creation and Management of Versions in Multiversion Data Warehouse, 19th ACM Symposium on Applied Computing, Nicosia, Cyprus, [2] Z. Bellahsene, Schema Evolution in Data Warehouses, Knowledge and Information Systems, pp , [3] A. Berson and L. Dubbov, Master data management and data governance, 2nd ed. New York: McGraw Hill, [4] P. Chen, "The Entity-Relationship Model - Toward a Unified View of Data", ACM Transactions on Database Systems, vol. 1 pp. 9 36, [5] J. Chen, S. Chen and E. Rundensteiner, A transactional model for data warehouse maintenance, In: Spaccapietra, S., March, S.T., Kambayashi, Y. (eds.) ER LNCS, vol. 2503, pp Springer, Heidelberg, [6] E. F. Codd, "Relational database: a practical foundation for productivity", Communications of the ACM, vol.25, pp , [7] Y. Cui and J. Widom, Practical Lineage Tracing in Data Warehouses, Proceedings of the 16th International Conference on Data Engineering, San Diego, California, [8] (2015) The CA ERwin Data Modeler Site [Online]. Available: [9] J. Eder and C. Koncilla, Evolution of Dimension Data in Temporal Data Warehouses, Technical Report, [10] H. Fan and A. Poulovassilis, Schema Evolution in Data Warehousing Environments A Schema Transformation-based Approach, Proceedings of 23rd International Conference on Conceptual Modeling, Shanghai, China [11] FreeDataGenerator (June 2016). Available at: [12] M. Golfarelli and S. Rizzi, Data warehouse design, New York: McGraw Hill, [13] M. Golfarelli, J. Lechtenbörger,S. Rizzi and G.Vossen, Schema Versioning in Data Warehouses, In: ER Workshops, LNCS Springer, vol. 3289, pp , [14] A. Gupta and I. Mumick, Maintenance of Materialized Views: Problems, Techniques, and Applications, Data Engineering Bulletin, [15] W. H. Inmon, Building the data warehouse, 4th ed., Indianapolis: Wiley Publishing, [16] W. H. Inmon, D. Strauss and G. Neushloss, DW 2.0: The Architecture for the Next Generation of Data Warehousing, Burlington: Morgan Kaufmann Publishers, [17] Integration Definition for Information Modeling (IDEF1X) Standard, FIPS Publication 184, Computer Systems Laboratory of the National Institute of Standards and Technology (NIST), [18] D. Jakšić and P. Poščić, Data Warehouse Models in Higher Education Courses, International Conference on Advanced Technology & Sciences, Antalya, Turkey, [19] V. Jovanovic, S. Bojicic, C. Knowles and M. Pavlic, Persistent staging area models for data warehouses, Issues in Information Systems, vol.13, pp , [20] R. Kimball and M. Ross, The data warehouse toolkit, 3rd ed., Indianapolis: Wiley Publishing, [21] D. Linstedt, SuperCharge Your Data Warehouse: Invaluable Data Modeling Rules to Implement Your Data Vault, USA: CreateSpace Independent Publishing Platform, [22] D. Linstedt, and M. Olschimke, Building a Scalable Data Warehouse with Data Vault 2.0: Implementation Guide for Microsoft SQL Server Morgan Kaufmann, [23] D. Loshin, Master Data Management, San Francisco: Morgan Kaufmann, [24] Microsoft SQL Server 2012 (October 2016). Available at: [25] T. Morzy and R. Wrembel, On Querying Versions of Multiversion Data Warehouse, In Proceedings of the International Workshop on Data Warehousing and OLAP, DOLAP 04, Washington, USA, [26] D. Subotić, Data Warehouse Schema Evolution Perspectives, Advances in Intelligent Systems and Computing, Springer, vol. 312, pp , [27] D. Subotić, V. Jovanović, and P. Poščić, Data Warehouse and Master Data Management Evolution-A Meta-Data-Vault Approach, Issues in Information Systems, vol.15, pp , [28] D. Subotić, V. Jovanovic, P. Poščić, "Data Warehouse Schema Evolution: State of the Art". 25th Central European Conference on Information and Intelligent Systems CECIIS, Varaždin, Croatia, [29] C. Quix, Repository Support for Data Warehouse Evolution, Proceedings of the Workshop DMDW, Germany, MIPRO 2017/miproBIS

Managing Changes to Schema of Data Sources in a Data Warehouse

Managing Changes to Schema of Data Sources in a Data Warehouse Association for Information Systems AIS Electronic Library (AISeL) AMCIS 2001 Proceedings Americas Conference on Information Systems (AMCIS) December 2001 Managing Changes to Schema of Data Sources in

More information

Modelling Data Warehouses with Multiversion and Temporal Functionality

Modelling Data Warehouses with Multiversion and Temporal Functionality Modelling Data Warehouses with Multiversion and Temporal Functionality Waqas Ahmed waqas.ahmed@ulb.ac.be Université Libre de Bruxelles Poznan University of Technology July 9, 2015 ITBI DC Outline 1 Introduction

More information

IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 06, 2016 ISSN (online):

IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 06, 2016 ISSN (online): IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 06, 2016 ISSN (online): 2321-0613 Tanzeela Khanam 1 Pravin S.Metkewar 2 1 Student 2 Associate Professor 1,2 SICSR, affiliated

More information

Data Mining & Data Warehouse

Data Mining & Data Warehouse Data Mining & Data Warehouse Asso. Profe. Dr. Raed Ibraheem Hamed University of Human Development, College of Science and Technology Department of Information Technology 2016 2017 (1) Points to Cover Problem:

More information

Information Management course

Information Management course Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 05(b) : 23/10/2012 Data Mining: Concepts and Techniques (3 rd ed.) Chapter

More information

Data Vault Brisbane User Group

Data Vault Brisbane User Group Data Vault Brisbane User Group 26-02-2013 Agenda Introductions A brief introduction to Data Vault Creating a Data Vault based Data Warehouse Comparisons with 3NF/Kimball When is it good for you? Examples

More information

DATA VAULT MODELING GUIDE

DATA VAULT MODELING GUIDE DATA VAULT MODELING GUIDE Introductory Guide to Data Vault Modeling GENESEE ACADEMY, LLC 2012 Authored by: Hans Hultgren DATA VAULT MODELING GUIDE Introductory Guide to Data Vault Modeling Forward Data

More information

Rocky Mountain Technology Ventures

Rocky Mountain Technology Ventures Rocky Mountain Technology Ventures Comparing and Contrasting Online Analytical Processing (OLAP) and Online Transactional Processing (OLTP) Architectures 3/19/2006 Introduction One of the most important

More information

Decision Guidance. Data Vault in Data Warehousing

Decision Guidance. Data Vault in Data Warehousing Decision Guidance Data Vault in Data Warehousing DATA VAULT IN DATA WAREHOUSING Today s business environment requires data models, which are resilient to change and enable the integration of multiple data

More information

Semantics, Metadata and Identifying Master Data

Semantics, Metadata and Identifying Master Data Semantics, Metadata and Identifying Master Data A DataFlux White Paper Prepared by: David Loshin, President, Knowledge Integrity, Inc. Once you have determined that your organization can achieve the benefits

More information

CHAPTER 3 Implementation of Data warehouse in Data Mining

CHAPTER 3 Implementation of Data warehouse in Data Mining CHAPTER 3 Implementation of Data warehouse in Data Mining 3.1 Introduction to Data Warehousing A data warehouse is storage of convenient, consistent, complete and consolidated data, which is collected

More information

STRATEGIC INFORMATION SYSTEMS IV STV401T / B BTIP05 / BTIX05 - BTECH DEPARTMENT OF INFORMATICS. By: Dr. Tendani J. Lavhengwa

STRATEGIC INFORMATION SYSTEMS IV STV401T / B BTIP05 / BTIX05 - BTECH DEPARTMENT OF INFORMATICS. By: Dr. Tendani J. Lavhengwa STRATEGIC INFORMATION SYSTEMS IV STV401T / B BTIP05 / BTIX05 - BTECH DEPARTMENT OF INFORMATICS LECTURE: 05 (A) DATA WAREHOUSING (DW) By: Dr. Tendani J. Lavhengwa lavhengwatj@tut.ac.za 1 My personal quote:

More information

Evolution of Database Systems

Evolution of Database Systems Evolution of Database Systems Krzysztof Dembczyński Intelligent Decision Support Systems Laboratory (IDSS) Poznań University of Technology, Poland Intelligent Decision Support Systems Master studies, second

More information

A Data Warehouse Engineering Process

A Data Warehouse Engineering Process A Data Warehouse Engineering Process Sergio Luján-Mora and Juan Trujillo D. of Software and Computing Systems, University of Alicante Carretera de San Vicente s/n, Alicante, Spain {slujan,jtrujillo}@dlsi.ua.es

More information

Fig 1.2: Relationship between DW, ODS and OLTP Systems

Fig 1.2: Relationship between DW, ODS and OLTP Systems 1.4 DATA WAREHOUSES Data warehousing is a process for assembling and managing data from various sources for the purpose of gaining a single detailed view of an enterprise. Although there are several definitions

More information

Data Warehousing and OLAP Technologies for Decision-Making Process

Data Warehousing and OLAP Technologies for Decision-Making Process Data Warehousing and OLAP Technologies for Decision-Making Process Hiren H Darji Asst. Prof in Anand Institute of Information Science,Anand Abstract Data warehousing and on-line analytical processing (OLAP)

More information

A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective

A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective B.Manivannan Research Scholar, Dept. Computer Science, Dravidian University, Kuppam, Andhra Pradesh, India

More information

An Automatic Tool for Anchor Model Data Warehouse Development

An Automatic Tool for Anchor Model Data Warehouse Development Vol.8/.2 (2016) INTERNETWORKING INDONESIA JOURNAL 33 An Automatic Tool for Anchor Model Data Warehouse Development Humasak Simanjuntak, Marcella Tambunan, Kamaruddin Manullang, Hanna Panjaitan Abstract

More information

Copyright 2016 Datalynx Pty Ltd. All rights reserved. Datalynx Enterprise Data Management Solution Catalogue

Copyright 2016 Datalynx Pty Ltd. All rights reserved. Datalynx Enterprise Data Management Solution Catalogue Datalynx Enterprise Data Management Solution Catalogue About Datalynx Vendor of the world s most versatile Enterprise Data Management software Licence our software to clients & partners Partner-based sales

More information

Question Bank. 4) It is the source of information later delivered to data marts.

Question Bank. 4) It is the source of information later delivered to data marts. Question Bank Year: 2016-2017 Subject Dept: CS Semester: First Subject Name: Data Mining. Q1) What is data warehouse? ANS. A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile

More information

The Data Organization

The Data Organization C V I T F E P A O TM The Data Organization Best Practices Metadata Dictionary Application Architecture Prepared by Rainer Schoenrank January 2017 Table of Contents 1. INTRODUCTION... 3 1.1 PURPOSE OF THE

More information

Data Mining. Asso. Profe. Dr. Raed Ibraheem Hamed. University of Human Development, College of Science and Technology Department of CS (1)

Data Mining. Asso. Profe. Dr. Raed Ibraheem Hamed. University of Human Development, College of Science and Technology Department of CS (1) Data Mining Asso. Profe. Dr. Raed Ibraheem Hamed University of Human Development, College of Science and Technology Department of CS 2016 2017 (1) Points to Cover Problem: Heterogeneous Information Sources

More information

Data Vault Modeling and its Evolution DECISION SCIENCES INSTITUTE. Conceptual Data Vault Modeling and its Opportunities for the Future

Data Vault Modeling and its Evolution DECISION SCIENCES INSTITUTE. Conceptual Data Vault Modeling and its Opportunities for the Future DECISION SCIENCES INSTITUTE Conceptual Data Vault Modeling and its Opportunities for the Future Aarthi Raman, Active Network, Dallas, TX, 75201 itz.aarthi@gmail.com Teuta Cata, Northern Kentucky University,

More information

Designing and Implementing an Object Relational Data Warehousing System

Designing and Implementing an Object Relational Data Warehousing System Designing and Implementing an Object Relational Data Warehousing System Abstract Bodgan Czejdo 1, Johann Eder 2, Tadeusz Morzy 3, Robert Wrembel 3 1 Department of Mathematics and Computer Science, Loyola

More information

Business Intelligence and Decision Support Systems

Business Intelligence and Decision Support Systems Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 8: Data Warehousing Learning Objectives Understand the basic definitions and concepts of data warehouses Learn different

More information

Full file at

Full file at Chapter 2 Data Warehousing True-False Questions 1. A real-time, enterprise-level data warehouse combined with a strategy for its use in decision support can leverage data to provide massive financial benefits

More information

Data Warehousing. Ritham Vashisht, Sukhdeep Kaur and Shobti Saini

Data Warehousing. Ritham Vashisht, Sukhdeep Kaur and Shobti Saini Advance in Electronic and Electric Engineering. ISSN 2231-1297, Volume 3, Number 6 (2013), pp. 669-674 Research India Publications http://www.ripublication.com/aeee.htm Data Warehousing Ritham Vashisht,

More information

A New Approach of Extraction Transformation Loading Using Pipelining

A New Approach of Extraction Transformation Loading Using Pipelining A New Approach of Extraction Transformation Loading Using Pipelining Dr. Rajender Singh Chhillar* (Professor, CS Department, M.D.U) Barjesh Kochar (Head(MCA,IT),GNIM) Abstract Companies have lots of valuable

More information

DATAWAREHOUSING AND ETL PROCESSES: An Explanatory Research

DATAWAREHOUSING AND ETL PROCESSES: An Explanatory Research DATAWAREHOUSING AND ETL PROCESSES: An Explanatory Research Priyanshu Gupta ETL Software Developer United Health Group Abstract- In this paper, the author has focused on explaining Data Warehousing and

More information

COURSE 20466D: IMPLEMENTING DATA MODELS AND REPORTS WITH MICROSOFT SQL SERVER

COURSE 20466D: IMPLEMENTING DATA MODELS AND REPORTS WITH MICROSOFT SQL SERVER ABOUT THIS COURSE The focus of this five-day instructor-led course is on creating managed enterprise BI solutions. It describes how to implement multidimensional and tabular data models, deliver reports

More information

Introductory Guide to Data Vault Modeling GENESEE ACADEMY, LLC

Introductory Guide to Data Vault Modeling GENESEE ACADEMY, LLC Introductory Guide to Data Vault Modeling GENESEE ACADEMY, LLC 2016 Authored by: Hans Hultgren Introductory Guide to Data Vault Modeling Forward Data Vault modeling is most compelling when applied to an

More information

Research Article ISSN:

Research Article ISSN: Research Article [Srivastava,1(4): Jun., 2012] IJESRT INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY An Optimized algorithm to select the appropriate Schema in Data Warehouses Rahul

More information

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing. About the Tutorial A data warehouse is constructed by integrating data from multiple heterogeneous sources. It supports analytical reporting, structured and/or ad hoc queries and decision making. This

More information

Adnan YAZICI Computer Engineering Department

Adnan YAZICI Computer Engineering Department Data Warehouse Adnan YAZICI Computer Engineering Department Middle East Technical University, A.Yazici, 2010 Definition A data warehouse is a subject-oriented integrated time-variant nonvolatile collection

More information

The Evolution of Data Warehousing. Data Warehousing Concepts. The Evolution of Data Warehousing. The Evolution of Data Warehousing

The Evolution of Data Warehousing. Data Warehousing Concepts. The Evolution of Data Warehousing. The Evolution of Data Warehousing The Evolution of Data Warehousing Data Warehousing Concepts Since 1970s, organizations gained competitive advantage through systems that automate business processes to offer more efficient and cost-effective

More information

Managing Data Resources

Managing Data Resources Chapter 7 Managing Data Resources 7.1 2006 by Prentice Hall OBJECTIVES Describe basic file organization concepts and the problems of managing data resources in a traditional file environment Describe how

More information

A Step towards Centralized Data Warehousing Process: A Quality Aware Data Warehouse Architecture

A Step towards Centralized Data Warehousing Process: A Quality Aware Data Warehouse Architecture A Step towards Centralized Data Warehousing Process: A Quality Aware Data Warehouse Architecture Maqbool-uddin-Shaikh Comsats Institute of Information Technology Islamabad maqboolshaikh@comsats.edu.pk

More information

A Data Warehouse Implementation Using the Star Schema. For an outpatient hospital information system

A Data Warehouse Implementation Using the Star Schema. For an outpatient hospital information system A Data Warehouse Implementation Using the Star Schema For an outpatient hospital information system GurvinderKaurJosan Master of Computer Application,YMT College of Management Kharghar, Navi Mumbai ---------------------------------------------------------------------***----------------------------------------------------------------

More information

Implementing a Data Warehouse with Microsoft SQL Server 2014

Implementing a Data Warehouse with Microsoft SQL Server 2014 Course 20463D: Implementing a Data Warehouse with Microsoft SQL Server 2014 Page 1 of 5 Implementing a Data Warehouse with Microsoft SQL Server 2014 Course 20463D: 4 days; Instructor-Led Introduction This

More information

Implement a Data Warehouse with Microsoft SQL Server

Implement a Data Warehouse with Microsoft SQL Server Implement a Data Warehouse with Microsoft SQL Server 20463D; 5 days, Instructor-led Course Description This course describes how to implement a data warehouse platform to support a BI solution. Students

More information

DATA WAREHOUSING IN LIBRARIES FOR MANAGING DATABASE

DATA WAREHOUSING IN LIBRARIES FOR MANAGING DATABASE DATA WAREHOUSING IN LIBRARIES FOR MANAGING DATABASE Dr. Kirti Singh, Librarian, SSD Women s Institute of Technology, Bathinda Abstract: Major libraries have large collections and circulation. Managing

More information

A Data warehouse within a Federated database architecture

A Data warehouse within a Federated database architecture Association for Information Systems AIS Electronic Library (AISeL) AMCIS 1997 Proceedings Americas Conference on Information Systems (AMCIS) 8-15-1997 A Data warehouse within a Federated database architecture

More information

After completing this course, participants will be able to:

After completing this course, participants will be able to: Designing a Business Intelligence Solution by Using Microsoft SQL Server 2008 T h i s f i v e - d a y i n s t r u c t o r - l e d c o u r s e p r o v i d e s i n - d e p t h k n o w l e d g e o n d e s

More information

02 Hr/week. Theory Marks. Internal assessment. Avg. of 2 Tests

02 Hr/week. Theory Marks. Internal assessment. Avg. of 2 Tests Course Code Course Name Teaching Scheme Credits Assigned Theory Practical Tutorial Theory Practical/Oral Tutorial Total TEITC504 Database Management Systems 04 Hr/week 02 Hr/week --- 04 01 --- 05 Examination

More information

Data warehouse architecture consists of the following interconnected layers:

Data warehouse architecture consists of the following interconnected layers: Architecture, in the Data warehousing world, is the concept and design of the data base and technologies that are used to load the data. A good architecture will enable scalability, high performance and

More information

Managing Data Resources

Managing Data Resources Chapter 7 OBJECTIVES Describe basic file organization concepts and the problems of managing data resources in a traditional file environment Managing Data Resources Describe how a database management system

More information

Proposal of a new Data Warehouse Architecture Reference Model

Proposal of a new Data Warehouse Architecture Reference Model Proposal of a new Data Warehouse Architecture Reference Model Dariusz Dymek Wojciech Komnata Piotr Szwed AGH University of Science and Technology Department of Applied Computer Science e-mail: eidymek@kinga.cyf-kr.edu.pl,

More information

Duration: 5 Days. EZY Intellect Pte. Ltd.,

Duration: 5 Days. EZY Intellect Pte. Ltd., Implementing a SQL Data Warehouse Duration: 5 Days Course Code: 20767A Course review About this course This 5-day instructor led course describes how to implement a data warehouse platform to support a

More information

Exam /Course 20767B: Implementing a SQL Data Warehouse

Exam /Course 20767B: Implementing a SQL Data Warehouse Exam 70-767/Course 20767B: Implementing a SQL Data Warehouse Course Outline Module 1: Introduction to Data Warehousing This module describes data warehouse concepts and architecture consideration. Overview

More information

Business Intelligence An Overview. Zahra Mansoori

Business Intelligence An Overview. Zahra Mansoori Business Intelligence An Overview Zahra Mansoori Contents 1. Preference 2. History 3. Inmon Model - Inmonities 4. Kimball Model - Kimballities 5. Inmon vs. Kimball 6. Reporting 7. BI Algorithms 8. Summary

More information

Data Mining & Data Warehouse

Data Mining & Data Warehouse Data Mining & Data Warehouse Associate Professor Dr. Raed Ibraheem Hamed University of Human Development, College of Science and Technology (1) 2016 2017 1 Points to Cover Why Do We Need Data Warehouses?

More information

Novel Materialized View Selection in a Multidimensional Database

Novel Materialized View Selection in a Multidimensional Database Graphic Era University From the SelectedWorks of vijay singh Winter February 10, 2009 Novel Materialized View Selection in a Multidimensional Database vijay singh Available at: https://works.bepress.com/vijaysingh/5/

More information

Anchor Modeling A Technique for Information under Evolution

Anchor Modeling A Technique for Information under Evolution Anchor Modeling A Technique for Information under Evolution Lars Rönnbäck @Ordina 6/12, 2011 Anchor Modeling... Pitches has a solid theoretical foundation. is based on well known principles. shortens implementation

More information

Training 24x7 DBA Support Staffing. MCSA:SQL 2016 Business Intelligence Development. Implementing an SQL Data Warehouse. (40 Hours) Exam

Training 24x7 DBA Support Staffing. MCSA:SQL 2016 Business Intelligence Development. Implementing an SQL Data Warehouse. (40 Hours) Exam MCSA:SQL 2016 Business Intelligence Development Implementing an SQL Data Warehouse (40 Hours) Exam 70-767 Prerequisites At least 2 years experience of working with relational databases, including: Designing

More information

Schema Evolution for Data Warehouse: A Survey

Schema Evolution for Data Warehouse: A Survey Schema Evolution for Data Warehouse: A Survey Meenakshi Arora University School of Information Technology, Guru Gobind Singh Indraprastha University Delhi, India ABSTRACT Data warehouse is considered as

More information

Improving the Data Warehouse Architecture Using Design Patterns

Improving the Data Warehouse Architecture Using Design Patterns Association for Information Systems AIS Electronic Library (AISeL) MWAIS 2011 Proceedings Midwest (MWAIS) 5-20-2011 Improving the Data Warehouse Architecture Using Design Patterns Weiwen Yang Colorado

More information

A Layered Architecture for Enterprise Data Warehouse Systems

A Layered Architecture for Enterprise Data Warehouse Systems A Layered Architecture for Enterprise Warehouse Systems Thorsten Winsemann 1,2, Veit Köppen 2, and Gunter Saake 2 1 SAP Deutschland AG & Co. KG, Großer Grasbrook 17, 22047 Hamburg, Germany Thorsten.Winsemann@t-online.de

More information

Kent Graziano

Kent Graziano Agile Data Warehouse Modeling: Introduction to Data Vault Modeling Kent Graziano Twitter @KentGraziano Agenda Bio What is a Data Vault? Where does it fit in an DW/BI architecture? How to design a Data

More information

Introduction to DWML. Christian Thomsen, Aalborg University. Slides adapted from Torben Bach Pedersen and Man Lung Yiu

Introduction to DWML. Christian Thomsen, Aalborg University. Slides adapted from Torben Bach Pedersen and Man Lung Yiu Introduction to DWML Christian Thomsen, Aalborg University Slides adapted from Torben Bach Pedersen and Man Lung Yiu Course Structure Business intelligence Extract knowledge from large amounts of data

More information

Q1) Describe business intelligence system development phases? (6 marks)

Q1) Describe business intelligence system development phases? (6 marks) BUISINESS ANALYTICS AND INTELLIGENCE SOLVED QUESTIONS Q1) Describe business intelligence system development phases? (6 marks) The 4 phases of BI system development are as follow: Analysis phase Design

More information

Management Information Systems MANAGING THE DIGITAL FIRM, 12 TH EDITION FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT

Management Information Systems MANAGING THE DIGITAL FIRM, 12 TH EDITION FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT MANAGING THE DIGITAL FIRM, 12 TH EDITION Chapter 6 FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT VIDEO CASES Case 1: Maruti Suzuki Business Intelligence and Enterprise Databases

More information

Data-Driven Driven Business Intelligence Systems: Parts I. Lecture Outline. Learning Objectives

Data-Driven Driven Business Intelligence Systems: Parts I. Lecture Outline. Learning Objectives Data-Driven Driven Business Intelligence Systems: Parts I Week 5 Dr. Jocelyn San Pedro School of Information Management & Systems Monash University IMS3001 BUSINESS INTELLIGENCE SYSTEMS SEM 1, 2004 Lecture

More information

Towards Development of Solution for Business Process-Oriented Data Analysis

Towards Development of Solution for Business Process-Oriented Data Analysis Towards Development of Solution for Business Process-Oriented Data Analysis M. Klimavicius Abstract This paper proposes a modeling methodology for the development of data analysis solution. The Author

More information

DC Area Business Objects Crystal User Group (DCABOCUG) Data Warehouse Architectures for Business Intelligence Reporting.

DC Area Business Objects Crystal User Group (DCABOCUG) Data Warehouse Architectures for Business Intelligence Reporting. DC Area Business Objects Crystal User Group (DCABOCUG) Data Warehouse Architectures for Business Intelligence Reporting April 14, 2009 Whitemarsh Information Systems Corporation 2008 Althea Lane Bowie,

More information

Information Modeling and Relational Databases

Information Modeling and Relational Databases Information Modeling and Relational Databases Second Edition Terry Halpin Neumont University Tony Morgan Neumont University AMSTERDAM» BOSTON. HEIDELBERG LONDON NEW YORK OXFORD PARIS SAN DIEGO SAN FRANCISCO

More information

Building a Data Warehouse step by step

Building a Data Warehouse step by step Informatica Economică, nr. 2 (42)/2007 83 Building a Data Warehouse step by step Manole VELICANU, Academy of Economic Studies, Bucharest Gheorghe MATEI, Romanian Commercial Bank Data warehouses have been

More information

20463C-Implementing a Data Warehouse with Microsoft SQL Server. Course Content. Course ID#: W 35 Hrs. Course Description: Audience Profile

20463C-Implementing a Data Warehouse with Microsoft SQL Server. Course Content. Course ID#: W 35 Hrs. Course Description: Audience Profile Course Content Course Description: This course describes how to implement a data warehouse platform to support a BI solution. Students will learn how to create a data warehouse 2014, implement ETL with

More information

Data Warehouse Design Using Row and Column Data Distribution

Data Warehouse Design Using Row and Column Data Distribution Int'l Conf. Information and Knowledge Engineering IKE'15 55 Data Warehouse Design Using Row and Column Data Distribution Behrooz Seyed-Abbassi and Vivekanand Madesi School of Computing, University of North

More information

Data Warehouses Chapter 12. Class 10: Data Warehouses 1

Data Warehouses Chapter 12. Class 10: Data Warehouses 1 Data Warehouses Chapter 12 Class 10: Data Warehouses 1 OLTP vs OLAP Operational Database: a database designed to support the day today transactions of an organization Data Warehouse: historical data is

More information

OLAP Introduction and Overview

OLAP Introduction and Overview 1 CHAPTER 1 OLAP Introduction and Overview What Is OLAP? 1 Data Storage and Access 1 Benefits of OLAP 2 What Is a Cube? 2 Understanding the Cube Structure 3 What Is SAS OLAP Server? 3 About Cube Metadata

More information

Top of Minds Report series Data Warehouse The six levels of integration

Top of Minds Report series Data Warehouse The six levels of integration Top of Minds Report series Data Warehouse The six levels of integration Recommended reading Before reading this report it is recommended to read ToM Report Series on Data Warehouse Definitions for Integration

More information

CHAPTER 3 BUILDING ARCHITECTURAL DATA WAREHOUSE FOR CANCER DISEASE

CHAPTER 3 BUILDING ARCHITECTURAL DATA WAREHOUSE FOR CANCER DISEASE 32 CHAPTER 3 BUILDING ARCHITECTURAL DATA WAREHOUSE FOR CANCER DISEASE 3.1 INTRODUCTION Due to advanced technology, increasing number of hospitals are using electronic medical records to accumulate substantial

More information

Implementing a Data Warehouse with Microsoft SQL Server 2012

Implementing a Data Warehouse with Microsoft SQL Server 2012 10777 - Implementing a Data Warehouse with Microsoft SQL Server 2012 Duration: 5 days Course Price: $2,695 Software Assurance Eligible Course Description 10777 - Implementing a Data Warehouse with Microsoft

More information

CS614 - Data Warehousing - Midterm Papers Solved MCQ(S) (1 TO 22 Lectures)

CS614 - Data Warehousing - Midterm Papers Solved MCQ(S) (1 TO 22 Lectures) CS614- Data Warehousing Solved MCQ(S) From Midterm Papers (1 TO 22 Lectures) BY Arslan Arshad Nov 21,2016 BS110401050 BS110401050@vu.edu.pk Arslan.arshad01@gmail.com AKMP01 CS614 - Data Warehousing - Midterm

More information

Implementing a Data Warehouse with Microsoft SQL Server

Implementing a Data Warehouse with Microsoft SQL Server Course 20463C: Implementing a Data Warehouse with Microsoft SQL Server Page 1 of 6 Implementing a Data Warehouse with Microsoft SQL Server Course 20463C: 4 days; Instructor-Led Introduction This course

More information

Data Vault Modeling & Methodology. Technical Side and Introduction Dan Linstedt, 2010,

Data Vault Modeling & Methodology. Technical Side and Introduction Dan Linstedt, 2010, Data Vault Modeling & Methodology Technical Side and Introduction Dan Linstedt, 2010, http://danlinstedt.com Technical Definition The Data Vault is a detail oriented, historical tracking and uniquely linked

More information

TDWI strives to provide course books that are contentrich and that serve as useful reference documents after a class has ended.

TDWI strives to provide course books that are contentrich and that serve as useful reference documents after a class has ended. Previews of TDWI course books offer an opportunity to see the quality of our material and help you to select the courses that best fit your needs. The previews cannot be printed. TDWI strives to provide

More information

Data Warehousing & Mining Techniques

Data Warehousing & Mining Techniques Data Warehousing & Mining Techniques Wolf-Tilo Balke Kinda El Maarry Institut für Informationssysteme Technische Universität Braunschweig http://www.ifis.cs.tu-bs.de 2. Summary Last week: What is a Data

More information

The GOLD Model CASE Tool: an environment for designing OLAP applications

The GOLD Model CASE Tool: an environment for designing OLAP applications The GOLD Model CASE Tool: an environment for designing OLAP applications Juan Trujillo, Sergio Luján-Mora, Enrique Medina Departamento de Lenguajes y Sistemas Informáticos. Universidad de Alicante. Campus

More information

A MAS Based ETL Approach for Complex Data

A MAS Based ETL Approach for Complex Data A MAS Based ETL Approach for Complex Data O. Boussaid, F. Bentayeb, J. Darmont Abstract : In a data warehousing process, the phase of data integration is crucial. Many methods for data integration have

More information

Oracle Warehouse Builder 10g Release 2 Integrating Packaged Applications Data

Oracle Warehouse Builder 10g Release 2 Integrating Packaged Applications Data Oracle Warehouse Builder 10g Release 2 Integrating Packaged Applications Data June 2006 Note: This document is for informational purposes. It is not a commitment to deliver any material, code, or functionality,

More information

Data Warehousing & Mining Techniques

Data Warehousing & Mining Techniques 2. Summary Data Warehousing & Mining Techniques Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig http://www.ifis.cs.tu-bs.de Last week: What is a Data

More information

The Data Organization

The Data Organization C V I T F E P A O TM The Data Organization 1251 Yosemite Way Hayward, CA 94545 (510) 303-8868 rschoenrank@computer.org Business Intelligence Process Architecture By Rainer Schoenrank Data Warehouse Consultant

More information

Efficiency Gains in Inbound Data Warehouse Feed Implementation

Efficiency Gains in Inbound Data Warehouse Feed Implementation Efficiency Gains in Inbound Data Warehouse Feed Implementation Simon Eligulashvili simon.e@gamma-sys.com Introduction The task of building a data warehouse with the objective of making it a long-term strategic

More information

DATA WAREHOUSE 03 COMMON DWH ARCHITECTURES ANDREAS BUCKENHOFER, DAIMLER TSS

DATA WAREHOUSE 03 COMMON DWH ARCHITECTURES ANDREAS BUCKENHOFER, DAIMLER TSS A company of Daimler AG LECTURE @DHBW: DATA WAREHOUSE 03 COMMON DWH ARCHITECTURES ANDREAS BUCKENHOFER, DAIMLER TSS ABOUT ME Andreas Buckenhofer Senior DB Professional andreas.buckenhofer@daimler.com Since

More information

Measuring the functional size of a data warehouse application using COSMIC-FFP

Measuring the functional size of a data warehouse application using COSMIC-FFP Measuring the functional size of a data warehouse application using COSMIC-FFP Harold van Heeringen Abstract A data warehouse system is not the kind of traditional system that is easily sized with FPA,

More information

Meaning & Concepts of Databases

Meaning & Concepts of Databases 27 th August 2015 Unit 1 Objective Meaning & Concepts of Databases Learning outcome Students will appreciate conceptual development of Databases Section 1: What is a Database & Applications Section 2:

More information

FedDW Global Schema Architect

FedDW Global Schema Architect UML based Design Tool for the Integration of Data Mart Schemas Dr. Stefan Berger Department of Business Informatics Data & Knowledge Engineering Johannes Kepler University Linz ACM 15 th DOLAP 12 November

More information

Information Management Fundamentals by Dave Wells

Information Management Fundamentals by Dave Wells Information Management Fundamentals by Dave Wells All rights reserved. Reproduction in whole or part prohibited except by written permission. Product and company names mentioned herein may be trademarks

More information

Implementing a SQL Data Warehouse

Implementing a SQL Data Warehouse Implementing a SQL Data Warehouse 20767B; 5 days, Instructor-led Course Description This 4-day instructor led course describes how to implement a data warehouse platform to support a BI solution. Students

More information

Comparing Anchor Modeling with Data Vault Modeling

Comparing Anchor Modeling with Data Vault Modeling PLACE PHOTO HERE, OTHERWISE DELETE BOX Comparing Anchor Modeling with Data Vault Modeling Lars Rönnbäck & Hans Hultgren SUMMER 2013 lars.ronnback@anchormodeling.com www.anchormodeling.com Hans@GeneseeAcademy.com

More information

Data Warehouse and Mining

Data Warehouse and Mining Data Warehouse and Mining 1. is a subject-oriented, integrated, time-variant, nonvolatile collection of data in support of management decisions. A. Data Mining. B. Data Warehousing. C. Web Mining. D. Text

More information

Proceed Requirements Meta-Model For Adequate Business Intelligence Using Workflow

Proceed Requirements Meta-Model For Adequate Business Intelligence Using Workflow International Journal of Research in Engineering and Science (IJRES) ISSN (Online): 2320-9364, ISSN (Print): 2320-9356 Volume 1 Issue 5 ǁ Sep. 2013 ǁ PP.46-50 Proceed Requirements Meta-Model For Adequate

More information

Practicing for Business Intelligence Application with SQL Server 2008 Zhijun Ren

Practicing for Business Intelligence Application with SQL Server 2008 Zhijun Ren Applied Mechanics and Materials Vols. 20-23 (2010) pp 1499-1503 Online available since 2010/Jan/12 at www.scientific.net (2010) Trans Tech Publications, Switzerland doi:10.4028/www.scientific.net/amm.20-23.1499

More information

International Journal of Computer Engineering and Applications, REQUIREMENT GATHERING FOR MODEL DRIVEN DESIGN OF DATAWAREHOUSE

International Journal of Computer Engineering and Applications, REQUIREMENT GATHERING FOR MODEL DRIVEN DESIGN OF DATAWAREHOUSE International Journal of Computer Engineering and Applications, Volume XII, Issue I, Jan. 18, www.ijcea.com ISSN 2321-3469 REQUIREMENT GATHERING FOR MODEL DRIVEN DESIGN OF DATAWAREHOUSE Kuldeep Deshpande

More information

2. Summary. 2.1 Basic Architecture. 2. Architecture. 2.1 Staging Area. 2.1 Operational Data Store. Last week: Architecture and Data model

2. Summary. 2.1 Basic Architecture. 2. Architecture. 2.1 Staging Area. 2.1 Operational Data Store. Last week: Architecture and Data model 2. Summary Data Warehousing & Mining Techniques Wolf-Tilo Balke Kinda El Maarry Institut für Informationssysteme Technische Universität Braunschweig http://www.ifis.cs.tu-bs.de Last week: What is a Data

More information

20767B: IMPLEMENTING A SQL DATA WAREHOUSE

20767B: IMPLEMENTING A SQL DATA WAREHOUSE ABOUT THIS COURSE This 5-day instructor led course describes how to implement a data warehouse platform to support a BI solution. Students will learn how to create a data warehouse with Microsoft SQL Server

More information

Next Generation DWH Modeling. An overview of DWH modeling methods

Next Generation DWH Modeling. An overview of DWH modeling methods Next Generation DWH Modeling An overview of DWH modeling methods Ronald Kunenborg www.grundsatzlich-it.nl Topics Where do we stand today Data storage and modeling through the ages Current data warehouse

More information

AVOIDING SILOED DATA AND SILOED DATA MANAGEMENT

AVOIDING SILOED DATA AND SILOED DATA MANAGEMENT AVOIDING SILOED DATA AND SILOED DATA MANAGEMENT Dalton Cervo Author, Consultant, Data Management Expert March 2016 This presentation contains extracts from books that are: Copyright 2011 John Wiley & Sons,

More information

BD - Databases

BD - Databases Coordinating unit: Teaching unit: Academic year: Degree: ECTS credits: 2018 270 - FIB - Barcelona School of Informatics 747 - ESSI - Department of Service and Information System Engineering BACHELOR'S

More information