Year. <Sales> Product. Supplier

Size: px
Start display at page:

Download "Year. <Sales> Product. Supplier"

Transcription

1 Fragmentation of Multidimensional Databases Derek Munneke Kirsten Wahlstrom Mukesh Mohania Advanced Computing Research Centre School of Computer and Information Science University of South Australia, The Levels Campus Mawson Lakes 595 SA, Australia fd.munneke, K.Wahlstrom, Abstract. As diverse demands are placed on On-Line Analytical Processing (OLAP) systems, distribution of the supporting multidimensional database (MDDB), and thus fragmentation, become an issue. We consider a recent multidimensional data model, and show that two basic strategies of fragmentation are possible: slice 'n dice and sever. The operators that realise these methods are composed from the basic set given in the data model, and the strategies are shown to satisfy the fragmentation correctness rules. 1 Introduction Multidimensional databases (MDDB) exploit the multidimensional nature of On-Line Analytical Processing (OLAP) [Codd et al., 1993]. As the popularity of OLAP grows, an increasingly diverse group of users are making demands of the systems, possibly with conicting goals. In order to meet the performance requirements of the systems, implementation as distributed databases will become an important consideration. Fragmentation is an important part of distributed design [ Ozsu and Valduriez, 1991]. On-Line Analytical Processing is a decision support tool which enables the sophisticated analysis of an organisation's performance by providing access to views of data which characterise the multidimensional nature of the enterprise [Codd et al., 1993]. Contemporary relational database systems are designed for On-Line Transaction Processing (OLTP), whose operational demands make them unsuitable for OLAP applications[chaudhuri and Dayal, 1997]. A multidimensional database stores data as hypercubes which enable the OLAP operations to be implemented directly over the data structure. Consequently, several Proceedings of the Tenth Australasian Database Conference, Auckland, New Zealand, January 18{ Copyright Springer-Verlag, Singapore. Permission to copy this work for personal or classroom use is granted without fee provided that copies are not made or distributed for prot or personal advantage; and this copyright notice, the title of the publication, and its date appear. Any other use or copying of this document requires specic prior permission from Springer-Verlag.

2 multidimensional database systems have been developed commercially, for example [ARBOR, 1998; IRI, 1997], and multidimensional databases have become a topic of active research [Colliat, 1995; Li and Wang, 1996; Gyssens and Lakshmanan, 1996; Agrawal et al., 1997; Hacid and Sattler, 1997; Lehner et al., 1998]. One issue which has seen little investigation is the fragmentation of a multidimensional database. Fragmentation plays an important role in the design of a distributed database system, it enables the denition of appropriate units of distribution which enhance query performance by enabling concurrent and parallel execution of queries, the requirement for parallel processing of OLAP operations is argued in [Datta et al., 1998; Liang and Orlowska, 1998], and by minimising communication costs. An existing MDDB fragmentation strategy applies the common OLAP operation `slice 'n dice' to partition the hypercube into smaller subcubes. This strategy is currently employed by vendors of multidimensional databases that support fragmentation, for example the Essbase OLAP Server Partitioning option [ARBOR, 1998]. We propose an additional strategy called sever that fragments the original data by removing dimensions from a hypercube to produce fragments with fewer dimensions than the original data cube. In the next section we give an example data set that will help to illustrate following discussion, and then in Section 3 we present a multidimensional model recently proposed by Agrawal, Gupta and Sarawagi [Agrawal et al., 1997]. We proceed in Section 4 to present the two fragmentation strategies, slice 'n dice and sever, and their derived forms, by drawing analogies with relational fragmentation. We present the operations that realise the two fragmentation methods and both are shown to obey the correctness rules. 2 An example data set Year Product Supplier Sales 1997 a X 1997 a Y b X b Y a X a Y b X b Y 4 Table 1. A small subset of data for a retail company. Consider a retail company that buys products from suppliers to sell them in its own stores. A data warehouse is constructed to collect sales gures for each

3 product over time, including information about the supplier of the products sold. The information may be used by store managers to track store sales over time, the company marketing department to assess campaign strategies, and inventory to calculate when orders need to be made. A small subset of example data is shown in Table 1. This data may be represented equivalently by a three-dimensional cube shown in Figure 1. Year <Sales> X a 2 1 b Product Supplier Y Fig. 1. A data cube with dimensions `Year', `Product' and `Supplier', and measure `Sales' representing the example data set given in Table 1. 3 Multidimensional databases 3.1 Terminology A multidimensional database uses a hypercube representation of the data, often referred to as a data cube. Determining attributes, such as `Year', `Product' and `Supplier' in Figure 1, are referred to as dimensions whose values denote positions on the dimension. The determined attributes, such as `Sales', are called measures. The values of a measure are the cell values in the data cube, called elements. The choice of which attributes should be determined and which are determining is a database design decision [Agrawal et al., 1997]. Operations are dened which make use of the visual image of the data cube. Pivot rotates the cube to `show' a particular face and slice 'n dice selects some subset of the data by `cutting up' the cube. In addition, OLAP operations such as roll up and drill down change the granularity of the dimensions by aggregating over the dened hierarchies and expanding a hierarchy to display further detail, respectively.[chaudhuri and Dayal, 1997]

4 In addition to the above terminology, we dene the notion of a cube's degree and introduce the concept of key dimensions: Degree of a data cube C is the number of dimensions in the cube, expressed as deg(c). Key dimensions is a dimension or a set of dimensions whose positions map to a unique element, with a dened value, on every other dimension. The data cube in Figure 1 has degree 3, and the set fyear, Supplier, Productg is the key as positions on these dimensions uniquely map to elements containing Sales information. There may be multiple choices of key dimensions so, as in the Entity-Relationship (ER) model, the possible choices are called candidate keys, of which one may be chosen as the primary key. 3.2 A Data model Agrawal, Gupta and Sarawagi [Agrawal et al., 1997] recently proposed a data model to act as a framework for research in multidimensional databases. A pertinent feature of the model is the symmetric treatment of dimensions and measures that allows the operators to act on both equivalently. This permits a general specication of the fragmentation strategies without restricting their applicability. In the logical model the data is organised in one or more data cubes C j with the following components: { k dimensions, each with a name D i and a domain dom i = dom(d i ) over which the position values are dened. We denote dims(c) = hd 1 ; : : : ; D k i as a k-tuple containing the dimension names of the cube in the order specied by the element mapping function E(C). { Elements, dened as a mapping E(C) from dom 1 : : :dom k to either, 1 or an n-tuple he 1 ; : : : ; e n i. E(C)(d 1 ; : : : ; d k ) refers to an element in cell location (d 1 ; : : : ; d k ) of cube C. The d i refer to values, not positions per se, so the dimensions are not required to have ranked or discrete domains. { An n-tuple of names that describes the attributes the n-tuple elements of the cube. We denote this as attributes(c) = ha 1 ; : : : ; A n i. The cell values of the cube may be either, 1 or an n-tuple. The value for element E(C)(d 1 ; : : : ; d k ) indicates that no value exists in the database for the combination of dimension positions (d 1 ; : : : ; d k ). If the combination is present, then the element may be either a 1 to indicate a value exists, or an n-tuple that provides the additional data available for that combination. If any element of a cube is a n-tuple then none of the elements can be a 1, and vice versa. If all elements of a cube are then the cube is empty, and if the domain dom i of a dimension contains no values then no element can be dened, thus the cube is considered empty also. The operators support the symmetric treatment of dimensions and measures to provide a minimal set of operators that provides all the functionality of current

5 multidimensional products [Agrawal et al., 1997]. We refer the reader to [Agrawal et al., 1997] for a description of the operators. The operators in this model support a query model, as opposed to the `oneoperation-at-a-time' computational model presently used by many existing products. A query model allows complex queries to be composed without having to generate the often meaningless intermediate cubes, and supports optimisation of complex multidimensional queries. As there is no distinction in the data model between measures and dimensions, the `Sales' in Figure 1 may be transformed into just another dimension, as illustrated in Figure 2. Year Sales a b Product X Y Supplier Fig. 2. The data cube from Figure 1 may be represented as a 4-dimensional data cube in which `Sales' is also on a dimension. The positions on the `Sales' dimension consist of all possible sales gures. Representative lines of the binary data is shown corresponding to Supplier `X' and Product `a', but other values cannot be shown in this representation of 4-dimensional space. Other models have also been proposed by Li and Wang [Li and Wang, 1996], and Gyssens and Lakshmanan [Gyssens and Lakshmanan, 1996] but neither of these treated dimensions and measures symmetrically. Note that this is a logical model and does not force or imply any storage mechanism. 4 Multidimensional fragmentation Fragmentation of a multidimensional database divides a global data cube, into fragment cubes containing a subset of the data. The slice 'n dice operation is a method of fragmentation for multidimensional databases. Slice 'n dice is analogous to horizontal fragmentation in the relational model as it partitions the data into subsets without altering the structure of the

6 database. The dimensions remain unchanged by slice 'n dice, just as attributes are not are not altered by horizontal fragmentation of a relational database. We dene an additional fragmentation strategy called sever. Sever is analogous to relational vertical fragmentation in that it reduces the degree of a data cube just as vertical fragmentation reduces the degree of a relation. The methods for fragmentation and reconstruction for these two strategies are given using the data model and operators described in the previous section. Derived and hybrid forms follow naturally from the basic schemes. There are three rules which ensure the semantic validity of a fragmentation strategy which are referred to as the correctness rules. The three rules verify completeness to ensure no data is lost during fragmentation, check the original data representation can be reconstructed, and test for disjointness so replication is controlled explicitly by allocation [ Ozsu and Valduriez, 1991; Ceri and Pelagatti, 1983]. We show that slice 'n dice and sever obey these rules. 4.1 Slice 'n dice fragmentation Fig. 3. Slice 'n dice fragmentation is realised by the restrict operator. The restrict operator realises the multidimensional slice 'n dice operation, acting to partition the data into smaller subcubes. Figure 3 shows an example of slice 'n dice fragmentation where the data cube has been partitioned into eight subcubes; in general l dimensions, each divided m j (1 j l) ways will produce m 1 m 2 : : : m l subcubes. The general expressions to fragment a data cube C into m subcubes C j that satisfy a simple predicate P j i may be specied as: C 1 = restrict(c; f[d 1 ; P 1 1 ]; : : : ; [D l ; P 1 l ]g). C m = restrict(c; f[d 1 ; P m 1 ]; : : : ; [D l ; Pl m ]g) (1)

7 allowing restrictions on multiple dimensions 1 D i (1 i l) and multiple restrictions across a dimension. A simpler scheme allows restrictions on only a single dimension D i at a time, corresponding to just slice, so that m restrictions produces m partitions: C 1 = restrict(c; f[d i ; P 1 ]g). C m = restrict(c; f[d i ; P m ]g): (2) The more general case can then be built up by iteratively applying this operation across all the dimensions to be fragmented on. To guarantee completeness, the predicates P j that dene the restrictions must be cover the entire domain of the dimension: P 1 (x) _ : : : _ P m (x) = true 8x 2 dom i : (3) To ensure disjointness the predicates must be mutually exclusive: P 1 (x) ^ : : : ^ P m (x) = false 8x 2 dom i : (4) The data cube may be reconstructed by applying the join operator along all dimensions, using the identity function id as the mapping functions and the function f unique as the element merging function: C = join(c i ; [D 1 ; id]; : : : ; [D k ; id]g; f unique ) for 1 i m where id(d) = d; 8 < e 1 if e 2 = or e 1 = e 2, f unique (e 1 ; e 2 ) = e : 2 if e 1 =, error otherwise. (5) The function f unique checks if each element is uniquely determined by its position in the cube, that is it determines if data inconsistencies exist. For disjoint fragments f unique will always return a value, however if the fragments are not disjoint f unique eectively tests for violation of referential integrity. By using the identity mapping function the join operator is analogous to the relational union operator. To join in this way the subcubes must be `union compatible', that is they must have the same degree and each corresponding domain dom i be type compatible. Fragments produced by the slice 'n dice are intrinsicly union compatible, and thus this constraint is always satised. Fragmentation on one dimension followed by fragmentation on another dimension, corresponding to a slice then dice, is analogous to derived horizontal fragmentation of a relational database. 1 The action of restrict on multiple dimensions is an extension we have added that diers from the original specication by [Agrawal et al., 1997], but is similar to how the merge operator is expressed.

8 Example Fragmentation of the example data from Section 2 by slicing on the `Product' dimension is performed by the following operations: C 1 C 2 = restrict(c; f[product; position = `a']g) = restrict(c; f[product; position = `b']g): This produces two subcubes C 1 and C 2 dened by the predicates position = a and position = b respectively. The fragmentation is complete as the two predicates cover the entire domain of `Product': (position = `a') _ (position = `b') = true; 8position 2 dom Product ; and the fragmentation is disjoint as the predicates are exclusive: (position = `a') ^ (position = `b') = false; 8position 2 dom Product : To reconstruct the cube we join C 1 and C 2 with the functions specied in equation (5): C = join( C 1 ; f[year; id]; [Supplier; id]; [Product; id]g; C 2 ; f[year; id]; [Supplier; id]; [Product; id]g; f unique ): Fragmentation by slice 'n dice can be useful for organisations that are geographically distributed, in which case the data cube can be divided into the subcubes of data most appropriate for a location. For example the two dierent products, `a' and `b', may only be sold in locations `A' and `B' respectively, thus location B does not require the gures for product a, and vice versa. 4.2 Sever fragmentation Fig. 4. Sever fragmentation reduces the dimensionality of the data cube. Sever produces fragments with less dimensions than the original cube by merging a dimension and then destroying it. This is analogous to relational projection

9 in that the new data set has less determining attributes, that is the degree is reduced. In order to perform fragmentation by sever more than one non-key dimension must exist. This is necessary in order to produce severed fragment from which the original data cube can be reconstructed. To sever the cube C, rst a set of l non-key dimensions fd 1 1 ; : : : ; D1 l g are merged the by merge operator, using the function f exists that maps each domain to a single position, labelled `9', and the function f unique, as dened by equation (5), to combine the elements: C 1 = merge(c; f[d 1 1 ; f exists]; : : : ; [D 1 l ; f exists]g; f unique ) where f exists (d i ) = `9' 8d i 2 dom i : (6) Once the dimensions to be severed are merged successfully to a single position `9', which indicates if a value existed corresponding to that position, the dimension can be destroyed: C 1 = destroy(c 1 ; D 1 1);. (7) C 1 = destroy(c 1 ; Dl 1): This results in a fragment C 1 of degree k? l, where deg(c) = k. If the primary key, that is the key maintained across all fragments, consists of a set of j dimensions fd key 1 ; : : : ; D key j g, then C 1 contains a set of k? l? j non-key dimensions D 1 : D 1 = fd 1 ; : : : ; D k g? (fd 1 ; : : : ; 1 D1 l g) [ fdkey 1 ; : : : ; D key j g where fd 1 ; : : : ; 1 D1 l g \ fdkey ; : : : ; D key 1 j g = ;: Repeat the above steps for m disjoint sets D i ; 1 i m; of non-key dimensions to produce m fragments C 1 ; : : : ; C m. The fragmentation is complete if every dimension is contained in at least one fragment. That is guaranteed if D 1 [ : : : [ D m [ fd key 1 ; : : : ; D key j g = fd 1 ; : : : ; D k g: (8) Disjointness is dened only on the non-key dimensions, as the key dimensions are necessarily replicated to satisfy reconstruction. Disjointness is ensured if the fragment's set of non-key dimensions D i are chosen to be disjoint: D 1 \ : : : \ D m = ;: (9) Reconstruction of the cube is achieved by joining the fragments on the key dimensions using the identity mapping function and the element merging function f unique to check referential integrity. C = join(c i ; [D key 1 ; id]; : : : ; [D key j ; id]g; f unique ) for 1 i m: (1)

10 C 1 C C 2 Sever C 1.1 Dice C Slice C C 2.2 Fig. 5. Hybrid fragmentation produces a tree structure. If the fragmentation is complete, then the data cube can be properly reconstructed. Slice 'n dice and sever may be used in combination to produce hybrid fragmentation which forms a tree structure, as illustrated in Figure 5. Completeness and disjointness follow naturally if the constituent fragmentation schemes are complete and disjoint. Reconstruction of the original or intermediate data cubes is achieved by reconstructing fragments from the leaves of the tree and then progressively reconstructing up the tree. Example The data cube C from Figure 1 cannot be fragmented by sever as only one non-key dimension exists, namely `Sales'. If we introduce an enterprise constraint that \each product has a unique supplier", then `Supplier' becomes a determined attribute, and `Year' and `Product' are the key dimensions. A new example data cube C illustrates this in Figure 6. Before performing fragmentation of C by sever, the `Sales' measure in Figure 6 is converted to a dimension, as in Figure 2, by using the pull operator. Then the 4 dimensional cube is severed into two 3 dimensional cubes by the following operations, where the functions are dened in equations (5) & (6). C 1 = merge(c ; f[sales; f exists ]g; f unique ) C 1 = destroy(c 1 ; Sales) produces a subcube C 1 with dimensions Year, Product, and Supplier. C 2 = merge(c ; f[supplier; f exists ]g; f unique ) C 2 = destroy(c 2 ; Supplier) produces a subcube C 2 with dimensions Year, Product, and Sales.

11 Year <Sales> a 1 b Product X Y Supplier Fig. 6. The data cube from Figure 1 with the enterprise constraint \each product has a unique supplier". Note that the data cube now contains zero elements() that indicate no corresponding value, rather than a zero value. The fragmentation is complete as every dimension (Year, Product, Supplier and Sales) is in at least one of the cubes, and disjoint as the non-key dimensions of the cubes (Supplier in C 1 and Sales in C 2 ) are disjoint. To reconstruct the data cube the subcubes are joined on key dimensions: C = join( C 1 ; f[year; id]; [Supplier; id]g; C 2 ; f[year; id]; [Supplier; id]g; f unique ): The sever fragmentation strategy is useful when departments within an organisation require dierent information. For example the marketing department is concerned with the sales gures and does not need to know information about suppliers, whilst the ordering department need to know the suppliers details but do not require sales gures. 5 Conclusion and Further work Two basic fragmentation strategies are possible in multidimensional databases: slice 'n dice and sever. Slice 'n dice partitions the data cube into subcubes by dividing the domains on the dimensions, in a manner analogous to relational horizontal fragmentation. Sever fragments the data cube by reducing its degree, which is comparable to vertical fragmentation of a relational database. Fragmentation by slice 'n dice is well known, though perhaps not formally stated, but fragmentation by sever is a new strategy. The relevance of sever will emerge as demands on multidimensional OLAP systems diverge. We have only considered fragmentation strategies and not discussed algorithms that determine how the data should be partitioned. Algorithms similar

12 to those developed for relational systems [ Ozsu and Valduriez, 1991] can be applied to a multidimensional system, however we believe additional algorithms that exploit the multidimensional characteristics of the system may be possible. References Agrawal, Rakesh, Gupta, Ashish, and Sarawagi, Suntina (1997). Modeling multidimensional databases. In Proc. 13th Int'l Conference on Data Engineering, pages 232{243, Los Alamitos, CA. IEEE Comput. Soc. Press. ARBOR (1998). Arbor Essbase OLAP Server. Arbor Software Corporation, Sunnyvale CA. available via (25 July 1998). Ceri, Stefano and Pelagatti, Giuseppe (1983). Distributed Databases: Principles and Systems. McGraw-Hill Book Co., Singapore. Chaudhuri, Surajit and Dayal, Umeshwar (1997). An overview of data warehousing and olap technology. SIGMOD Record, 26(1):65{74. Codd, E. F., Codd, S. B., and Salley, C. T. (1993). Providing olap (on-line analytical processing) to user-analysts: An it mandate. Technical report, E.F.Codd & Associates. Colliat, George (1995). Olap, relational and multidimensional database systems. Technical report, Arbor Software Corporation, Sunnyvale, CA. Datta, Anindya, Moon, Bongki, and Thomas, Helen (1998). A case for parallelism in data warehousing and olap. In Proc. of IEEE First Int'l Workshop on Data Wharehouse design and OLAP Technology. Gyssens, Marc and Lakshmanan, Laks V.S. (1996). A foundation for multidimensional databases. In Proc. of 22nd VLDB Conference. Hacid, Mohand-Said and Sattler, Ulrike (1997). An object-centered multidimensional data model with hierarchically structured dimensions. In Proc. IEEE Knowledge and Data Engineering Exchange Workshop, pages 65{72. IRI (1997). QScan. IRI Software Information Resources, Inc., Waltham MA. available via (25 July 1998). Lehner, W., Albrect, J., and Wedekind, H. (1998). Normal forms for multidimensional databases. In 1th Int'l Conf. on Scientic and Statistical Data Dase Management. Li, Chang and Wang, X. Sean (1996). A data model for supporting on-line analytical processing. In Proc. 5th Int'l Conf. on Information and Knowledge Management, pages 81{88. Liang, Weifa and Orlowska, Maria (1998). Computing multidimensional aggregates in parallel. In International Conf. on Parallel and Distributed Systems. IEEE Comput. Soc. Press. Ozsu, M. Tamer and Valduriez, Patrick (1991). Principles of Distributed Database Systems. Prentice Hall, Englewood Clis, NJ.

Computing Appropriate Representations for Multidimensional Data

Computing Appropriate Representations for Multidimensional Data Computing Appropriate Representations for Multidimensional Data Yeow Wei Choong LI - Université F Rabelais HELP Institute - Malaysia choong yw@helpedumy Dominique Laurent LI - Université F Rabelais Tours

More information

CHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP)

CHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP) CHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP) INTRODUCTION A dimension is an attribute within a multidimensional model consisting of a list of values (called members). A fact is defined by a combination

More information

Rocky Mountain Technology Ventures

Rocky Mountain Technology Ventures Rocky Mountain Technology Ventures Comparing and Contrasting Online Analytical Processing (OLAP) and Online Transactional Processing (OLTP) Architectures 3/19/2006 Introduction One of the most important

More information

DATA WAREHOUSE EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY

DATA WAREHOUSE EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY DATA WAREHOUSE EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY CHARACTERISTICS Data warehouse is a central repository for summarized and integrated data

More information

An Overview of Data Warehousing and OLAP Technology

An Overview of Data Warehousing and OLAP Technology An Overview of Data Warehousing and OLAP Technology CMPT 843 Karanjit Singh Tiwana 1 Intro and Architecture 2 What is Data Warehouse? Subject-oriented, integrated, time varying, non-volatile collection

More information

A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective

A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective B.Manivannan Research Scholar, Dept. Computer Science, Dravidian University, Kuppam, Andhra Pradesh, India

More information

Novel Materialized View Selection in a Multidimensional Database

Novel Materialized View Selection in a Multidimensional Database Graphic Era University From the SelectedWorks of vijay singh Winter February 10, 2009 Novel Materialized View Selection in a Multidimensional Database vijay singh Available at: https://works.bepress.com/vijaysingh/5/

More information

Constructing Object Oriented Class for extracting and using data from data cube

Constructing Object Oriented Class for extracting and using data from data cube Constructing Object Oriented Class for extracting and using data from data cube Antoaneta Ivanova Abstract: The goal of this article is to depict Object Oriented Conceptual Model Data Cube using it as

More information

OLAP Introduction and Overview

OLAP Introduction and Overview 1 CHAPTER 1 OLAP Introduction and Overview What Is OLAP? 1 Data Storage and Access 1 Benefits of OLAP 2 What Is a Cube? 2 Understanding the Cube Structure 3 What Is SAS OLAP Server? 3 About Cube Metadata

More information

IT DATA WAREHOUSING AND DATA MINING UNIT-2 BUSINESS ANALYSIS

IT DATA WAREHOUSING AND DATA MINING UNIT-2 BUSINESS ANALYSIS PART A 1. What are production reporting tools? Give examples. (May/June 2013) Production reporting tools will let companies generate regular operational reports or support high-volume batch jobs. Such

More information

International Journal of Modern Trends in Engineering and Research e-issn: p-issn:

International Journal of Modern Trends in Engineering and Research  e-issn: p-issn: International Journal of Modern Trends in Engineering and Research www.ijmter.com Fragmentation as a Part of Security in Distributed Database: A Survey Vaidik Ochurinda 1 1 External Student, MCA, IGNOU.

More information

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing. About the Tutorial A data warehouse is constructed by integrating data from multiple heterogeneous sources. It supports analytical reporting, structured and/or ad hoc queries and decision making. This

More information

Basics of Dimensional Modeling

Basics of Dimensional Modeling Basics of Dimensional Modeling Data warehouse and OLAP tools are based on a dimensional data model. A dimensional model is based on dimensions, facts, cubes, and schemas such as star and snowflake. Dimension

More information

Data Warehousing. Ritham Vashisht, Sukhdeep Kaur and Shobti Saini

Data Warehousing. Ritham Vashisht, Sukhdeep Kaur and Shobti Saini Advance in Electronic and Electric Engineering. ISSN 2231-1297, Volume 3, Number 6 (2013), pp. 669-674 Research India Publications http://www.ripublication.com/aeee.htm Data Warehousing Ritham Vashisht,

More information

Database design View Access patterns Need for separate data warehouse:- A multidimensional data model:-

Database design View Access patterns Need for separate data warehouse:- A multidimensional data model:- UNIT III: Data Warehouse and OLAP Technology: An Overview : What Is a Data Warehouse? A Multidimensional Data Model, Data Warehouse Architecture, Data Warehouse Implementation, From Data Warehousing to

More information

Data Warehousing and Decision Support

Data Warehousing and Decision Support Data Warehousing and Decision Support Chapter 23, Part A Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke 1 Introduction Increasingly, organizations are analyzing current and historical

More information

Data Warehousing and Decision Support. Introduction. Three Complementary Trends. [R&G] Chapter 23, Part A

Data Warehousing and Decision Support. Introduction. Three Complementary Trends. [R&G] Chapter 23, Part A Data Warehousing and Decision Support [R&G] Chapter 23, Part A CS 432 1 Introduction Increasingly, organizations are analyzing current and historical data to identify useful patterns and support business

More information

ETL and OLAP Systems

ETL and OLAP Systems ETL and OLAP Systems Krzysztof Dembczyński Intelligent Decision Support Systems Laboratory (IDSS) Poznań University of Technology, Poland Software Development Technologies Master studies, first semester

More information

Indexing OLAP data. Sunita Sarawagi. IBM Almaden Research Center. Abstract

Indexing OLAP data. Sunita Sarawagi. IBM Almaden Research Center. Abstract Indexing OLAP data Sunita Sarawagi IBM Almaden Research Center sunita@almaden.ibm.com Abstract In this paper we discuss indexing methods for On-Line Analytical Processing (OLAP) databases. We start with

More information

DATA MINING AND WAREHOUSING

DATA MINING AND WAREHOUSING DATA MINING AND WAREHOUSING Qno Question Answer 1 Define data warehouse? Data warehouse is a subject oriented, integrated, time-variant, and nonvolatile collection of data that supports management's decision-making

More information

Chapter 13 Business Intelligence and Data Warehouses The Need for Data Analysis Business Intelligence. Objectives

Chapter 13 Business Intelligence and Data Warehouses The Need for Data Analysis Business Intelligence. Objectives Chapter 13 Business Intelligence and Data Warehouses Objectives In this chapter, you will learn: How business intelligence is a comprehensive framework to support business decision making How operational

More information

Data Warehousing and Decision Support

Data Warehousing and Decision Support Data Warehousing and Decision Support [R&G] Chapter 23, Part A CS 4320 1 Introduction Increasingly, organizations are analyzing current and historical data to identify useful patterns and support business

More information

CT75 DATA WAREHOUSING AND DATA MINING DEC 2015

CT75 DATA WAREHOUSING AND DATA MINING DEC 2015 Q.1 a. Briefly explain data granularity with the help of example Data Granularity: The single most important aspect and issue of the design of the data warehouse is the issue of granularity. It refers

More information

DATABASE DEVELOPMENT (H4)

DATABASE DEVELOPMENT (H4) IMIS HIGHER DIPLOMA QUALIFICATIONS DATABASE DEVELOPMENT (H4) December 2017 10:00hrs 13:00hrs DURATION: 3 HOURS Candidates should answer ALL the questions in Part A and THREE of the five questions in Part

More information

Lectures for the course: Data Warehousing and Data Mining (IT 60107)

Lectures for the course: Data Warehousing and Data Mining (IT 60107) Lectures for the course: Data Warehousing and Data Mining (IT 60107) Week 1 Lecture 1 21/07/2011 Introduction to the course Pre-requisite Expectations Evaluation Guideline Term Paper and Term Project Guideline

More information

Horizontal Aggregations for Mining Relational Databases

Horizontal Aggregations for Mining Relational Databases Horizontal Aggregations for Mining Relational Databases Dontu.Jagannadh, T.Gayathri, M.V.S.S Nagendranadh. Department of CSE Sasi Institute of Technology And Engineering,Tadepalligudem, Andhrapradesh,

More information

A taxonomy of race. D. P. Helmbold, C. E. McDowell. September 28, University of California, Santa Cruz. Santa Cruz, CA

A taxonomy of race. D. P. Helmbold, C. E. McDowell. September 28, University of California, Santa Cruz. Santa Cruz, CA A taxonomy of race conditions. D. P. Helmbold, C. E. McDowell UCSC-CRL-94-34 September 28, 1994 Board of Studies in Computer and Information Sciences University of California, Santa Cruz Santa Cruz, CA

More information

CHAPTER 8 DECISION SUPPORT V2 ADVANCED DATABASE SYSTEMS. Assist. Prof. Dr. Volkan TUNALI

CHAPTER 8 DECISION SUPPORT V2 ADVANCED DATABASE SYSTEMS. Assist. Prof. Dr. Volkan TUNALI CHAPTER 8 DECISION SUPPORT V2 ADVANCED DATABASE SYSTEMS Assist. Prof. Dr. Volkan TUNALI Topics 2 Business Intelligence (BI) Decision Support System (DSS) Data Warehouse Online Analytical Processing (OLAP)

More information

Data Warehousing and OLAP

Data Warehousing and OLAP Data Warehousing and OLAP INFO 330 Slides courtesy of Mirek Riedewald Motivation Large retailer Several databases: inventory, personnel, sales etc. High volume of updates Management requirements Efficient

More information

Dynamic Optimization of Generalized SQL Queries with Horizontal Aggregations Using K-Means Clustering

Dynamic Optimization of Generalized SQL Queries with Horizontal Aggregations Using K-Means Clustering Dynamic Optimization of Generalized SQL Queries with Horizontal Aggregations Using K-Means Clustering Abstract Mrs. C. Poongodi 1, Ms. R. Kalaivani 2 1 PG Student, 2 Assistant Professor, Department of

More information

Chapter 18: Data Analysis and Mining

Chapter 18: Data Analysis and Mining Chapter 18: Data Analysis and Mining Database System Concepts See www.db-book.com for conditions on re-use Chapter 18: Data Analysis and Mining Decision Support Systems Data Analysis and OLAP 18.2 Decision

More information

Data Warehouse and Data Mining

Data Warehouse and Data Mining Data Warehouse and Data Mining Lecture No. 04-06 Data Warehouse Architecture Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology

More information

2 Data Reduction Techniques The granularity of reducible information is one of the main criteria for classifying the reduction techniques. While the t

2 Data Reduction Techniques The granularity of reducible information is one of the main criteria for classifying the reduction techniques. While the t Data Reduction - an Adaptation Technique for Mobile Environments A. Heuer, A. Lubinski Computer Science Dept., University of Rostock, Germany Keywords. Reduction. Mobile Database Systems, Data Abstract.

More information

Analyzing a Greedy Approximation of an MDL Summarization

Analyzing a Greedy Approximation of an MDL Summarization Analyzing a Greedy Approximation of an MDL Summarization Peter Fontana fontanap@seas.upenn.edu Faculty Advisor: Dr. Sudipto Guha April 10, 2007 Abstract Many OLAP (On-line Analytical Processing) applications

More information

Evolution of Database Systems

Evolution of Database Systems Evolution of Database Systems Krzysztof Dembczyński Intelligent Decision Support Systems Laboratory (IDSS) Poznań University of Technology, Poland Intelligent Decision Support Systems Master studies, second

More information

REPORTING AND QUERY TOOLS AND APPLICATIONS

REPORTING AND QUERY TOOLS AND APPLICATIONS Tool Categories: REPORTING AND QUERY TOOLS AND APPLICATIONS There are five categories of decision support tools Reporting Managed query Executive information system OLAP Data Mining Reporting Tools Production

More information

On-Line Application Processing

On-Line Application Processing On-Line Application Processing WAREHOUSING DATA CUBES DATA MINING 1 Overview Traditional database systems are tuned to many, small, simple queries. Some new applications use fewer, more time-consuming,

More information

CS614 - Data Warehousing - Midterm Papers Solved MCQ(S) (1 TO 22 Lectures)

CS614 - Data Warehousing - Midterm Papers Solved MCQ(S) (1 TO 22 Lectures) CS614- Data Warehousing Solved MCQ(S) From Midterm Papers (1 TO 22 Lectures) BY Arslan Arshad Nov 21,2016 BS110401050 BS110401050@vu.edu.pk Arslan.arshad01@gmail.com AKMP01 CS614 - Data Warehousing - Midterm

More information

Building Fuzzy Blocks from Data Cubes

Building Fuzzy Blocks from Data Cubes Building Fuzzy Blocks from Data Cubes Yeow Wei Choong HELP University College Kuala Lumpur MALAYSIA choongyw@help.edu.my Anne Laurent Dominique Laurent LIRMM ETIS Université Montpellier II Université de

More information

CSE 544 Principles of Database Management Systems. Alvin Cheung Fall 2015 Lecture 8 - Data Warehousing and Column Stores

CSE 544 Principles of Database Management Systems. Alvin Cheung Fall 2015 Lecture 8 - Data Warehousing and Column Stores CSE 544 Principles of Database Management Systems Alvin Cheung Fall 2015 Lecture 8 - Data Warehousing and Column Stores Announcements Shumo office hours change See website for details HW2 due next Thurs

More information

Data Warehouse Design Using Row and Column Data Distribution

Data Warehouse Design Using Row and Column Data Distribution Int'l Conf. Information and Knowledge Engineering IKE'15 55 Data Warehouse Design Using Row and Column Data Distribution Behrooz Seyed-Abbassi and Vivekanand Madesi School of Computing, University of North

More information

Inference in Hierarchical Multidimensional Space

Inference in Hierarchical Multidimensional Space Proc. International Conference on Data Technologies and Applications (DATA 2012), Rome, Italy, 25-27 July 2012, 70-76 Related papers: http://conceptoriented.org/ Inference in Hierarchical Multidimensional

More information

MAC: Conceptual Data Modeling for OLAP

MAC: Conceptual Data Modeling for OLAP MAC: Conceptual Data Modeling for OLAP Aris Tsois Knowledge and Database Systems Laboratory, NTUA atsois@dblab.ece.ntua.gr Nikos Karayannidis Knowledge and Database Systems Laboratory, NTUA nikos@dblab.ece.ntua.gr

More information

An Overview of various methodologies used in Data set Preparation for Data mining Analysis

An Overview of various methodologies used in Data set Preparation for Data mining Analysis An Overview of various methodologies used in Data set Preparation for Data mining Analysis Arun P Kuttappan 1, P Saranya 2 1 M. E Student, Dept. of Computer Science and Engineering, Gnanamani College of

More information

Heuristic Horizontal XML Fragmentation

Heuristic Horizontal XML Fragmentation Heuristic Horizontal XML Fragmentation Hui Ma, Klaus-Dieter Schewe Massey University, Information Science Research Centre Private Bag 11 222, Palmerston North, New Zealand [h.ma k.d.schewe]@massey.ac.nz

More information

Big Trend in Business Intelligence: Data Mining over Big Data Web Transaction Data. Fall 2012

Big Trend in Business Intelligence: Data Mining over Big Data Web Transaction Data. Fall 2012 Big Trend in Business Intelligence: Data Mining over Big Data Web Transaction Data Fall 2012 Data Warehousing and OLAP Introduction Decision Support Technology On Line Analytical Processing Star Schema

More information

02 Hr/week. Theory Marks. Internal assessment. Avg. of 2 Tests

02 Hr/week. Theory Marks. Internal assessment. Avg. of 2 Tests Course Code Course Name Teaching Scheme Credits Assigned Theory Practical Tutorial Theory Practical/Oral Tutorial Total TEITC504 Database Management Systems 04 Hr/week 02 Hr/week --- 04 01 --- 05 Examination

More information

CHAPTER 3 Implementation of Data warehouse in Data Mining

CHAPTER 3 Implementation of Data warehouse in Data Mining CHAPTER 3 Implementation of Data warehouse in Data Mining 3.1 Introduction to Data Warehousing A data warehouse is storage of convenient, consistent, complete and consolidated data, which is collected

More information

Data Warehousing and Decision Support (mostly using Relational Databases) CS634 Class 20

Data Warehousing and Decision Support (mostly using Relational Databases) CS634 Class 20 Data Warehousing and Decision Support (mostly using Relational Databases) CS634 Class 20 Slides based on Database Management Systems 3 rd ed, Ramakrishnan and Gehrke, Chapter 25 Introduction Increasingly,

More information

Management Information Systems MANAGING THE DIGITAL FIRM, 12 TH EDITION FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT

Management Information Systems MANAGING THE DIGITAL FIRM, 12 TH EDITION FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT MANAGING THE DIGITAL FIRM, 12 TH EDITION Chapter 6 FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT VIDEO CASES Case 1: Maruti Suzuki Business Intelligence and Enterprise Databases

More information

A Systems Approach to Dimensional Modeling in Data Marts. Joseph M. Firestone, Ph.D. White Paper No. One. March 12, 1997

A Systems Approach to Dimensional Modeling in Data Marts. Joseph M. Firestone, Ph.D. White Paper No. One. March 12, 1997 1 of 8 5/24/02 4:43 PM A Systems Approach to Dimensional Modeling in Data Marts By Joseph M. Firestone, Ph.D. White Paper No. One March 12, 1997 OLAP s Purposes And Dimensional Data Modeling Dimensional

More information

. The problem: ynamic ata Warehouse esign Ws are dynamic entities that evolve continuously over time. As time passes, new queries need to be answered

. The problem: ynamic ata Warehouse esign Ws are dynamic entities that evolve continuously over time. As time passes, new queries need to be answered ynamic ata Warehouse esign? imitri Theodoratos Timos Sellis epartment of Electrical and Computer Engineering Computer Science ivision National Technical University of Athens Zographou 57 73, Athens, Greece

More information

Adnan YAZICI Computer Engineering Department

Adnan YAZICI Computer Engineering Department Data Warehouse Adnan YAZICI Computer Engineering Department Middle East Technical University, A.Yazici, 2010 Definition A data warehouse is a subject-oriented integrated time-variant nonvolatile collection

More information

Data Warehousing and OLAP Technologies for Decision-Making Process

Data Warehousing and OLAP Technologies for Decision-Making Process Data Warehousing and OLAP Technologies for Decision-Making Process Hiren H Darji Asst. Prof in Anand Institute of Information Science,Anand Abstract Data warehousing and on-line analytical processing (OLAP)

More information

Algebraic Properties of CSP Model Operators? Y.C. Law and J.H.M. Lee. The Chinese University of Hong Kong.

Algebraic Properties of CSP Model Operators? Y.C. Law and J.H.M. Lee. The Chinese University of Hong Kong. Algebraic Properties of CSP Model Operators? Y.C. Law and J.H.M. Lee Department of Computer Science and Engineering The Chinese University of Hong Kong Shatin, N.T., Hong Kong SAR, China fyclaw,jleeg@cse.cuhk.edu.hk

More information

CT75 (ALCCS) DATA WAREHOUSING AND DATA MINING JUN

CT75 (ALCCS) DATA WAREHOUSING AND DATA MINING JUN Q.1 a. Define a Data warehouse. Compare OLTP and OLAP systems. Data Warehouse: A data warehouse is a subject-oriented, integrated, time-variant, and 2 Non volatile collection of data in support of management

More information

Sql Fact Constellation Schema In Data Warehouse With Example

Sql Fact Constellation Schema In Data Warehouse With Example Sql Fact Constellation Schema In Data Warehouse With Example Data Warehouse OLAP - Learn Data Warehouse in simple and easy steps using Multidimensional OLAP (MOLAP), Hybrid OLAP (HOLAP), Specialized SQL

More information

On the Integration of Autonomous Data Marts

On the Integration of Autonomous Data Marts On the Integration of Autonomous Data Marts Luca Cabibbo and Riccardo Torlone Dipartimento di Informatica e Automazione Università di Roma Tre {cabibbo,torlone}@dia.uniroma3.it Abstract We address the

More information

OLAP Theory-English version On-Line Analytical processing (Buisness Intzlligence)

OLAP Theory-English version On-Line Analytical processing (Buisness Intzlligence) OLAP Theory-English version On-Line Analytical processing (Buisness Intzlligence) [Ing.Skorkovský,CSc] KPH_ESF_MU Agenda The Market Why OLAP Introduction to OLAP OLAP Terms and Concepts Summary OLAP market

More information

MIS2502: Data Analytics Dimensional Data Modeling. Jing Gong

MIS2502: Data Analytics Dimensional Data Modeling. Jing Gong MIS2502: Data Analytics Dimensional Data Modeling Jing Gong gong@temple.edu http://community.mis.temple.edu/gong Where we are Now we re here Data entry Transactional Database Data extraction Analytical

More information

Questions about the contents of the final section of the course of Advanced Databases. Version 0.3 of 28/05/2018.

Questions about the contents of the final section of the course of Advanced Databases. Version 0.3 of 28/05/2018. Questions about the contents of the final section of the course of Advanced Databases. Version 0.3 of 28/05/2018. 12 Decision support systems How would you define a Decision Support System? What do OLTP

More information

CS6302- DATABASE MANAGEMENT SYSTEMS- QUESTION BANK- II YEAR CSE- III SEM UNIT I

CS6302- DATABASE MANAGEMENT SYSTEMS- QUESTION BANK- II YEAR CSE- III SEM UNIT I CS6302- DATABASE MANAGEMENT SYSTEMS- QUESTION BANK- II YEAR CSE- III SEM UNIT I 1.List the purpose of Database System (or) List the drawback of normal File Processing System. 2. Define Data Abstraction

More information

International Journal of Computer Sciences and Engineering. Research Paper Volume-6, Issue-1 E-ISSN:

International Journal of Computer Sciences and Engineering. Research Paper Volume-6, Issue-1 E-ISSN: International Journal of Computer Sciences and Engineering Open Access Research Paper Volume-6, Issue-1 E-ISSN: 2347-2693 Precomputing Shell Fragments for OLAP using Inverted Index Data Structure D. Datta

More information

Partner Presentation Faster and Smarter Data Warehouses with Oracle OLAP 11g

Partner Presentation Faster and Smarter Data Warehouses with Oracle OLAP 11g Partner Presentation Faster and Smarter Data Warehouses with Oracle OLAP 11g Vlamis Software Solutions, Inc. Founded in 1992 in Kansas City, Missouri Oracle Partner and reseller since 1995 Specializes

More information

Decision Support, Data Warehousing, and OLAP

Decision Support, Data Warehousing, and OLAP Decision Support, Data Warehousing, and OLAP : Contents Terminology : OLAP vs. OLTP Data Warehousing Architecture Technologies References 1 Decision Support and OLAP Information technology to help knowledge

More information

OLAP2 outline. Multi Dimensional Data Model. A Sample Data Cube

OLAP2 outline. Multi Dimensional Data Model. A Sample Data Cube OLAP2 outline Multi Dimensional Data Model Need for Multi Dimensional Analysis OLAP Operators Data Cube Demonstration Using SQL Multi Dimensional Data Model Multi dimensional analysis is a popular approach

More information

The GOLD Model CASE Tool: an environment for designing OLAP applications

The GOLD Model CASE Tool: an environment for designing OLAP applications The GOLD Model CASE Tool: an environment for designing OLAP applications Juan Trujillo, Sergio Luján-Mora, Enrique Medina Departamento de Lenguajes y Sistemas Informáticos. Universidad de Alicante. Campus

More information

Introduction to MDDBs

Introduction to MDDBs 3 CHAPTER 2 Introduction to MDDBs What Is OLAP? 3 What Is SAS/MDDB Server Software? 4 What Is an MDDB? 4 Understanding the MDDB Structure 5 How Can I Use MDDBs? 7 Why Should I Use MDDBs? 8 What Is OLAP?

More information

collection of data that is used primarily in organizational decision making.

collection of data that is used primarily in organizational decision making. Data Warehousing A data warehouse is a special purpose database. Classic databases are generally used to model some enterprise. Most often they are used to support transactions, a process that is referred

More information

Data Mining Concepts & Techniques

Data Mining Concepts & Techniques Data Mining Concepts & Techniques Lecture No. 01 Databases, Data warehouse Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro

More information

A Boolean Expression. Reachability Analysis or Bisimulation. Equation Solver. Boolean. equations.

A Boolean Expression. Reachability Analysis or Bisimulation. Equation Solver. Boolean. equations. A Framework for Embedded Real-time System Design? Jin-Young Choi 1, Hee-Hwan Kwak 2, and Insup Lee 2 1 Department of Computer Science and Engineering, Korea Univerity choi@formal.korea.ac.kr 2 Department

More information

Parallel Processing of Multi-join Expansion_aggregate Data Cube Query in High Performance Database Systems

Parallel Processing of Multi-join Expansion_aggregate Data Cube Query in High Performance Database Systems Parallel Processing of Multi-join Expansion_aggregate Data Cube Query in High Performance Database Systems David Taniar School of Business Systems Monash University, Clayton Campus Victoria 3800, AUSTRALIA

More information

DATA WAREHOUING UNIT I

DATA WAREHOUING UNIT I BHARATHIDASAN ENGINEERING COLLEGE NATTRAMAPALLI DEPARTMENT OF COMPUTER SCIENCE SUB CODE & NAME: IT6702/DWDM DEPT: IT Staff Name : N.RAMESH DATA WAREHOUING UNIT I 1. Define data warehouse? NOV/DEC 2009

More information

Most database operations involve On- Line Transaction Processing (OTLP).

Most database operations involve On- Line Transaction Processing (OTLP). Data Warehouse 1 Data Warehouse Most common form of data integration. Copy data from one or more sources into a single DB (warehouse) Update: periodic reconstruction of the warehouse, perhaps overnight.

More information

Joint Entity Resolution

Joint Entity Resolution Joint Entity Resolution Steven Euijong Whang, Hector Garcia-Molina Computer Science Department, Stanford University 353 Serra Mall, Stanford, CA 94305, USA {swhang, hector}@cs.stanford.edu No Institute

More information

The i com Tool for Intelligent Conceptual Modelling

The i com Tool for Intelligent Conceptual Modelling The i com Tool for Intelligent Conceptual Modelling Enrico Franconi and Gary Ng Department of Computer Science, University of Manchester, UK {franconi ngg}@cs.man.ac.uk http://www.cs.man.ac.uk/ franconi/

More information

DATA WAREHOUSE- MODEL QUESTIONS

DATA WAREHOUSE- MODEL QUESTIONS DATA WAREHOUSE- MODEL QUESTIONS 1. The generic two-level data warehouse architecture includes which of the following? a. At least one data mart b. Data that can extracted from numerous internal and external

More information

by Prentice Hall

by Prentice Hall Chapter 6 Foundations of Business Intelligence: Databases and Information Management 6.1 2010 by Prentice Hall Organizing Data in a Traditional File Environment File organization concepts Computer system

More information

Distributed Database Management System UNIT-2. Concurrency Control. Transaction ACID rules. MCA 325, Distributed DBMS And Object Oriented Databases

Distributed Database Management System UNIT-2. Concurrency Control. Transaction ACID rules. MCA 325, Distributed DBMS And Object Oriented Databases Distributed Database Management System UNIT-2 Bharati Vidyapeeth s Institute of Computer Applications and Management, New Delhi-63,By Shivendra Goel. U2.1 Concurrency Control Concurrency control is a method

More information

Fast Discovery of Sequential Patterns Using Materialized Data Mining Views

Fast Discovery of Sequential Patterns Using Materialized Data Mining Views Fast Discovery of Sequential Patterns Using Materialized Data Mining Views Tadeusz Morzy, Marek Wojciechowski, Maciej Zakrzewicz Poznan University of Technology Institute of Computing Science ul. Piotrowo

More information

The strategic advantage of OLAP and multidimensional analysis

The strategic advantage of OLAP and multidimensional analysis IBM Software Business Analytics Cognos Enterprise The strategic advantage of OLAP and multidimensional analysis 2 The strategic advantage of OLAP and multidimensional analysis Overview Online analytical

More information

~ Ian Hunneybell: DWDM Revision Notes (31/05/2006) ~

~ Ian Hunneybell: DWDM Revision Notes (31/05/2006) ~ Useful reference: Microsoft Developer Network Library (http://msdn.microsoft.com/library). Drill down to Servers and Enterprise Development SQL Server SQL Server 2000 SDK Documentation Creating and Using

More information

C=(FS) 2 : Cubing by Composition of Faceted Search

C=(FS) 2 : Cubing by Composition of Faceted Search C=(FS) : Cubing by Composition of Faceted Search Ronny Lempel Dafna Sheinwald IBM Haifa Research Lab Introduction to Multifaceted Search and to On-Line Analytical Processing (OLAP) Intro Multifaceted Search

More information

BUSINESS INTELLIGENCE. SSAS - SQL Server Analysis Services. Business Informatics Degree

BUSINESS INTELLIGENCE. SSAS - SQL Server Analysis Services. Business Informatics Degree BUSINESS INTELLIGENCE SSAS - SQL Server Analysis Services Business Informatics Degree 2 BI Architecture SSAS: SQL Server Analysis Services 3 It is both an OLAP Server and a Data Mining Server Distinct

More information

Horizontal Aggregations in SQL to Prepare Data Sets Using PIVOT Operator

Horizontal Aggregations in SQL to Prepare Data Sets Using PIVOT Operator Horizontal Aggregations in SQL to Prepare Data Sets Using PIVOT Operator R.Saravanan 1, J.Sivapriya 2, M.Shahidha 3 1 Assisstant Professor, Department of IT,SMVEC, Puducherry, India 2,3 UG student, Department

More information

Decision Support. Chapter 25. CS 286, UC Berkeley, Spring 2007, R. Ramakrishnan 1

Decision Support. Chapter 25. CS 286, UC Berkeley, Spring 2007, R. Ramakrishnan 1 Decision Support Chapter 25 CS 286, UC Berkeley, Spring 2007, R. Ramakrishnan 1 Introduction Increasingly, organizations are analyzing current and historical data to identify useful patterns and support

More information

Using the Holey Brick Tree for Spatial Data. in General Purpose DBMSs. Northeastern University

Using the Holey Brick Tree for Spatial Data. in General Purpose DBMSs. Northeastern University Using the Holey Brick Tree for Spatial Data in General Purpose DBMSs Georgios Evangelidis Betty Salzberg College of Computer Science Northeastern University Boston, MA 02115-5096 1 Introduction There is

More information

Improving the Performance of OLAP Queries Using Families of Statistics Trees

Improving the Performance of OLAP Queries Using Families of Statistics Trees To appear in Proceedings of the 3rd International Conference on Data Warehousing and Knowledge Discovery (DaWaK 2001), Technical University of München, München, Germany, September 2001. Improving the Performance

More information

Stored Relvars 18 th April 2013 (30 th March 2001) David Livingstone. Stored Relvars

Stored Relvars 18 th April 2013 (30 th March 2001) David Livingstone. Stored Relvars Stored Relvars Introduction The purpose of a Stored Relvar (= Stored Relational Variable) is to provide a mechanism by which the value of a real (or base) relvar may be partitioned into fragments and/or

More information

Full file at

Full file at Chapter 2 Data Warehousing True-False Questions 1. A real-time, enterprise-level data warehouse combined with a strategy for its use in decision support can leverage data to provide massive financial benefits

More information

Generating Data Sets for Data Mining Analysis using Horizontal Aggregations in SQL

Generating Data Sets for Data Mining Analysis using Horizontal Aggregations in SQL Generating Data Sets for Data Mining Analysis using Horizontal Aggregations in SQL Sanjay Gandhi G 1, Dr.Balaji S 2 Associate Professor, Dept. of CSE, VISIT Engg College, Tadepalligudem, Scholar Bangalore

More information

Welcome to the topic of SAP HANA modeling views.

Welcome to the topic of SAP HANA modeling views. Welcome to the topic of SAP HANA modeling views. 1 At the end of this topic, you will be able to describe the three types of SAP HANA modeling views and use the SAP HANA Studio to work with views in the

More information

SAS Web Report Studio 3.1

SAS Web Report Studio 3.1 SAS Web Report Studio 3.1 User s Guide SAS Documentation The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2006. SAS Web Report Studio 3.1: User s Guide. Cary, NC: SAS

More information

Decision Support Systems aka Analytical Systems

Decision Support Systems aka Analytical Systems Decision Support Systems aka Analytical Systems Decision Support Systems Systems that are used to transform data into information, to manage the organization: OLAP vs OLTP OLTP vs OLAP Transactions Analysis

More information

Information Integration

Information Integration Chapter 11 Information Integration While there are many directions in which modern database systems are evolving, a large family of new applications fall undei the general heading of information integration.

More information

Extending Visual OLAP for Handling Irregular Dimensional Hierarchies

Extending Visual OLAP for Handling Irregular Dimensional Hierarchies Extending Visual OLAP for Handling Irregular Dimensional Hierarchies Svetlana Mansmann and Marc H. Scholl University of Konstanz P.O. Box D188 78457 Konstanz Germany {Svetlana.Mansmann Marc.Scholl}@uni-konstanz.de

More information

TIM 50 - Business Information Systems

TIM 50 - Business Information Systems TIM 50 - Business Information Systems Lecture 15 UC Santa Cruz May 20, 2014 Announcements DB 2 Due Tuesday Next Week The Database Approach to Data Management Database: Collection of related files containing

More information

8) A top-to-bottom relationship among the items in a database is established by a

8) A top-to-bottom relationship among the items in a database is established by a MULTIPLE CHOICE QUESTIONS IN DBMS (unit-1 to unit-4) 1) ER model is used in phase a) conceptual database b) schema refinement c) physical refinement d) applications and security 2) The ER model is relevant

More information

Henning Koch. Dept. of Computer Science. University of Darmstadt. Alexanderstr. 10. D Darmstadt. Germany. Keywords:

Henning Koch. Dept. of Computer Science. University of Darmstadt. Alexanderstr. 10. D Darmstadt. Germany. Keywords: Embedding Protocols for Scalable Replication Management 1 Henning Koch Dept. of Computer Science University of Darmstadt Alexanderstr. 10 D-64283 Darmstadt Germany koch@isa.informatik.th-darmstadt.de Keywords:

More information

The Data Organization

The Data Organization C V I T F E P A O TM The Data Organization 1251 Yosemite Way Hayward, CA 94545 (510) 303-8868 rschoenrank@computer.org Business Intelligence Process Architecture By Rainer Schoenrank Data Warehouse Consultant

More information