IT DATA WAREHOUSING AND DATA MINING UNIT-2 BUSINESS ANALYSIS

Similar documents
DATA WAREHOUING UNIT I

Data Mining Concepts & Techniques

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.

Database design View Access patterns Need for separate data warehouse:- A multidimensional data model:-

REPORTING AND QUERY TOOLS AND APPLICATIONS

DATA WAREHOUSE EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY

CHAPTER 8 DECISION SUPPORT V2 ADVANCED DATABASE SYSTEMS. Assist. Prof. Dr. Volkan TUNALI

Lectures for the course: Data Warehousing and Data Mining (IT 60107)

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1394

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1396

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1394

CT75 DATA WAREHOUSING AND DATA MINING DEC 2015

CHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP)

CHAPTER 3 Implementation of Data warehouse in Data Mining

An Overview of Data Warehousing and OLAP Technology

Syllabus. Syllabus. Motivation Decision Support. Syllabus

Data Warehouse and Data Mining

Evolution of Database Systems

Tribhuvan University Institute of Science and Technology MODEL QUESTION

DHANALAKSHMI COLLEGE OF ENGINEERING, CHENNAI

UNIT -1 UNIT -II. Q. 4 Why is entity-relationship modeling technique not suitable for the data warehouse? How is dimensional modeling different?

GUJARAT TECHNOLOGICAL UNIVERSITY MASTER OF COMPUTER APPLICATIONS (MCA) Semester: IV

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.

Call: SAS BI Course Content:35-40hours

Decision Support Systems aka Analytical Systems

Chapter 13 Business Intelligence and Data Warehouses The Need for Data Analysis Business Intelligence. Objectives

On-Line Analytical Processing (OLAP) Traditional OLTP

Data Mining & Data Warehouse

Summary of Last Chapter. Course Content. Chapter 2 Objectives. Data Warehouse and OLAP Outline. Incentive for a Data Warehouse

A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective

Decision Support, Data Warehousing, and OLAP

CS614 - Data Warehousing - Midterm Papers Solved MCQ(S) (1 TO 22 Lectures)

Sql Fact Constellation Schema In Data Warehouse With Example

CSPP 53017: Data Warehousing Winter 2013! Lecture 7! Svetlozar Nestorov! Class News!

DATA MINING AND WAREHOUSING

D Daaatta W Waaarrreeehhhooouuusssiiinng B I R L A S O F T

Create Cube From Star Schema Grouping Framework Manager

What is a Data Warehouse?

Basics of Dimensional Modeling

Unit 7: Basics in MS Power BI for Excel 2013 M7-5: OLAP

Information Management course

Teradata Aggregate Designer

ETL and OLAP Systems

Deccansoft Software Services Microsoft Silver Learning Partner. SSAS Syllabus

Data Warehouse. Asst.Prof.Dr. Pattarachai Lalitrojwong

Adnan YAZICI Computer Engineering Department

Managing Information Resources

Data Warehouse and Data Mining

Data Warehouse and Mining

Oracle Database 11g: Data Warehousing Fundamentals

Data Mining. Associate Professor Dr. Raed Ibraheem Hamed. University of Human Development, College of Science and Technology

Rocky Mountain Technology Ventures

CT75 (ALCCS) DATA WAREHOUSING AND DATA MINING JUN

Chapter 4, Data Warehouse and OLAP Operations

A Multi-Dimensional Data Model

Data warehouse architecture consists of the following interconnected layers:

collection of data that is used primarily in organizational decision making.

Data Warehousing & OLAP

UNIT -1 UNIT -II. Q. 4 Why is entity-relationship modeling technique not suitable for the data warehouse? How is dimensional modeling different?

R07. FirstRanker. 7. a) What is text mining? Describe about basic measures for text retrieval. b) Briefly describe document cluster analysis.

Q1) Describe business intelligence system development phases? (6 marks)

Data Warehouse and Data Mining

Question Bank. 4) It is the source of information later delivered to data marts.

Data Warehousing and OLAP


1. SQL Server Integration Services. What Is Microsoft BI? Core concept BI Introduction to SQL Server Integration Services

International Journal of Scientific & Engineering Research, Volume 7, Issue 11, November ISSN

Data Science. Data Analyst. Data Scientist. Data Architect

Dta Mining and Data Warehousing

Information Management course

OLAP2 outline. Multi Dimensional Data Model. A Sample Data Cube

Data Warehousing (1)

Introduction to DWH / BI Concepts

Cognos also provides you an option to export the report in XML or PDF format or you can view the reports in XML format.

Big Data 13. Data Warehousing

MICROSOFT BUSINESS INTELLIGENCE

Data Warehousing. Data Warehousing and Mining. Lecture 8. by Hossen Asiful Mustafa

Fig 1.2: Relationship between DW, ODS and OLTP Systems

Chapter 18: Data Analysis and Mining

Data Warehouses. Yanlei Diao. Slides Courtesy of R. Ramakrishnan and J. Gehrke

IDU0010 ERP,CRM ja DW süsteemid Loeng 5 DW concepts. Enn Õunapuu

02 Hr/week. Theory Marks. Internal assessment. Avg. of 2 Tests

Time: 3 hours. Full Marks: 70. The figures in the margin indicate full marks. Answers from all the Groups as directed. Group A.

Foundations of SQL Server 2008 R2 Business. Intelligence. Second Edition. Guy Fouche. Lynn Lang it. Apress*

PeopleTools 8.51 PeopleBook: PeopleSoft Cube Manager

MOLAP Data Warehouse of a Software Products Servicing Call Center

Analytic Workspace Manager and Oracle OLAP 10g. An Oracle White Paper November 2004

Data Warehousing and OLAP Technologies for Decision-Making Process

Table Of Contents: xix Foreword to Second Edition

Aggregating Knowledge in a Data Warehouse and Multidimensional Analysis

Data Warehousing & OLAP

1. Attempt any two of the following: 10 a. State and justify the characteristics of a Data Warehouse with suitable examples.

A Benchmarking Criteria for the Evaluation of OLAP Tools

DATA WAREHOUSING & DATA MINING. by: Prof. Asha Ambhaikar

Proceedings of the IE 2014 International Conference AGILE DATA MODELS

Data Warehousing. Ritham Vashisht, Sukhdeep Kaur and Shobti Saini

Course Contents: 1 Business Objects Online Training

Improving the Performance of OLAP Queries Using Families of Statistics Trees

After completing this course, participants will be able to:

The strategic advantage of OLAP and multidimensional analysis

Transcription:

PART A 1. What are production reporting tools? Give examples. (May/June 2013) Production reporting tools will let companies generate regular operational reports or support high-volume batch jobs. Such as calculating and printing pay checks. Examples: Third generation languages such as COBOL Specialized fourth generation languages such as Information builders, Inc s Focus High-end client/server tools such as MITI s SQL. 2. Define data cube. (May/June 2013) Data cube consists of a large set of facts or measures and a number of dimensions. Facts are numerical measures that are quantities by which we can analyze the relationship between dimensions. Dimensions are the entities or perspectives with respect to an organization for keeping records and are hierarchical nature. 3. What is a Reporting tool? List out the two different types of reporting tools. (May/June 2014,Nov/Dec 2012) Reporting tools are software applications that make data extracted in a query accessible to the user. That is it used for to generate the various types of reports. It can be divided into 2 types: 1. Production reporting tools 2. Desktop reporting tools 4. Define OLAP. (May/June 2014) OLAP (online analytical processing) is computer processing that enables a user to easily and selectively extract and view data from different points of view. OLAP is becoming an architecture that an increasing number of enterprises are implementing to support analytical applications. 5. Briefly discuss the schemas for multidimensional databases. (May/June 2010, Nov/Dec 2014, May/June 2011) Stars schema: The most common modeling paradigm is the star schema, in which the data warehouse contains (1) a large central table (fact table) containing the bulk of the data, with no redundancy, and (2) a set of smaller attendant tables (dimension tables), one for each dimension. Snowflakes schema: The snowflake schema is a variant of the star schema model, where some dimension tables are normalized, thereby further splitting

the data into additional tables. The resulting schema graph forms a shape similar to a snowflake. Fact Constellations: Sophisticated applications may require multiple fact tables to share dimension tables. This kind of schema can be viewed as a collection of stars, and hence is called a galaxy schema or a fact constellation. 6. Define the categories of tools in business analysis. (Nov/Dec 2014) There are 5 categories of tools in business analysis. i) Reporting tools it can be used to generate the reports. ii) Managed query tools it can be used to SQL queries for accessing the databases. iii) Executive information systems It allow developers to build customized, graphical decision support applications or briefing books. iv) On-line analytical processing these tools aggregate data along common business subjects or dimensions and then let users navigate the hierarchies and dimensions with the click of a mouse button. v) Data mining It use a variety of statistical and artificial intelligence algorithm to analyze the correlation of variables in the data and extract interesting patterns and relationship to investigate. 7. Differentiate between MOLAP, ROLAP and HOLAP. (Nov/Dec 2013) MOLAP ROLAP HOLAP MOLAP stands for Multidimensional Online Analytical Processing The MOLAP storage mode causes the aggregations of the partition and a copy of its source data to be stored in a multidimensional structure in Analysis Services when the partition is processed. ROLAP stands for Relational Online Analytical Processing The ROLAP storage mode causes the aggregations of the partition to be stored in indexed views in the relational database that was specified in the partition s data source. HOLAP stands for Hybrid Online Analytical Processing The HOLAP storage mode combines attributes of both MOLAP and ROLAP. Like MOLAP, HOLAP causes the aggregations of the partition to be stored in a multidimensional structure in an SQL Server Analysis Services instance. 8. List any four tools for performing OLAP. (Nov/Dec 2013) Arbor Essbase Web Information advantage web OLAP Micro strategy DSS web

Brio technology 9. Classify OLAP Tools. (Apr/May 2011) MOLAP Multidimensional Online Analytical Processing ROLAP Multirelational Online Analytical Processing MQE Managed Query Environment 10. Define how the complex aggregation at multiple granularities is achieved using multi-feature cubes? (May/June 2012) Multi-feature cubes, which compute complex queries involving multiple dependent aggregates at multiple granularity. These cubes are very useful in practice. Many complex data mining queries can be answered by multi-feature cubes without any significant increase in computational cost, in comparison to cube computation for simple queries with standard data cubes. 11. Give examples for managed query tools. (Nov/Dec 2012) IQ software s IQ objects Andyne Computing Ltd s GQL IBM s Decision server Oracle Corp s Discoverer/2000 12. What is Apex cuboid? (Apr/May 2011,Nov/Dec 2011) Apex cuboid or 0-D cuboid which holds the highest level of summarization. The Apex cuboid is typically denoted by all. 13. What is multidimensional database? (Nov/Dec 2011) Data warehouses and OLAP tools are based on a multidimensional data model. This model is used for the design of corporate data warehouses and department data marts. This model contains a star schema, snowflake schema and fact constellation schemas. The core of multidimensional model is the data cube. 14. What are the applications of query tools? (Nov/Dec 2014) The applications of query tools are Multidimensional analysis Decision making In-depth analysis such as data classification Clustering. 15. Compare OLTP and OLAP. (Apr/May 2008,May/June 2010) Data Warehouse (OLAP) Involves historical processing of information. OLAP systems are used by knowledge workers such as executives, managers and analysts. Operational Database (OLTP) Involves day-to-day processing. OLTP systems are used by clerks, DBAs, or database professionals.

Useful in analyzing the business. It focuses on Information out. Based on Star Schema, Snowflake, Schema and Fact Constellation Schema. Contains historical data. Provides summarized and consolidated data. Provides summarized and multidimensional view of data. Number or users is in hundreds. Number of records accessed is in millions. Database size is from 100 GB to 1 TB Highly flexible. Useful in running the business. It focuses on Data in. Based on Entity Relationship Model. Contains current data. Provides primitive and highly detailed data. Provides detailed and flat relational view of data. Number of users is in thousands. Number of records accessed is in tens. Database size is from 100 MB to 1 GB. Provides high performance. 16. List out OLAP operations in multidimensional data model. (May/June 2009) Roll-up - performs aggregation on a data cube Drill-down - is the reverse operation of roll-up. Slice and dice Slice operation selects one particular dimension from a given cube and provides a new sub-cube. Dice selects two or more dimensions from a given cube and provides a new sub-cube. Pivot (or) rotate - The pivot operation is also known as rotation. It rotates the data axes in view in order to provide an alternative presentation of data. 17. Mention the functions of OLAP servers in the data warehousing architecture. (Nov/Dec 2010) The OLAP server performs multidimensional queries of data and stores the results in its multidimensional storage. It speeds the analysis of fact tables into cubes, stores the cubes until needed, and then quickly returns the data to clients. 18. What is Impromptu? Impromptu from Cognos Corporation is positioned as an enterprise solution for interactive database reporting that delivers 1 to 100+ seat scalability. 19. Mention some supported databases of Impromptu. ORACLE Microsoft SQL Server SYBASE Omni SQL Gateway SYBASE Net Gateway 20. What is enterprise warehouse?

An enterprise warehouse collects all the information s about subjects spanning the entire organization. It provides corporate-wide data integration, usually from one or more operational systems or external information providers. It contains detailed data as well as summarized data and can range in size from a few giga bytes to hundreds of giga bytes, tera bytes or beyond. 21. Write note on Report writers. Report writers are inexpensive desktop tools designed for end users. Report writers have graphical interfaces and built-in charting functions; they can pull groups of data from variety of data sources and integrate them in a single report. Leading report writers include Crystal Reports, Actuate and Platinum technology, Inc s Info reports. PART B 1. Explain in detail about the reporting and query tools. (May/June 2014) 2. Describe in detail about COGNOS IMPROMTU. (May/June 2014) 3. Explain the categorization of OLAP tools with necessary diagrams.(may/june 2014) 4. i) List and explain the OLAP operation in multidimensional data model. (Nov/Dec 2014) ii) Differentiate between OLTP and OLAP. (Nov/Dec 2014) 5. i)list and discuss the features of Cognos Impromptu. (Nov/Dec 2012) ii)list and discuss the basic features data provided by reporting and query tools used for business analysis. (Apr/May 2011) 6. i) What is a Multidimensional data model? Explain star schema with an example. (May/June 2014) ii) Write the difference between multi-dimensional OLAP (MOLAP) and Multirelational OLAP (ROLAP). (May/June 2014, Nov/Dec 2012) 7. Explain the following: (May/June 2012) i) Different schemas for multidimensional databases.

ii) OLAP guidelines. 8. i) Write in detail about Managed Query Environment (MQE). ii) Explain about how to use OLAP tools on the Internet.