Data warehousing in telecom Industry

Similar documents
Rocky Mountain Technology Ventures

Data Warehousing and OLAP Technologies for Decision-Making Process

DATA MINING TRANSACTION

DATA WAREHOUSING IN LIBRARIES FOR MANAGING DATABASE

1 DATAWAREHOUSING QUESTIONS by Mausami Sawarkar

Question Bank. 4) It is the source of information later delivered to data marts.

A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective

Fig 1.2: Relationship between DW, ODS and OLTP Systems

Data Mining & Data Warehouse

Improving Resource Management And Solving Scheduling Problem In Dataware House Using OLAP AND OLTP Authors Seenu Kohar 1, Surender Singh 2

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.

Data Warehouse Testing. By: Rakesh Kumar Sharma

Data Warehouses Chapter 12. Class 10: Data Warehouses 1

1. Inroduction to Data Mininig

Evolution of Database Systems

Time: 3 hours. Full Marks: 70. The figures in the margin indicate full marks. Answers from all the Groups as directed. Group A.

Information Management course

CHAPTER 3 Implementation of Data warehouse in Data Mining

DATA MINING AND WAREHOUSING

Data Mining. Associate Professor Dr. Raed Ibraheem Hamed. University of Human Development, College of Science and Technology

Data warehouse architecture consists of the following interconnected layers:

Generating Cross level Rules: An automated approach

Data Warehousing. Ritham Vashisht, Sukhdeep Kaur and Shobti Saini

Data Warehousing (1)

Data Mining Concepts & Techniques

STRATEGIC INFORMATION SYSTEMS IV STV401T / B BTIP05 / BTIX05 - BTECH DEPARTMENT OF INFORMATICS. By: Dr. Tendani J. Lavhengwa

Discovering the Association Rules in OLAP Data Cube with Daily Downloads of Folklore Materials *

Research Article ISSN:

CHAPTER 3 BUILDING ARCHITECTURAL DATA WAREHOUSE FOR CANCER DISEASE

CS377: Database Systems Data Warehouse and Data Mining. Li Xiong Department of Mathematics and Computer Science Emory University

Data-Driven Driven Business Intelligence Systems: Parts I. Lecture Outline. Learning Objectives

IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 06, 2016 ISSN (online):

Novel Materialized View Selection in a Multidimensional Database

A Data Warehouse Implementation Using the Star Schema. For an outpatient hospital information system

Decision Support, Data Warehousing, and OLAP

This proposed research is inspired by the work of Mr Jagdish Sadhave 2009, who used

This tutorial has been prepared for computer science graduates to help them understand the basic-to-advanced concepts related to data mining.

Data Warehousing & Mining. Data integration. OLTP versus OLAP. CPS 116 Introduction to Database Systems

DHANALAKSHMI COLLEGE OF ENGINEERING, CHENNAI

Data Warehouse and Data Mining

Full file at

Summary of Last Chapter. Course Content. Chapter 2 Objectives. Data Warehouse and OLAP Outline. Incentive for a Data Warehouse

BUSINESS INTELLIGENCE FOR PROFILING OF TELECOMMUNICATION CUSTOMERS

Decision Support Systems aka Analytical Systems

Statistics Warehouse for Intelligent Networks Applying data warehouse technology in the telecommunications domain

Knowledge Modelling and Management. Part B (9)

Data Warehousing on the MPE Platform Presentation #272

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1394

Business Intelligence An Overview. Zahra Mansoori

Syllabus. Syllabus. Motivation Decision Support. Syllabus

WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems

DKMS Brief No. Five: Is Data Staging Relational? A Comment

GUJARAT TECHNOLOGICAL UNIVERSITY MASTER OF COMPUTER APPLICATIONS (MCA) Semester: IV

Chapter 1, Introduction

Database Vs. Data Warehouse

Data Warehouse. Asst.Prof.Dr. Pattarachai Lalitrojwong

Data Mining and Warehousing

Power Distribution Analysis For Electrical Usage In Province Area Using Olap (Online Analytical Processing)

Data Warehouse and Mining

An Overview of various methodologies used in Data set Preparation for Data mining Analysis

Oracle Database 11g: Data Warehousing Fundamentals

CS614 - Data Warehousing - Midterm Papers Solved MCQ(S) (1 TO 22 Lectures)

Data Warehousing and Data Mining. Announcements (December 1) Data integration. CPS 116 Introduction to Database Systems

Q1) Describe business intelligence system development phases? (6 marks)

Data warehouse and its Applications in Agriculture Dr. Anil Rai

Overview. Introduction to Data Warehousing and Business Intelligence. BI Is Important. What is Business Intelligence (BI)?

Dynamic Optimization of Generalized SQL Queries with Horizontal Aggregations Using K-Means Clustering

Databases and Data Warehouses

The Data Warehouse Toolkit: The Complete Guide To Dimensional Modeling By Ralph Kimball;Margy Ross READ ONLINE

QUALITY MONITORING AND

Real-World Performance Training Dimensional Queries

Class Organization. CSPP 53017: Data Warehousing Winter 2013" Lecture 1" Svetlozar Nestorov"

DATA WAREHOUSING DEVELOPING OPTIMIZED ALGORITHMS TO ENHANCE THE USABILITY OF SCHEMA IN DATA MINING AND ALLIED DATA INTELLIGENCE MODELS

A Step towards Centralized Data Warehousing Process: A Quality Aware Data Warehouse Architecture

Handout 12 Data Warehousing and Analytics.

Course Objective. Data Warehousing & Mining. Course Outline. Course Outline

OLAP Introduction and Overview

Enterprise Data-warehouse (EDW) In Easy Steps

The Use of Soft Systems Methodology for the Development of Data Warehouses

Designing Data Warehouses. Data Warehousing Design. Designing Data Warehouses. Designing Data Warehouses

Seminars of Software and Services for the Information Society. Data Warehousing Design Issues

collection of data that is used primarily in organizational decision making.

The Evolution of Data Warehousing. Data Warehousing Concepts. The Evolution of Data Warehousing. The Evolution of Data Warehousing

Research on Data Mining Technology Based on Business Intelligence. Yang WANG

Database Technologies for E-Business. Dongmei CUI

SCHEME OF COURSE WORK. Data Warehousing and Data mining

Data Warehouse and Data Mining

Table of Contents. Knowledge Management Data Warehouses and Data Mining. Introduction and Motivation

Knowledge Management Data Warehouses and Data Mining

Improved Frequent Pattern Mining Algorithm with Indexing

Deccansoft Software Services Microsoft Silver Learning Partner. SSAS Syllabus

Data Warehousing & On-Line Analytical Processing

BUSINESS INTELLIGENCE AND OLAP

Unit 1.

What is a Data Warehouse?

Applying Objective Interestingness Measures. in Data Mining Systems. Robert J. Hilderman and Howard J. Hamilton. Department of Computer Science

Database design View Access patterns Need for separate data warehouse:- A multidimensional data model:-

International Journal of Computer Engineering and Applications, ICCSTAR-2016, Special Issue, May.16

BI Assignment ISDS 4180 Fall 2016

The Data Organization

Transcription:

Data warehousing in telecom Industry Dr. Sanjay Srivastava, Kaushal Srivastava, Avinash Pandey, Akhil Sharma Abstract: Data Warehouse is termed as the storage for the large heterogeneous data collected from different sources and departments for the one organization. It is collected and maintained at a separate location for the analysis purpose by the expert and particular concern team to understand and predict the support and confidence of product and services of any organization. By this we can predict and place the particular item at profitable place.it with the help of data mining tools and techniques, Make us to understand the knowledge of customer and its requirements. Telecom industries have many departmental databases which are integrated to form large databases or Data warehouses. The aim of this paper is to describe about the use of data warehouse in managing and storing for telecom industry. The telecom industry contains very large amount of data, so it is difficult and time consuming to retrieve data related to particular field from such a large databases. So for this, one of the aim of this thesis is to show how data warehouse efficiently manage and store large data marts and how query performance improves due to this. Keywords: Data warehouses, Design prediction. I. INTRODUCTION A data warehouse is a relational database that is designed for query and analysis rather than for transaction processing. It usually contains historical data derived from transaction data, but it can include data from other sources. It separates analysis workload from transaction workload and enables an organization to consolidate data from several sources. In addition to a relational database, a data warehouse environment includes an extraction, transportation, transformation, and loading (ETL) solution, an online analytical processing (OLAP) engine, client analysis tools, and other applications that manage the process of gathering data and delivering it to business users. The concept of data warehousing has evolved out of the need for easy access to a structured store of quality data that can be used for decision making. It is globally accepted that information is a very powerful asset that can provide significant benefits to any organization and a competitive advantage in the business world. Organizations have vast amounts of data but have found it increasingly difficult to access it and make use of it. This is because it is in many different formats, exists on many different platforms, and resides in many different file and database structures developed by different vendors. Thus organizations have had to write and maintain perhaps hundreds of programs that are used to extract, prepare, and consolidate data for use by many different applications for analysis and reporting. Also, decision makers often want to dig deeper into the data once initial findings are made. This would typically require modification of the extract programs or development of new ones. This process is costly, inefficient, and very time consuming. Data warehousing offers a better approach. Data warehousing implements the process to access heterogeneous data sources clean, filter, and transform the data and store the data in a structure that is easy to access, understand, and use. The data is then used for query, Dr. Sanjay Srivastava, Professor,Computer Science, MGMCoET, Noida, India, E-mail: san.sri2k@gmail.com Kaushal Srivastava, Scholar,Computer Science, MGMCoET, Noida, India, E-mail: lightkaushal@gmail.com Avinash Pandey, Scholar,Computer Science, MGMCoET, Noida, India, E-mail:avipandey1508@gmail.com Akhill Sharma, Scholar,Computer Science, MGMCoET, Noida, India, E-mail: Akhil.sharma0612@gmail.com www.ijcrd.com Page 26

reporting, and data analysis. As such, the access, use, technology and performance requirements are completely different from those in a Transaction-oriented operational environment. The volume of data in data warehousing can be very high, particularly when considering the requirements for historical data analysis. Data analysis programs are often required to scan vast amounts of that data, which could result in a negative impact on operational applications, which are more performance sensitive. Therefore, there is a requirement to separate the two environments to minimize conflicts and Degradation of performance in the operational environment. II. LITERARUTE SURVEY The origin of the concept of data warehousing can be traced back to the early 80 sthe concept of data warehousing has evolved out of the need for easy access to a structured store of quality data that can be used for decision making. It is globally accepted that information is a very powerful asset that can provide significant benefits to any organization and a competitive advantage in the business world. Organizations have vast amounts of data but have found it increasingly difficult to access it and make use of it. This is because it is in many different formats, exists on many different platforms, and resides in many different file and database structures developed by different vendors. Thus organizations have had to write and maintain perhaps hundreds of programs that are used to extract, prepare, and consolidate data for use by many different applications for analysis and reporting. Also, decision makers often want to dig deeper into the data once initial findings are made. This would typically require modification of the extract programs or development of new ones. This process is costly, inefficient, and very time consuming. Data warehousing offers a better approach. III.CHARACTERISTICS OF DATA WARE HOUSE Following are the characteristics given by data warehousing: Subject Oriented Data warehouses are designed to help you analyze data. For example, to learn more about our company's sales data, we can build a warehouse that concentrates on sales. Using this warehouse, we can answer questions like "Who was our best customer for this item last year?" This ability to define a data warehouse by subject matter, sales in this case, makes the data warehouse subject oriented. Integrated Integration is closely related to subject orientation. Data warehouses must put data from disparate sources into a consistent format. They must resolve such problems as naming conflicts and inconsistencies among units of measure. Non volatile Non volatile means that, once entered into the warehouse, data should not change. This is logical because the purpose of a warehouse is to enable you to analyse what has occurred. Time Variant In order to discover trends in business, analysts need large amounts of data. This is very much in contrast to online transaction processing (OLTP) systems, where performance requirements demand that historical data be moved to an archive. A data warehouse's focus on change over time is what is meant by the term time variant Contrasting OLTP and Data Warehousing Environments IV. COMPARISON BETWEEN OLAP AND DATA WAREHOUSE www.ijcrd.com Page 27

The comparison between OLAP and OLTP data warehouse is mentioned below: Fig4.1 Comparison figure S. No Basis of comparison OLTP Data warehouse 1 Historical data OLTP systems usually store data from only a few weeks or months. The OLTP system stores only historical data as needed to successfully meet the requirements of the current transaction Data warehouses usually store many months or years of data. This is to support historical analysis 2 Schema design It uses fully normalized schemas to optimize update/insert/delete performance and guarantee data consistency 3 Typical operations A typical OLTP operation accesses It uses denormalized or partially denormalized schemas to optimize query performance. Data warehouse query scans www.ijcrd.com Page 28

only a handful of records. 4 Data modifications The end users routinely issue individual data modification statements to the database. The OLTP database is always up to date, and reflects the current state of each business transaction 5 Workload OLTP systems support only predefined operations. thousands or millions of rows. A data warehouse is updated on a regular basis by the ETL process using bulk data modification techniques. The end users of a data warehouse do not directly update the data warehouse. Data warehouses are designed to accommodate ad hoc queries. Table 5.1 Table of comparison Conclusion The data warehouse is build to provide an easy to access source of high quality data from the easy to access heterogeneous source situated at different location. The data warehouse provide an easy way to model or represent large databases.in telecom industries large amount of data bases are represented using star schema and snow flake schema model. The data warehouse provide best way to access data concerning to particular field. The data ware house is best way organise and maintain data in large data ware houses. The telecom industry have various analytical transaction per day and these queries can be easily processed if the data is represented using star schema and snowflake schema. In star schema more than one dimension table are there and each dimension table is connected to fact table using foreign keys. But in the case of snow flake schema dimension tables are further extend to other dimension table and the dimension table are connected to fact tables but extended dimension table are not connected to fact table. www.ijcrd.com Page 29

REFERENCES [1] J.-L. Amat, Using reporting and data mining techniques to improve knowledge of subscribers; applications to customer profiling and fraud management, J. Telecommun. Inform. Technol., no. 3, pp. 11 16, 2002. [2] K. Morik and M. Scholz, The Minin gmart approach to knowledge discovery in databases in Handbook of Intelligent IT, Ning Zhong and Jiming Liu, Eds. IOS Press, 2003 [3]. Rosset, S., Neumann, E., Eick, U., & Vatnik (2003). Customer lifetime value models for decision support. Data Mining and Knowledge Discovery, 7(3), 321-339. [4] Data mining in Telecommunication by Gray M. Weiss, Fordham University [5] Cortes, C., Pregibon, D. Giga-mining. Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining; 174-178, 1998 August 27-31; New York, NY:AAAI Press, 1998 [6] Ezawa, K., Norton, S. Knowledge discovery in telecommunication services data using Bayesian network models. Proceedings of the First International Conference on Knowledge Discovery and Data Mining; 1995 August 20-21. Montreal Canada. AAAI Press: Menlo Park, CA, 1995 [7] The data warehouse toolkit: the complete guide to dimensional modeling by Ralph Kimball, Margy Ross. 2nd ed. (ISBN 0-471-20024-7). [8] Berry, M. and Linoff, G. Data Mining Techniques for Marketing, Sales and Customer Support. Wiley (1997). [9] Chaudhri, S. and Dayal, U. An overview of Data Warehousing and OLAP Technology. ACM SIGMOD Record 26 (1), (1997), 65-74. [10] Hilderman, R.J., Carter, C.L., Hamilton, H.J., and Cercone, N. Mining Association Rules from Market Basket Data Using Share Measures and Characterized Itemsets. International Journal of Artificial Intelligence Tools. 7 (2), (1998), 189-220. [11] Inmon, W. H. Building the Data Warehouse. Wiley (1996). [12] Kimball, R., Reeves, L., Ross, M., and Thornthwhite, W.The Data Warehouse Lifecycle Toolkit. Wiley (1998). [13] Leavitt, N. Data Mining for the Corporate Masses. IEEE Computer. 35 (5), (2002), 22-24. [14] S. Sarawagi, S. Thomas, R. Agrawal. Integrating Mining with Relational Database Systems: Alternatives and Implications. Proceedings of ACM SIGMOD Conference (1998), 343-354 [15] S. Tsur, J. Ullman, S. Abiteboul, C. Clifton, R. Motwani, S. Nestorov, A. Rosenthal. Query Flocks: A Generalization of Association-Rule Mining. Proceedings of ACM SIGMOD Conference, (1998), 1-12. [16] Wang, K., He Y., and Han J. Mining Frequent Itemsets Using Support Constraints. Proceedings of International Conference on Very Large Databases VLDB, (2000), 43-52 [17] Watson, H. J., Annino, D. A., and Wixom, B. H. Current Practices in Data Warehousing. Information Systems Management. 18 (1), (2001). www.ijcrd.com Page 30