Meaning & Concepts of Databases

Similar documents
Database Systems: Learning Outcomes. Examples of Database Application. Introduction

DATABASE MANAGEMENT SYSTEMS. UNIT I Introduction to Database Systems

Lecture 01. Fall 2018 Borough of Manhattan Community College

Introduction Database Technology [DBTECO601]

Databases and Database Systems

Introduction: Databases and. Database Users

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 1-1

Strategic Information Systems Systems Development Life Cycle. From Turban et al. (2004), Information Technology for Management.

Chapter 1. Types of Databases and Database Applications. Basic Definitions. Introduction to Databases

Chapter 1. Introduction of Database (from ElMasri&Navathe and my editing)

DATABASE DEVELOPMENT (H4)

Introduction to Databases

Introduction: Databases and Database Users. Copyright 2007 Ramez Elmasri and Shamkant B. Navathe Slide 1

Fundamentals of Database Systems (INSY2061)

Database Systems. A Practical Approach to Design, Implementation, and Management. Database Systems. Thomas Connolly Carolyn Begg

LECTURE1: PRINCIPLES OF DATABASES

QM Chapter 1 Database Fundamentals Version 10 th Ed. Prepared by Dr Kamel Rouibah / Dept QM & IS

Outline. Definitions History Basic concepts of DBMS Data Models Relational database Normalization

DATA MINING TRANSACTION

IT1105 Information Systems and Technology. BIT 1 ST YEAR SEMESTER 1 University of Colombo School of Computing. Student Manual

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

Chapter 1 Introduction

CSE 3241: Database Systems I Databases Introduction (Ch. 1-2) Jeremy Morris

Databases and Database Management Systems

Databases 1. Daniel POP

Quick Facts about the course. CS 2550 / Spring 2006 Principles of Database Systems. Administrative. What is a Database Management System?

MIT Database Management Systems Lesson 01: Introduction

+ Data-Management for Data-intensive Computing

Database Management System. Fundamental Database Concepts

Chapter 6. Foundations of Business Intelligence: Databases and Information Management VIDEO CASES

What is Data? ANSI definition: Volatile vs. persistent data. Data. Our concern is primarily with persistent data

What is Data? Volatile vs. persistent data Our concern is primarily with persistent data

DC Area Business Objects Crystal User Group (DCABOCUG) Data Warehouse Architectures for Business Intelligence Reporting.

Module-01 Introduction to Database Concepts

The Evolution of Data Warehousing. Data Warehousing Concepts. The Evolution of Data Warehousing. The Evolution of Data Warehousing

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.

Introduction Database Concepts

Question Bank. 4) It is the source of information later delivered to data marts.

Database Fundamentals Chapter 1

UNIT I. Introduction

Sample Answers to Discussion Questions

Introduction to Oracle

Database System Concepts and Architecture

Fundamentals of Information Systems, Seventh Edition

About the Tutorial. Audience. Prerequisites. Copyright & Disclaimer DBMS


Top 88 Question Asked in Part I of MIS 150 EXAM #1 (Chapter 1-4, Appendix C) Exams questions gathered from old tests dating back to Fall 2000

The functions performed by a typical DBMS are the following:

CSCU9Q5. Administrivia & Topics to be covered. Traditional File-Based Systems. Problems with Manual Filing Systems. CSCU9Q5- Database P&A

Chapter 1: Introduction. Chapter 1: Introduction

CHAPTER 3 Implementation of Data warehouse in Data Mining

CS425 Fall 2016 Boris Glavic Chapter 1: Introduction

TIM 50 - Business Information Systems

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe Slide 2-1

BIS Database Management Systems.

1 Overview of Database Management

DATA MINING AND WAREHOUSING

Chapter 6 VIDEO CASES

Course Outline Faculty of Computing and Information Technology

Database Systems Concepts *

Chapter 3. Databases and Data Warehouses: Building Business Intelligence

CHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP)

CHAPTER 2: DATA MODELS

Chapter 1: Introduction

TIM 50 - Business Information Systems

Managing Data Resources

DBM/500 COURSE NOTES

MIS Database Systems.

Data analysis and design Unit number: 23 Level: 5 Credit value: 15 Guided learning hours: 60 Unit reference number: H/601/1991.

Database Management Systems MIT Lesson 01 - Introduction By S. Sabraz Nawaz

Chapter 1: Introduction

Evolution of Database Systems

Chapter 1 Chapter-1

CS 4400 Introduction to Database Systems 2002 Spring Term Project (Section A)

Test bank for accounting information systems 1st edition by richardson chang and smith

Chapter 1: Introduction

Data Mining & Data Warehouse

Fig 1.2: Relationship between DW, ODS and OLTP Systems

5/23/2014. Limitations of File-based Approach. Limitations of File-based Approach CS235/CS334 DATABASE TECHNOLOGY CA 40%

4/28/2014. File-based Systems. Arose because: Result

Introduction: Database Concepts Slides by: Ms. Shree Jaswal

Outline. Database Management Systems (DBMS) Database Management and Organization. IT420: Database Management and Organization

DEC Computer Technology LESSON 6: DATABASES AND WEB SEARCH ENGINES

Database Design. 1-3 History of the Database. Copyright 2015, Oracle and/or its affiliates. All rights reserved.

Application software office packets, databases and data warehouses.

Timeless Theory vs. Changing Users: Reconsidering Database Education

A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective

CHAPTER 2: DATA MODELS

AVOIDING SILOED DATA AND SILOED DATA MANAGEMENT

Summary of Last Chapter. Course Content. Chapter 2 Objectives. Data Warehouse and OLAP Outline. Incentive for a Data Warehouse

A collection of persistent data that can be shared and interrelated. A system or application that must be operational for a company to function.

Data Warehouse and Data Mining

Chapter 1: Introduction

CS102B: Introduction to Information Systems. Minerva A. Lagarde

KNGX NOTES INFS1603 [INFS1603] KEVIN NGUYEN

Data Warehousing. Ritham Vashisht, Sukhdeep Kaur and Shobti Saini

Guide Users along Information Pathways and Surf through the Data

Data Mining. Associate Professor Dr. Raed Ibraheem Hamed. University of Human Development, College of Science and Technology

IT DATA WAREHOUSING AND DATA MINING UNIT-2 BUSINESS ANALYSIS

Introduction. Example Databases

Transcription:

27 th August 2015

Unit 1 Objective Meaning & Concepts of Databases Learning outcome Students will appreciate conceptual development of Databases

Section 1: What is a Database & Applications Section 2: Evolutions & Developments in Databases Section 3: Database System Architecture Section 4: Roles in Database Environments Section 5: Database Emerging Trends & Future (BI) Review Questions Next week (3): Unit 1B Content Preview

Database: A shared collection of logically related data and a description of the data, designed to meet the information needs of an organization. Database: A logically coherent collection of related data that (i) describes the entities and their interrelationships, and (ii) it is designed, built & populated for a specific reason.

Purchases from the supermarket: When you purchase goods from your local supermarket, it is likely that a database is accessed. The checkout assistant uses a bar code reader to scan each of your purchased items. Purchases using your credit card: When you purchase goods using your credit card, the assistant normally checks that you have sufficient credit left to make the purchase. Booking a holiday at the travel agents: When you make inquiries about a holiday, the travel agent may access several databases containing holiday and flight details. Using the local library: Your local library probably has a database containing details of the books in the library, details of the readers, reservations, and so on. There will be a computerized index that allows readers to find a book based on its title, or its authors, or its subject area.

Applications cont d Taking out insurance: Whenever you wish to take out insurance, for example personal insurance, building, and contents insurance for your house, or car insurance, your broker may access several databases containing figures for various insurance organizations. Using the Internet: Many of the sites on the Internet are driven by database applications. For example, you may visit an online bookstore that allows you to browse and buy books, such as Amazon.com. Studying at University: If you are at university, there will be a database system containing information about yourself, the course you are enrolled in, details about your grant, the modules you have taken in previous years or are taking this year, and details of all your examination results.

In an organization a manual file is set up to hold all external and internal correspondence relating to a project, product, task, client, or employee. There are many such files, and for safety they are labeled and stored in one or more cabinets. For security, the cabinets have locks or located in secure areas of the building. In our own home, we probably have some sort of filing system which contains receipts, guarantees, invoices, bank statements, and such like. When we need to look something up, we go to the filing system and search through the system starting from the first entry until we find what we want. The manual filing system works well whiles the number of items to be stored is small. However, the manual filing system breaks down when we have to cross-reference or process the information in the files.

File-based System: A collection of application programs that perform services for the system end-users such as the production of reports. Each program defines and manages its own data. File-based systems were an early attempt to computerize the manual filing system that we are all familiar with. Limitations: 1. Separation and isolation of data: When data is isolated in separate files, it is difficult to access data that should be available. 2. Duplication of data: Owing to the decentralized approach taken by each department, the file-based approach encouraged, if not necessitated, the uncontrolled duplication of data. 3. Data dependence: Physical structure and storage of the data files and records are defined in the User-end application code. This means that changes to an existing structure are difficult.

Incompatible file formats: Because the structure of files it is embedded in the application programs, the structures are dependent on the application programming language. For example, the structure of a file generated by a COBOL program may be different from the structure of a file generated by a C program. The direct incompatibility of such files makes them difficult to process jointly. Fixed queries/proliferation of application programs: From the enduser s point of view, file-based systems proved to be a great improvement over manual systems. Consequently, the requirement for new or modified queries grew. Turning Point: The limitations listed above puts pressure on data processing employees. Often, certain types of functionality were omitted: data security, data recovery during hardware/software failure, access to files was restricted to one user at a time; no multi-shared access like today, first come-first serve. Another solution was required-hence Databases

Limitations of the file-based approach are attributed to two factors: (1) Definition of the data is embedded/stored as part of the application programs, rather than data having independent camp or storage base. (2) No control over the access and manipulation of data beyond that imposed by the application programs. To become effective, a new approach was required. What emerged were the database and the Database Management System (DBMS). Where the DBMS (query and search engine) separates application program at the user end interface from the data stored in the database. 1960s: Computerized database started in the 1960s, when the use of computers became a more cost-effective option for private organizations. There were two popular data models in this decade: a network model called CODASYL and a hierarchical model called IMS. One database system that proved to be a commercial success was the SABRE system that was used by IBM to help American Airlines manage its reservations data.

1970 to 1972: E.F. Codd published an important paper to propose the use of a relational database model, and his ideas changed the way people thought about databases. In his model, the database s schema, or logical organization, is disconnected from physical information storage, and this became the standard principle for database systems. 1976: A new database model called Entity-Relationship, or ER, was proposed by P. Chen this year. This model made it possible for designers to focus on data application, instead of logical table structure.

1980s: Structured Query Language, or SQL, became the standard query language. Mid 1990s: The advent of the Internet led to exponential growth of the database industry. Average desktop users began to use client-server database systems to access computer systems that contained legacy data. 2000s: Although the Internet industry experienced a decline in the early 2000s, database applications continue to grow. New interactive applications were developed for PDAs, pointof-sale transactions, and consolidation of vendors. Presently, the three leading database companies in the western world are Microsoft, IBM, and Oracle. Database System=Database+ DBMS+ User Application Program+ User

Stages Components Remarks Independence High level transaction stage 1. User/Human /Robot 2. Application Program for user Query & Interaction Components 1&2 forms the User Application Program This stage Is independent and can be changed without major effect to other levels ( from VB to Java programming language) Mid level transaction stage 1. Software to accept User queries from Application program 2. Software to Access stored data for a User Component 1& 2 forms the DBMS Active engine Independent & can be changed; however you need to modify data definition for interlinking the high level & low level Low level transaction stage 1. Meta-data defining the database operations 2. Stored data of day-today users transaction Components 1&2 forms the passive Database/or Data_Camp or Data_Containers Independent storage base/camp; on like the file-based system where we embedded as part of the Application program at High level

The Data Administrator (DA) is responsible for the management of the data resource including database planning, development and maintenance of standards, policies and procedures, and conceptual/logical database design. The DA consults with and advises senior managers, ensuring that the direction of database development will ultimately support corporate objectives. NB: The database and the DBMS are corporate resources that must be managed like any other resource. Data and database administration are the roles generally associated with the management and control of a DBMS and its data.

Database Designers: In large database design projects, we can distinguish between two types of designer such as logical database designers and physical database designers. 1. The logical database designer is concerned with identifying the data (that is, the entities and attributes), the relationships between the data, and the constraints on the data that is to be stored in the database. The logical database designer must have a thorough and complete understanding of the organization s data and any constraints on this data (the constraints are sometimes called business rules). 2. The physical database designer decides how the logical database design is to be physically realized. This involves: mapping the logical database design into a set of tables and integrity constraints; selecting specific storage structures and access methods for the data to achieve good performance; designing any security measures required on the data.

Application Developers: Once the database has been implemented, the application programs that provide the required functionality for the end-users must be implemented. This is the responsibility of the application developers. Typically, the application developers work from a specification produced by systems analysts. Each program contains statements that request the DBMS to perform some operation on the database. This includes retrieving data, inserting, updating, and deleting data. The programs may be written in a third-generation programming language or a fourth-generation language, as discussed in the previous section.

End-Users: The end-users are clients for the database, Endusers are classified according to the way they use the system: 1. Naïve users are typically unaware of the DBMS. They access the database through specially written application programs that attempt to make the operations as simple as possible. They invoke database operations by entering simple commands or choosing options from a menu. This means that they do not need to know anything about the database or the DBMS. 2. Sophisticated users. At the other end of the spectrum, the sophisticated end-user is familiar with the structure of the database and the facilities offered by the DBMS. Sophisticated end-users may use a high-level query language such as SQL to perform the required operations. Some sophisticated end-users may even write application programs for their own use.

There is no such thing as business-as-usual. Business Intelligence (BI) is the current trend in the database environment. we have moved beyond database traditional functions such as Adding, updating a content, Insertion and deletion to a new world where we derive business intelligence from a stored data using statistical analysis tools to support management decisions 1. Data Warehousing In computing, a data warehouse (DW or DWH, EDW), is a system used for reporting and data analysis. DWs are central repositories of integrated data from one or more disparate sources. They store current and historical data and are used for creating analytical reports for knowledge workers throughout the enterprise. Examples of reports could range from annual and quarterly comparisons and trends to detailed daily sales analyses.

Generally, data mining (sometimes called data or knowledge discovery) is the process of analyzing data from different perspectives and summarizing it into useful information - information that can be used to increase revenue, cuts costs, or both. Data mining software is one of a number of analytical tools for analyzing data. It allows users to analyze data from many different dimensions or angles, categorize it, and summarize the relationships identified. Technically, data mining is the process of finding correlations or patterns among dozens of fields in large relational databases.

Data analytics (DA) is the science of examining raw data with the purpose of drawing conclusions about that information. Data analytics is used in many industries to allow companies and organization to make better business decisions and in the sciences to verify or disprove existing models or theories.

Principle 1: Acid properties & State of Transaction Principle 2: Dr. CODDS 12-Rules in Relational Model Principle 3: Anomalies & Normalization Principle 4: Deadlock Expectation & Prevention Principle 5: Database Backup Principle 6: Data recovery Summary: Review Questions