Data Mining and Data Warehousing Introduction to Data Mining Quiz Easy Q1. Which of the following is a data warehouse? a. Can be updated by end users. b. Contains numerous naming conventions and formats. c. Organized around important subject areas. d. Contains only current data. Q2. An operational system is which of the following? a. A system that is used to run the business in real time and is based on historical data. b. A system that is used to run the business in real time and is based on current data. c. A system that is used to support decision making and is based on current data. d. A system that is used to support decision making and is based on historical data. Q3. A goal of data mining includes which of the following? a. To explain some observed event or condition b. To confirm that data exists c.to analyze data for expected relationships d. To create a new data warehouse Q4. The @active data warehouse architecture includes which of the following? a. At least one data mart b. Data that can extracted from numerous internal and external sources c. near real-time updates.
Q5. Which of the following is data scrubbing? a. A process to reject data from the data warehouse and to create the necessary indexes b. A process to load the data in the data warehouse and to create the necessary indexes c. A process to upgrade the quality of data after it is moved into a data warehouse d. A process to upgrade the quality of data before it is moved into a data warehouse Q6. Which of the following types of tables is a snowflake schema? a. Fact b. Dimension c. Helper Q7. Which of the following includes the generic two-level data warehouse architecture? a. At least one data mart b. Data that can extracted from numerous internal and external sources c. Near real-time updates. Q8. Which of the following are fact tables? a. Completely de-normalized b. Partially de-normalized c. Completely normalized d. Partially normalized Q9. which of the following includes data transformation? a. A process to change data from a detailed level to a summary level b. A process to change data from a summary level to a detailed level c. Joining data from one source into various sources of data d. Separating data from one source into various sources of data
Q10. Which of the following is a reconciled data? a. Data stored in the various operational systems throughout the organization. b. Current data intended to be the single source for all decision support systems. c. Data stored in one operational system in the organization. d. Data that has been selected and formatted for end-user support applications. Q11. Which of the following is not a data mining task? a. Frequent pattern mining c. Data warehousing d. Clustering Q12. Data mining can be applied to which type of data? a. Transactional data b. Multimedia data c. Web data Medium Q1. What data mining task will be performed to clean the data? d. Frequent Pattern Mining Q2. What data mining task will be performed for grouping similar data>
d. Frequent Pattern Mining Q3. Which data mining task is performed for identifying new data based on previous data? d. Frequent Pattern Mining Q4. Which data mining task is performed for getting interesting trends in data? d. Frequent Pattern Mining Q5. Which data mining task is performed for summarizing data in graphical form? a. Data Visualization Q6. Which data mining task is performed for predicting stock market trends? a. Data Visualization Q7. Data mining is useful for? a. Data scientist b. Market trend analyst c. Both a and b d. Neither a nor b
Q8. Data mining is useful for? a. Supermarket chain b. Stock Market c. Spam detection Q9. What type of data mining task will a market analyst use for predicting future market trend? a. Pre-processing Q10. What type of data mining task will a spam detector use? a. Regression b. Pre-processing d. Clustering Q11. What type of data mining task will a data scientist use for describing the company performance? a. Data Visualization c. Regression Q12. What type of data mining task will be used by a supermarket chain to identify customer purchase trends? a. Regression b. Classification
d. Frequent Pattern Mining Hard Q1. What are the privacy issues of data mining? a. Confidentiality of data should be maintained b. User anonymity should be provided c. Controlled access of data should be performed Q2. Why is data mining required for biological data? a. To aid in medical diagnosis b. To find disease patterns c. Both a and b Q3. Why is data mining required for multimedia data? a. To help in understanding the different types of multimedia data b. To get a collection of multimedia data c. Because multimedia data has no content Q4. What are the issues in mining multimedia data? a. Size of the data is large b. Curse of dimensionality c. Both a and b Q5. What are the issues in mining biological data? a. Curse of dimensionality b. Large size of data c. Privacy issues of biological data
Q6. What are the issues in mining stream data? a. Small data size b. Dynamic in nature c. All of the above