Improvement and Simulation of Data Mining Algorithm Based on Association Rules

Similar documents
Yunfeng Zhang 1, Huan Wang 2, Jie Zhu 1 1 Computer Science & Engineering Department, North China Institute of Aerospace

Data Mining in the Application of E-Commerce Website

O2O E-Commerce Data Mining in Big Data Era

The application of OLAP and Data mining technology in the analysis of. book lending

Multisource Remote Sensing Data Mining System Construction in Cloud Computing Environment Dong YinDi 1, Liu ChengJun 1

Study on the Application Analysis and Future Development of Data Mining Technology

Research and Application of E-Commerce Recommendation System Based on Association Rules Algorithm

Dynamic Clustering of Data with Modified K-Means Algorithm

Introduction to Data Mining

A Test Sequence Generation Method Based on Dependencies and Slices Jin-peng MO *, Jun-yi LI and Jian-wen HUANG

Open Access Apriori Algorithm Research Based on Map-Reduce in Cloud Computing Environments

RETRACTED ARTICLE. Web-Based Data Mining in System Design and Implementation. Open Access. Jianhu Gong 1* and Jianzhi Gong 2

Research on Data Mining and Statistical Analysis Xiaoyao Lu1, a

Research on Applications of Data Mining in Electronic Commerce. Xiuping YANG 1, a

Research and application on the Data mining technology in medical information systems. Jinhai Zhang

This tutorial has been prepared for computer science graduates to help them understand the basic-to-advanced concepts related to data mining.

APRIORI ALGORITHM FOR MINING FREQUENT ITEMSETS A REVIEW

Exploration of Fault Diagnosis Technology for Air Compressor Based on Internet of Things

An Evolutionary Algorithm for Mining Association Rules Using Boolean Approach

Design of student information system based on association algorithm and data mining technology. CaiYan, ChenHua

Research on Software Scheduling Technology Based on Multi-Buffered Parallel Encryption

Open Access Research on the Prediction Model of Material Cost Based on Data Mining

The Comparative Study of Machine Learning Algorithms in Text Data Classification*

The Establishment of Large Data Mining Platform Based on Cloud Computing. Wei CAI

Creating Time-Varying Fuzzy Control Rules Based on Data Mining

Efficiently Mining Positive Correlation Rules

Power Load Forecasting Based on ABC-SA Neural Network Model

Study on the Quantitative Vulnerability Model of Information System based on Mathematical Modeling Techniques. Yunzhi Li

A Data Classification Algorithm of Internet of Things Based on Neural Network

An Improved Apriori Algorithm for Association Rules

Data Mining and Business Process Management of Apriori Algorithm

Design and Implementation of Inspection System for Lift Based on Android Platform Yan Zhang1, a, Yanping Hu2,b

Design of Physical Education Management System Guoquan Zhang

Comprehensive analysis and evaluation of big data for main transformer equipment based on PCA and Apriority

Research and Improvement of Apriori Algorithm Based on Hadoop

Learning to Match. Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li

A Software Testing Optimization Method Based on Negative Association Analysis Lin Wan 1, Qiuling Fan 1,Qinzhao Wang 2

Related Work The Concept of the Signaling. In the mobile communication system, in addition to transmit the necessary user information (usually voice

An Indian Journal FULL PAPER. Trade Science Inc. Research on data mining clustering algorithm in cloud computing environments ABSTRACT KEYWORDS

A Technical Analysis of Market Basket by using Association Rule Mining and Apriori Algorithm

An Efficient Algorithm for finding high utility itemsets from online sell

Analytical model A structure and process for analyzing a dataset. For example, a decision tree is a model for the classification of a dataset.

Data Structures. Notes for Lecture 14 Techniques of Data Mining By Samaher Hussein Ali Association Rules: Basic Concepts and Application

Improved Frequent Pattern Mining Algorithm with Indexing

Housing Estates Information Management System Based on.net. Jianliang Min

BioTechnology. An Indian Journal FULL PAPER. Trade Science Inc. Study on secure data storage based on cloud computing ABSTRACT KEYWORDS

Research on the value of search engine optimization based on Electronic Commerce WANG Yaping1, a

Analysis on the technology improvement of the library network information retrieval efficiency

IMPROVING APRIORI ALGORITHM USING PAFI AND TDFI

An Overview of Web Accessibility Evaluation of Government Websites in China Liang-cheng LI, Jia-jun BU*, Zhi YU, Wei WANG and Can WANG

ANU MLSS 2010: Data Mining. Part 2: Association rule mining

A Study of Cloud Computing Scheduling Algorithm Based on Task Decomposition

A PSO-based Generic Classifier Design and Weka Implementation Study

Analyzing Working of FP-Growth Algorithm for Frequent Pattern Mining

Two Algorithms of Image Segmentation and Measurement Method of Particle s Parameters

WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems

Pattern Discovery Using Apriori and Ch-Search Algorithm

Remotely Sensed Image Processing Service Automatic Composition

ITEM ARRANGEMENT PATTERN IN WAREHOUSE USING APRIORI ALGORITHM (GIANT KAPASAN CASE STUDY)

Efficient Algorithm for Frequent Itemset Generation in Big Data

An Efficient Algorithm for Finding the Support Count of Frequent 1-Itemsets in Frequent Pattern Mining

RESEARCH AND APPLICATION OF DATA MINING TECHNOLOGY ON CONSTRUCTION PROJECT COST CONTROL SYSTEM

Correlation Based Feature Selection with Irrelevant Feature Removal

A Comparative Study of Association Mining Algorithms for Market Basket Analysis

A Study on Mining of Frequent Subsequences and Sequential Pattern Search- Searching Sequence Pattern by Subset Partition

Mining Temporal Association Rules in Network Traffic Data

INTELLIGENT SUPERMARKET USING APRIORI

Bézier Surface Modeling for Neutrosophic Data Problems

Information Push Service of University Library in Network and Information Age

Research on friction parameter identification under the influence of vibration and collision

Data- and Rule-Based Integrated Mechanism for Job Shop Scheduling

Double Self-Organizing Maps to Cluster Gene Expression Data

ONLINE INDEXING FOR DATABASES USING QUERY WORKLOADS

Towards New Heterogeneous Data Stream Clustering based on Density

Ancient Kiln Landscape Evolution Based on Particle Swarm Optimization and Cellular Automata Model

Pattern Mining. Knowledge Discovery and Data Mining 1. Roman Kern KTI, TU Graz. Roman Kern (KTI, TU Graz) Pattern Mining / 42

DISCOVERING INFORMATIVE KNOWLEDGE FROM HETEROGENEOUS DATA SOURCES TO DEVELOP EFFECTIVE DATA MINING

DMSA TECHNIQUE FOR FINDING SIGNIFICANT PATTERNS IN LARGE DATABASE

AN APPROACH FOR LOAD BALANCING FOR SIMULATION IN HETEROGENEOUS DISTRIBUTED SYSTEMS USING SIMULATION DATA MINING

Improved Apriori Algorithms- A Survey

Chapter 28. Outline. Definitions of Data Mining. Data Mining Concepts

Prediction of traffic flow based on the EMD and wavelet neural network Teng Feng 1,a,Xiaohong Wang 1,b,Yunlai He 1,c

Value Added Association Rules

A Novel method for Frequent Pattern Mining

Association mining rules

Texture Image Segmentation using FCM

Research Article Apriori Association Rule Algorithms using VMware Environment

Rapid Modeling of Digital City Based on Sketchup

5th International Conference on Information Engineering for Mechanics and Materials (ICIMM 2015)

K-coverage prediction optimization for non-uniform motion objects in wireless video sensor networks

Predicting User Ratings Using Status Models on Amazon.com

Design and Implementation of Agricultural Information Resources Vertical Search Engine Based on Nutch

Text Clustering Incremental Algorithm in Sensitive Topic Detection

CMPUT 391 Database Management Systems. Data Mining. Textbook: Chapter (without 17.10)

EFFICIENT ATTRIBUTE REDUCTION ALGORITHM

Clustering of Data with Mixed Attributes based on Unified Similarity Metric

A Survey Of Issues And Challenges Associated With Clustering Algorithms

The Gene Modular Detection of Random Boolean Networks by Dynamic Characteristics Analysis

Support and Confidence Based Algorithms for Hiding Sensitive Association Rules

Research on An Electronic Map Retrieval Algorithm Based on Big Data

Transcription:

ELKOMNKA, Vol., No.A, September, pp. ~ SSN: -, accredited A by DK, Decree No: /DK/Kep/ DO:./ELKOMNKA.viA. mprovement and Simulation of Data Mining Algorithm Based on Association Rules Li ian*, Dai Xiao-Xia, Chen Guan-Lin, Weng Wen-Yong Department of Computer and Computer Science, Zhejiang University City College, Hangzhou, P. R. China *Corresponding author, e-mail: lit@zucc.edu.cn Abstract As a highly concerned research topic in the field of data mining, association rules have achieved good results at present. However, in practical application, the expanding of database scale coupled with the increasing of the amount of data leads to the decrease of efficiency in algorithm mining, which limits its own application. he paper modifies the data mining algorithm based on association rules and explores the simulation of this algorithm, intending to provide some reference for the improvement and simulation of association rules data mining algorithm. Keywords: association rules; data mining; algorithm; simulation Copyright Universitas Ahmad Dahlan. All rights reserved.. ntroduction he characteristic of data mining is particularly remarkable, mainly manifesting in its huge scale, imperfect complete degree and massive random data, etc. Generally, people don t have the awareness of extracting data in advance. Besides, the large scale network of database data and the dynamic and heterogeneous characteristics of those data will increase the difficulty of extraction. Under such circumstance, an effective algorithm should be integrated in order to solve this problem and improve the computing capability. n the past years, with the continuous development of information technology, the amount of data held by people is constantly expanding, and how to find valuable information from the large amounts of data is already a very concerned topic, which promotes the emergence of data mining technology and its rapid development. Since data mining is a kind of automation and intelligent technology transfer data into valuable information, the analysis and exploration of association rule data mining algorithm in this paper is of much significance for the improvement and simulation of data mining algorithm based on association rules.. mprovement of Association Rule Data Mining Algorithm wo main aspects are discussed in this section about the improvement of association rules data mining algorithm. Strategy need to be modified to improve the efficiency of association rules data mining algorithm, and thus enhancing its practical value. Specific content is as follows:.. Researches on Data Mining Algorithm he most important role of data mining algorithm is to obtain valuable information from large amounts of data including structured data, semi-structured and non-structured data sources, such as audio [], video, data and data algorithm. he first thing to do is to create a good model and standardize the algorithm of priority search. he common data mining algorithms include decision tree method, bionic global optimized genetic algorithm and neural network, statistical analysis and reports of exclusive counterexample method and so on. n order to enhance the efficiency of data mining and to improve the application of information data, a new system of cloud computing should be conducted to excavate more effective information hidden in the large-scale data []. he association rules find the relationship and Received March, ; Revised July, ; Accepted August,

SSN: - i dependence between things. Suppose there is a collection, i, i m, and the corresponding data task is the database D collection shown below, each transaction is a collection,, and each transaction has an identifier D. he signal A association rule of these two channels is the type of containing types, and A, then A B and A B. With the support rules set in the transaction, the percentage of transaction is A B in D... Objects and Props of Data Mining At present, many related statistician or econometrician, particularly the research staff of the National Bureau of Statistics, have already focused their attention on model creating and inquiry based on data model. hey point out that data analysis and modeling are an indispensable task. n their viewpoint, the model is like a summary and a more complete description about the relationship between the variables and variables []. he two variables are the budget that allows the use of the model results, and can help to know the process of data forming. Because of this, the linear model of simultaneous equations can be widely used in real analysis of these two variables. However, the actual process is usually based on a specific and simplified theoretical model (especially economic theory). Besides being unrealistic, it is also more difficult to replace them with unclosed mathematical model. herefore, the use of data mining algorithm will have better feasibility.. LP Rule of Data Mining Algorithm LP data mining algorithm has certain rules. he LP algorithm uses the data set of scan frequency to obtain corresponding data, and forms the mining. ts mechanism is as: f there is a group with different characteristics, and each characteristic forms a group of projects. An element in the subsets which is not of non-empty set with the project sets can be shown in able. able. Sequence database Sit Sequence S ACAABC S ACABCE S ACABC S ABCAB he relationship between sample variance and sample variance ratio is established: S S () prove: according to definition S n ( n ) S n i n i i ( ) ( ) i () ELKOMNKA Vol., No. A, September :

ELKOMNKA SSN: - his algorithm is based on the precondition that the raw data have been processed many times and the information contained in raw data is effectively used. he research of data mining algorithm is of great significance, and the effect of data processing is improving. he user data information needs to be extracted from a large amount of data to promote the development of it. Wang [] Cloud computing is a relatively new computing model and its application in data mining need further research to constantly improve its application efficiency and improve data information application process.. Results and Analysis Simulation can provide technical support for the improvement and perfection of association rules data mining algorithm. he author explores the association rules data mining algorithm simulation in detail from data preparation, association rule mining and mining results. Main contents are as follows:.. Data Preparation His data mining algorithm simulation is based on the application model of algorithm in marketing. Shopping malls recorded a large number of customers purchase through the operation of these years, which constitute a customer purchase information database. At present, the data has not been reasonably and effectively used, being a fortune waiting for discovery. o better run this store, a study on the customer s purchases and purchase habits must be done. Association rule mining algorithm application is quite widespread, and how to closely relate the association rules algorithm to practical problems is the main research target of association rule mining problem. his paper is trying to melt Boolean matrix based association rule mining algorithm into the mall marketing decision support system so as to obtain useful information to enhance business interests and enhance the competitiveness from the items data of the customers. Because transaction information are not the same every day [], copies of the transaction in different time are randomly selected, the membership number whom and their purchase information are input into the database D. able shows part of the transaction records in the database, a total of copies. able. ransaction database D Customer membership Purchase goods records Milk bread shampoo instant noodles fruit drinks Razor battery gum Shampoo towel toothbrush fruit drinks Milk bread fruit drinks Fruit drinks battery razor Milk bread shampoo towel toothbrush Fruit drink gum Fruit drink bread milk owel toothbrush shampoo instant noodles Bread milk shampoo towel toothbrush Given an ordinary two-dimensional table is given, it is required to separate the non discrete attributes to obtain the required tables for association rules analysis. n able, the customer membership number was labeled i (i=,,, n), and milk, bread, shampoo, towels, toothbrush, fruits, drinks, razor, battery, chewing gum and instant noodles are recorded in sequence order j (j=,,, n). he association rules mining algorithm based on Boolean matrix is applied here to separate the transaction database, and the discrete results are shown in able. mprovement and Simulation of Data Mining Algorithm Based on Association Rules (Li ian)

SSN: - ELKOMNKA Vol., No. A, September : able. he discrete relational table D tem sets.. Association Rule Mining We set: mins=%; minc=%. Minimum support: Min supsh=min_sup sup x m=. * = () transfer the transaction database D into a Boolean matrix Am*n: as shown in equation (): m n () he frequent itemsets of equation () are calculated according to corresponding program written into the computer. () All the frequent itemsets obtained by operation L= {{, }, {, }, {, }, {, }, {, }, {,, }} (without regard to L). Potential association rules can be listed according to six frequent itemsets got from type Figure. Meanwhile, confidence C can also be calculated. Finally association rules are listed in the following table :

ELKOMNKA SSN: - able. Association rules No. Potential association rules X Y X C minc Whether association rules {}=>{} Yes {}=>{} Yes {}=>{} No {}=>{} Yes {}=>{} No {}=>{} Yes {}=>{} Yes {}=>{} Yes {}=>{} Yes {}=>{} Yes {}=>{,} No {}=>{,} Yes {}=>{,} Yes {,}=>{} Yes {,}=>{} Yes {,}=>{} Yes Finally, the association rule is obtained as shown in equation (),,, ().. Mining Results By using the above association rules analysis based on Boolean matrix algorithm, some simple association rules can be obtained. For example, things such as milk, bread, shampoo, towels and toothbrushes are strongly correlated and the together-sales of these strongly correlated items can increase sales and provide data support for goods placing []. However, it is just a manor aspect in mall marketing.. Conclusion As is shown in this paper, the improvement of association rule data mining algorithm is very significant, so is the strengthening of the simulation of the association rule data mining algorithm. However, this is a systematic work, and it cannot be accomplished overnight. t needs to be improved in many aspects, such as data preparation and systematic collection. he author of this paper believes that as a new discipline, data mining needs further study to effectively improve the efficiency and ability of it. Data cube environment and correspondent data warehouse technology should be given in order to make data mining meet the demands of needs. t is believed that as long as the above problems are solved, the application of association rules data mining algorithm will be effectively strengthened and it can lay a very solid foundation for further improvement and simulation of association rules data mining algorithm [, ]. References [] Lifeng Liu. A mining algorithm of financial data cleaning based on association rules. Microelectronics and Computer. ; : -. [] Jie Ma. Association rule data mining algorithm research on cloud computing environment. Journal of ndustrial and Commercial University of Chongqing (Natural Science Edition). ; : -. mprovement and Simulation of Data Mining Algorithm Based on Association Rules (Li ian)

SSN: - [] Jun Li, Wenliang Song, Kuan Zhu. A novel association rule data mining algorithm. Liaoning Engineering echnology University Journal (Natural Science Edition). ; : -. [] Gang Wang. A Novel Data Mining Algorithm for Mathematics eaching Evaluation. ELKOMNKA ndonesian Journal of Electrical Engineering. ; (): -. [] M Abdar, S.R.N. Kalhori,. Sutikno,.M. Subroto, G. Arji. Comparing Performance of Data Mining Algorithms in Prediction Heart Diseases. nternational Journal of Electrical and Computer Engineering (JECE). ; (): -. [] Ali SH, El Desouky A, Saleh A. A New Profile Learning Model for Recommendation System based on Machine Learning echnique. ndonesian Journal of Electrical Engineering and nformatics. ; (): -. [] Juanqin Wang, Shuqin Li. Matrix based association rule mining algorithm research and improvement. Computer Measurement and Control. ; : -. [] Xue-Feng Jiang. Application of Parallel Annealing Particle Clustering Algorithm in Data Mining. ELKOMNKA ndonesian Journal of Electrical Engineering. ; (): -. ELKOMNKA Vol., No. A, September :