An Overview of Mining Fuzzy Association Rules

Size: px
Start display at page:

Download "An Overview of Mining Fuzzy Association Rules"

Transcription

1 An Overview of Mining Fuzzy Association Rules Tzung-Pei Hong and Yeong-Chyi Lee Abstract Data mining is the process of extracting desirable knowledge or interesting patterns from existing databases for specific purposes. Many types of knowledge and technology have been proposed for data mining. Among them, finding association rules from transaction data is most commonly seen. Most studies have shown how binary valued transaction data may be handled. Transaction data in real-world applications, however, usually consist of fuzzy and quantitative values, so designing sophisticated data-mining algorithms able to deal with various types of data presents a challenge to workers in this research field. This chapter thus surveys some fuzzy mining concepts and techniques related to association-rule discovery. The motivation from crisp mining to fuzzy mining will be first described. Some crisp mining techniques for handling quantitative data will then be briefly reviewed. Several fuzzy mining techniques, including mining fuzzy association rules, mining fuzzy generalized association rules, mining both membership functions and fuzzy association rules, will then be described. The advantages and the limitations of fuzzy mining will also be discussed. 1 Introduction Knowledge discovery in databases (KDD) has thus become a process of considerable interest in recent years, as the amounts of data in many databases have grown tremendously large. KDD means the application of nontrivial procedures for identifying effective, coherent, potentially useful, and previously unknown patterns in large databases [20, 45]. The KDD process generally consists of three phases: pre-processing, data mining and post-processing [17, 2]. Among them, data mining plays a critical role to KDD. Depending on the types of databases to be processed, mining approaches may be classified as working on transaction databases, temporal databases, relational databases, and multimedia databases, among others [31]. On the other hand, depending on the classes of knowledge derived, mining approaches may be classified as finding association rules, classification rules, clustering rules, and sequential patterns [14], among others. Among them, finding association rules in transaction databases is most commonly seen in data mining [41, 50, 18, 1, 42, 25, 48, 51]. H. Bustince et al., (eds.), Fuzzy Sets and Their Extensions: Representation, Aggregation and Models. C Springer

2 398 T.-P. Hong, Y.-C. Lee An association rule is an expression X Y,whereX is a set of items and Y is usually a single item. It means in the set of transactions, if all the items in X exist in a transaction, then Y is also in the transaction with a high probability. Most studies have shown how binary valued transaction data may be handled. Transactiondata in real-world applications, however, usually consist of fuzzy and quantitative values. Designing sophisticated data-mining algorithms able to deal with various types of data presents a challenge to workers in this research field. Fuzzy set theory was first proposed by Zadeh in 1965 [58]. It has been used more and more frequently in intelligent systems because of its simplicity and similarity to human reasoning [35]. It is primarily concerned with quantifying and reasoning using natural language in which words can have ambiguous meanings. This can be thought of as an extension of traditional crisp sets, in which each element must either be in or not in a set. The theory has been applied in many fields such as manufacturing, engineering, diagnosis, economics, among others [35, 37, 59]. Several fuzzy learning algorithms for inducing rules from given sets of data have been designed and used with good effects in specific domains [33, 34, 19, 24, 46, 29, 53]. Strategies based on decision trees [40] were proposed in [16, 47, 21]. Wang et al. also proposed a fuzzy version space learning strategy for managing vague information [53]. Recently, the fuzzy set theory has also been applied to data mining to find interesting association rules or sequential patterns in transaction data with quantitative values [23, 29, 28, 8, 6, 3, 57]. This chapter thus attempts to survey some fuzzy mining concepts and techniques about association rules. The remaining part of this chapter is organized as follows. Some related crisp mining approaches for association rules are first reviewed in Sect. 2. Some fuzzy mining techniques, including mining fuzzy association rules, mining fuzzy generalized association rules, mining both membership functions and fuzzy association rules, integrating clustering with fuzzy association rules, are then described in Sects. 3 to 5. The advantages and the limitations of fuzzy mining are discussed and conclusions are given in Sect Some Crisp Data-mining Approaches for Association Rules The goal of data mining is to discover important associations among items such that the presence of some items in a transaction will imply the presence of some other items. To achieve this purpose, Agrawal and his co-workers proposed the famous Apriori algorithm that based on the concept of large itemsets to find association rules in transaction data [41, 50]. They divided the mining process into two phases. In the first phase, candidate itemsets were generated and counted by scanning the transaction data. If the number of an itemset appearing in the transactions (called the support of an itemset) was larger than a pre-defined threshold value (called minimum support), the itemset was considered a large itemset. Itemsets containing only one item were processed first. Large itemsets containing only single items were then combined to form candidate itemsets containing two items. This process was repeated until all large itemsets had been found. In the second phase, association rules were induced from the large itemsets found in the first

3 An Overview of Mining Fuzzy Association Rules 399 phase. All possible association rules from each large itemset were formed, and their confidence values were calculated. The confidence was defined as the support of the itemset in the left hand side of the rule over the support of the itemset in the whole rule. Those rules with calculated confidence values larger than a predefined threshold (called minimum confidence) were output as association rules. Han et al. proposed the Frequent-Pattern-tree structure (FP-tree) for efficiently mining association rules without generation of candidate itemsets [22]. The approach compressed a database into a tree structure storing only large items. Three steps were involved in FP-tree construction. The database was first scanned to find all items with their frequency. The items with their supports larger than a predefined minimum support were selected as large 1-itemsets (items). Next, the large items were sorted in descending frequency. At last, the database was scanned again to construct the FP-tree according to the sorted order of large items. The construction process was executed tuple by tuple, from the first transaction to the last one. After all transactions were processed, the FP-tree was completely constructed. After the FP-tree was constructed from a database, a mining procedure called FP-Growth [22] was executed to find all large itemsets. FP-Growth did not need to generate candidate itemsets for mining, but derived frequent patterns directly from the FP-tree. It was a recursive process, handling the frequent items one by one. A conditional FP-tree was generated for each frequent item, and from the tree the large itemsets with the processed item could be recursively derived. Many mining methods for finding association rules based on the FP-tree structure have also been proposed [32, 4, 49, 60]. Cai et al. proposed weighted mining to reflect different importance on items [36]. Each item was attached a numerical weight given by users. Weighted supports and weighted confidenceswere then defined to determine interesting association rules. In the above approach, the items in transactions are binary. That is, the value of an item is treated as 1 if the item is present in a transaction, and as 0 if it is not present. Items in real applications may, however, be presented with rich data types, such as quantitative and categorical types. Some approaches were then proposed for handling items with quantitative and categorical values while discovering association rules. Piatetsky-Shapiro proposed a mining approach, which partitioned quantitative attributes into intervals [27]. The value of each quantitative attribute could be presented as a range rather than a single value, and was allowed to appear in the left-hand side (antecedent) of a rule. Srikant and Agrawal then proposed a method for mining association rules from transactions with quantitative attributes [51]. Their proposed method first determined the number of partitions for each quantitative attribute, and then mapped all possible values of each attribute into a set of consecutive integers. The number of intervals was determined by a partial completeness factor, which was used to evaluate the lost information from partition. Adjacent intervals were also allowed to be merged into a large one. Their approach then found large itemsets whose support values were greater than the user-specified minimum-support levels. These large itemsets were then processed to generate association rules. An interest measure was also introduced to get interesting association rules.

4 400 T.-P. Hong, Y.-C. Lee Fukuda et al. proposed a mining approach to discover optimized association rules from quantitative data [18]. Two problems were discussed, one for mining optimized support rules and the other for mining optimized conference rules. Mining optimized support rules focused on finding an association rule with a maximum support value on the condition that the confidence value of the rule had to be larger than a given minimum confidence. On the other hand, mining optimized confidence rules emphasized on finding an association rule with a maximum confidence value on the condition that the support value of the rule had to be larger than a given minimum support. The authors first found the ranges of quantitative attributes by sampling, sorting and filling data into buckets before the mining process proceeded. Association rules containing quantitative attributes in the left-hand side were then found by an effective search approach. In addition, Rastogi and Shim extended Fukuda et al. s approach to find the association rules with disjunction conditions [30]. Lent et al. proposed a geometry-based algorithm to deal with the problem of clustering association rules from quantitative attributes [11]. Each association rule was initially represented by a primitive attribute value (or range). Adjacent association rules were then clustered together to form generalized rules. Many other researches are still in progress. 3 Mining Fuzzy Association Rules As mentioned above, the fuzzy set theory is concerned with quantifying and reasoning using natural language. It is thus very suitable to handle quantitative values by fuzzy sets. Several fuzzy mining approaches have thus been proposed to find interesting association rules or sequential patterns in transaction data with quantitative values. In this section, we focus on the data mining approaches for finding association rules with predefined membership functions. Chan and Au proposed an F-APACS algorithm to mine fuzzy association rules [23]. They first transformed quantitative attribute values into linguistic terms and then used the adjusted difference analysis to find interesting associations among attributes. It had the advantage that the user-specified thresholds were not needed since the statistical analysis was used. In addition, both positive and negative associations could be found. Kuok et al. proposed a fuzzy mining approach to handle numerical data in databases with attributes and derived fuzzy association rules [3]. At nearly the same time, Hong et al. proposed a fuzzy mining algorithm to mine fuzzy rules from quantitative transaction data [29]. Basically, the fuzzy mining algorithms first used membership functions to transform each quantitative value into a fuzzy set in linguistic terms. The algorithm then calculated the scalar cardinality of each linguistic term on all the transaction data. The mining process based on fuzzy counts was then performed to find fuzzy association rules. Hong et al. described the fuzzy mining steps in details. Their fuzzy mining algorithm is described below as a good reference [8].

5 An Overview of Mining Fuzzy Association Rules 401 The Fuzzy Data Mining Algorithm: INPUT: A set of n training data, each with m attribute values, a set of membership functions, a predefined minimum support value α, and a predefined confidence value λ. OUTPUT: A set of fuzzy association rules. STEP 1: Transform the quantitative value v (i) j of each transaction datum D (i) for each attribute A j, j=1 to m, into a fuzzy set f (i) j represented as ( (i) ) f j 1 R j1 + f (i) j 2 R j2 + + f (i) j l R jl using the given membership functions, where i presents the current transaction number that is processed, R jk is the k-th fuzzy region (linguistic term) of attribute A j, f (i) jk is the value of the membership function in R jk for the value v (i) j,andl (= A j )isthe number of fuzzy regions for A j. STEP 2: Calculate the count of each attribute region (linguistic term) R jk in the transaction data: count jk = n i=1 f (i) jk. STEP 3: Collect each attribute region (linguistic term) to form the candidate set C 1. STEP 4: Check whether count jk of each R jk (1 j m and 1 k A j )is larger than or equal to the predefined minimum support value α. IfR jk satisfies the above condition, put it in the set of large 1-itemsets (L 1 ). That is: L 1 ={R jk count jk α, 1 j m and 1 k A j }. STEP 5: IF L 1 is not null, then do the next step; otherwise, exit the algorithm. STEP 6: Set r=1, where r is used to represent the number of items kept in the current large itemsets. STEP 7: Join the large itemsets L r to generate the candidate set C r+1 in a way similar to that in the apriori algorithm [41] except that two regions (linguistic terms) belonging to the same attribute cannot simultaneously exist in an itemset in C r+1. Restated, the algorithm first joins L r and L r under the condition that r-1 items in the two itemsets are the same and the other one is different. It then keeps in C r+1 the itemsets which have all their sub-itemsets of r items existing in L r and do not have any two items R jp and R jq (p q) of the same attribute R j. STEP 8: Do the following substeps for each newly formed (r+1)-itemset s with items (s 1, s 2,, s r+1 )inc r+1 :

6 402 T.-P. Hong, Y.-C. Lee (a) Calculate the fuzzy value of each transaction data D (i) in s as f s (i) = f s (i) 1 f s (i) 2 f s (i) r+1,where f s (i) j is the membership value of D (i) in region s j. If the minimum operator is used for the intersection, then: f (i) s = Min r+1 j=1 f (i) s j. (b) Calculate the count of s in the transactions as: count s = (c) If count s is larger than or equal to the predefined minimum support value α, put s in L r+1. STEP 9: IF L r+1 is null, then do the next step; otherwise, set r = r+1 and repeat STEPs 6 to 8. STEP 10: Collect the large itemsets together. STEP 11: Construct association rules for each large q-itemset s with items (s 1, s 2,...,s q ), q 2, using the following substeps: n i=1 f (i) s (a) Form each possible association rule as follows: s 1 s k 1 s k+1 s q, k = 1 to q. (b) Calculate the confidence values of all association rules using: ni=1 ( f (i) s 1 ni=1 f s (i) f s (i) k 1 f s (i) k+1 f s (i) q ). STEP 12: Output the association rules with confidence values larger than or equal to the predefined confidence threshold λ. In the above fuzzy mining approach, all the linguistic terms are used. As an alternative, each item can use only the linguistic term with the maximum cardinality in later mining processes [28]. It can thus keep the same number of items as the original attributes. The alternative therefore focuses on the most important linguistic terms and can reduce its time complexity. Its derived set of association rules is, however, more incomplete than that by considering all the linguistic items. Trade-off thus exists between the rule completeness and the time complexity. In addition, items may have different importance. Yue et al. thus extended the above concept to find fuzzy association rules with weighted items from transaction data [57]. Each item was given a weight to represent the importance of an item, and each weight was in a range of [0, 1]. They also adopted Kohonen self-organized mapping to derive fuzzy sets for numerical attributes. Weighted supports and weighted conferences were utilized to discover weighted fuzzy association rules.

7 An Overview of Mining Fuzzy Association Rules 403 Hong et al. then proposed a mining approach for extracting interesting weighted fuzzy association rules from transactions, with the parameters (minimum support and minimum confidence) needed in the mining process were given in linguistic terms [6]. Items were evaluated by managers as linguistic terms to reflect their importance, which were then transformed as fuzzy sets of weights. The approach then transformed linguistic weighted items, minimum supports and minimum confidences into fuzzy sets, then filtered weighted large itemsets out by fuzzy operations. Weighted association rules with linguistic supports and confidences were then derived from the weighted large itemsets. 4 Mining Fuzzy Generalized Association Rules Most mining algorithms for association rules focused on single-concept levels. However, mining multiple-concept-level rules may lead to discovery of more specific and important knowledge from data. Relevant item taxonomies are usually predefined in real-world applications and can be represented by hierarchy trees. Terminal nodes on the trees represent actual items appearing in transactions; internal nodes represent classes or concepts formed by lower-level nodes. A simple example is given in Fig. 1. In this example, food falls into two classes: brink and bread. Brink can be further classified into milk and juice. Similarly, clothes are divided into jackets and T-shirts. Only the terminal items (milk, juice, bread, jacket and T-shirt) can appear in transactions. Srikant and Agrawal thus proposed a method for finding generalized association rules, which included generalized items (the terminal nodes in taxonomy) [48]. Their mining process could be divided into four phases. In the first phase, ancestors of items in each given transaction were added according to the predefined taxonomy. In the second phase, large itemsets were generated in a way similar to the Apriori algorithm. In the third phase, all possible generalized association rules were induced from the large itemsets found in the second phase. The rules with calculated confidence values larger than the minimum confidence were kept. In the fourth phase, Food Clothes Drink Bread Jackets T-shirts Fig. 1 An example of predefined taxonomic structures Milk Juice

8 404 T.-P. Hong, Y.-C. Lee uninteresting association rules were pruned and interest rules were output according to the following interest requirements: A rule that had no ancestor rules (by replacing the items in a rule with their ancestors in the taxonomy) was mined out; The support of a rule was R times larger than the expected supports of its ancestor rules, where R was called a minimum interest; The confidence of a rule was R times larger than the expected confidences of its ancestor rules. In addition, Han and Fu also proposed a method for finding level-crossing association rules at multiple levels [1]. Nodes in predefined taxonomies were first encoded using sequences of numbers and the symbol * according to their positions in the hierarchy tree. A top-down progressively deepening search approach was then used to explore level-crossing association relationships. Wei et al. considered the partial relationships possibly existing in taxonomy [56]. An item may partially belong to more than one parent item. For instance, tomato may partially belong to both fruit and vegetable with different degrees. Wei et al. thus defined a fuzzy taxonomic structure and considered the extended degrees of support, conference and interest measures for mining generalized association rules. Hong et al. proposed a fuzzy multiple-level mining algorithm for extracting implicit knowledge from transactions stored as quantitative values [5]. The proposed generalized fuzzy mining algorithm was based on Srikant and Agrawal s approach to find fuzzy interesting rules from quantitative data. The quantitative items may be from any level of the given taxonomy. They also proposed another fuzzy multiple-level mining algorithm which adopted a top-down progressively deepening approach [28]. Shen et al. devised an algorithm for mining generalized association rules with fuzzy taxonomic structure from quantitative databases based on Wei s algorithm [39]. They handled the quantitative data by Srikant and Agrawal s algorithm [51], which partitioned quantitative attributes into several intervals. Kaya et al. then extended Hong et al s approaches [6, 43] for discovering multi-cross-level fuzzy weighted association rules [9]. In summary, mining fuzzy generalized association rules is a little different from mining fuzzy association rules in the following ways. 1. Non-terminal items must be processed. These items are usually considered expanded items. 2. The candidate set C 2 must be processed in a particular way. The two items in an itemset in C 2 must not have relationships in the taxonomy. This check needs to be done only for C 2. The other candidate sets derived from C 2 will not have hierarchical relationships due to the characteristic of sub-itemset checking. 3. Cross-level association rules need to be found. 4. The interestingness of association rules must consider generalization criteria from the taxonomy. Interest requirements are checked to remove uninteresting rules.

9 An Overview of Mining Fuzzy Association Rules Mining both Membership Functions and Fuzzy Association Rules The proposed approaches in the above sections mined fuzzy rules under a given set of membership functions. The given membership functions had a critical influence on the final mining results. Although many approaches for learning membership functions were proposed [12, 13, 38, 54], most of them were usually used for classification or control problems. Recently, researches about fuzzy data-mining algorithms for extracting both association rules and membership functions from quantitative transactions have been proposed. For example, Wang et al. tuned membership functions for intrusion detection systems based on similarity of association rules [55]. Kaya et al. proposed a GA-based clustering method to derive a predefined number of membership functions for getting a maximum profit within an interval of user specified minimum support values [7]. In their approach, the membership functions of quantitative attributes were obtained by GAs and were then used to discover fuzzy association rules. Its goal was to output the membership functions which would generate the maximized number of large itemsets. For this purpose, the parameter values of membership functions of the quantitative attributes were encoded into a real-valued string for evolving. Hong et al. proposed several algorithms to dynamically adapt membership functions by genetic algorithms and used them to fuzzify the quantitative transactions [44, 26]. They proposed a GA-based framework for searching membership functions suitable for mining problems and then used the final best set of membership functions to mine association rules. The proposed framework is shown in Fig. 2. The proposed framework maintained a population of sets of membership functions, and used the genetic algorithm to automatically derive the resulting one. It first transformed each set of membership functions into a fixed-length string. It then chose appropriate strings for mating, gradually creating good offspring membership function sets. The offspring membership function sets then underwent recursive evolution until a good set of membership functions has been obtained. The fitness was evaluated by the number of large 1-itemsets and the suitability of membership functions. The suitability of membership functions was composed of two terms, overlap and coverage. The overlap ratio of two membership functions was defined as the overlap length divided by half the minimum span of the two functions. If the overlap length was larger than half the span, then these two membership functions were thought of as a little redundant. Appropriate punishment was thus considered in this case. The coverage ratio of a set of membership functions for an item was defined as the coverage range of the functions divided by the maximum quantity of that item in the transactions. The more the coverage ratio was, the better the derived membership functions were. Besides, a larger number of 1-itemsets would usually result in a larger number of all itemsets with a higher probability, which would thus usually imply more interesting association rules. The evaluation by 1-itemsets was, however, faster than that by all itemsets or interesting association rules. Using the number of large 1-itemsets could thus achieve a trade-off between execution time and rule interestingness.

10 406 T.-P. Hong, Y.-C. Lee Linguistic terms Minimum support Minimum confidence Mining Membership Functions PC Membership Function Set 1 Membership Function Set 2 Membership Function Set 3... Membership Function Set q Population Chromosome 1 Chromosome 2 Chromosome 3... Chromosome q Genetic Fuzzy MF Acquisition process Fuzzy Mining for Large 1 -itemsets Transaction Database Mining Fuzzy Association Rules Final membership function Set Fuzzy Mining Fuzzy Association Rules Fig. 2 GA-based framework for searching membership functions Hong et al. also proposed an enhanced approach, called cluster-based fuzzy-ga mining algorithm, to speed up the evaluation process and keep nearly the same quality of solutions [15]. The proposed algorithm first divided the chromosomes in a population into k clusters by using the k-means clustering approach. All the chromosomes in a cluster then used the number of large 1-itemsets derived from the representative chromosome in the cluster and their own suitability of membership functions to calculate the fitness values. Since the chromosomes with similar coverage and overlap factors would form a cluster, they would have nearly the same shape of membership functions and induce about the same number of large 1-itemsets. For each cluster, the chromosome which was the nearest to the cluster center was thus chosen to derive its number of large 1-itemsets. All chromosomes in the same cluster then used the number of large 1-itemsets derived from the representative chromosome as their own. Finally, each chromosome was evaluated by this number of large 1-itemsets divided by its own suitability value. The evaluation cost could thus be greatly reduced due to the time-saving in finding 1-itemsets. 6 Discussion and Conclusion This chapter has given a wide survey about fuzzy data mining for association rules. Several fuzzy mining techniques, including mining fuzzy association rules, mining

11 An Overview of Mining Fuzzy Association Rules 407 fuzzy generalized association rules, mining both membership functions and fuzzy association rules, have been described. Some crisp mining techniques for handling quantitative data have also been briefly reviewed. Verlinde et al. investigated the difference between the rules by fuzzy mining and crisp mining through experiments and claimed the difference was small from the experimental results [52]. They constrained the rules with only a single item in both the antecedent and the consequence. The experiments were made in the environments where the data were normally distributed and the data boundaries were at the sides of the normal distributions. It is apparent that the rule difference in this situation is small since the data used by fuzzy mining and by crisp mining are not different very much. The boundary effect reduced by the fuzzy mining approaches on data in large databases and with normal distribution is thus not significant. However, as we know, the fuzzy set is usually used for modeling human perception on a concept. The membership functions defined are not necessarily corresponding to the distribution of a data. It is also the reason why the fuzzy set was proposed and not replaced with the probability theory. In summary, when compared to conventional crisp-set mining methods for quantitative data, fuzzy-mining approaches can get smoother mining results due to the fuzzy membership characteristics. The mined rules are expressed in linguistic terms, which are more natural and understandable for human beings. Besides, nearly all the fuzzy mining approaches can be easily degraded into crisp ones by assigning membership functions with values always equal to 1 or 0. These make fuzzy mining promising in real applications. Acknowledgment The authors would like to thank Mr. Chien-Shing Chen for his help in making the experiments. This research was supported by the National Science Council of the Republic of China under contract NSC E References 1. Agrawal R, Imielinksi T and Swami A (1993) Mining association rules between sets of items in large database. In: Proceedings of the ACM SIGMOD Conference, Washington DC, USA, pp Agrawal R, Srikant R (1994) Fast algorithm for mining association rules. In: Proceedings of the International Conference on Very Large Databases, pp Agrawal R, Srikant R (1995) Mining sequential patterns. In: Proceedings of the 8th IEEE International Conference on Data Engineering, pp Blishun AF (1987) Fuzzy learning models in expert systems. Fuzzy Sets and Systems, 22: Cai CH, Fu WC, Cheng CH, Kwong WW (1998) Mining association rules with weighted items. In: Proceedings of the International Database Engineering and Applications Symposium. Cardiff, Wales, UK, pp decampos LM, Moral S (1993) Learning rules for a fuzzy inference model. Fuzzy Sets and Systems, 59: Chan CC, Au WH (1997) Mining fuzzy association rules. Conference on Information and Knowledge Management. Las Vegas, pp

12 408 T.-P. Hong, Y.-C. Lee 8. Chang RLP, Pavliddis T (1977) Fuzzy decision tree algorithms. IEEE Transactions on Systems, Man and Cybernetics, 7: Chen MS, Han J, Yu PS (1996) Data mining: an overview from database perspective. IEEE Transaction on Knowledge and Data Engineering, 8(6): Chen CH, Hong TP, Tseng VSM (2006) Cluster-based fuzzy-genetic mining approach for association rules and membership functions. In: Proceedings of the 2006 IEEE International Conference on Fuzzy Systems. 11. Clair C, Liu C, Pissinou N (1998) Attribute weighting: a method of applying domain knowledge in the decision tree process. In: Proceedings of the 7th International Conference on Information and Knowledge Management, pp Clark P, Niblett T (1989) The CN2 induction algorithm. Machine Learning, 3: Cordón O, Herrera F, Villar P (2001) Generating the knowledge base of a fuzzy rule-based system by the genetic learning of the data base. In: Proceedings of IEEE Transactions on Fuzzy Systems, 9(4): Cordón O, Herrera F, Hoffmann F, Magdalena L (2001) Evolutionary tuning and learning of fuzzy knowledge bases. Genetic Fuzzy Systems: Evolutionary Tuning and Learning of Fuzzy Knowledge Bases. World Scientific Publishing Company, Singapore. 15. Delgado M, Gonzalez A (1993) An inductive learning procedure to identify fuzzy systems. Fuzzy Sets and Systems, 55: Ezeife CI (2002) Mining incremental association rules with generalized FP-tree. In: Proceedings of the 15th Conference of the Canadian Society for Computational Studies of Intelligence on Advances in Artificial Intelligence, pp Famili A, Shen W, Weber R, Simoudis E (1997) Data preprocessing and intelligent data analysis. Intelligent Data Analysis, 1(1): Fayyad U, Piatesky-Shapiro G, Smyth P (1996) From data mining to knowledge discovery in databases. In: Fayyad U, Piatesky-Shapiro G, Smyth P (eds): Advances in Knowledge Discovery & Data Mining. AAAI/MIT, pp Frawley WJ, Piatetsky-Shapiro G, Matheus CJ (1991) Knowledge discovery in databases: an overview. The AAAI Workshop on Knowledge Discovery in Databases, pp Fukuda T, Morimoto Y, Morishita S, Tokuyama T (1996) Mining optimized association rules for numeric attributes. The ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, pp Gonzalez A (1995) A learning methodology in uncertain and imprecise environments. International Journal of Intelligent Systems, 10: Han J, Fu Y (1995) Discovery of multiple-level association rules from large database. In: Proceedings of the 21st International Conference on Very Large Data Bases, Zurich, Switzerland, pp Han J, Pei J, Yin Y (2000) Mining frequent patterns without candidate generation. The 2000 ACM SIGMOD International Conference on Management of Data, 29(2): Hong TP, Lee CY (1996) Induction of fuzzy rules and membership functions from training examples. Fuzzy Sets and Systems, 84: Hong TP, Kuo CS, Chi SC (1999) A data mining algorithm for transaction data with quantitative values. In: Proceedings of the 8th International Fuzzy Systems Association World Congress, pp Hong TP, Kuo CS, Chi SC (1999) Mining association rules from quantitative data. Intelligent Data Analysis, 3(5): Hong TP, Kuo CS, Chi SC (2001) Trade-off between computation time and number of rules for fuzzy mining from quantitative data. International Journal of Uncertainty. Fuzziness and Knowledge-based Systems, 9(5): Hong TP, Chiang MJ, Wang SL (2002) Mining from quantitative data with linguistic minimum supports and confidences. In: Proceedings of the 8th IEEE International Conference on Fuzzy Systems, pp Hong TP, Lin KY, Chien BC (2003) Mining fuzzy multiple-level association rules from quantitative data. Applied Intelligence, 18(1):

13 An Overview of Mining Fuzzy Association Rules Hong TP, Lin KY, Wang SL (2003) Fuzzy data mining for interesting generalized association rules. Fuzzy Sets and Systems, 138(2): Hong TP, Chen CH, Wu YL, Lee YC (2003) Mining membership functions and fuzzy association rules. In: Proceedings of 2003 The Joint Conference on AI, Fuzzy System, and Grey System. 32. Hong TP, Chen CH, Wu YL, Lee YC (2006) A GA-based fuzzy mining approach to achieve a trade-off between number of rules and suitability of membership functions. Soft Computing, 10(11): Hong TP, Lin JW, Wu YL (2006) A Fast Updated Frequent Pattern Tree. In: Proceedings of the 17th IEEE International Conference on Systems, Man, and Cybernetics. 34. Kandel A (1992) Fuzzy Expert Systems. CRC Press, Boca Raton, pp Kaya M, Alhajj R (2003) A clustering algorithm with genetically optimized membership functions for fuzzy association rules mining. In: Proceedings of The IEEE International Conference on Fuzzy Systems, pp Kaya M, Alhajj R (2004) Mining multi-cross-level fuzzy weighted association rules. In: Proceedings of IEEE International Conference Intelligent Systems, Varna, Bulgaria, pp Kuok C, Fu A, Wong M (1998) Mining fuzzy association rules in databases. SIGMOD Record, 27(1): Lent B, Swami A, and Widom J (1997) Clustering association rules. In: Proceedings of International Conference on Data Engineering (ICDE 97), Birmingham, England, pp Mamdani EH (1974) Applications of fuzzy algorithms for control of simple dynamic plants. Proceedings of IEEE, 121: Mannila H, Toivonen H, Verkamo AI (1994) Efficient algorithm for discovering association rules. The AAAI Workshop on Knowledge Discovery in Databases, pp Mannila H (1997) Methods and problems in data mining. In: Proceedings of the 6th International Conference on Database Theory (ICDT 97), LNCS, Springer-Verlag, 1186: Park JS, Chen MS, Yu PS (1997) Using a hash-based method with transaction trimming for mining association rules. IEEE Transactions on Knowledge and Data Engineering, 9(5): Piatetsky-Shapiro G (1991) Discovery, analysis, and presentation of strong rules. Knowledge Discovery in Databases, AAAI/MIT Press, pp Qiu Y, Lan YJ, Xie QS (2004) An improved algorithm of mining from FP-tree. In: Proceedings of the 3rd International Conference on Machine Learning and Cybernetics, Shanghai, pp Quinlan JR (1987) Decision tree as probabilistic classifier. In:Proceedings of the 4th International Machine Learning Workshop, Morgan Kaufmann, San Mateo, CA., pp Rastogi R, Shim K (1998) Mining optimized association rules with categorical and numeric attributes. In: Proceedings of the 14th IEEE International Conference on Data Engineering, Orlando, pp Rives J (1990) FID3: fuzzy induction decision tree. In: Proceedings of the 1st International symposium on Uncertainty, Modeling and Analysis, pp Setnes M, Roubos H (2000) GA-fuzzy modeling and classification: Complexity and performance. IEEE Transaction on Fuzzy System, 8(5): Shen H, Wang S, Yang J (2004) Fuzzy taxonomic, quantitative database and mining generalized association rules. In: Proceedings of the 4th International Conference on Rough Sets and Current Trends in Computing (RSCTC 2004), Uppsala, Sweden, pp Srikant R, Agrawal R (1995) Mining generalized association rules. In: Proceedings of the 21st International Conference on Very Large Data Bases, pp Srikant R, Agrawal R (1996) Mining quantitative association rules in large relational tables. In: Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data, Monreal, Canada, pp Verlinde H, Cock MD, Boute R (2006) Fuzzy versus quantitative association rules: a fair data driven comparison. IEEE Transactions on Systems, Man and Cybernetics, 36(3): Wang CH, Hong TP, Tseng SS (1996) Inductive learning from fuzzy examples. In: Proceedings of the fifth IEEE International Conference on Fuzzy Systems, New Orleans, pp

14 410 T.-P. Hong, Y.-C. Lee 54. Wang CH, Hong TP, Tseng SS (1998) Integrating fuzzy knowledge by genetic algorithms. IEEE Transactions on Evolutionary Computation, 2(4): Wang W, Bridges SM (2000) Genetic algorithm optimization of membership functions for mining fuzzy association rules. In: Proceedings of the International Joint Conference on Information Systems, Fuzzy Theory and Technology, pp Wei Q, Chen G (1999) Mining generalized association rules with fuzzy taxonomic structures. In: Proceedings of the 18th International Conference of the North American Fuzzy Information Processing Society (NAFIPS), NY, USA, pp Yue S, Tsand E, Yeung D, Shi D (2000) Mining fuzzy association rules with weighted items. In: Proceedings of the IEEE International Conference on System, Man and Cybernetics, pp Zadeh LA (1965) Fuzzy sets. Information and Control, 8(3): Zadeh LA (1988) Fuzzy logic. IEEE Computer, pp Zaïane OR, Mohammed EH (2003) COFI-tree mining: A new approach to pattern growth with reduced candidacy generation. In: Proceedings of the ICDM 2003 Workshop on Frequent Itemset Mining Implementations.

Maintenance of fast updated frequent pattern trees for record deletion

Maintenance of fast updated frequent pattern trees for record deletion Maintenance of fast updated frequent pattern trees for record deletion Tzung-Pei Hong a,b,, Chun-Wei Lin c, Yu-Lung Wu d a Department of Computer Science and Information Engineering, National University

More information

A GA-based fuzzy mining approach to achieve a trade-off between number of rules and suitability of membership functions

A GA-based fuzzy mining approach to achieve a trade-off between number of rules and suitability of membership functions Soft Comput (26 : 9 DOI.7/s5-6-46-x ORIGINAL PAPER Tzung-Pei Hong Chun-Hao Chen Yu-Lung Wu Yeong-Chyi Lee A GA-based fuzzy mining approach to achieve a trade-off between number of rules and suitability

More information

Maintenance of Generalized Association Rules for Record Deletion Based on the Pre-Large Concept

Maintenance of Generalized Association Rules for Record Deletion Based on the Pre-Large Concept Proceedings of the 6th WSEAS Int. Conf. on Artificial Intelligence, Knowledge Engineering and ata Bases, Corfu Island, Greece, February 16-19, 2007 142 Maintenance of Generalized Association Rules for

More information

Fuzzy Weighted Data Mining from Quantitative Transactions with Linguistic Minimum Supports and Confidences

Fuzzy Weighted Data Mining from Quantitative Transactions with Linguistic Minimum Supports and Confidences International Journal of Fuzzy Systems, Vol. 8, No. 4, December 26 73 Fuzzy Weighted Data Mining from Quantitative Transactions with Linguistic Minimum Supports and Confidences Tzung-Pei Hong, Ming-Jer

More information

The fuzzy data mining generalized association rules for quantitative values

The fuzzy data mining generalized association rules for quantitative values The fuzzy data mining generalized association rules for quantitative values Tzung-Pei Hong* and Kuei-Ying Lin Graduate School of Information Engineering I-Shou University Kaohsiung, 84008, Taiwan, R.O.C.

More information

Appropriate Item Partition for Improving the Mining Performance

Appropriate Item Partition for Improving the Mining Performance Appropriate Item Partition for Improving the Mining Performance Tzung-Pei Hong 1,2, Jheng-Nan Huang 1, Kawuu W. Lin 3 and Wen-Yang Lin 1 1 Department of Computer Science and Information Engineering National

More information

An Efficient Tree-based Fuzzy Data Mining Approach

An Efficient Tree-based Fuzzy Data Mining Approach 150 International Journal of Fuzzy Systems, Vol. 12, No. 2, June 2010 An Efficient Tree-based Fuzzy Data Mining Approach Chun-Wei Lin, Tzung-Pei Hong, and Wen-Hsiang Lu Abstract 1 In the past, many algorithms

More information

Shaikh Nikhat Fatma Department Of Computer, Mumbai University Pillai s Institute Of Information Technology, New Panvel, Mumbai, India

Shaikh Nikhat Fatma Department Of Computer, Mumbai University Pillai s Institute Of Information Technology, New Panvel, Mumbai, India Genetic Approach for Fuzzy Mining Using Modified K-Means Clustering Shaikh Nikhat Fatma Department Of Computer, Mumbai University Pillai s Institute Of Information Technology, New Panvel, Mumbai, India

More information

Maintenance of the Prelarge Trees for Record Deletion

Maintenance of the Prelarge Trees for Record Deletion 12th WSEAS Int. Conf. on APPLIED MATHEMATICS, Cairo, Egypt, December 29-31, 2007 105 Maintenance of the Prelarge Trees for Record Deletion Chun-Wei Lin, Tzung-Pei Hong, and Wen-Hsiang Lu Department of

More information

UNCORRECTED PROOF ARTICLE IN PRESS. 1 Expert Systems with Applications. 2 Mining knowledge from object-oriented instances

UNCORRECTED PROOF ARTICLE IN PRESS. 1 Expert Systems with Applications. 2 Mining knowledge from object-oriented instances 1 Expert Systems with Applications Expert Systems with Applications xxx (26) xxx xxx wwwelseviercom/locate/eswa 2 Mining knowledge from object-oriented instances 3 Cheng-Ming Huang a, Tzung-Pei Hong b,

More information

M.Kannan et al IJCSET Feb 2011 Vol 1, Issue 1,30-34

M.Kannan et al IJCSET Feb 2011 Vol 1, Issue 1,30-34 Genetic Data Mining With Divide-And- Conquer Strategy M.Kannan, P.Yasodha, V.Srividhya CSA Dept., SCSVMV University, Enathur, Kanchipuram - 631 561. Abstract: Data mining is most commonly used in attempts

More information

AN EFFICIENT GRADUAL PRUNING TECHNIQUE FOR UTILITY MINING. Received April 2011; revised October 2011

AN EFFICIENT GRADUAL PRUNING TECHNIQUE FOR UTILITY MINING. Received April 2011; revised October 2011 International Journal of Innovative Computing, Information and Control ICIC International c 2012 ISSN 1349-4198 Volume 8, Number 7(B), July 2012 pp. 5165 5178 AN EFFICIENT GRADUAL PRUNING TECHNIQUE FOR

More information

Applying Data Mining to Wireless Networks

Applying Data Mining to Wireless Networks Applying Data Mining to Wireless Networks CHENG-MING HUANG 1, TZUNG-PEI HONG 2 and SHI-JINN HORNG 3,4 1 Department of Electrical Engineering National Taiwan University of Science and Technology, Taipei,

More information

An Evolutionary Algorithm for Mining Association Rules Using Boolean Approach

An Evolutionary Algorithm for Mining Association Rules Using Boolean Approach An Evolutionary Algorithm for Mining Association Rules Using Boolean Approach ABSTRACT G.Ravi Kumar 1 Dr.G.A. Ramachandra 2 G.Sunitha 3 1. Research Scholar, Department of Computer Science &Technology,

More information

Mining Association Rules From Time Series Data Using Hybrid Approaches

Mining Association Rules From Time Series Data Using Hybrid Approaches International Journal Of Computational Engineering Research (ijceronline.com) Vol. Issue. ining Association Rules From Time Series Data Using ybrid Approaches ima Suresh 1, Dr. Kumudha Raimond 2 1 PG Scholar,

More information

An Improved Apriori Algorithm for Association Rules

An Improved Apriori Algorithm for Association Rules Research article An Improved Apriori Algorithm for Association Rules Hassan M. Najadat 1, Mohammed Al-Maolegi 2, Bassam Arkok 3 Computer Science, Jordan University of Science and Technology, Irbid, Jordan

More information

Mining High Average-Utility Itemsets

Mining High Average-Utility Itemsets Proceedings of the 2009 IEEE International Conference on Systems, Man, and Cybernetics San Antonio, TX, USA - October 2009 Mining High Itemsets Tzung-Pei Hong Dept of Computer Science and Information Engineering

More information

To Enhance Projection Scalability of Item Transactions by Parallel and Partition Projection using Dynamic Data Set

To Enhance Projection Scalability of Item Transactions by Parallel and Partition Projection using Dynamic Data Set To Enhance Scalability of Item Transactions by Parallel and Partition using Dynamic Data Set Priyanka Soni, Research Scholar (CSE), MTRI, Bhopal, priyanka.soni379@gmail.com Dhirendra Kumar Jha, MTRI, Bhopal,

More information

Genetic-PSO Fuzzy Data Mining With Divide and Conquer Strategy

Genetic-PSO Fuzzy Data Mining With Divide and Conquer Strategy Genetic-PSO Fuzzy Data Mining With Divide and Conquer Strategy Amin Jourabloo Department of Computer Engineering, Sharif University of Technology, Tehran, Iran E-mail: jourabloo@ce.sharif.edu Abstract

More information

Pattern Mining. Knowledge Discovery and Data Mining 1. Roman Kern KTI, TU Graz. Roman Kern (KTI, TU Graz) Pattern Mining / 42

Pattern Mining. Knowledge Discovery and Data Mining 1. Roman Kern KTI, TU Graz. Roman Kern (KTI, TU Graz) Pattern Mining / 42 Pattern Mining Knowledge Discovery and Data Mining 1 Roman Kern KTI, TU Graz 2016-01-14 Roman Kern (KTI, TU Graz) Pattern Mining 2016-01-14 1 / 42 Outline 1 Introduction 2 Apriori Algorithm 3 FP-Growth

More information

Discovering fuzzy time-interval sequential patterns in sequence databases

Discovering fuzzy time-interval sequential patterns in sequence databases Discovering fuzzy time-interval sequential patterns in sequence databases Yen-Liang Chen Department of Information Management National Central University ylchen@mgt.ncu.edu.tw Cheng-Kui Huang Department

More information

International Journal of Advance Engineering and Research Development. Fuzzy Frequent Pattern Mining by Compressing Large Databases

International Journal of Advance Engineering and Research Development. Fuzzy Frequent Pattern Mining by Compressing Large Databases Scientific Journal of Impact Factor(SJIF): 3.134 e-issn(o): 2348-4470 p-issn(p): 2348-6406 International Journal of Advance Engineering and Research Development Volume 2,Issue 7, July -2015 Fuzzy Frequent

More information

Graph Based Approach for Finding Frequent Itemsets to Discover Association Rules

Graph Based Approach for Finding Frequent Itemsets to Discover Association Rules Graph Based Approach for Finding Frequent Itemsets to Discover Association Rules Manju Department of Computer Engg. CDL Govt. Polytechnic Education Society Nathusari Chopta, Sirsa Abstract The discovery

More information

Algorithm for Efficient Multilevel Association Rule Mining

Algorithm for Efficient Multilevel Association Rule Mining Algorithm for Efficient Multilevel Association Rule Mining Pratima Gautam Department of computer Applications MANIT, Bhopal Abstract over the years, a variety of algorithms for finding frequent item sets

More information

Association Rule Mining from XML Data

Association Rule Mining from XML Data 144 Conference on Data Mining DMIN'06 Association Rule Mining from XML Data Qin Ding and Gnanasekaran Sundarraj Computer Science Program The Pennsylvania State University at Harrisburg Middletown, PA 17057,

More information

Efficient Mining of Generalized Negative Association Rules

Efficient Mining of Generalized Negative Association Rules 2010 IEEE International Conference on Granular Computing Efficient Mining of Generalized egative Association Rules Li-Min Tsai, Shu-Jing Lin, and Don-Lin Yang Dept. of Information Engineering and Computer

More information

DISCOVERING ACTIVE AND PROFITABLE PATTERNS WITH RFM (RECENCY, FREQUENCY AND MONETARY) SEQUENTIAL PATTERN MINING A CONSTRAINT BASED APPROACH

DISCOVERING ACTIVE AND PROFITABLE PATTERNS WITH RFM (RECENCY, FREQUENCY AND MONETARY) SEQUENTIAL PATTERN MINING A CONSTRAINT BASED APPROACH International Journal of Information Technology and Knowledge Management January-June 2011, Volume 4, No. 1, pp. 27-32 DISCOVERING ACTIVE AND PROFITABLE PATTERNS WITH RFM (RECENCY, FREQUENCY AND MONETARY)

More information

Interestingness Measurements

Interestingness Measurements Interestingness Measurements Objective measures Two popular measurements: support and confidence Subjective measures [Silberschatz & Tuzhilin, KDD95] A rule (pattern) is interesting if it is unexpected

More information

Mining Quantitative Association Rules on Overlapped Intervals

Mining Quantitative Association Rules on Overlapped Intervals Mining Quantitative Association Rules on Overlapped Intervals Qiang Tong 1,3, Baoping Yan 2, and Yuanchun Zhou 1,3 1 Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China {tongqiang,

More information

An overview of interval encoded temporal mining involving prioritized mining, fuzzy mining, and positive and negative rule mining

An overview of interval encoded temporal mining involving prioritized mining, fuzzy mining, and positive and negative rule mining An overview of interval encoded temporal mining involving prioritized mining, fuzzy mining, and positive and negative rule mining # C. Balasubramanian 1, K. Duraiswamy 2, V.Palanisamy 3 ISSN:0975-9646

More information

Generalized Knowledge Discovery from Relational Databases

Generalized Knowledge Discovery from Relational Databases 148 IJCSNS International Journal of Computer Science and Network Security, VOL.9 No.6, June 2009 Generalized Knowledge Discovery from Relational Databases Yu-Ying Wu, Yen-Liang Chen, and Ray-I Chang Department

More information

Improved Frequent Pattern Mining Algorithm with Indexing

Improved Frequent Pattern Mining Algorithm with Indexing IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 16, Issue 6, Ver. VII (Nov Dec. 2014), PP 73-78 Improved Frequent Pattern Mining Algorithm with Indexing Prof.

More information

Interestingness Measurements

Interestingness Measurements Interestingness Measurements Objective measures Two popular measurements: support and confidence Subjective measures [Silberschatz & Tuzhilin, KDD95] A rule (pattern) is interesting if it is unexpected

More information

A Data Mining Framework for Extracting Product Sales Patterns in Retail Store Transactions Using Association Rules: A Case Study

A Data Mining Framework for Extracting Product Sales Patterns in Retail Store Transactions Using Association Rules: A Case Study A Data Mining Framework for Extracting Product Sales Patterns in Retail Store Transactions Using Association Rules: A Case Study Mirzaei.Afshin 1, Sheikh.Reza 2 1 Department of Industrial Engineering and

More information

Mining Fuzzy Association Rules Using Mutual Information

Mining Fuzzy Association Rules Using Mutual Information Mining Fuzzy Association Rules Using Mutual Information S. Lotfi, M.H. Sadreddini Abstract Quantitative Association Rule (QAR) mining has been recognized as an influential research problem over the last

More information

Discovery of Multi-level Association Rules from Primitive Level Frequent Patterns Tree

Discovery of Multi-level Association Rules from Primitive Level Frequent Patterns Tree Discovery of Multi-level Association Rules from Primitive Level Frequent Patterns Tree Virendra Kumar Shrivastava 1, Parveen Kumar 2, K. R. Pardasani 3 1 Department of Computer Science & Engineering, Singhania

More information

DATA MINING II - 1DL460

DATA MINING II - 1DL460 Uppsala University Department of Information Technology Kjell Orsborn DATA MINING II - 1DL460 Assignment 2 - Implementation of algorithm for frequent itemset and association rule mining 1 Algorithms for

More information

A Hierarchical Document Clustering Approach with Frequent Itemsets

A Hierarchical Document Clustering Approach with Frequent Itemsets A Hierarchical Document Clustering Approach with Frequent Itemsets Cheng-Jhe Lee, Chiun-Chieh Hsu, and Da-Ren Chen Abstract In order to effectively retrieve required information from the large amount of

More information

Association Rules Mining:References

Association Rules Mining:References Association Rules Mining:References Zhou Shuigeng March 26, 2006 AR Mining References 1 References: Frequent-pattern Mining Methods R. Agarwal, C. Aggarwal, and V. V. V. Prasad. A tree projection algorithm

More information

Temporal Weighted Association Rule Mining for Classification

Temporal Weighted Association Rule Mining for Classification Temporal Weighted Association Rule Mining for Classification Purushottam Sharma and Kanak Saxena Abstract There are so many important techniques towards finding the association rules. But, when we consider

More information

Keywords Fuzzy, Set Theory, KDD, Data Base, Transformed Database.

Keywords Fuzzy, Set Theory, KDD, Data Base, Transformed Database. Volume 6, Issue 5, May 016 ISSN: 77 18X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Fuzzy Logic in Online

More information

Generation of Potential High Utility Itemsets from Transactional Databases

Generation of Potential High Utility Itemsets from Transactional Databases Generation of Potential High Utility Itemsets from Transactional Databases Rajmohan.C Priya.G Niveditha.C Pragathi.R Asst.Prof/IT, Dept of IT Dept of IT Dept of IT SREC, Coimbatore,INDIA,SREC,Coimbatore,.INDIA

More information

Association Rule Mining for Multiple Tables With Fuzzy Taxonomic Structures

Association Rule Mining for Multiple Tables With Fuzzy Taxonomic Structures Association Rule Mining for Multiple Tables With Fuzzy Taxonomic Structures Praveen Arora, R. K. Chauhan and Ashwani Kush Abstract Most of the existing data mining algorithms handle databases consisting

More information

Optimized Weighted Association Rule Mining using Mutual Information on Fuzzy Data

Optimized Weighted Association Rule Mining using Mutual Information on Fuzzy Data International Journal of scientific research and management (IJSRM) Volume 2 Issue 1 Pages 501-505 2014 Website: www.ijsrm.in ISSN (e): 2321-3418 Optimized Weighted Association Rule Mining using Mutual

More information

A SURVEY OF DIFFERENT ASSOCIATIVE CLASSIFICATION ALGORITHMS

A SURVEY OF DIFFERENT ASSOCIATIVE CLASSIFICATION ALGORITHMS Asian Journal Of Computer Science And Information Technology 3 : 6 (2013)88-93. Contents lists available at www.innovativejournal.in Asian Journal of Computer Science And Information Technology Journal

More information

An Efficient Algorithm for finding high utility itemsets from online sell

An Efficient Algorithm for finding high utility itemsets from online sell An Efficient Algorithm for finding high utility itemsets from online sell Sarode Nutan S, Kothavle Suhas R 1 Department of Computer Engineering, ICOER, Maharashtra, India 2 Department of Computer Engineering,

More information

Discovering interesting rules from financial data

Discovering interesting rules from financial data Discovering interesting rules from financial data Przemysław Sołdacki Institute of Computer Science Warsaw University of Technology Ul. Andersa 13, 00-159 Warszawa Tel: +48 609129896 email: psoldack@ii.pw.edu.pl

More information

An Optimization of Association Rule Mining Algorithm using Weighted Quantum behaved PSO

An Optimization of Association Rule Mining Algorithm using Weighted Quantum behaved PSO An Optimization of Association Rule Mining Algorithm using Weighted Quantum behaved PSO S.Deepa 1, M. Kalimuthu 2 1 PG Student, Department of Information Technology 2 Associate Professor, Department of

More information

Categorization of Sequential Data using Associative Classifiers

Categorization of Sequential Data using Associative Classifiers Categorization of Sequential Data using Associative Classifiers Mrs. R. Meenakshi, MCA., MPhil., Research Scholar, Mrs. J.S. Subhashini, MCA., M.Phil., Assistant Professor, Department of Computer Science,

More information

A New Approach for Handling the Iris Data Classification Problem

A New Approach for Handling the Iris Data Classification Problem International Journal of Applied Science and Engineering 2005. 3, : 37-49 A New Approach for Handling the Iris Data Classification Problem Shyi-Ming Chen a and Yao-De Fang b a Department of Computer Science

More information

A Literature Review of Modern Association Rule Mining Techniques

A Literature Review of Modern Association Rule Mining Techniques A Literature Review of Modern Association Rule Mining Techniques Rupa Rajoriya, Prof. Kailash Patidar Computer Science & engineering SSSIST Sehore, India rprajoriya21@gmail.com Abstract:-Data mining is

More information

Mining High Order Decision Rules

Mining High Order Decision Rules Mining High Order Decision Rules Y.Y. Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 e-mail: yyao@cs.uregina.ca Abstract. We introduce the notion of high

More information

UAPRIORI: AN ALGORITHM FOR FINDING SEQUENTIAL PATTERNS IN PROBABILISTIC DATA

UAPRIORI: AN ALGORITHM FOR FINDING SEQUENTIAL PATTERNS IN PROBABILISTIC DATA UAPRIORI: AN ALGORITHM FOR FINDING SEQUENTIAL PATTERNS IN PROBABILISTIC DATA METANAT HOOSHSADAT, SAMANEH BAYAT, PARISA NAEIMI, MAHDIEH S. MIRIAN, OSMAR R. ZAÏANE Computing Science Department, University

More information

Product presentations can be more intelligently planned

Product presentations can be more intelligently planned Association Rules Lecture /DMBI/IKI8303T/MTI/UI Yudho Giri Sucahyo, Ph.D, CISA (yudho@cs.ui.ac.id) Faculty of Computer Science, Objectives Introduction What is Association Mining? Mining Association Rules

More information

AN IMPROVISED FREQUENT PATTERN TREE BASED ASSOCIATION RULE MINING TECHNIQUE WITH MINING FREQUENT ITEM SETS ALGORITHM AND A MODIFIED HEADER TABLE

AN IMPROVISED FREQUENT PATTERN TREE BASED ASSOCIATION RULE MINING TECHNIQUE WITH MINING FREQUENT ITEM SETS ALGORITHM AND A MODIFIED HEADER TABLE AN IMPROVISED FREQUENT PATTERN TREE BASED ASSOCIATION RULE MINING TECHNIQUE WITH MINING FREQUENT ITEM SETS ALGORITHM AND A MODIFIED HEADER TABLE Vandit Agarwal 1, Mandhani Kushal 2 and Preetham Kumar 3

More information

USING FREQUENT PATTERN MINING ALGORITHMS IN TEXT ANALYSIS

USING FREQUENT PATTERN MINING ALGORITHMS IN TEXT ANALYSIS INFORMATION SYSTEMS IN MANAGEMENT Information Systems in Management (2017) Vol. 6 (3) 213 222 USING FREQUENT PATTERN MINING ALGORITHMS IN TEXT ANALYSIS PIOTR OŻDŻYŃSKI, DANUTA ZAKRZEWSKA Institute of Information

More information

Mining of Web Server Logs using Extended Apriori Algorithm

Mining of Web Server Logs using Extended Apriori Algorithm International Association of Scientific Innovation and Research (IASIR) (An Association Unifying the Sciences, Engineering, and Applied Research) International Journal of Emerging Technologies in Computational

More information

A Comparative study of CARM and BBT Algorithm for Generation of Association Rules

A Comparative study of CARM and BBT Algorithm for Generation of Association Rules A Comparative study of CARM and BBT Algorithm for Generation of Association Rules Rashmi V. Mane Research Student, Shivaji University, Kolhapur rvm_tech@unishivaji.ac.in V.R.Ghorpade Principal, D.Y.Patil

More information

Transforming Quantitative Transactional Databases into Binary Tables for Association Rule Mining Using the Apriori Algorithm

Transforming Quantitative Transactional Databases into Binary Tables for Association Rule Mining Using the Apriori Algorithm Transforming Quantitative Transactional Databases into Binary Tables for Association Rule Mining Using the Apriori Algorithm Expert Systems: Final (Research Paper) Project Daniel Josiah-Akintonde December

More information

Association Rules. Berlin Chen References:

Association Rules. Berlin Chen References: Association Rules Berlin Chen 2005 References: 1. Data Mining: Concepts, Models, Methods and Algorithms, Chapter 8 2. Data Mining: Concepts and Techniques, Chapter 6 Association Rules: Basic Concepts A

More information

Hierarchical Online Mining for Associative Rules

Hierarchical Online Mining for Associative Rules Hierarchical Online Mining for Associative Rules Naresh Jotwani Dhirubhai Ambani Institute of Information & Communication Technology Gandhinagar 382009 INDIA naresh_jotwani@da-iict.org Abstract Mining

More information

Genetic Tuning for Improving Wang and Mendel s Fuzzy Database

Genetic Tuning for Improving Wang and Mendel s Fuzzy Database Proceedings of the 2009 IEEE International Conference on Systems, Man, and Cybernetics San Antonio, TX, USA - October 2009 Genetic Tuning for Improving Wang and Mendel s Fuzzy Database E. R. R. Kato, O.

More information

Multiobjective Association Rule Mining

Multiobjective Association Rule Mining Multiobjective Association Rule Mining Hisao Ishibuchi, Isao Kuwajima, and Yusuke Nojima Department of Computer Science and Intelligent Systems, Graduate School of Engineering, Osaka Prefecture University,

More information

GRANULAR COMPUTING AND EVOLUTIONARY FUZZY MODELLING FOR MECHANICAL PROPERTIES OF ALLOY STEELS. G. Panoutsos and M. Mahfouf

GRANULAR COMPUTING AND EVOLUTIONARY FUZZY MODELLING FOR MECHANICAL PROPERTIES OF ALLOY STEELS. G. Panoutsos and M. Mahfouf GRANULAR COMPUTING AND EVOLUTIONARY FUZZY MODELLING FOR MECHANICAL PROPERTIES OF ALLOY STEELS G. Panoutsos and M. Mahfouf Institute for Microstructural and Mechanical Process Engineering: The University

More information

A Rough Set Approach for Generation and Validation of Rules for Missing Attribute Values of a Data Set

A Rough Set Approach for Generation and Validation of Rules for Missing Attribute Values of a Data Set A Rough Set Approach for Generation and Validation of Rules for Missing Attribute Values of a Data Set Renu Vashist School of Computer Science and Engineering Shri Mata Vaishno Devi University, Katra,

More information

An Algorithm for Frequent Pattern Mining Based On Apriori

An Algorithm for Frequent Pattern Mining Based On Apriori An Algorithm for Frequent Pattern Mining Based On Goswami D.N.*, Chaturvedi Anshu. ** Raghuvanshi C.S.*** *SOS In Computer Science Jiwaji University Gwalior ** Computer Application Department MITS Gwalior

More information

EFFICIENT MINING OF FUZZY ASSOCIATION RULES FROM THE PRE-PROCESSED DATASET

EFFICIENT MINING OF FUZZY ASSOCIATION RULES FROM THE PRE-PROCESSED DATASET Computing and Informatics, Vol. 31, 2012, 331 347 EFFICIENT MINING OF FUZZY ASSOCIATION RULES FROM THE PRE-PROCESSED DATASET Zahra Farzanyar, Mohammadreza Kangavari Deprtment of Computer Engineering, Iran

More information

Performance Analysis of Apriori Algorithm with Progressive Approach for Mining Data

Performance Analysis of Apriori Algorithm with Progressive Approach for Mining Data Performance Analysis of Apriori Algorithm with Progressive Approach for Mining Data Shilpa Department of Computer Science & Engineering Haryana College of Technology & Management, Kaithal, Haryana, India

More information

A mining method for tracking changes in temporal association rules from an encoded database

A mining method for tracking changes in temporal association rules from an encoded database A mining method for tracking changes in temporal association rules from an encoded database Chelliah Balasubramanian *, Karuppaswamy Duraiswamy ** K.S.Rangasamy College of Technology, Tiruchengode, Tamil

More information

WIP: mining Weighted Interesting Patterns with a strong weight and/or support affinity

WIP: mining Weighted Interesting Patterns with a strong weight and/or support affinity WIP: mining Weighted Interesting Patterns with a strong weight and/or support affinity Unil Yun and John J. Leggett Department of Computer Science Texas A&M University College Station, Texas 7783, USA

More information

Weighted Association Rule Mining from Binary and Fuzzy Data

Weighted Association Rule Mining from Binary and Fuzzy Data Weighted Association Rule Mining from Binary and Fuzzy Data M. Sulaiman Khan 1, Maybin Muyeba 1, and Frans Coenen 2 1 School of Computing, Liverpool Hope University, Liverpool, L16 9JD, UK 2 Department

More information

An Approach for Privacy Preserving in Association Rule Mining Using Data Restriction

An Approach for Privacy Preserving in Association Rule Mining Using Data Restriction International Journal of Engineering Science Invention Volume 2 Issue 1 January. 2013 An Approach for Privacy Preserving in Association Rule Mining Using Data Restriction Janakiramaiah Bonam 1, Dr.RamaMohan

More information

Efficient SQL-Querying Method for Data Mining in Large Data Bases

Efficient SQL-Querying Method for Data Mining in Large Data Bases Efficient SQL-Querying Method for Data Mining in Large Data Bases Nguyen Hung Son Institute of Mathematics Warsaw University Banacha 2, 02095, Warsaw, Poland Abstract Data mining can be understood as a

More information

Challenges and Interesting Research Directions in Associative Classification

Challenges and Interesting Research Directions in Associative Classification Challenges and Interesting Research Directions in Associative Classification Fadi Thabtah Department of Management Information Systems Philadelphia University Amman, Jordan Email: FFayez@philadelphia.edu.jo

More information

Mining Temporal Association Rules in Network Traffic Data

Mining Temporal Association Rules in Network Traffic Data Mining Temporal Association Rules in Network Traffic Data Guojun Mao Abstract Mining association rules is one of the most important and popular task in data mining. Current researches focus on discovering

More information

Mining Frequent Itemsets for data streams over Weighted Sliding Windows

Mining Frequent Itemsets for data streams over Weighted Sliding Windows Mining Frequent Itemsets for data streams over Weighted Sliding Windows Pauray S.M. Tsai Yao-Ming Chen Department of Computer Science and Information Engineering Minghsin University of Science and Technology

More information

An Approach to Intensional Query Answering at Multiple Abstraction Levels Using Data Mining Approaches

An Approach to Intensional Query Answering at Multiple Abstraction Levels Using Data Mining Approaches An Approach to Intensional Query Answering at Multiple Abstraction Levels Using Data Mining Approaches Suk-Chung Yoon E. K. Park Dept. of Computer Science Dept. of Software Architecture Widener University

More information

Incrementally mining high utility patterns based on pre-large concept

Incrementally mining high utility patterns based on pre-large concept Appl Intell (2014) 40:343 357 DOI 10.1007/s10489-013-0467-z Incrementally mining high utility patterns based on pre-large concept Chun-Wei Lin Tzung-Pei Hong Guo-Cheng Lan Jia-Wei Wong Wen-Yang Lin Published

More information

Fuzzy Partitioning with FID3.1

Fuzzy Partitioning with FID3.1 Fuzzy Partitioning with FID3.1 Cezary Z. Janikow Dept. of Mathematics and Computer Science University of Missouri St. Louis St. Louis, Missouri 63121 janikow@umsl.edu Maciej Fajfer Institute of Computing

More information

Discovering Numeric Association Rules via Evolutionary Algorithm

Discovering Numeric Association Rules via Evolutionary Algorithm Discovering Numeric Association Rules via Evolutionary Algorithm Jacinto Mata 1,José-Luis Alvarez 1,andJosé-Cristobal Riquelme 2 1 Dpto. Ingeniería Electrónica, Sistemas Informáticos y Automática Universidad

More information

MINING ASSOCIATION RULES WITH UNCERTAIN ITEM RELATIONSHIPS

MINING ASSOCIATION RULES WITH UNCERTAIN ITEM RELATIONSHIPS MINING ASSOCIATION RULES WITH UNCERTAIN ITEM RELATIONSHIPS Mei-Ling Shyu 1, Choochart Haruechaiyasak 1, Shu-Ching Chen, and Kamal Premaratne 1 1 Department of Electrical and Computer Engineering University

More information

Data Mining with Linguistic Thresholds

Data Mining with Linguistic Thresholds Int. J. Contemp. Math. Science, Vol. 7, 22, no. 35, 7-725 Data Mining with Linguitic Threhold Tzung-Pei Hong Department of Electrical Engineering National Univerity of Kaohiung Kaohiung, Taiwan, R.O.C.

More information

Predicting Missing Items in Shopping Carts

Predicting Missing Items in Shopping Carts Predicting Missing Items in Shopping Carts Mrs. Anagha Patil, Mrs. Thirumahal Rajkumar, Assistant Professor, Dept. of IT, Assistant Professor, Dept of IT, V.C.E.T, Vasai T.S.E.C, Bandra Mumbai University,

More information

Data Mining: Concepts and Techniques. Chapter 5. SS Chung. April 5, 2013 Data Mining: Concepts and Techniques 1

Data Mining: Concepts and Techniques. Chapter 5. SS Chung. April 5, 2013 Data Mining: Concepts and Techniques 1 Data Mining: Concepts and Techniques Chapter 5 SS Chung April 5, 2013 Data Mining: Concepts and Techniques 1 Chapter 5: Mining Frequent Patterns, Association and Correlations Basic concepts and a road

More information

An Approach for Finding Frequent Item Set Done By Comparison Based Technique

An Approach for Finding Frequent Item Set Done By Comparison Based Technique Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 4, April 2014,

More information

Improved Algorithm for Frequent Item sets Mining Based on Apriori and FP-Tree

Improved Algorithm for Frequent Item sets Mining Based on Apriori and FP-Tree Global Journal of Computer Science and Technology Software & Data Engineering Volume 13 Issue 2 Version 1.0 Year 2013 Type: Double Blind Peer Reviewed International Research Journal Publisher: Global Journals

More information

Lecture Topic Projects 1 Intro, schedule, and logistics 2 Data Science components and tasks 3 Data types Project #1 out 4 Introduction to R,

Lecture Topic Projects 1 Intro, schedule, and logistics 2 Data Science components and tasks 3 Data types Project #1 out 4 Introduction to R, Lecture Topic Projects 1 Intro, schedule, and logistics 2 Data Science components and tasks 3 Data types Project #1 out 4 Introduction to R, statistics foundations 5 Introduction to D3, visual analytics

More information

ANALYSIS OF DENSE AND SPARSE PATTERNS TO IMPROVE MINING EFFICIENCY

ANALYSIS OF DENSE AND SPARSE PATTERNS TO IMPROVE MINING EFFICIENCY ANALYSIS OF DENSE AND SPARSE PATTERNS TO IMPROVE MINING EFFICIENCY A. Veeramuthu Department of Information Technology, Sathyabama University, Chennai India E-Mail: aveeramuthu@gmail.com ABSTRACT Generally,

More information

Fast Algorithm for Mining Association Rules

Fast Algorithm for Mining Association Rules Fast Algorithm for Mining Association Rules M.H.Margahny and A.A.Mitwaly Dept. of Computer Science, Faculty of Computers and Information, Assuit University, Egypt, Email: marghny@acc.aun.edu.eg. Abstract

More information

A Quantified Approach for large Dataset Compression in Association Mining

A Quantified Approach for large Dataset Compression in Association Mining IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661, p- ISSN: 2278-8727Volume 15, Issue 3 (Nov. - Dec. 2013), PP 79-84 A Quantified Approach for large Dataset Compression in Association Mining

More information

A Conflict-Based Confidence Measure for Associative Classification

A Conflict-Based Confidence Measure for Associative Classification A Conflict-Based Confidence Measure for Associative Classification Peerapon Vateekul and Mei-Ling Shyu Department of Electrical and Computer Engineering University of Miami Coral Gables, FL 33124, USA

More information

Multi Objective Fuzzy Association Rule Mining With ABC Algorithm: A Survey

Multi Objective Fuzzy Association Rule Mining With ABC Algorithm: A Survey Multi Objective Fuzzy Association Rule Mining With ABC Algorithm: A Survey Alok B. Patel 1, Mr. Premkumar Shivakumar 2 ME Student, Dept. of CE, Silveroak Colleage of Engineering and Technology, Gujarat,

More information

Optimization using Ant Colony Algorithm

Optimization using Ant Colony Algorithm Optimization using Ant Colony Algorithm Er. Priya Batta 1, Er. Geetika Sharmai 2, Er. Deepshikha 3 1Faculty, Department of Computer Science, Chandigarh University,Gharaun,Mohali,Punjab 2Faculty, Department

More information

Efficient Remining of Generalized Multi-supported Association Rules under Support Update

Efficient Remining of Generalized Multi-supported Association Rules under Support Update Efficient Remining of Generalized Multi-supported Association Rules under Support Update WEN-YANG LIN 1 and MING-CHENG TSENG 1 Dept. of Information Management, Institute of Information Engineering I-Shou

More information

EFFICIENT ALGORITHM FOR MINING FREQUENT ITEMSETS USING CLUSTERING TECHNIQUES

EFFICIENT ALGORITHM FOR MINING FREQUENT ITEMSETS USING CLUSTERING TECHNIQUES EFFICIENT ALGORITHM FOR MINING FREQUENT ITEMSETS USING CLUSTERING TECHNIQUES D.Kerana Hanirex Research Scholar Bharath University Dr.M.A.Dorai Rangaswamy Professor,Dept of IT, Easwari Engg.College Abstract

More information

Structure of Association Rule Classifiers: a Review

Structure of Association Rule Classifiers: a Review Structure of Association Rule Classifiers: a Review Koen Vanhoof Benoît Depaire Transportation Research Institute (IMOB), University Hasselt 3590 Diepenbeek, Belgium koen.vanhoof@uhasselt.be benoit.depaire@uhasselt.be

More information

INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & TECHNOLOGY (IJCET)

INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & TECHNOLOGY (IJCET) INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & TECHNOLOGY (IJCET) International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 6367(Print) ISSN 0976 6375(Online)

More information

SEQUENTIAL PATTERN MINING FROM WEB LOG DATA

SEQUENTIAL PATTERN MINING FROM WEB LOG DATA SEQUENTIAL PATTERN MINING FROM WEB LOG DATA Rajashree Shettar 1 1 Associate Professor, Department of Computer Science, R. V College of Engineering, Karnataka, India, rajashreeshettar@rvce.edu.in Abstract

More information

Item Set Extraction of Mining Association Rule

Item Set Extraction of Mining Association Rule Item Set Extraction of Mining Association Rule Shabana Yasmeen, Prof. P.Pradeep Kumar, A.Ranjith Kumar Department CSE, Vivekananda Institute of Technology and Science, Karimnagar, A.P, India Abstract:

More information

Formal Concept Analysis and Hierarchical Classes Analysis

Formal Concept Analysis and Hierarchical Classes Analysis Formal Concept Analysis and Hierarchical Classes Analysis Yaohua Chen, Yiyu Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: {chen115y, yyao}@cs.uregina.ca

More information