Mining Association rules with Dynamic and Collective Support Thresholds
|
|
- Coleen Warner
- 5 years ago
- Views:
Transcription
1 Mining Association rules with Dynamic and Collective Suort Thresholds C S Kanimozhi Selvi and A Tamilarasi Abstract Mining association rules is an imortant task in data mining. It discovers the hidden, interesting relationshis (associations) between items in the database based on the user-secified suort and confidence thresholds. In order to find relevant associations one has to secify an aroriate suort threshold. The suort threshold lays an imortant role in deciding the number of aroriate rules found. The rare associations will not aear if a high threshold is set. Some uninteresting associations may aear if a low threshold is set. This aer rooses an aroach to obtain the aroriate suort thresholds at each level of the level-wise mining aroach. It sets the suort threshold by analyzing the of items and their associations in the database at each level. Exerimental results show that this aroach roduces the interesting rules without secifying the user secified suort threshold. Index Terms Association rules, Collective Suort, Dynamic Suort, Frequent itemset I. INTRODUCTION Data mining is widely used in various alication areas such as banking, marketing and retail industry and so on. Association rule mining is one technique used in data mining to discover hidden associations that occur between various data items [1]. An association rule [1] is an exression of the form X Y, where X, Y are itemsets. It reveals the relationshi between the itemsets X and Y. The ortion of transactions containing X also containing Y, i.e., P(Y X) = P( X Y ) / P( X ) is called the confidence (conf) of the rule. The suort (su) of the rule is the ortion of the transactions that contain all items both in X and Y, i.e., su(x Y) = P (XUY). To generate an interesting association rule, the suort and confidence of the rule should satisfy a user-secified minimum suort called MINSUP and minimum confidence called MINCON, resectively. Mining association rules consists of two stes [1]: 1) Finding all the frequent itemsets having adequate suort. 2) Producing association rules from these frequent itemsets. The outcome des uon the results of ste1. Manuscrit received May 4, C S Kanimozhiselvi, Assistant Professor, Deartment of Comuter Science and Engineering, Kongu Engineering College, Perundurai Erode, Tamilnadu, India Phone: ; A Tamilarasi, Professor, Deartment of Comuter Science and Engineering, Kongu Engineering College, Perundurai Erode, Tamilnadu, India. According to ste 1, the itemsets whose suort value is greater than the MINSUP are labeled as frequent itemsets. The result of this ste lays an imortant role in roducing the association rules. Consequently, the concentration has been given only on the first ste by many researchers. However, without secific knowledge, users will have difficulties in setting the MINSUP to obtain their required results. If MINSUP secified by the user is not aroriate, the user may find many meaningless rules or may miss some interesting rules. Hence, the user has to try many ossibilities for secifying the MINSUP in order to find the aroriate one. To overcome these roblems, a technique is required to generate the rules of high confidence without having user secified suort constraint. To meet out this task, the aer rooses an aroach to secify suort for frequent itemset generation without consulting the users. This aroach finds an initial MINSUP value by analyzing the itemsets and their. It also rooses a collective suort threshold on the subsequent levels based on the revious level suort and the items considered in the current level. The rest of the aer is organized as follows: Section II revisits the roblem of association rule mining and exlores the need for dynamic and collective suort thresholds for the generation of association rules. Section III rovides a detailed insight into the modified association rule framework and exlains the suort secification and mining rocess for this model. Section IV reorts the exerimental results on the IBM synthetic data uon rule set size. Finally, the conclusions are ointed out in Section V. II. RELATED WORK Many researchers considered association rule mining as an interesting research area and studied widely [1-11]. Most of these studies address the issue of finding the association rules that satisfy user-secified minimum suort and minimum confidence constraints. Aroaches like Ariori [1,12] and FP-Growth [13] emloy the uniform minimum suort at all levels. These aroaches assume that all items in the data are of the same kind and have similar frequencies in the database but this assumtion is not alicable for real-life alications. In many alications, some items aear very frequently in the database, while others hardly ever aear. One cannot claim that the frequent itemsets are alone interesting, but the rare items would also matter. To identify the frequent and rare items, an aroriate minimum suort has to be secified. Otherwise the user would face two roblems [4]
2 9) Generation of fewer rules uon secifying the high minimum suort. 10) Generation of too many rules sometimes uninteresting uon secifying low minimum suort. This may lead to the develoment of stronger rule runing techniques. The aer [4] argues that the single MINSUP for the whole database is inadequate, because it cannot cature the inherent nature and / or differences of the items in the database. Therefore it suggests a model with multile minimum suorts for items at each level. Even if the user secifies multile minimum suorts, the rules generated may not be more aroriate and interesting. A better solution lies in develoing suort constraints, which secify minimum suort required for the itemsets, so that only the necessary itemsets are generated. If more than one suort constraint is satisfied by an itemset, then the one with minimum suort value should be adoted. This has been studied in the aer [6]. This aroach also has some roblems: 1) This aroach deals with only the secific roblems but not in general. Moreover it assumes that the user has adequate domain knowledge. 2) Using the aroach, one can analyze the nature of the items but can t able to know of the items until all the items in the database are scanned and candidate items are generated. Correlation based framework [10] without the use of suort thresholds has also been studied in the ast. It uses contingency table to find ositively correlated items. Although the correlation framework discovers strongly correlated items, the suort threshold is still imortant. Without the suort threshold, the comutation cost will be too high and many ineffectual itemsets would be generated [10]. As such, users still face the roblem of aroriate suort secification. In [11], the authors avoided the use of suort measure to find the interesting associations. This aroach is not suitable for alications that adhere to the traditional asymmetric confidence measure as ointed out by [11]. Although substantial effort has been made to lighten these roblems, such as adding the lift or conviction measure, which derives the minimum suort dynamically from the item suort, it also leaves out some interesting rules comosed of more than two items because the secification is derived from frequent 2-itemsets [16]. Another effort is made by [14, 15] for finding the N-most frequent itemsets. It allows the user to control the result by secifying the value N, thus leading to user intervention. This aer aims at making the user free from secifying any constraints including suort constraints. It rooses an aroach to calculate the minimum suort threshold dynamically by scanning the records in the database so that all frequent, interesting and meaningful rules would be generated. This minimizes the generation of large number of rules and reduces the need for rule runing techniques. Exeriment results on synthetic data show that the roosed technique is effective. III. ASSOCIATION RULE MINING FRAMEWORK Consider a given transaction database T = {r 1, r 2,, rn}, where each record ri, 1 i n is a set of items from a set I of items, i.e., ri I. The basic association rule model as follows: Let I = {i 1, i 2, i m } be a set of items. Let R be a set of records in the database, where each record r is a set of items such that r I. An association rule is an exression of the form, X Y, where X I, Y I, and X Y = φ. The rule X Y holds in the set of records R with confidence c if c% of records in R that suorts X also suorts Y. The rule has suort s in R if s% of the records in R contains X U Y. Given a set of records R, the roblem of mining association rules is to discover all association rules that have suort and confidence greater than the user-secified minimum suort (MINSUP) and minimum confidence (MINCON). An association rule mining algorithm works in two stes [3, 4]: 1. Generate frequent itemsets that satisfy MINSUP. 2. Generate interesting association rules that satisfy MINCON using the frequent itemsets. A. Proosed Model In this model, we roose two minimum suort counts namely Dynamic Minimum Suort Count (DMS) and Collective Minimum Suort Count (CMS) for the itemset generation at each level. Initially, DMS is calculated while scanning the items in the database. CMS is calculated during the itemset generation. DMS reflects the of items in the database. CMS reflects the intrinsic nature of items in the database by carrying over the existing suort to the next level. This model is based on multile minimum suorts model. In each ass, a different minimum suort value is used (i.e) the DMS and CMS values are calculated in each ass. Initially, the DMS is used for itemset generation and in the subsequent asses CMS values are used to find the frequent itemsets. Let there are n items in the database and su be the suort of each item in the database and reresents the current ass. MAXS and MINS denote the Maximum Suort and Minimum Suort resectively. The total suort of items considered in each ass is TOTOCC. TOTOCC DMS = n i= 1 su TOTOCC MINS + MAXS = + 2 n 2 1 [ DMS + DMS ] 1 CMS = 1 4 The calculation for the DMS values has been the same at all level. Here DMS reresents the value at the current level whereas the DMS -1 reresents the value at the revious level. B. Frequent Itemset Generation The roosed algorithm exts the Ariori algorithm for finding large itemsets. We call the new algorithm, DCS (Dynamic Collective Suort) Ariori. The new algorithm is also based on level wise search. It generates all large itemsets by making multile asses over the data. In each ass, it
3 counts the suorts of itemsets and finds the MINS and MAXS values, TOTOCC and thus the DMS value. Initially, the itemsets that satisfy the DMS value are retrieved. The DMS value is calculated based on the candidates generated in the revious ass. Let L k denote the list of k-itemsets. Each itemset c is of the following form, {c[1], c[2],, c[k]}, which consists of items, c[1], c[2],, c[k]. The algorithm is given below: Algorithm DCSAriori Inut : Set of records R in Transaction database T Outut : Frequent itemsets L 1 = find frequent 1 itemsets(r) for (k = 2; L k-1 Ø, k++) do C k =candidate-gen (L k-1 ) for each Record r in Є R do C t = subset (C k, r); for each candidate c Є Ct do c.count++; CMS = su_calc(c k ) L k = {c Є C k c.count ( CMS )} return U k L k ; Procedure Candidate-gen(L k-1) for each itemset l 1 Є L k-1 for each itemset l 2 Є L k-1 erform join oeration l1 l 2 if has_infrequent_subset(c, L k-1) rune c; else add c to C k; if return C k; Procedure has_infrequent_subset(c, L k-1) for each (k-1) subset s of c if s is in L k-1 return false; else return true; if Procedure su_calc(c k ) MINS = c.count c.count is minimum for all C k and c.count > 1 MAXS = c.count c.count is maximum for all C k TOTOCC = sum(c.count) for all C k DMS = Average( (Average (MAXS, MINS )+Average(TOTOCC )) If (= 1) then CMS = DMS Else CMS = (DMS -1 + DMS ) / 4 End if return CMS Examle: Consider the following dataset. TABLE I: TRANSACTION DATABASE R1 : I 1,I 2,I 5 R2 : I 1,I 3 R3 : I 3,I 4 R4 : I 1, I 2, I 3 R5: I 1,I 2,I 3,I 5, I 6 R6: I 2,I 5,I 6 R7 : I 2,I 5,I 6,I 7 Initially the database is scanned and the suort counts of itemsets are found. If there are n items and the minimum suort and maximum suort are MINS and MAXS resectively. The sum of suort of all items is known as TOTOCC. In the first ass, the algorithm retrieves the itemsets whose suort count is greater than the DMS value as frequent itemsets. In the subsequent asses, the algorithm uses the CMS value to rune the uninteresting items. Using the formula 1 and 2, the roosed algorithm finds out the frequent itemsets in the first ass as: I 1, I 2, I 3, I 5. Similarly during the second ass the DMS and the CMS values are calculated and the itemsets {I 1, I 2 },{I 1, I 3 }, {I 1, I 5 },{{I 2, I 3 }, {I 2,I 6 },{I 3, I 6 } are generated. During the third ass the itemsets {I 2, I 3, I 5 }, {I 1, I 2, I 3 }, {I 1, I 2, I 5 }, {I 1, I 3, I 5 } are generated as frequent itemsets. C. Rule Generation The roosed algorithm adots the confidence based rule generation model of ariori [1] for rule generation. The confidence threshold can be used to find out the interesting rule set. The confidence of a rule is its suort divided by the suort of its antecedent. For examle, the following rule { I 2, I 3 } { I 5 } has confidence equivalent to suort for { I 2, I 3, I 5 } / suort for { I 2, I 3 }. Association rules are generated as follows. For each frequent itemset fl, all non emty subsets of fl are generated. For every non emty subset s of fl, the rule s (fl-s) is formed, if suort-count (fl) / suort-count of (s) Minimum Confidence. After the generation of frequent items, the algorithm checks if it satisfies the MINCON threshold. If their confidence is larger than MINCON then they will be generated as interesting association rules. Otherwise, the rules will be discarded. The algorithm for rule generation is given below: Algorithm: Rule_gen(L k, MINCON) for each frequent itemset fl L k do for each nonemty subset s of fl do if c.count(fl)/c.count(s) MINCON then outut the rule s (fl - s); IV. EXPERIMENTAL RESULTS In this section, the roosed DCSAriori algorithm obtains the frequent itemsets by finding various Collective Minimum Suort (CMS) at each level. Then a minimum confidence is set by the user to find the interesting association rules. We use different minimum confidence thresholds at each run. The Ariori algorithm with the standard uniform suort (US, set to 0.5 %,1%,2%,3% and 4)
4 is also evaluated. The evaluation is examined based on the number of rules found. All exeriments are erformed on an Intel Pentium-IV with 256 MB RAM, running Windows 98. We use two synthetic data sets generated from IBM synthetic data generator [1]: T10.I4.D100K and T40.I10.D100K. Characteristics of these two data sets are shown in Table II. Fig 2 Ariori Vs DCSAriori with Suort=1.0% for dataset T10I4D100K TABLE II PROPERTIES OF DATASETS T10I4D100K T40I10D100K Number of 100, ,000 Transactions Number of items Minimum item Maximum item Average item 0.001% 0.005% 7.8% % 0.11% 0.10% As shown in Figs 1,2,3,4 and 5 DCSAriori generates not less rules or not more rules while varying the confidence levels. It retains the medium level where as ariori roduces number of less confidence rules and at high confidence it roduces fewer rules. This means that the ratio of surious frequent itemsets increases when the MINCON becomes higher, and more uninteresting rules will be generated. The advantage of DCSAriori is that it sets the aroriate minsu at each level based on the and association of items. It exlores more hidden but effective frequent itemsets and avoids the generation of surious frequent itemsets and thus the low confidence rules. Fig 3 Ariori Vs DCSAriori with Suort=1.0% for dataset T40I10D100K Fig 4 Ariori Vs DCSAriori with Suort=3.0% for dataset T40I10D100K Fig 5 Ariori Vs DCSAriori with Suort=4.0% for dataset T40I10D100K Fig 1. Ariori Vs DCSAriori with Suort=0.5% for dataset T10I4D100K V. CONCLUSION In the roosed method the minimum suort is calculated dynamically. Instead of using the user secified minimum suort we use the calculated minimum suort for each itemset generation and for rule generation. Thus it generates more relevant and meaningful rules. In fact the running time of the roosed method is not taken into account, it still leaves the user free from secifying minimum suort and guarantees the generation of interesting association rules
5 REFERENCES [1] R. Agrawal, T. Imielinski, and A. Swami, Mining association rules between sets of items in large databases, Proceedings of 1993 ACM-SIGMOD International Conference on Management of Data, 1993, [2] S. Brin, R. Motwani, J.D. Ullman, and S. Tsur, Dynamic itemset counting and imlication rules for marketbasket data, Proceedings of 1997 ACM-SIGMOD International Conference on Management of Data, 1997, [3] J. Han and Y. Fu, Discovery of multile-level association rules from large databases, Proceedings of the21st International Conference on Very Large Data Base 1995, [4] B. Liu, W. Hsu, and Y. Ma, Mining association rules with multile minimum suorts, Proceedings of 1999 ACM-SIGKDD International Conference on Knowledge Discovery and Data Mining 1999, [5] M.C. Tseng and W.Y. Lin, Mining generalized association rules with multile minimum suorts, Proceedings of International Conference on Data Warehousing and Knowledge Discovery 2001, [6] K. Wang, Y. He, and J. Han, Mining frequent itemsets using suort constraints, Proceedings of the 26th International Conference on Very Large Data Bases 2000, [7] M. Seno and G. Karyis, LPMiner: An algorithm for finding frequent itemsets using length-decreasing suort constraint, Proceedings of the 1st IEEE International Conference on Data Mining [8] J. Li and X. Zhang, Efficient mining of high confidence association rules without suort thresholds, Proceedings of the 3rd Euroean Conference on Princiles and Practice of Knowledge Discovery in Databases [9] C.C. Aggarwal and P.S. Yu, A new framework for itemset generation, Proceedings of the 17th ACM Symosium on Princiles of Database Systems 1998, [10] S. Brin, R. Motwani, and C. Silverstein, Beyond market baskets: generalizing association rules to correlations, Proceedings of 1997 ACM-SIGMOD International Conference on Management of Data 1997, [11] E. Cohen, M. Datar, S. Fujiwara, A. Gionis, P. Indyk, R. Motwani, J.D. Ullman, and C. Yang, Finding interesting associations without suort runing, Proceedings of IEEE International Conference on Data Engineering 2000, [12] R. Agrawal and R. Srikant, Fast algorithms for mining association rules, Proceedings of the 20th International Conference on Very Large Data Bases 1994, [13] J. Han, J. Pei, and Y. Yin., Mining Frequent Patterns without Candidate Generation, Proc ACM-SIGMOD Int. Conf. on Management of Data (SIGMOD'00),Dallas, TX, May [14] Ngan, S-C., Lam, T.,Wong, R.C-W. and Fu, A.W-C. (2005), Mining N-most interesting itemsets without suort threshold by the COFI-tree, Vol. 1, No. 1, Int. J. Business Intelligence and Data Mining, [15] Y. Cheung and A. Fu. Mining frequent itemsets without suort threshold: with and without item constraints, IEEE Trans. on Knowledge and Data Engineering, 16(9): , [16] Wen-Yang Lin and Ming-Cheng Tseng,, Automated suort secification for efficient mining of interesting association rules, Journal of Information Science,Volume 32, Issue 3 (June 2006) : , 2006,ISSN:
Mining Frequent Itemsets Along with Rare Itemsets Based on Categorical Multiple Minimum Support
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 18, Issue 6, Ver. IV (Nov.-Dec. 2016), PP 109-114 www.iosrjournals.org Mining Frequent Itemsets Along with Rare
More informationETP-Mine: An Efficient Method for Mining Transitional Patterns
ETP-Mine: An Efficient Method for Mining Transitional Patterns B. Kiran Kumar 1 and A. Bhaskar 2 1 Department of M.C.A., Kakatiya Institute of Technology & Science, A.P. INDIA. kirankumar.bejjanki@gmail.com
More informationAn Efficient Coding Method for Coding Region-of-Interest Locations in AVS2
An Efficient Coding Method for Coding Region-of-Interest Locations in AVS2 Mingliang Chen 1, Weiyao Lin 1*, Xiaozhen Zheng 2 1 Deartment of Electronic Engineering, Shanghai Jiao Tong University, China
More informationEfficient Remining of Generalized Multi-supported Association Rules under Support Update
Efficient Remining of Generalized Multi-supported Association Rules under Support Update WEN-YANG LIN 1 and MING-CHENG TSENG 1 Dept. of Information Management, Institute of Information Engineering I-Shou
More informationDMSA TECHNIQUE FOR FINDING SIGNIFICANT PATTERNS IN LARGE DATABASE
DMSA TECHNIQUE FOR FINDING SIGNIFICANT PATTERNS IN LARGE DATABASE Saravanan.Suba Assistant Professor of Computer Science Kamarajar Government Art & Science College Surandai, TN, India-627859 Email:saravanansuba@rediffmail.com
More informationDiscovery of Multi-level Association Rules from Primitive Level Frequent Patterns Tree
Discovery of Multi-level Association Rules from Primitive Level Frequent Patterns Tree Virendra Kumar Shrivastava 1, Parveen Kumar 2, K. R. Pardasani 3 1 Department of Computer Science & Engineering, Singhania
More informationMining of Web Server Logs using Extended Apriori Algorithm
International Association of Scientific Innovation and Research (IASIR) (An Association Unifying the Sciences, Engineering, and Applied Research) International Journal of Emerging Technologies in Computational
More informationImproved Frequent Pattern Mining Algorithm with Indexing
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 16, Issue 6, Ver. VII (Nov Dec. 2014), PP 73-78 Improved Frequent Pattern Mining Algorithm with Indexing Prof.
More informationAn Improved Algorithm for Mining Association Rules Using Multiple Support Values
An Improved Algorithm for Mining Association Rules Using Multiple Support Values Ioannis N. Kouris, Christos H. Makris, Athanasios K. Tsakalidis University of Patras, School of Engineering Department of
More informationCHAPTER 3 ASSOCIATION RULE MINING WITH LEVELWISE AUTOMATIC SUPPORT THRESHOLDS
23 CHAPTER 3 ASSOCIATION RULE MINING WITH LEVELWISE AUTOMATIC SUPPORT THRESHOLDS This chapter introduces the concepts of association rule mining. It also proposes two algorithms based on, to calculate
More informationWIP: mining Weighted Interesting Patterns with a strong weight and/or support affinity
WIP: mining Weighted Interesting Patterns with a strong weight and/or support affinity Unil Yun and John J. Leggett Department of Computer Science Texas A&M University College Station, Texas 7783, USA
More informationTo Enhance Projection Scalability of Item Transactions by Parallel and Partition Projection using Dynamic Data Set
To Enhance Scalability of Item Transactions by Parallel and Partition using Dynamic Data Set Priyanka Soni, Research Scholar (CSE), MTRI, Bhopal, priyanka.soni379@gmail.com Dhirendra Kumar Jha, MTRI, Bhopal,
More informationA Technical Analysis of Market Basket by using Association Rule Mining and Apriori Algorithm
A Technical Analysis of Market Basket by using Association Rule Mining and Apriori Algorithm S.Pradeepkumar*, Mrs.C.Grace Padma** M.Phil Research Scholar, Department of Computer Science, RVS College of
More informationMining Negative Rules using GRD
Mining Negative Rules using GRD D. R. Thiruvady and G. I. Webb School of Computer Science and Software Engineering, Monash University, Wellington Road, Clayton, Victoria 3800 Australia, Dhananjay Thiruvady@hotmail.com,
More informationAn Evolutionary Algorithm for Mining Association Rules Using Boolean Approach
An Evolutionary Algorithm for Mining Association Rules Using Boolean Approach ABSTRACT G.Ravi Kumar 1 Dr.G.A. Ramachandra 2 G.Sunitha 3 1. Research Scholar, Department of Computer Science &Technology,
More informationAn Efficient Algorithm for finding high utility itemsets from online sell
An Efficient Algorithm for finding high utility itemsets from online sell Sarode Nutan S, Kothavle Suhas R 1 Department of Computer Engineering, ICOER, Maharashtra, India 2 Department of Computer Engineering,
More informationMS-FP-Growth: A multi-support Vrsion of FP-Growth Agorithm
, pp.55-66 http://dx.doi.org/0.457/ijhit.04.7..6 MS-FP-Growth: A multi-support Vrsion of FP-Growth Agorithm Wiem Taktak and Yahya Slimani Computer Sc. Dept, Higher Institute of Arts MultiMedia (ISAMM),
More informationRHUIET : Discovery of Rare High Utility Itemsets using Enumeration Tree
International Journal for Research in Engineering Application & Management (IJREAM) ISSN : 2454-915 Vol-4, Issue-3, June 218 RHUIET : Discovery of Rare High Utility Itemsets using Enumeration Tree Mrs.
More informationMining N-most Interesting Itemsets. Ada Wai-chee Fu Renfrew Wang-wai Kwong Jian Tang. fadafu,
Mining N-most Interesting Itemsets Ada Wai-chee Fu Renfrew Wang-wai Kwong Jian Tang Department of Computer Science and Engineering The Chinese University of Hong Kong, Hong Kong fadafu, wwkwongg@cse.cuhk.edu.hk
More informationApplying the fuzzy preference relation to the software selection
Proceedings of the 007 WSEAS International Conference on Comuter Engineering and Alications, Gold Coast, Australia, January 17-19, 007 83 Alying the fuzzy reference relation to the software selection TIEN-CHIN
More informationA Parallel Algorithm for Constructing Obstacle-Avoiding Rectilinear Steiner Minimal Trees on Multi-Core Systems
A Parallel Algorithm for Constructing Obstacle-Avoiding Rectilinear Steiner Minimal Trees on Multi-Core Systems Cheng-Yuan Chang and I-Lun Tseng Deartment of Comuter Science and Engineering Yuan Ze University,
More informationAn Improved Apriori Algorithm for Association Rules
Research article An Improved Apriori Algorithm for Association Rules Hassan M. Najadat 1, Mohammed Al-Maolegi 2, Bassam Arkok 3 Computer Science, Jordan University of Science and Technology, Irbid, Jordan
More informationResearch Article Apriori Association Rule Algorithms using VMware Environment
Research Journal of Applied Sciences, Engineering and Technology 8(2): 16-166, 214 DOI:1.1926/rjaset.8.955 ISSN: 24-7459; e-issn: 24-7467 214 Maxwell Scientific Publication Corp. Submitted: January 2,
More informationMining Frequent Patterns with Counting Inference at Multiple Levels
International Journal of Computer Applications (097 7) Volume 3 No.10, July 010 Mining Frequent Patterns with Counting Inference at Multiple Levels Mittar Vishav Deptt. Of IT M.M.University, Mullana Ruchika
More informationA Quantified Approach for large Dataset Compression in Association Mining
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661, p- ISSN: 2278-8727Volume 15, Issue 3 (Nov. - Dec. 2013), PP 79-84 A Quantified Approach for large Dataset Compression in Association Mining
More informationTemporal Weighted Association Rule Mining for Classification
Temporal Weighted Association Rule Mining for Classification Purushottam Sharma and Kanak Saxena Abstract There are so many important techniques towards finding the association rules. But, when we consider
More informationMaintenance of the Prelarge Trees for Record Deletion
12th WSEAS Int. Conf. on APPLIED MATHEMATICS, Cairo, Egypt, December 29-31, 2007 105 Maintenance of the Prelarge Trees for Record Deletion Chun-Wei Lin, Tzung-Pei Hong, and Wen-Hsiang Lu Department of
More informationPrivacy Preserving Moving KNN Queries
Privacy Preserving Moving KNN Queries arxiv:4.76v [cs.db] 4 Ar Tanzima Hashem Lars Kulik Rui Zhang National ICT Australia, Deartment of Comuter Science and Software Engineering University of Melbourne,
More informationEfficiently Mining Positive Correlation Rules
Applied Mathematics & Information Sciences An International Journal 2011 NSP 5 (2) (2011), 39S-44S Efficiently Mining Positive Correlation Rules Zhongmei Zhou Department of Computer Science & Engineering,
More informationMining High Average-Utility Itemsets
Proceedings of the 2009 IEEE International Conference on Systems, Man, and Cybernetics San Antonio, TX, USA - October 2009 Mining High Itemsets Tzung-Pei Hong Dept of Computer Science and Information Engineering
More informationAssociation Rule Mining from XML Data
144 Conference on Data Mining DMIN'06 Association Rule Mining from XML Data Qin Ding and Gnanasekaran Sundarraj Computer Science Program The Pennsylvania State University at Harrisburg Middletown, PA 17057,
More informationPerformance Based Study of Association Rule Algorithms On Voter DB
Performance Based Study of Association Rule Algorithms On Voter DB K.Padmavathi 1, R.Aruna Kirithika 2 1 Department of BCA, St.Joseph s College, Thiruvalluvar University, Cuddalore, Tamil Nadu, India,
More informationMining Frequent Itemsets for data streams over Weighted Sliding Windows
Mining Frequent Itemsets for data streams over Weighted Sliding Windows Pauray S.M. Tsai Yao-Ming Chen Department of Computer Science and Information Engineering Minghsin University of Science and Technology
More informationThe Research on Curling Track Empty Value Fill Algorithm Based on Similar Forecast
Research Journal of Alied Sciences, Engineering and Technology 6(8): 1472-1478, 2013 ISSN: 2040-7459; e-issn: 2040-7467 Maxwell Scientific Organization, 2013 Submitted: October 31, 2012 Acceted: January
More informationAn Efficient Video Program Delivery algorithm in Tree Networks*
3rd International Symosium on Parallel Architectures, Algorithms and Programming An Efficient Video Program Delivery algorithm in Tree Networks* Fenghang Yin 1 Hong Shen 1,2,** 1 Deartment of Comuter Science,
More informationAn Efficient Algorithm for Finding the Support Count of Frequent 1-Itemsets in Frequent Pattern Mining
An Efficient Algorithm for Finding the Support Count of Frequent 1-Itemsets in Frequent Pattern Mining P.Subhashini 1, Dr.G.Gunasekaran 2 Research Scholar, Dept. of Information Technology, St.Peter s University,
More informationAN EFFICIENT GRADUAL PRUNING TECHNIQUE FOR UTILITY MINING. Received April 2011; revised October 2011
International Journal of Innovative Computing, Information and Control ICIC International c 2012 ISSN 1349-4198 Volume 8, Number 7(B), July 2012 pp. 5165 5178 AN EFFICIENT GRADUAL PRUNING TECHNIQUE FOR
More informationAssociation Rule Mining. Entscheidungsunterstützungssysteme
Association Rule Mining Entscheidungsunterstützungssysteme Frequent Pattern Analysis Frequent pattern: a pattern (a set of items, subsequences, substructures, etc.) that occurs frequently in a data set
More informationEfficient Sequence Generator Mining and its Application in Classification
Efficient Sequence Generator Mining and its Alication in Classification Chuancong Gao, Jianyong Wang 2, Yukai He 3 and Lizhu Zhou 4 Tsinghua University, Beijing 0084, China {gaocc07, heyk05 3 }@mails.tsinghua.edu.cn,
More informationGeneration of Potential High Utility Itemsets from Transactional Databases
Generation of Potential High Utility Itemsets from Transactional Databases Rajmohan.C Priya.G Niveditha.C Pragathi.R Asst.Prof/IT, Dept of IT Dept of IT Dept of IT SREC, Coimbatore,INDIA,SREC,Coimbatore,.INDIA
More informationAvailable online at ScienceDirect. Procedia Computer Science 45 (2015 )
Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 45 (2015 ) 101 110 International Conference on Advanced Computing Technologies and Applications (ICACTA- 2015) An optimized
More informationPerformance Analysis of Apriori Algorithm with Progressive Approach for Mining Data
Performance Analysis of Apriori Algorithm with Progressive Approach for Mining Data Shilpa Department of Computer Science & Engineering Haryana College of Technology & Management, Kaithal, Haryana, India
More informationASSOCIATION RULE MINING: MARKET BASKET ANALYSIS OF A GROCERY STORE
ASSOCIATION RULE MINING: MARKET BASKET ANALYSIS OF A GROCERY STORE Mustapha Muhammad Abubakar Dept. of computer Science & Engineering, Sharda University,Greater Noida, UP, (India) ABSTRACT Apriori algorithm
More informationFace Recognition Based on Wavelet Transform and Adaptive Local Binary Pattern
Face Recognition Based on Wavelet Transform and Adative Local Binary Pattern Abdallah Mohamed 1,2, and Roman Yamolskiy 1 1 Comuter Engineering and Comuter Science, University of Louisville, Louisville,
More informationEfficient Processing of Top-k Dominating Queries on Multi-Dimensional Data
Efficient Processing of To-k Dominating Queries on Multi-Dimensional Data Man Lung Yiu Deartment of Comuter Science Aalborg University DK-922 Aalborg, Denmark mly@cs.aau.dk Nikos Mamoulis Deartment of
More informationWeighted Page Rank Algorithm based on In-Out Weight of Webpages
Indian Journal of Science and Technology, Vol 8(34), DOI: 10.17485/ijst/2015/v8i34/86120, December 2015 ISSN (Print) : 0974-6846 ISSN (Online) : 0974-5645 eighted Page Rank Algorithm based on In-Out eight
More information620 HUANG Liusheng, CHEN Huaping et al. Vol.15 this itemset. Itemsets that have minimum support (minsup) are called large itemsets, and all the others
Vol.15 No.6 J. Comput. Sci. & Technol. Nov. 2000 A Fast Algorithm for Mining Association Rules HUANG Liusheng (ΛΠ ), CHEN Huaping ( ±), WANG Xun (Φ Ψ) and CHEN Guoliang ( Ξ) National High Performance Computing
More informationInfrequent Weighted Itemset Mining Using SVM Classifier in Transaction Dataset
Infrequent Weighted Itemset Mining Using SVM Classifier in Transaction Dataset M.Hamsathvani 1, D.Rajeswari 2 M.E, R.Kalaiselvi 3 1 PG Scholar(M.E), Angel College of Engineering and Technology, Tiruppur,
More informationAn Automated Support Threshold Based on Apriori Algorithm for Frequent Itemsets
An Automated Support Threshold Based on Apriori Algorithm for sets Jigisha Trivedi #, Brijesh Patel * # Assistant Professor in Computer Engineering Department, S.B. Polytechnic, Savli, Gujarat, India.
More informationParallel Mining of Maximal Frequent Itemsets in PC Clusters
Proceedings of the International MultiConference of Engineers and Computer Scientists 28 Vol I IMECS 28, 19-21 March, 28, Hong Kong Parallel Mining of Maximal Frequent Itemsets in PC Clusters Vong Chan
More informationRandomized algorithms: Two examples and Yao s Minimax Principle
Randomized algorithms: Two examles and Yao s Minimax Princile Maximum Satisfiability Consider the roblem Maximum Satisfiability (MAX-SAT). Bring your knowledge u-to-date on the Satisfiability roblem. Maximum
More informationA Decremental Algorithm for Maintaining Frequent Itemsets in Dynamic Databases *
A Decremental Algorithm for Maintaining Frequent Itemsets in Dynamic Databases * Shichao Zhang 1, Xindong Wu 2, Jilian Zhang 3, and Chengqi Zhang 1 1 Faculty of Information Technology, University of Technology
More informationA Novel Iris Segmentation Method for Hand-Held Capture Device
A Novel Iris Segmentation Method for Hand-Held Cature Device XiaoFu He and PengFei Shi Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, Shanghai 200030, China {xfhe,
More informationImproved Algorithm for Frequent Item sets Mining Based on Apriori and FP-Tree
Global Journal of Computer Science and Technology Software & Data Engineering Volume 13 Issue 2 Version 1.0 Year 2013 Type: Double Blind Peer Reviewed International Research Journal Publisher: Global Journals
More informationFinding Sporadic Rules Using Apriori-Inverse
Finding Sporadic Rules Using Apriori-Inverse Yun Sing Koh and Nathan Rountree Department of Computer Science, University of Otago, New Zealand {ykoh, rountree}@cs.otago.ac.nz Abstract. We define sporadic
More informationTo appear in IEEE TKDE Title: Efficient Skyline and Top-k Retrieval in Subspaces Keywords: Skyline, Top-k, Subspace, B-tree
To aear in IEEE TKDE Title: Efficient Skyline and To-k Retrieval in Subsaces Keywords: Skyline, To-k, Subsace, B-tree Contact Author: Yufei Tao (taoyf@cse.cuhk.edu.hk) Deartment of Comuter Science and
More informationCS570 Introduction to Data Mining
CS570 Introduction to Data Mining Frequent Pattern Mining and Association Analysis Cengiz Gunay Partial slide credits: Li Xiong, Jiawei Han and Micheline Kamber George Kollios 1 Mining Frequent Patterns,
More informationSensitive Rule Hiding and InFrequent Filtration through Binary Search Method
International Journal of Computational Intelligence Research ISSN 0973-1873 Volume 13, Number 5 (2017), pp. 833-840 Research India Publications http://www.ripublication.com Sensitive Rule Hiding and InFrequent
More informationA DEA-bases Approach for Multi-objective Design of Attribute Acceptance Sampling Plans
Available online at htt://ijdea.srbiau.ac.ir Int. J. Data Enveloment Analysis (ISSN 2345-458X) Vol.5, No.2, Year 2017 Article ID IJDEA-00422, 12 ages Research Article International Journal of Data Enveloment
More informationMining Top-K Strongly Correlated Item Pairs Without Minimum Correlation Threshold
Mining Top-K Strongly Correlated Item Pairs Without Minimum Correlation Threshold Zengyou He, Xiaofei Xu, Shengchun Deng Department of Computer Science and Engineering, Harbin Institute of Technology,
More informationMining Quantitative Association Rules on Overlapped Intervals
Mining Quantitative Association Rules on Overlapped Intervals Qiang Tong 1,3, Baoping Yan 2, and Yuanchun Zhou 1,3 1 Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China {tongqiang,
More informationOptimized Frequent Pattern Mining for Classified Data Sets
Optimized Frequent Pattern Mining for Classified Data Sets A Raghunathan Deputy General Manager-IT, Bharat Heavy Electricals Ltd, Tiruchirappalli, India K Murugesan Assistant Professor of Mathematics,
More informationAn Approach for Privacy Preserving in Association Rule Mining Using Data Restriction
International Journal of Engineering Science Invention Volume 2 Issue 1 January. 2013 An Approach for Privacy Preserving in Association Rule Mining Using Data Restriction Janakiramaiah Bonam 1, Dr.RamaMohan
More informationFIT: A Fast Algorithm for Discovering Frequent Itemsets in Large Databases Sanguthevar Rajasekaran
FIT: A Fast Algorithm for Discovering Frequent Itemsets in Large Databases Jun Luo Sanguthevar Rajasekaran Dept. of Computer Science Ohio Northern University Ada, OH 4581 Email: j-luo@onu.edu Dept. of
More informationA Modern Search Technique for Frequent Itemset using FP Tree
A Modern Search Technique for Frequent Itemset using FP Tree Megha Garg Research Scholar, Department of Computer Science & Engineering J.C.D.I.T.M, Sirsa, Haryana, India Krishan Kumar Department of Computer
More informationData Structure for Association Rule Mining: T-Trees and P-Trees
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 16, NO. 6, JUNE 2004 1 Data Structure for Association Rule Mining: T-Trees and P-Trees Frans Coenen, Paul Leng, and Shakil Ahmed Abstract Two new
More informationAUTOMATIC GENERATION OF HIGH THROUGHPUT ENERGY EFFICIENT STREAMING ARCHITECTURES FOR ARBITRARY FIXED PERMUTATIONS. Ren Chen and Viktor K.
inuts er clock cycle Streaming ermutation oututs er clock cycle AUTOMATIC GENERATION OF HIGH THROUGHPUT ENERGY EFFICIENT STREAMING ARCHITECTURES FOR ARBITRARY FIXED PERMUTATIONS Ren Chen and Viktor K.
More informationIMS Network Deployment Cost Optimization Based on Flow-Based Traffic Model
IMS Network Deloyment Cost Otimization Based on Flow-Based Traffic Model Jie Xiao, Changcheng Huang and James Yan Deartment of Systems and Comuter Engineering, Carleton University, Ottawa, Canada {jiexiao,
More informationResults and Discussions on Transaction Splitting Technique for Mining Differential Private Frequent Itemsets
Results and Discussions on Transaction Splitting Technique for Mining Differential Private Frequent Itemsets Sheetal K. Labade Computer Engineering Dept., JSCOE, Hadapsar Pune, India Srinivasa Narasimha
More informationItem Set Extraction of Mining Association Rule
Item Set Extraction of Mining Association Rule Shabana Yasmeen, Prof. P.Pradeep Kumar, A.Ranjith Kumar Department CSE, Vivekananda Institute of Technology and Science, Karimnagar, A.P, India Abstract:
More informationStereo Disparity Estimation in Moment Space
Stereo Disarity Estimation in oment Sace Angeline Pang Faculty of Information Technology, ultimedia University, 63 Cyberjaya, alaysia. angeline.ang@mmu.edu.my R. ukundan Deartment of Comuter Science, University
More informationDiscovering interesting rules from financial data
Discovering interesting rules from financial data Przemysław Sołdacki Institute of Computer Science Warsaw University of Technology Ul. Andersa 13, 00-159 Warszawa Tel: +48 609129896 email: psoldack@ii.pw.edu.pl
More informationInternational Journal of Scientific Research and Reviews
Research article Available online www.ijsrr.org ISSN: 2279 0543 International Journal of Scientific Research and Reviews A Survey of Sequential Rule Mining Algorithms Sachdev Neetu and Tapaswi Namrata
More informationAn Algorithm for Mining Frequent Itemsets from Library Big Data
JOURNAL OF SOFTWARE, VOL. 9, NO. 9, SEPTEMBER 2014 2361 An Algorithm for Mining Frequent Itemsets from Library Big Data Xingjian Li lixingjianny@163.com Library, Nanyang Institute of Technology, Nanyang,
More informationOptimization using Ant Colony Algorithm
Optimization using Ant Colony Algorithm Er. Priya Batta 1, Er. Geetika Sharmai 2, Er. Deepshikha 3 1Faculty, Department of Computer Science, Chandigarh University,Gharaun,Mohali,Punjab 2Faculty, Department
More informationUtility Mining: An Enhanced UP Growth Algorithm for Finding Maximal High Utility Itemsets
Utility Mining: An Enhanced UP Growth Algorithm for Finding Maximal High Utility Itemsets C. Sivamathi 1, Dr. S. Vijayarani 2 1 Ph.D Research Scholar, 2 Assistant Professor, Department of CSE, Bharathiar
More informationRaunak Rathi 1, Prof. A.V.Deorankar 2 1,2 Department of Computer Science and Engineering, Government College of Engineering Amravati
Analytical Representation on Secure Mining in Horizontally Distributed Database Raunak Rathi 1, Prof. A.V.Deorankar 2 1,2 Department of Computer Science and Engineering, Government College of Engineering
More informationAC-Close: Efficiently Mining Approximate Closed Itemsets by Core Pattern Recovery
: Efficiently Mining Approximate Closed Itemsets by Core Pattern Recovery Hong Cheng Philip S. Yu Jiawei Han University of Illinois at Urbana-Champaign IBM T. J. Watson Research Center {hcheng3, hanj}@cs.uiuc.edu,
More informationMining Rare Periodic-Frequent Patterns Using Multiple Minimum Supports
Mining Rare Periodic-Frequent Patterns Using Multiple Minimum Supports R. Uday Kiran P. Krishna Reddy Center for Data Engineering International Institute of Information Technology-Hyderabad Hyderabad,
More informationFace Recognition Using Legendre Moments
Face Recognition Using Legendre Moments Dr.S.Annadurai 1 A.Saradha Professor & Head of CSE & IT Research scholar in CSE Government College of Technology, Government College of Technology, Coimbatore, Tamilnadu,
More informationUsing Association Rules for Better Treatment of Missing Values
Using Association Rules for Better Treatment of Missing Values SHARIQ BASHIR, SAAD RAZZAQ, UMER MAQBOOL, SONYA TAHIR, A. RAUF BAIG Department of Computer Science (Machine Intelligence Group) National University
More informationMining Frequent Patterns Based on Data Characteristics
Mining Frequent Patterns Based on Data Characteristics Lan Vu, Gita Alaghband, Senior Member, IEEE Department of Computer Science and Engineering, University of Colorado Denver, Denver, CO, USA {lan.vu,
More informationSynthesis of FSMs on the Basis of Reusable Hardware Templates
Proceedings of the 6th WSEAS Int. Conf. on Systems Theory & Scientific Comutation, Elounda, Greece, August -3, 006 (09-4) Synthesis of FSMs on the Basis of Reusable Hardware Temlates VALERY SKLYAROV, IOULIIA
More informationInfrequent Weighted Itemset Mining Using Frequent Pattern Growth
Infrequent Weighted Itemset Mining Using Frequent Pattern Growth Namita Dilip Ganjewar Namita Dilip Ganjewar, Department of Computer Engineering, Pune Institute of Computer Technology, India.. ABSTRACT
More informationPurna Prasad Mutyala et al, / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 2 (5), 2011,
Weighted Association Rule Mining Without Pre-assigned Weights PURNA PRASAD MUTYALA, KUMAR VASANTHA Department of CSE, Avanthi Institute of Engg & Tech, Tamaram, Visakhapatnam, A.P., India. Abstract Association
More informationA Survey on Moving Towards Frequent Pattern Growth for Infrequent Weighted Itemset Mining
A Survey on Moving Towards Frequent Pattern Growth for Infrequent Weighted Itemset Mining Miss. Rituja M. Zagade Computer Engineering Department,JSPM,NTC RSSOER,Savitribai Phule Pune University Pune,India
More informationAn Efficient Reduced Pattern Count Tree Method for Discovering Most Accurate Set of Frequent itemsets
IJCSNS International Journal of Computer Science and Network Security, VOL.8 No.8, August 2008 121 An Efficient Reduced Pattern Count Tree Method for Discovering Most Accurate Set of Frequent itemsets
More informationA Further Study in the Data Partitioning Approach for Frequent Itemsets Mining
A Further Study in the Data Partitioning Approach for Frequent Itemsets Mining Son N. Nguyen, Maria E. Orlowska School of Information Technology and Electrical Engineering The University of Queensland,
More informationAN IMPROVISED FREQUENT PATTERN TREE BASED ASSOCIATION RULE MINING TECHNIQUE WITH MINING FREQUENT ITEM SETS ALGORITHM AND A MODIFIED HEADER TABLE
AN IMPROVISED FREQUENT PATTERN TREE BASED ASSOCIATION RULE MINING TECHNIQUE WITH MINING FREQUENT ITEM SETS ALGORITHM AND A MODIFIED HEADER TABLE Vandit Agarwal 1, Mandhani Kushal 2 and Preetham Kumar 3
More informationAnju Singh Information Technology,Deptt. BUIT, Bhopal, India. Keywords- Data mining, Apriori algorithm, minimum support threshold, multiple scan.
Volume 3, Issue 7, July 2013 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com A Survey on Association
More informationThe Spatial Skyline Queries
The Satial Skyline Queries Mehdi Sharifzadeh Comuter Science Deartment University of Southern California Los Angeles, CA 90089-078 sharifza@usc.edu Cyrus Shahabi Comuter Science Deartment University of
More informationAn Approximate Approach for Mining Recently Frequent Itemsets from Data Streams *
An Approximate Approach for Mining Recently Frequent Itemsets from Data Streams * Jia-Ling Koh and Shu-Ning Shin Department of Computer Science and Information Engineering National Taiwan Normal University
More informationA Conflict-Based Confidence Measure for Associative Classification
A Conflict-Based Confidence Measure for Associative Classification Peerapon Vateekul and Mei-Ling Shyu Department of Electrical and Computer Engineering University of Miami Coral Gables, FL 33124, USA
More informationImproved Image Super-Resolution by Support Vector Regression
Proceedings of International Joint Conference on Neural Networks, San Jose, California, USA, July 3 August 5, 0 Imroved Image Suer-Resolution by Suort Vector Regression Le An and Bir Bhanu Abstract Suort
More informationModel for Load Balancing on Processors in Parallel Mining of Frequent Itemsets
American Journal of Applied Sciences 2 (5): 926-931, 2005 ISSN 1546-9239 Science Publications, 2005 Model for Load Balancing on Processors in Parallel Mining of Frequent Itemsets 1 Ravindra Patel, 2 S.S.
More informationImproving Efficiency of Apriori Algorithm using Cache Database
Improving Efficiency of Apriori Algorithm using Cache Database Priyanka Asthana VIth Sem, BUIT, Bhopal Computer Science Deptt. Divakar Singh Computer Science Deptt. BUIT, Bhopal ABSTRACT One of the most
More informationISSN: ISO 9001:2008 Certified International Journal of Engineering Science and Innovative Technology (IJESIT) Volume 2, Issue 2, March 2013
A Novel Approach to Mine Frequent Item sets Of Process Models for Cloud Computing Using Association Rule Mining Roshani Parate M.TECH. Computer Science. NRI Institute of Technology, Bhopal (M.P.) Sitendra
More informationConcurrent Processing of Frequent Itemset Queries Using FP-Growth Algorithm
Concurrent Processing of Frequent Itemset Queries Using FP-Growth Algorithm Marek Wojciechowski, Krzysztof Galecki, Krzysztof Gawronek Poznan University of Technology Institute of Computing Science ul.
More informationShuigeng Zhou. May 18, 2016 School of Computer Science Fudan University
Query Processing Shuigeng Zhou May 18, 2016 School of Comuter Science Fudan University Overview Outline Measures of Query Cost Selection Oeration Sorting Join Oeration Other Oerations Evaluation of Exressions
More informationAn Efficient Algorithm for Mining Association Rules using Confident Frequent Itemsets
0 0 Third International Conference on Advanced Computing & Communication Technologies An Efficient Algorithm for Mining Association Rules using Confident Frequent Itemsets Basheer Mohamad Al-Maqaleh Faculty
More information