Granular Computing: Models and Applications

Size: px
Start display at page:

Download "Granular Computing: Models and Applications"

Transcription

1 Granular Computing: Models and Applications Jianchao Han, 1, Tsau Young Lin 2, 1 Department of Computer Science, California State University, Dominguez Hills, Carson, CA Department of Computer Science, San Jose State University, CA INTRODUCTION Granular computing (GrC) is a general computing paradigm that effectively deals with elements and granules, vaguely generalized subsets. The objective of granular computing research is to build an efficient computational model for handling huge amounts of data, information, and knowledge. The terminology of granular computing was first proposed by Professor T. Y. Lin in as a label of family of theories, methodologies, and techniques that make use of granules, although its basic ideas and principles have been studied in various application domains for a long time. Especially in the form of partitions, the theory has been accumulated for thousands of years in mathematics. So the focus of GrC is on the nonpartition models. Let us first recall some results and thoughts in the pre-grc era, namely before the terms was invented. The explicit study of granular computing can be dated back to the late 1970s. In 1979, Zadeh 2 introduced the notion of information granulation and suggested that fuzzy set theory might find potential applications in this respect. Although we address the nonpartition theories, nevertheless, the partition case was the main source of inspiration. In 1982, Pawlak 3 proposed rough set theory to deal with inexact information. It is an uncertainty theory using a special form of granules, called equivalence classes. It is primarily the rough set theory (partition theory) that causes researchers to realize the importance of the systematic study of the generalized notion, GrC. In 1985, Hobbes 4 presented a theory of granularity as the base of knowledge representation, abstraction, heuristic search, and reasoning. In his theory the problem world is represented as various grains and only interesting ones are abstracted to learn concepts. The conceptualization of the world can be performed at different granularities and switched between them. Even though his discussion mainly focused on the partition cases, his model is more general than rough sets. It includes reflexive and symmetric binary relations. Author to whom all correspondence should be addressed: jhan@csudh.edu. tylin@cs.sjsu.edu. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, VOL. 25, (2010) C 2009 Wiley Periodicals, Inc. Published online in Wiley InterScience (

2 112 HAN AND LIN In , from the approximation retrieval, Lin 5 introduced the notion of neighborhood systems as models of uncertainty; a neighborhood is a unit of uncertainty. Its mathematics originates from topology, called topological neighborhood systems. It attaches to every point p a collection of subsets that satisfy a set of axioms (the axioms of topology). Each such subset is called a neighborhood of p, and the point p is called a center of the neighborhood. Lin removed the axioms and extended the theory. This extended notion is quite general, for example, 1. The family of α-cuts of a fuzzy set is a neighborhood system of a real set. A real set consists of those points whose memberships are exactly one A partition (equivalence relation) is a topological neighborhood system and forms a very special case of topological spaces, called Clopen space (or Pawlak topological space) A binary relation is a special form of neighborhood systems, where each point contains at most one neighborhood. In 1989, Lin 7 used the neighborhood as a unit of basic knowledge (a list of foes) and applied it to computer security; see also Ref A covering ={X i i = 1, 2,...} is an open neighborhood system. 9 Each X i is a neighborhood of every point in it. In other words, every point in the neighborhood is a center point. In 1992, Giunchigalia and Walsh 10 presented a theory of abstraction to improve the conceptualization of granularities. In 1996, Lin, based on Zadeh s granular mathematics (GrM), proposed the term granular computing and formed a special interesting group. In Refs. 11, 12, Zadeh outlined his views of GrC/GrM. Since then, granular computing has received more and more attentions and much research has been conducted in various aspects of this area and has begun to play important roles in various fields, such as approximate retrieval, machine learning, data mining, bioinformatics, e-business, computer security, control, highperformance computing, and wireless mobile computing. 5,7,13 23 In this special issue, we collect six articles to reflect the developments and applications of some special models/views of granular computing, including lattice model, rough set model, association analysis model, and classification model. 2. MODELS AND APPLICATIONS Granular computing has been a shifting paradigm; various views have been proposed. Informally, some computing theories and models that deal with granules may be called granular computing or softer version of granular computing, where granules are generalized subsets, which are regarded as given basic knowledge. In 1980s, although many AI and database/knowledge researchers, including Japanese Fifth generation computing project, had proposed many complex knowledge models, a simpler and effectively computable view of knowledge is in need. Rough set theory takes a courageous step and assumes that partitions (classifications) are essence of human knowledge and focuses on the development of knowledge engineering and uncertainty managements, at this level. The developments have been rewarded with great success. Intuitively, elements are the data, and granules are the (units of) basic knowledge or lack of knowledge (uncertainty). So granular computing

3 GRANULAR COMPUTING: MODELS AND APPLICATIONS 113 provides the infrastructure for data and knowledge engineering and uncertainty management, or more generally AI-engineering. Recently, Lin proposed a set of examples, including Zadeh s intuitive view of GrC, to define GrC implicitly and a category theory based GrC model to define GrC formally. 24,25 In his model, there are two forms, commutative and noncommutative, of granules. For example, the family of ordered keyword sets (text is a linearly ordered words), a collection of committees in social networks (each member has distinct roles), and a collection of tuples (in relations) are collections of noncommutative granules. The neighborhoods are commutative granules. Hierarchy theory 26 and lattice model have been applied to construct and formalize concept hierarchies. In an article of this issue, The Design and Application of Structured Types in Ptolemy II, Zhao, Xiong, Lee, Liu, and Zhong organize all base data types such as integer, double, Boolean, char into a lattice to model subtyping relations among them, and type constraints in components and across components are described over the lattice as inequalities that can be solved efficiently. Structured types including arrays, records, and unions can be added to the base type lattice. Authors expose and discuss some technique challenges of adding structured types, especially the infiniteness of the lattice, recursive structured types, and the inequality constraints on structured types. They suggest answers to these challenging questions and propose corresponding solutions, which are applied to design and model a simplified charity organization and wireless protocols based on IEEE media access control and physical specifications. According to Agrawal et al., 27 association rules mining from a large database of customer transactions that consist of items purchased by customers is to find significant associations between items such that which items are always or often bought together with which items. These significant associations are described as a set of associations and quantitatively measured with support and confidence. Various algorithms to mining associations have been developed The problem of mining association rules can be reduced to GrC. 5,15,16 Qiu, Chen, Liu, and Huang present a Granular Computing Approach to Finding Association Rules in Relational Database in this special issue. Elementary granules are defined in two aspects: intension and extension, where the intension is an attribute-value pair, while the extension is the collection of objects (records in a database table) that satisfy the intension. All elementary granules are generated by scanning a relational database table and stored in the elementary granule table in memory. The elementary granule table contains all elementary granules as 3-tuples consisting of number of objects, intension, and a pointer to the linked list of objects (extension) contained. Frequent 1-itemsets can be checked out from the elementary granule table to form a frequent 1-itemset granule table. By keeping attributes in order, frequent 2-itemsets are generated from frequent 1-itemsets and stored in frequent 2-itemset granule tables, frequent 3-itemsets from frequent 2-itemsets and in frequent 3-itemset granule tables, and so on. Generally, frequent k-itemsets are generated by combining two frequent (k 1)-itemsets from the different nodes of the same linked list if the combination satisfies the support threshold. Corresponding algorithms are described and illustrated with a simplified example and a testing data set. They claim that the algorithms can reduce the number of candidate itemsets and save the computing time.

4 114 HAN AND LIN From some sense, Qiu et al. take advantages of rough set computing model to define the elementary granules as the partitions based on attribute-value pairs, which are the special cases of indiscernibility equivalent relations extensively used in rough set theory. In another article, A Method of Discovering Important Rules Using Rules as Attributes, Li and Cercone apply rough set theory to find significant and important association rules. First, they use existing reduct generation algorithm based on rough set theory to find attribute reducts from the original data set and then generate a set of association rules in terms of each reduct using the classic Apripri algorithm. The resulting association rules are in the form such that the antecedents of a rule is from the value of condition attributes in a reduct, and the consequents of a rule are from the value of decision attributes from the original data set. Since reducts contain the most representative and important condition attributes of a decision table, they assume that rules extracted based on these reducts are representative of the original decision table and therefore are considered more important than the rules generated without using reducts. With this intuition, the rules generated from these reducts are used to construct a decision table, with each of the individual rules being a condition attribute and decision attributes being kept the same. The reduct extracted from such a decision table would contain representative and important attributes, which are the association rules. The reduct generation algorithm is, in turn, applied to this newly constructed decision table, and the result is the reduct consisting of a set of rules that are most important. In their article, Mining Hidden Connections among Biomedical Concepts from Disjoint Biomedical Literature Sets through Semantic-Based Association Rule, Hu, Zhang, Li, Yoo, Zhou, Xu, and Wu adapt the association rule mining approach to automatically identify implicit novel connections among biomedical concepts from disjoint biomedical literature sets. The adapted rules are called semantic-based association rules. The idea behind is described as follows: Assume that three biomedical concepts A, B, and C occur in biomedical literature, where concepts A and C occur in two disjoint sets of documents, but B co-occurs with A and C in some documents. If both A is associated with B and B is associated with C, we have rationales to expect that A and C may have some association or correlation. To find such kind of associations, the authors first apply association rule mining algorithm to the two disjoint sets of biomedical literature separately to generate two sets of association rules in the implication form of A B from the first set of documents and B C from the second set of documents, and then apply the transitive law to conclude the novel implication A C. Owing to the characteristics of association rules which are not actually logical implications, this transitive operation may be too weak to guarantee the association between A and C in the sense of association rules based on the support-confidence framework. On the other hand, this simple application of transition generates a huge number of possible connections among the millions of biomedical concepts and many of these hypothetical connections are spurious, useless, and/or biologically meaningless. To get the problem around, Hu et al. develop a new approach, called biomedical semantic-based association rule system, or Bio-SARS for short, to generate highly likely novel and biologically relevant connections among the biomedical concepts. With semantic filtering,

5 GRANULAR COMPUTING: MODELS AND APPLICATIONS 115 Bio-SARS can significantly reduce the number of spurious, useless, and biologically irrelevant connections, where semantic is based on biomedical anthologies such as MeSH and unified medical language system (UMLS). With a given concept C, the algorithm searches for a set of biomedical documents in which C occurs from online biomedical databases. The algorithm then extracts a set of concepts B from these documents searched and utilizes the semantic knowledge in UMLS to check the semantic type of each concept in B with that of C. All concepts in B that have different semantic types from C are filtered out, and those remaining concepts in B that have strong enough associations with C are selected and used as given concepts to find another set of concepts A in the same way such that all concepts in A co-occur with concepts in B in some documents but never co-occur in any documents with C. After concepts in A that have week associations with concepts in B are removed, the remaining concepts in A are verified to have strong associations with the concept C. Their experiments demonstrate that the discovered associations among biomedical concepts are novel and can be useful for domain expert to conduct new experiment, try new treatment, and so forth. Some authors think classification and clustering analysis as the same model of granular computing, but views are different. Clustering in data mining is partitioning a data set by the similarity. In other words, it is to transform a granulation (granulated by similarity) into partition: see Ref. 9 for the notion of induced partition. Classification problem is, given a training set of examples with class labels, to construct a classifier that is able to assign a class label to a new example without class label that is not in the training set. All examples are described with a given set of features (attributes). It is one form of partitioning. In both models, the granules are groups of the given set of examples, and the granulation is to partition the set of examples into groups. In this special issue, an article Ranking and Selecting Terms for Text Categorization via SVM Discriminate Boundary is presented by Kuo and Yajima. They utilize support vector machines (SVM) to discriminate boundary between classes and rank and select terms for text categorization in document classification. To classify documents into predefined categories, documents are converted into vectors that are composed of a list of words that occur in documents, and then document classification is performed on these document vectors. However, since documents usually contain too many words that are redundant, it is mandate to select most significant, discriminant, and representative words to represent corresponding documents. Currently dominating approaches based on LSI (latent semantic indexing) 9 and χ 2 statistics values are commonly used in document classification and retrieval research, but bear some weakness. The authors exploit several properties of the SVM with RBF (radial basis function) kernel functions to calculate the data points that lie on the nonlinear discriminant boundary and show that these data points, as well as their gradient vectors, can be calculated efficiently only by the elementary matrix and vector calculation. Gradient vectors on the boundary are defined and the contribution of each word to the document is given by the diagonal of the associated decision boundary feature matrix (DBFM) generated from the collection of these gradient vectors. The discriminant and significant words can be ranked and selected according to the words contributions. The authors also demonstrate with

6 116 HAN AND LIN experiments of real-world data sets that the method proposed in the paper has much better performance than existing approaches. Finally, we come to the paper A Novel Clustering Algorithm Using Hypergraph Based Granular Computing. The authors, Liu, Liao, Yang, and He, apply granular computing and hypergraph 9 in clustering a set of documents. Conceptwise, this article presents a very interesting view on partitioning a granulated space into a partition. Hypergraph (or simplicial complex) itself is a granular model; vertices are the underlying universe, and hyperedges (simplexes) are the granules. Basically, the approach extracts the frequent item sets via association rule mining algorithm, and these frequent item sets form hyperedges (simplexes) in a hypergraph (simplicial complex). Then the multilevel hypergraph partitioning algorithm is used to partition the hypergraph into k parts. Two criteria fitness and connectivity to prune the bad clusters and bad vertices are defined. Experiments of this method have been conducted with various data sets, and the results are compared between different data sets in terms of the clusters entropy and the algorithm response time. References 1. Lin TY. Granular computing. Announcement of the BISC Special Interest Group on Granular Computing, Zadeh LA. Fuzzy sets and information granularity. In: Gupta M, Ragade R and Yager R, editors. Advances in fuzzy set theory and applications. Amsterdam: North-Holland; pp Pawlak Z. Rough sets. Int J Comput Inf Sci 1982;11: Hobbs JR. Granularity. In: Proc of the 9th Int Joint Conf on Artificial Intelligence, pp Lin TY. Neighborhood systems and approximation in database and knowledge base systems. In: Proc of the Fourth Int Symp on Methodologies of Intelligent Systems, Poster Session, October 12 15, pp Lin TY. Topological and fuzzy rough sets. In: Slowinski R, editor. Decision support by experience application of the rough sets theory. Boston, MA: Kluwer; pp Lin TY. Chinese wall security policy an aggressive model. In: Proc of the Fifth Aerospace Computer Security Application Conf, December 4 8, pp Lin TY. Chinese wall security policy models: information flows and confining trojan horses. Proc 17th IFIP11.3 Working Conf on Database and Application Security, Estes Park, CO, August 4 6, 2003, pp Lin TY. Granular computing on binary relations I: data mining and neighborhood systems and II: rough set representations and belief functions. In:. Skowron A, Polkowski L. editors. Rough sets in knowledge discovery. Heidelberg, Germany: Physica-Verlag; pp , Giunchglia F, Walsh T. A theory of abstraction. Artif Intell 1992;56: Zadeh LA. Towards a theory of fuzzy information granulation and its centrality in human reasoning and fuzzy logic. Fuzzy Sets Syst 1997;19: Zadeh LA. Some reflections on soft computing, granular computing and their roles in the conception, design and utilization of information/ intelligent systems. Soft Comput 1998;2: Bargiela A, Pedrycz W. Granular computing: An introduction. Boston, MA: Kluwer; Lin TY. A rough logic formalism for fuzzy controllers: A hard and soft computing view. Int J Approx Reason 1996;15(4):

7 GRANULAR COMPUTING: MODELS AND APPLICATIONS Lin TY. Data mining: granular computing approach. Proc the Pacific-Asia Conference on Knowledge Discovery and Data Mining, Beijing, China, April 26 28, 1999, pp Lin TY. Data mining and machine oriented modeling: a granular computing approach. J Appl Intell 2000;13(2): Lin TY, Chiang I-J. A simplicial complex, a hypergraph, structure in the latent semantic space of document clustering. Int J Approxi Reason 2005;40(1 2): Pawlak Z. Granularity of knowledge, indiscernibility and rough sets. In: Proc IEEE Int Conf on Fuzzy Systems, pp Peters JF, Pawlak Z, Skowron A. A rough set approach to measuring information granules. In: Proc COMPSAC 2002, Oxford, England. pp Polkowski L, Skowron A. Towards adaptive calculus of granules, In: Proc IEEE Int Conf on Fuzzy Systems, pp Skowron A, Stepaniuk J. Information granules: towards foundations of granular computing. Int J Intell Syst 2001;16: Yao YY. Perspectives of Granular Computing, Proc. Of IEEE Int Conf on Granular Computing 1: Beijing, China 2005, pp Zhang L, Zhang B. The quotient space theory of problem solving. LNCS 2003;2639: Lin TY. Granular computing, practices, theories and future directions. In: Meyers RA, editor. Encyclopedia on complexity and systems. Berlin: Springer; pp Lin TY. Granular computing I: the concept of granulation and its formal model. Int J Granular Comput Rough Sets Intell Syst 2009;1(1): Ahl V, Allen TFH. Hierarchy theory, a vision, vocabulary and epistemology. Irvington, NY: Columbia University Press; Agrawal R, Imielinski T, Swami A. Mining association rules between sets of items in large databases. In: Proc ACM SIGMOD Conf, Washington DC; Agrawal R, Srikant R. Fast algorithms for mining association rules. In: Proc of the 20th Int Conf Very Large Data Bases, Santiago, Chile; pp Houtsma M, Swami A. Set-oriented mining for association rules in relational databases. In: Proc of the IEEE Int Conf on Data Engineering; pp Klemettinen M, Mannila H, Ronkainen P, Toivonen H, Verkamo A. Finding interesting rules from large sets of discovered association rules. In: 3rd Int Conf on Information and Knowledge Management; pp Savasere A, Omiecinski E, Navathe S. An efficient algorithm for mining association rules. In: Proc of 21th VLDB Conf, Switzerland; 1995.

Granular Computing. Y. Y. Yao

Granular Computing. Y. Y. Yao Granular Computing Y. Y. Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: yyao@cs.uregina.ca, http://www.cs.uregina.ca/~yyao Abstract The basic ideas

More information

A Generalized Decision Logic Language for Granular Computing

A Generalized Decision Logic Language for Granular Computing A Generalized Decision Logic Language for Granular Computing Y.Y. Yao Department of Computer Science, University of Regina, Regina Saskatchewan, Canada S4S 0A2, E-mail: yyao@cs.uregina.ca Churn-Jung Liau

More information

Rough Sets, Neighborhood Systems, and Granular Computing

Rough Sets, Neighborhood Systems, and Granular Computing Rough Sets, Neighborhood Systems, and Granular Computing Y.Y. Yao Department of Computer Science University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: yyao@cs.uregina.ca Abstract Granulation

More information

Modeling the Real World for Data Mining: Granular Computing Approach

Modeling the Real World for Data Mining: Granular Computing Approach Modeling the Real World for Data Mining: Granular Computing Approach T. Y. Lin Department of Mathematics and Computer Science San Jose State University San Jose California 95192-0103 and Berkeley Initiative

More information

A Logic Language of Granular Computing

A Logic Language of Granular Computing A Logic Language of Granular Computing Yiyu Yao and Bing Zhou Department of Computer Science University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: {yyao, zhou200b}@cs.uregina.ca Abstract Granular

More information

Mining High Order Decision Rules

Mining High Order Decision Rules Mining High Order Decision Rules Y.Y. Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 e-mail: yyao@cs.uregina.ca Abstract. We introduce the notion of high

More information

On Generalizing Rough Set Theory

On Generalizing Rough Set Theory On Generalizing Rough Set Theory Y.Y. Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: yyao@cs.uregina.ca Abstract. This paper summarizes various formulations

More information

On Reduct Construction Algorithms

On Reduct Construction Algorithms 1 On Reduct Construction Algorithms Yiyu Yao 1, Yan Zhao 1 and Jue Wang 2 1 Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 {yyao, yanzhao}@cs.uregina.ca 2 Laboratory

More information

Granular Computing based on Rough Sets, Quotient Space Theory, and Belief Functions

Granular Computing based on Rough Sets, Quotient Space Theory, and Belief Functions Granular Computing based on Rough Sets, Quotient Space Theory, and Belief Functions Yiyu (Y.Y.) Yao 1, Churn-Jung Liau 2, Ning Zhong 3 1 Department of Computer Science, University of Regina Regina, Saskatchewan,

More information

Granular Computing: A Paradigm in Information Processing Saroj K. Meher Center for Soft Computing Research Indian Statistical Institute, Kolkata

Granular Computing: A Paradigm in Information Processing Saroj K. Meher Center for Soft Computing Research Indian Statistical Institute, Kolkata Granular Computing: A Paradigm in Information Processing Saroj K. Meher Center for Soft Computing Research Indian Statistical Institute, Kolkata Granular computing (GrC): Outline Introduction Definitions

More information

EFFICIENT ATTRIBUTE REDUCTION ALGORITHM

EFFICIENT ATTRIBUTE REDUCTION ALGORITHM EFFICIENT ATTRIBUTE REDUCTION ALGORITHM Zhongzhi Shi, Shaohui Liu, Zheng Zheng Institute Of Computing Technology,Chinese Academy of Sciences, Beijing, China Abstract: Key words: Efficiency of algorithms

More information

An Evolutionary Algorithm for Mining Association Rules Using Boolean Approach

An Evolutionary Algorithm for Mining Association Rules Using Boolean Approach An Evolutionary Algorithm for Mining Association Rules Using Boolean Approach ABSTRACT G.Ravi Kumar 1 Dr.G.A. Ramachandra 2 G.Sunitha 3 1. Research Scholar, Department of Computer Science &Technology,

More information

Discovering interesting rules from financial data

Discovering interesting rules from financial data Discovering interesting rules from financial data Przemysław Sołdacki Institute of Computer Science Warsaw University of Technology Ul. Andersa 13, 00-159 Warszawa Tel: +48 609129896 email: psoldack@ii.pw.edu.pl

More information

Association Rules with Additional Semantics Modeled by Binary Relations

Association Rules with Additional Semantics Modeled by Binary Relations Association Rules with Additional Semantics Modeled by Binary Relations T. Y. Lin 1 and Eric Louie 2 1 Department of Mathematics and Computer Science San Jose State University, San Jose, California 95192-0103

More information

DMSA TECHNIQUE FOR FINDING SIGNIFICANT PATTERNS IN LARGE DATABASE

DMSA TECHNIQUE FOR FINDING SIGNIFICANT PATTERNS IN LARGE DATABASE DMSA TECHNIQUE FOR FINDING SIGNIFICANT PATTERNS IN LARGE DATABASE Saravanan.Suba Assistant Professor of Computer Science Kamarajar Government Art & Science College Surandai, TN, India-627859 Email:saravanansuba@rediffmail.com

More information

Value Added Association Rules

Value Added Association Rules Value Added Association Rules T.Y. Lin San Jose State University drlin@sjsu.edu Glossary Association Rule Mining A Association Rule Mining is an exploratory learning task to discover some hidden, dependency

More information

PSON: A Parallelized SON Algorithm with MapReduce for Mining Frequent Sets

PSON: A Parallelized SON Algorithm with MapReduce for Mining Frequent Sets 2011 Fourth International Symposium on Parallel Architectures, Algorithms and Programming PSON: A Parallelized SON Algorithm with MapReduce for Mining Frequent Sets Tao Xiao Chunfeng Yuan Yihua Huang Department

More information

Minimal Test Cost Feature Selection with Positive Region Constraint

Minimal Test Cost Feature Selection with Positive Region Constraint Minimal Test Cost Feature Selection with Positive Region Constraint Jiabin Liu 1,2,FanMin 2,, Shujiao Liao 2, and William Zhu 2 1 Department of Computer Science, Sichuan University for Nationalities, Kangding

More information

Feature Selection Based on Relative Attribute Dependency: An Experimental Study

Feature Selection Based on Relative Attribute Dependency: An Experimental Study Feature Selection Based on Relative Attribute Dependency: An Experimental Study Jianchao Han, Ricardo Sanchez, Xiaohua Hu, T.Y. Lin Department of Computer Science, California State University Dominguez

More information

Rough Set Approaches to Rule Induction from Incomplete Data

Rough Set Approaches to Rule Induction from Incomplete Data Proceedings of the IPMU'2004, the 10th International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems, Perugia, Italy, July 4 9, 2004, vol. 2, 923 930 Rough

More information

A mining method for tracking changes in temporal association rules from an encoded database

A mining method for tracking changes in temporal association rules from an encoded database A mining method for tracking changes in temporal association rules from an encoded database Chelliah Balasubramanian *, Karuppaswamy Duraiswamy ** K.S.Rangasamy College of Technology, Tiruchengode, Tamil

More information

Transforming Quantitative Transactional Databases into Binary Tables for Association Rule Mining Using the Apriori Algorithm

Transforming Quantitative Transactional Databases into Binary Tables for Association Rule Mining Using the Apriori Algorithm Transforming Quantitative Transactional Databases into Binary Tables for Association Rule Mining Using the Apriori Algorithm Expert Systems: Final (Research Paper) Project Daniel Josiah-Akintonde December

More information

Information Granulation and Approximation in a Decision-theoretic Model of Rough Sets

Information Granulation and Approximation in a Decision-theoretic Model of Rough Sets Information Granulation and Approximation in a Decision-theoretic Model of Rough Sets Y.Y. Yao Department of Computer Science University of Regina Regina, Saskatchewan Canada S4S 0A2 E-mail: yyao@cs.uregina.ca

More information

Mining Quantitative Association Rules on Overlapped Intervals

Mining Quantitative Association Rules on Overlapped Intervals Mining Quantitative Association Rules on Overlapped Intervals Qiang Tong 1,3, Baoping Yan 2, and Yuanchun Zhou 1,3 1 Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China {tongqiang,

More information

Formal Concept Analysis and Hierarchical Classes Analysis

Formal Concept Analysis and Hierarchical Classes Analysis Formal Concept Analysis and Hierarchical Classes Analysis Yaohua Chen, Yiyu Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: {chen115y, yyao}@cs.uregina.ca

More information

A rule-extraction framework under multigranulation rough sets

A rule-extraction framework under multigranulation rough sets DOI 10.1007/s13042-013-0194-0 ORIGINAL ARTICLE A rule-extraction framework under multigranulation rough sets Xin Liu Yuhua Qian Jiye Liang Received: 25 January 2013 / Accepted: 10 August 2013 Ó Springer-Verlag

More information

Efficient SQL-Querying Method for Data Mining in Large Data Bases

Efficient SQL-Querying Method for Data Mining in Large Data Bases Efficient SQL-Querying Method for Data Mining in Large Data Bases Nguyen Hung Son Institute of Mathematics Warsaw University Banacha 2, 02095, Warsaw, Poland Abstract Data mining can be understood as a

More information

Knowledge Engineering in Search Engines

Knowledge Engineering in Search Engines San Jose State University SJSU ScholarWorks Master's Projects Master's Theses and Graduate Research Spring 2012 Knowledge Engineering in Search Engines Yun-Chieh Lin Follow this and additional works at:

More information

Classification with Diffuse or Incomplete Information

Classification with Diffuse or Incomplete Information Classification with Diffuse or Incomplete Information AMAURY CABALLERO, KANG YEN Florida International University Abstract. In many different fields like finance, business, pattern recognition, communication

More information

Efficient Mining of Generalized Negative Association Rules

Efficient Mining of Generalized Negative Association Rules 2010 IEEE International Conference on Granular Computing Efficient Mining of Generalized egative Association Rules Li-Min Tsai, Shu-Jing Lin, and Don-Lin Yang Dept. of Information Engineering and Computer

More information

A Decision-Theoretic Rough Set Model

A Decision-Theoretic Rough Set Model A Decision-Theoretic Rough Set Model Yiyu Yao and Jingtao Yao Department of Computer Science University of Regina Regina, Saskatchewan, Canada S4S 0A2 {yyao,jtyao}@cs.uregina.ca Special Thanks to Professor

More information

Mathematical Foundation of Association Rules - Mining Associations by Solving Integral Linear Inequalities

Mathematical Foundation of Association Rules - Mining Associations by Solving Integral Linear Inequalities Mathematical Foundation of Association Rules - Mining Associations by Solving Integral Linear Inequalities Tsau Young ( T. Y. ) Lin Department of Computer Science San Jose State University San Jose, CA

More information

Discovery of Multi-level Association Rules from Primitive Level Frequent Patterns Tree

Discovery of Multi-level Association Rules from Primitive Level Frequent Patterns Tree Discovery of Multi-level Association Rules from Primitive Level Frequent Patterns Tree Virendra Kumar Shrivastava 1, Parveen Kumar 2, K. R. Pardasani 3 1 Department of Computer Science & Engineering, Singhania

More information

Data Analysis and Mining in Ordered Information Tables

Data Analysis and Mining in Ordered Information Tables Data Analysis and Mining in Ordered Information Tables Ying Sai, Y.Y. Yao Department of Computer Science University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: yyao@cs.uregina.ca Ning Zhong

More information

Discovery of Multi Dimensional Quantitative Closed Association Rules by Attributes Range Method

Discovery of Multi Dimensional Quantitative Closed Association Rules by Attributes Range Method Discovery of Multi Dimensional Quantitative Closed Association Rules by Attributes Range Method Preetham Kumar, Ananthanarayana V S Abstract In this paper we propose a novel algorithm for discovering multi

More information

Hierarchical Online Mining for Associative Rules

Hierarchical Online Mining for Associative Rules Hierarchical Online Mining for Associative Rules Naresh Jotwani Dhirubhai Ambani Institute of Information & Communication Technology Gandhinagar 382009 INDIA naresh_jotwani@da-iict.org Abstract Mining

More information

A Rough Set Approach for Generation and Validation of Rules for Missing Attribute Values of a Data Set

A Rough Set Approach for Generation and Validation of Rules for Missing Attribute Values of a Data Set A Rough Set Approach for Generation and Validation of Rules for Missing Attribute Values of a Data Set Renu Vashist School of Computer Science and Engineering Shri Mata Vaishno Devi University, Katra,

More information

To Enhance Projection Scalability of Item Transactions by Parallel and Partition Projection using Dynamic Data Set

To Enhance Projection Scalability of Item Transactions by Parallel and Partition Projection using Dynamic Data Set To Enhance Scalability of Item Transactions by Parallel and Partition using Dynamic Data Set Priyanka Soni, Research Scholar (CSE), MTRI, Bhopal, priyanka.soni379@gmail.com Dhirendra Kumar Jha, MTRI, Bhopal,

More information

Generalized Infinitive Rough Sets Based on Reflexive Relations

Generalized Infinitive Rough Sets Based on Reflexive Relations 2012 IEEE International Conference on Granular Computing Generalized Infinitive Rough Sets Based on Reflexive Relations Yu-Ru Syau Department of Information Management National Formosa University Huwei

More information

Structure of Association Rule Classifiers: a Review

Structure of Association Rule Classifiers: a Review Structure of Association Rule Classifiers: a Review Koen Vanhoof Benoît Depaire Transportation Research Institute (IMOB), University Hasselt 3590 Diepenbeek, Belgium koen.vanhoof@uhasselt.be benoit.depaire@uhasselt.be

More information

Using Association Rules for Better Treatment of Missing Values

Using Association Rules for Better Treatment of Missing Values Using Association Rules for Better Treatment of Missing Values SHARIQ BASHIR, SAAD RAZZAQ, UMER MAQBOOL, SONYA TAHIR, A. RAUF BAIG Department of Computer Science (Machine Intelligence Group) National University

More information

An Algorithm for Frequent Pattern Mining Based On Apriori

An Algorithm for Frequent Pattern Mining Based On Apriori An Algorithm for Frequent Pattern Mining Based On Goswami D.N.*, Chaturvedi Anshu. ** Raghuvanshi C.S.*** *SOS In Computer Science Jiwaji University Gwalior ** Computer Application Department MITS Gwalior

More information

A Granular Computing Approach. T.Y. Lin 1;2. Abstract. From the processing point of view, data mining is machine

A Granular Computing Approach. T.Y. Lin 1;2. Abstract. From the processing point of view, data mining is machine Data Mining and Machine Oriented Modeling: A Granular Computing Approach T.Y. Lin 1;2 1 Department of Mathematics and Computer Science San Jose State University, San Jose, California 95192 tylin@cs.sjsu.edu

More information

A study on lower interval probability function based decision theoretic rough set models

A study on lower interval probability function based decision theoretic rough set models Annals of Fuzzy Mathematics and Informatics Volume 12, No. 3, (September 2016), pp. 373 386 ISSN: 2093 9310 (print version) ISSN: 2287 6235 (electronic version) http://www.afmi.or.kr @FMI c Kyung Moon

More information

Combined Intra-Inter transaction based approach for mining Association among the Sectors in Indian Stock Market

Combined Intra-Inter transaction based approach for mining Association among the Sectors in Indian Stock Market Ranjeetsingh BParihar et al, / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol 3 (3), 01,3895-3899 Combined Intra-Inter transaction based approach for mining Association

More information

Applying Fuzzy Sets and Rough Sets as Metric for Vagueness and Uncertainty in Information Retrieval Systems

Applying Fuzzy Sets and Rough Sets as Metric for Vagueness and Uncertainty in Information Retrieval Systems Applying Fuzzy Sets and Rough Sets as Metric for Vagueness and Uncertainty in Information Retrieval Systems Nancy Mehta,Neera Bawa Lect. In CSE, JCDV college of Engineering. (mehta_nancy@rediffmail.com,

More information

Appropriate Item Partition for Improving the Mining Performance

Appropriate Item Partition for Improving the Mining Performance Appropriate Item Partition for Improving the Mining Performance Tzung-Pei Hong 1,2, Jheng-Nan Huang 1, Kawuu W. Lin 3 and Wen-Yang Lin 1 1 Department of Computer Science and Information Engineering National

More information

ROUGH SETS THEORY AND UNCERTAINTY INTO INFORMATION SYSTEM

ROUGH SETS THEORY AND UNCERTAINTY INTO INFORMATION SYSTEM ROUGH SETS THEORY AND UNCERTAINTY INTO INFORMATION SYSTEM Pavel Jirava Institute of System Engineering and Informatics Faculty of Economics and Administration, University of Pardubice Abstract: This article

More information

Materialized Data Mining Views *

Materialized Data Mining Views * Materialized Data Mining Views * Tadeusz Morzy, Marek Wojciechowski, Maciej Zakrzewicz Poznan University of Technology Institute of Computing Science ul. Piotrowo 3a, 60-965 Poznan, Poland tel. +48 61

More information

Mining of Web Server Logs using Extended Apriori Algorithm

Mining of Web Server Logs using Extended Apriori Algorithm International Association of Scientific Innovation and Research (IASIR) (An Association Unifying the Sciences, Engineering, and Applied Research) International Journal of Emerging Technologies in Computational

More information

A Comparison of Global and Local Probabilistic Approximations in Mining Data with Many Missing Attribute Values

A Comparison of Global and Local Probabilistic Approximations in Mining Data with Many Missing Attribute Values A Comparison of Global and Local Probabilistic Approximations in Mining Data with Many Missing Attribute Values Patrick G. Clark Department of Electrical Eng. and Computer Sci. University of Kansas Lawrence,

More information

Mining Temporal Association Rules in Network Traffic Data

Mining Temporal Association Rules in Network Traffic Data Mining Temporal Association Rules in Network Traffic Data Guojun Mao Abstract Mining association rules is one of the most important and popular task in data mining. Current researches focus on discovering

More information

DIVERSITY-BASED INTERESTINGNESS MEASURES FOR ASSOCIATION RULE MINING

DIVERSITY-BASED INTERESTINGNESS MEASURES FOR ASSOCIATION RULE MINING DIVERSITY-BASED INTERESTINGNESS MEASURES FOR ASSOCIATION RULE MINING Huebner, Richard A. Norwich University rhuebner@norwich.edu ABSTRACT Association rule interestingness measures are used to help select

More information

A Conflict-Based Confidence Measure for Associative Classification

A Conflict-Based Confidence Measure for Associative Classification A Conflict-Based Confidence Measure for Associative Classification Peerapon Vateekul and Mei-Ling Shyu Department of Electrical and Computer Engineering University of Miami Coral Gables, FL 33124, USA

More information

ASIAN JOURNAL OF MANAGEMENT RESEARCH Online Open Access publishing platform for Management Research

ASIAN JOURNAL OF MANAGEMENT RESEARCH Online Open Access publishing platform for Management Research ASIAN JOURNAL OF MANAGEMENT RESEARCH Online Open Access publishing platform for Management Research Copyright 2010 All rights reserved Integrated Publishing association Review Article ISSN 2229 3795 The

More information

Available online at ScienceDirect. Procedia Computer Science 96 (2016 )

Available online at   ScienceDirect. Procedia Computer Science 96 (2016 ) Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 96 (2016 ) 179 186 20th International Conference on Knowledge Based and Intelligent Information and Engineering Systems,

More information

Graph Based Approach for Finding Frequent Itemsets to Discover Association Rules

Graph Based Approach for Finding Frequent Itemsets to Discover Association Rules Graph Based Approach for Finding Frequent Itemsets to Discover Association Rules Manju Department of Computer Engg. CDL Govt. Polytechnic Education Society Nathusari Chopta, Sirsa Abstract The discovery

More information

Web page recommendation using a stochastic process model

Web page recommendation using a stochastic process model Data Mining VII: Data, Text and Web Mining and their Business Applications 233 Web page recommendation using a stochastic process model B. J. Park 1, W. Choi 1 & S. H. Noh 2 1 Computer Science Department,

More information

Rough Approximations under Level Fuzzy Sets

Rough Approximations under Level Fuzzy Sets Rough Approximations under Level Fuzzy Sets W.-N. Liu J.T. Yao Y.Y.Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: [liuwe200, jtyao, yyao]@cs.uregina.ca

More information

A Data Mining Framework for Extracting Product Sales Patterns in Retail Store Transactions Using Association Rules: A Case Study

A Data Mining Framework for Extracting Product Sales Patterns in Retail Store Transactions Using Association Rules: A Case Study A Data Mining Framework for Extracting Product Sales Patterns in Retail Store Transactions Using Association Rules: A Case Study Mirzaei.Afshin 1, Sheikh.Reza 2 1 Department of Industrial Engineering and

More information

Data Access Paths for Frequent Itemsets Discovery

Data Access Paths for Frequent Itemsets Discovery Data Access Paths for Frequent Itemsets Discovery Marek Wojciechowski, Maciej Zakrzewicz Poznan University of Technology Institute of Computing Science {marekw, mzakrz}@cs.put.poznan.pl Abstract. A number

More information

Content Based Image Retrieval system with a combination of Rough Set and Support Vector Machine

Content Based Image Retrieval system with a combination of Rough Set and Support Vector Machine Shahabi Lotfabadi, M., Shiratuddin, M.F. and Wong, K.W. (2013) Content Based Image Retrieval system with a combination of rough set and support vector machine. In: 9th Annual International Joint Conferences

More information

Fast Discovery of Sequential Patterns Using Materialized Data Mining Views

Fast Discovery of Sequential Patterns Using Materialized Data Mining Views Fast Discovery of Sequential Patterns Using Materialized Data Mining Views Tadeusz Morzy, Marek Wojciechowski, Maciej Zakrzewicz Poznan University of Technology Institute of Computing Science ul. Piotrowo

More information

Granular association rules for multi-valued data

Granular association rules for multi-valued data Granular association rules for multi-valued data Fan Min and William Zhu Lab of Granular Computing, Zhangzhou Normal University, Zhangzhou 363, China. Email: minfanphd@163.com, williamfengzhu@gmail.com

More information

Approximation Theories: Granular Computing vs Rough Sets

Approximation Theories: Granular Computing vs Rough Sets Approximation Theories: Granular Computing vs Rough Sets Tsau Young ( T. Y. ) Lin Department of Computer Science, San Jose State University San Jose, CA 95192-0249 tylin@cs.sjsu.edu Abstract. The goal

More information

Data with Missing Attribute Values: Generalization of Indiscernibility Relation and Rule Induction

Data with Missing Attribute Values: Generalization of Indiscernibility Relation and Rule Induction Data with Missing Attribute Values: Generalization of Indiscernibility Relation and Rule Induction Jerzy W. Grzymala-Busse 1,2 1 Department of Electrical Engineering and Computer Science, University of

More information

CS570 Introduction to Data Mining

CS570 Introduction to Data Mining CS570 Introduction to Data Mining Frequent Pattern Mining and Association Analysis Cengiz Gunay Partial slide credits: Li Xiong, Jiawei Han and Micheline Kamber George Kollios 1 Mining Frequent Patterns,

More information

An Improved Apriori Algorithm for Association Rules

An Improved Apriori Algorithm for Association Rules Research article An Improved Apriori Algorithm for Association Rules Hassan M. Najadat 1, Mohammed Al-Maolegi 2, Bassam Arkok 3 Computer Science, Jordan University of Science and Technology, Irbid, Jordan

More information

Temporal Weighted Association Rule Mining for Classification

Temporal Weighted Association Rule Mining for Classification Temporal Weighted Association Rule Mining for Classification Purushottam Sharma and Kanak Saxena Abstract There are so many important techniques towards finding the association rules. But, when we consider

More information

AN IMPROVISED FREQUENT PATTERN TREE BASED ASSOCIATION RULE MINING TECHNIQUE WITH MINING FREQUENT ITEM SETS ALGORITHM AND A MODIFIED HEADER TABLE

AN IMPROVISED FREQUENT PATTERN TREE BASED ASSOCIATION RULE MINING TECHNIQUE WITH MINING FREQUENT ITEM SETS ALGORITHM AND A MODIFIED HEADER TABLE AN IMPROVISED FREQUENT PATTERN TREE BASED ASSOCIATION RULE MINING TECHNIQUE WITH MINING FREQUENT ITEM SETS ALGORITHM AND A MODIFIED HEADER TABLE Vandit Agarwal 1, Mandhani Kushal 2 and Preetham Kumar 3

More information

A Model of Machine Learning Based on User Preference of Attributes

A Model of Machine Learning Based on User Preference of Attributes 1 A Model of Machine Learning Based on User Preference of Attributes Yiyu Yao 1, Yan Zhao 1, Jue Wang 2 and Suqing Han 2 1 Department of Computer Science, University of Regina, Regina, Saskatchewan, Canada

More information

An Efficient Tree-based Fuzzy Data Mining Approach

An Efficient Tree-based Fuzzy Data Mining Approach 150 International Journal of Fuzzy Systems, Vol. 12, No. 2, June 2010 An Efficient Tree-based Fuzzy Data Mining Approach Chun-Wei Lin, Tzung-Pei Hong, and Wen-Hsiang Lu Abstract 1 In the past, many algorithms

More information

COMBINATION OF ROUGH AND FUZZY SETS

COMBINATION OF ROUGH AND FUZZY SETS 1 COMBINATION OF ROUGH AND FUZZY SETS BASED ON α-level SETS Y.Y. Yao Department of Computer Science, Lakehead University Thunder Bay, Ontario, Canada P7B 5E1 E-mail: yyao@flash.lakeheadu.ca 1 ABSTRACT

More information

Concurrent Processing of Frequent Itemset Queries Using FP-Growth Algorithm

Concurrent Processing of Frequent Itemset Queries Using FP-Growth Algorithm Concurrent Processing of Frequent Itemset Queries Using FP-Growth Algorithm Marek Wojciechowski, Krzysztof Galecki, Krzysztof Gawronek Poznan University of Technology Institute of Computing Science ul.

More information

An Efficient Algorithm for finding high utility itemsets from online sell

An Efficient Algorithm for finding high utility itemsets from online sell An Efficient Algorithm for finding high utility itemsets from online sell Sarode Nutan S, Kothavle Suhas R 1 Department of Computer Engineering, ICOER, Maharashtra, India 2 Department of Computer Engineering,

More information

Applying Objective Interestingness Measures. in Data Mining Systems. Robert J. Hilderman and Howard J. Hamilton. Department of Computer Science

Applying Objective Interestingness Measures. in Data Mining Systems. Robert J. Hilderman and Howard J. Hamilton. Department of Computer Science Applying Objective Interestingness Measures in Data Mining Systems Robert J. Hilderman and Howard J. Hamilton Department of Computer Science University of Regina Regina, Saskatchewan, Canada SS 0A fhilder,hamiltong@cs.uregina.ca

More information

Improved Frequent Pattern Mining Algorithm with Indexing

Improved Frequent Pattern Mining Algorithm with Indexing IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 16, Issue 6, Ver. VII (Nov Dec. 2014), PP 73-78 Improved Frequent Pattern Mining Algorithm with Indexing Prof.

More information

AN IMPROVED GRAPH BASED METHOD FOR EXTRACTING ASSOCIATION RULES

AN IMPROVED GRAPH BASED METHOD FOR EXTRACTING ASSOCIATION RULES AN IMPROVED GRAPH BASED METHOD FOR EXTRACTING ASSOCIATION RULES ABSTRACT Wael AlZoubi Ajloun University College, Balqa Applied University PO Box: Al-Salt 19117, Jordan This paper proposes an improved approach

More information

Mining High Average-Utility Itemsets

Mining High Average-Utility Itemsets Proceedings of the 2009 IEEE International Conference on Systems, Man, and Cybernetics San Antonio, TX, USA - October 2009 Mining High Itemsets Tzung-Pei Hong Dept of Computer Science and Information Engineering

More information

A Mining Algorithm to Generate the Candidate Pattern for Authorship Attribution for Filtering Spam Mail

A Mining Algorithm to Generate the Candidate Pattern for Authorship Attribution for Filtering Spam Mail A Mining Algorithm to Generate the Candidate Pattern for Authorship Attribution for Filtering Spam Mail Khongbantabam Susila Devi #1, Dr. R. Ravi *2 1 Research Scholar, Department of Information & Communication

More information

Texture Image Segmentation using FCM

Texture Image Segmentation using FCM Proceedings of 2012 4th International Conference on Machine Learning and Computing IPCSIT vol. 25 (2012) (2012) IACSIT Press, Singapore Texture Image Segmentation using FCM Kanchan S. Deshmukh + M.G.M

More information

A New Approach To Graph Based Object Classification On Images

A New Approach To Graph Based Object Classification On Images A New Approach To Graph Based Object Classification On Images Sandhya S Krishnan,Kavitha V K P.G Scholar, Dept of CSE, BMCE, Kollam, Kerala, India Sandhya4parvathy@gmail.com Abstract: The main idea of

More information

Open Access Apriori Algorithm Research Based on Map-Reduce in Cloud Computing Environments

Open Access Apriori Algorithm Research Based on Map-Reduce in Cloud Computing Environments Send Orders for Reprints to reprints@benthamscience.ae 368 The Open Automation and Control Systems Journal, 2014, 6, 368-373 Open Access Apriori Algorithm Research Based on Map-Reduce in Cloud Computing

More information

IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, ISSN:

IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, ISSN: IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, 20131 Improve Search Engine Relevance with Filter session Addlin Shinney R 1, Saravana Kumar T

More information

SEQUENTIAL PATTERN MINING FROM WEB LOG DATA

SEQUENTIAL PATTERN MINING FROM WEB LOG DATA SEQUENTIAL PATTERN MINING FROM WEB LOG DATA Rajashree Shettar 1 1 Associate Professor, Department of Computer Science, R. V College of Engineering, Karnataka, India, rajashreeshettar@rvce.edu.in Abstract

More information

Pattern Mining. Knowledge Discovery and Data Mining 1. Roman Kern KTI, TU Graz. Roman Kern (KTI, TU Graz) Pattern Mining / 42

Pattern Mining. Knowledge Discovery and Data Mining 1. Roman Kern KTI, TU Graz. Roman Kern (KTI, TU Graz) Pattern Mining / 42 Pattern Mining Knowledge Discovery and Data Mining 1 Roman Kern KTI, TU Graz 2016-01-14 Roman Kern (KTI, TU Graz) Pattern Mining 2016-01-14 1 / 42 Outline 1 Introduction 2 Apriori Algorithm 3 FP-Growth

More information

An Improved Algorithm for Mining Association Rules Using Multiple Support Values

An Improved Algorithm for Mining Association Rules Using Multiple Support Values An Improved Algorithm for Mining Association Rules Using Multiple Support Values Ioannis N. Kouris, Christos H. Makris, Athanasios K. Tsakalidis University of Patras, School of Engineering Department of

More information

Data Mining: Concepts and Techniques. Chapter 5. SS Chung. April 5, 2013 Data Mining: Concepts and Techniques 1

Data Mining: Concepts and Techniques. Chapter 5. SS Chung. April 5, 2013 Data Mining: Concepts and Techniques 1 Data Mining: Concepts and Techniques Chapter 5 SS Chung April 5, 2013 Data Mining: Concepts and Techniques 1 Chapter 5: Mining Frequent Patterns, Association and Correlations Basic concepts and a road

More information

Chapter 4: Mining Frequent Patterns, Associations and Correlations

Chapter 4: Mining Frequent Patterns, Associations and Correlations Chapter 4: Mining Frequent Patterns, Associations and Correlations 4.1 Basic Concepts 4.2 Frequent Itemset Mining Methods 4.3 Which Patterns Are Interesting? Pattern Evaluation Methods 4.4 Summary Frequent

More information

Infrequent Weighted Itemset Mining Using SVM Classifier in Transaction Dataset

Infrequent Weighted Itemset Mining Using SVM Classifier in Transaction Dataset Infrequent Weighted Itemset Mining Using SVM Classifier in Transaction Dataset M.Hamsathvani 1, D.Rajeswari 2 M.E, R.Kalaiselvi 3 1 PG Scholar(M.E), Angel College of Engineering and Technology, Tiruppur,

More information

Model for Load Balancing on Processors in Parallel Mining of Frequent Itemsets

Model for Load Balancing on Processors in Parallel Mining of Frequent Itemsets American Journal of Applied Sciences 2 (5): 926-931, 2005 ISSN 1546-9239 Science Publications, 2005 Model for Load Balancing on Processors in Parallel Mining of Frequent Itemsets 1 Ravindra Patel, 2 S.S.

More information

Improving the Efficiency of Fast Using Semantic Similarity Algorithm

Improving the Efficiency of Fast Using Semantic Similarity Algorithm International Journal of Scientific and Research Publications, Volume 4, Issue 1, January 2014 1 Improving the Efficiency of Fast Using Semantic Similarity Algorithm D.KARTHIKA 1, S. DIVAKAR 2 Final year

More information

Maintenance of Generalized Association Rules for Record Deletion Based on the Pre-Large Concept

Maintenance of Generalized Association Rules for Record Deletion Based on the Pre-Large Concept Proceedings of the 6th WSEAS Int. Conf. on Artificial Intelligence, Knowledge Engineering and ata Bases, Corfu Island, Greece, February 16-19, 2007 142 Maintenance of Generalized Association Rules for

More information

620 HUANG Liusheng, CHEN Huaping et al. Vol.15 this itemset. Itemsets that have minimum support (minsup) are called large itemsets, and all the others

620 HUANG Liusheng, CHEN Huaping et al. Vol.15 this itemset. Itemsets that have minimum support (minsup) are called large itemsets, and all the others Vol.15 No.6 J. Comput. Sci. & Technol. Nov. 2000 A Fast Algorithm for Mining Association Rules HUANG Liusheng (ΛΠ ), CHEN Huaping ( ±), WANG Xun (Φ Ψ) and CHEN Guoliang ( Ξ) National High Performance Computing

More information

Medical Data Mining Based on Association Rules

Medical Data Mining Based on Association Rules Medical Data Mining Based on Association Rules Ruijuan Hu Dep of Foundation, PLA University of Foreign Languages, Luoyang 471003, China E-mail: huruijuan01@126.com Abstract Detailed elaborations are presented

More information

MINING ASSOCIATION RULES WITH UNCERTAIN ITEM RELATIONSHIPS

MINING ASSOCIATION RULES WITH UNCERTAIN ITEM RELATIONSHIPS MINING ASSOCIATION RULES WITH UNCERTAIN ITEM RELATIONSHIPS Mei-Ling Shyu 1, Choochart Haruechaiyasak 1, Shu-Ching Chen, and Kamal Premaratne 1 1 Department of Electrical and Computer Engineering University

More information

Lecture 2 Wednesday, August 22, 2007

Lecture 2 Wednesday, August 22, 2007 CS 6604: Data Mining Fall 2007 Lecture 2 Wednesday, August 22, 2007 Lecture: Naren Ramakrishnan Scribe: Clifford Owens 1 Searching for Sets The canonical data mining problem is to search for frequent subsets

More information

Survey on Rough Set Feature Selection Using Evolutionary Algorithm

Survey on Rough Set Feature Selection Using Evolutionary Algorithm Survey on Rough Set Feature Selection Using Evolutionary Algorithm M.Gayathri 1, Dr.C.Yamini 2 Research Scholar 1, Department of Computer Science, Sri Ramakrishna College of Arts and Science for Women,

More information

Novel Materialized View Selection in a Multidimensional Database

Novel Materialized View Selection in a Multidimensional Database Graphic Era University From the SelectedWorks of vijay singh Winter February 10, 2009 Novel Materialized View Selection in a Multidimensional Database vijay singh Available at: https://works.bepress.com/vijaysingh/5/

More information

Optimization using Ant Colony Algorithm

Optimization using Ant Colony Algorithm Optimization using Ant Colony Algorithm Er. Priya Batta 1, Er. Geetika Sharmai 2, Er. Deepshikha 3 1Faculty, Department of Computer Science, Chandigarh University,Gharaun,Mohali,Punjab 2Faculty, Department

More information