Granular Computing: Models and Applications
|
|
- Suzanna Haynes
- 5 years ago
- Views:
Transcription
1 Granular Computing: Models and Applications Jianchao Han, 1, Tsau Young Lin 2, 1 Department of Computer Science, California State University, Dominguez Hills, Carson, CA Department of Computer Science, San Jose State University, CA INTRODUCTION Granular computing (GrC) is a general computing paradigm that effectively deals with elements and granules, vaguely generalized subsets. The objective of granular computing research is to build an efficient computational model for handling huge amounts of data, information, and knowledge. The terminology of granular computing was first proposed by Professor T. Y. Lin in as a label of family of theories, methodologies, and techniques that make use of granules, although its basic ideas and principles have been studied in various application domains for a long time. Especially in the form of partitions, the theory has been accumulated for thousands of years in mathematics. So the focus of GrC is on the nonpartition models. Let us first recall some results and thoughts in the pre-grc era, namely before the terms was invented. The explicit study of granular computing can be dated back to the late 1970s. In 1979, Zadeh 2 introduced the notion of information granulation and suggested that fuzzy set theory might find potential applications in this respect. Although we address the nonpartition theories, nevertheless, the partition case was the main source of inspiration. In 1982, Pawlak 3 proposed rough set theory to deal with inexact information. It is an uncertainty theory using a special form of granules, called equivalence classes. It is primarily the rough set theory (partition theory) that causes researchers to realize the importance of the systematic study of the generalized notion, GrC. In 1985, Hobbes 4 presented a theory of granularity as the base of knowledge representation, abstraction, heuristic search, and reasoning. In his theory the problem world is represented as various grains and only interesting ones are abstracted to learn concepts. The conceptualization of the world can be performed at different granularities and switched between them. Even though his discussion mainly focused on the partition cases, his model is more general than rough sets. It includes reflexive and symmetric binary relations. Author to whom all correspondence should be addressed: jhan@csudh.edu. tylin@cs.sjsu.edu. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, VOL. 25, (2010) C 2009 Wiley Periodicals, Inc. Published online in Wiley InterScience (
2 112 HAN AND LIN In , from the approximation retrieval, Lin 5 introduced the notion of neighborhood systems as models of uncertainty; a neighborhood is a unit of uncertainty. Its mathematics originates from topology, called topological neighborhood systems. It attaches to every point p a collection of subsets that satisfy a set of axioms (the axioms of topology). Each such subset is called a neighborhood of p, and the point p is called a center of the neighborhood. Lin removed the axioms and extended the theory. This extended notion is quite general, for example, 1. The family of α-cuts of a fuzzy set is a neighborhood system of a real set. A real set consists of those points whose memberships are exactly one A partition (equivalence relation) is a topological neighborhood system and forms a very special case of topological spaces, called Clopen space (or Pawlak topological space) A binary relation is a special form of neighborhood systems, where each point contains at most one neighborhood. In 1989, Lin 7 used the neighborhood as a unit of basic knowledge (a list of foes) and applied it to computer security; see also Ref A covering ={X i i = 1, 2,...} is an open neighborhood system. 9 Each X i is a neighborhood of every point in it. In other words, every point in the neighborhood is a center point. In 1992, Giunchigalia and Walsh 10 presented a theory of abstraction to improve the conceptualization of granularities. In 1996, Lin, based on Zadeh s granular mathematics (GrM), proposed the term granular computing and formed a special interesting group. In Refs. 11, 12, Zadeh outlined his views of GrC/GrM. Since then, granular computing has received more and more attentions and much research has been conducted in various aspects of this area and has begun to play important roles in various fields, such as approximate retrieval, machine learning, data mining, bioinformatics, e-business, computer security, control, highperformance computing, and wireless mobile computing. 5,7,13 23 In this special issue, we collect six articles to reflect the developments and applications of some special models/views of granular computing, including lattice model, rough set model, association analysis model, and classification model. 2. MODELS AND APPLICATIONS Granular computing has been a shifting paradigm; various views have been proposed. Informally, some computing theories and models that deal with granules may be called granular computing or softer version of granular computing, where granules are generalized subsets, which are regarded as given basic knowledge. In 1980s, although many AI and database/knowledge researchers, including Japanese Fifth generation computing project, had proposed many complex knowledge models, a simpler and effectively computable view of knowledge is in need. Rough set theory takes a courageous step and assumes that partitions (classifications) are essence of human knowledge and focuses on the development of knowledge engineering and uncertainty managements, at this level. The developments have been rewarded with great success. Intuitively, elements are the data, and granules are the (units of) basic knowledge or lack of knowledge (uncertainty). So granular computing
3 GRANULAR COMPUTING: MODELS AND APPLICATIONS 113 provides the infrastructure for data and knowledge engineering and uncertainty management, or more generally AI-engineering. Recently, Lin proposed a set of examples, including Zadeh s intuitive view of GrC, to define GrC implicitly and a category theory based GrC model to define GrC formally. 24,25 In his model, there are two forms, commutative and noncommutative, of granules. For example, the family of ordered keyword sets (text is a linearly ordered words), a collection of committees in social networks (each member has distinct roles), and a collection of tuples (in relations) are collections of noncommutative granules. The neighborhoods are commutative granules. Hierarchy theory 26 and lattice model have been applied to construct and formalize concept hierarchies. In an article of this issue, The Design and Application of Structured Types in Ptolemy II, Zhao, Xiong, Lee, Liu, and Zhong organize all base data types such as integer, double, Boolean, char into a lattice to model subtyping relations among them, and type constraints in components and across components are described over the lattice as inequalities that can be solved efficiently. Structured types including arrays, records, and unions can be added to the base type lattice. Authors expose and discuss some technique challenges of adding structured types, especially the infiniteness of the lattice, recursive structured types, and the inequality constraints on structured types. They suggest answers to these challenging questions and propose corresponding solutions, which are applied to design and model a simplified charity organization and wireless protocols based on IEEE media access control and physical specifications. According to Agrawal et al., 27 association rules mining from a large database of customer transactions that consist of items purchased by customers is to find significant associations between items such that which items are always or often bought together with which items. These significant associations are described as a set of associations and quantitatively measured with support and confidence. Various algorithms to mining associations have been developed The problem of mining association rules can be reduced to GrC. 5,15,16 Qiu, Chen, Liu, and Huang present a Granular Computing Approach to Finding Association Rules in Relational Database in this special issue. Elementary granules are defined in two aspects: intension and extension, where the intension is an attribute-value pair, while the extension is the collection of objects (records in a database table) that satisfy the intension. All elementary granules are generated by scanning a relational database table and stored in the elementary granule table in memory. The elementary granule table contains all elementary granules as 3-tuples consisting of number of objects, intension, and a pointer to the linked list of objects (extension) contained. Frequent 1-itemsets can be checked out from the elementary granule table to form a frequent 1-itemset granule table. By keeping attributes in order, frequent 2-itemsets are generated from frequent 1-itemsets and stored in frequent 2-itemset granule tables, frequent 3-itemsets from frequent 2-itemsets and in frequent 3-itemset granule tables, and so on. Generally, frequent k-itemsets are generated by combining two frequent (k 1)-itemsets from the different nodes of the same linked list if the combination satisfies the support threshold. Corresponding algorithms are described and illustrated with a simplified example and a testing data set. They claim that the algorithms can reduce the number of candidate itemsets and save the computing time.
4 114 HAN AND LIN From some sense, Qiu et al. take advantages of rough set computing model to define the elementary granules as the partitions based on attribute-value pairs, which are the special cases of indiscernibility equivalent relations extensively used in rough set theory. In another article, A Method of Discovering Important Rules Using Rules as Attributes, Li and Cercone apply rough set theory to find significant and important association rules. First, they use existing reduct generation algorithm based on rough set theory to find attribute reducts from the original data set and then generate a set of association rules in terms of each reduct using the classic Apripri algorithm. The resulting association rules are in the form such that the antecedents of a rule is from the value of condition attributes in a reduct, and the consequents of a rule are from the value of decision attributes from the original data set. Since reducts contain the most representative and important condition attributes of a decision table, they assume that rules extracted based on these reducts are representative of the original decision table and therefore are considered more important than the rules generated without using reducts. With this intuition, the rules generated from these reducts are used to construct a decision table, with each of the individual rules being a condition attribute and decision attributes being kept the same. The reduct extracted from such a decision table would contain representative and important attributes, which are the association rules. The reduct generation algorithm is, in turn, applied to this newly constructed decision table, and the result is the reduct consisting of a set of rules that are most important. In their article, Mining Hidden Connections among Biomedical Concepts from Disjoint Biomedical Literature Sets through Semantic-Based Association Rule, Hu, Zhang, Li, Yoo, Zhou, Xu, and Wu adapt the association rule mining approach to automatically identify implicit novel connections among biomedical concepts from disjoint biomedical literature sets. The adapted rules are called semantic-based association rules. The idea behind is described as follows: Assume that three biomedical concepts A, B, and C occur in biomedical literature, where concepts A and C occur in two disjoint sets of documents, but B co-occurs with A and C in some documents. If both A is associated with B and B is associated with C, we have rationales to expect that A and C may have some association or correlation. To find such kind of associations, the authors first apply association rule mining algorithm to the two disjoint sets of biomedical literature separately to generate two sets of association rules in the implication form of A B from the first set of documents and B C from the second set of documents, and then apply the transitive law to conclude the novel implication A C. Owing to the characteristics of association rules which are not actually logical implications, this transitive operation may be too weak to guarantee the association between A and C in the sense of association rules based on the support-confidence framework. On the other hand, this simple application of transition generates a huge number of possible connections among the millions of biomedical concepts and many of these hypothetical connections are spurious, useless, and/or biologically meaningless. To get the problem around, Hu et al. develop a new approach, called biomedical semantic-based association rule system, or Bio-SARS for short, to generate highly likely novel and biologically relevant connections among the biomedical concepts. With semantic filtering,
5 GRANULAR COMPUTING: MODELS AND APPLICATIONS 115 Bio-SARS can significantly reduce the number of spurious, useless, and biologically irrelevant connections, where semantic is based on biomedical anthologies such as MeSH and unified medical language system (UMLS). With a given concept C, the algorithm searches for a set of biomedical documents in which C occurs from online biomedical databases. The algorithm then extracts a set of concepts B from these documents searched and utilizes the semantic knowledge in UMLS to check the semantic type of each concept in B with that of C. All concepts in B that have different semantic types from C are filtered out, and those remaining concepts in B that have strong enough associations with C are selected and used as given concepts to find another set of concepts A in the same way such that all concepts in A co-occur with concepts in B in some documents but never co-occur in any documents with C. After concepts in A that have week associations with concepts in B are removed, the remaining concepts in A are verified to have strong associations with the concept C. Their experiments demonstrate that the discovered associations among biomedical concepts are novel and can be useful for domain expert to conduct new experiment, try new treatment, and so forth. Some authors think classification and clustering analysis as the same model of granular computing, but views are different. Clustering in data mining is partitioning a data set by the similarity. In other words, it is to transform a granulation (granulated by similarity) into partition: see Ref. 9 for the notion of induced partition. Classification problem is, given a training set of examples with class labels, to construct a classifier that is able to assign a class label to a new example without class label that is not in the training set. All examples are described with a given set of features (attributes). It is one form of partitioning. In both models, the granules are groups of the given set of examples, and the granulation is to partition the set of examples into groups. In this special issue, an article Ranking and Selecting Terms for Text Categorization via SVM Discriminate Boundary is presented by Kuo and Yajima. They utilize support vector machines (SVM) to discriminate boundary between classes and rank and select terms for text categorization in document classification. To classify documents into predefined categories, documents are converted into vectors that are composed of a list of words that occur in documents, and then document classification is performed on these document vectors. However, since documents usually contain too many words that are redundant, it is mandate to select most significant, discriminant, and representative words to represent corresponding documents. Currently dominating approaches based on LSI (latent semantic indexing) 9 and χ 2 statistics values are commonly used in document classification and retrieval research, but bear some weakness. The authors exploit several properties of the SVM with RBF (radial basis function) kernel functions to calculate the data points that lie on the nonlinear discriminant boundary and show that these data points, as well as their gradient vectors, can be calculated efficiently only by the elementary matrix and vector calculation. Gradient vectors on the boundary are defined and the contribution of each word to the document is given by the diagonal of the associated decision boundary feature matrix (DBFM) generated from the collection of these gradient vectors. The discriminant and significant words can be ranked and selected according to the words contributions. The authors also demonstrate with
6 116 HAN AND LIN experiments of real-world data sets that the method proposed in the paper has much better performance than existing approaches. Finally, we come to the paper A Novel Clustering Algorithm Using Hypergraph Based Granular Computing. The authors, Liu, Liao, Yang, and He, apply granular computing and hypergraph 9 in clustering a set of documents. Conceptwise, this article presents a very interesting view on partitioning a granulated space into a partition. Hypergraph (or simplicial complex) itself is a granular model; vertices are the underlying universe, and hyperedges (simplexes) are the granules. Basically, the approach extracts the frequent item sets via association rule mining algorithm, and these frequent item sets form hyperedges (simplexes) in a hypergraph (simplicial complex). Then the multilevel hypergraph partitioning algorithm is used to partition the hypergraph into k parts. Two criteria fitness and connectivity to prune the bad clusters and bad vertices are defined. Experiments of this method have been conducted with various data sets, and the results are compared between different data sets in terms of the clusters entropy and the algorithm response time. References 1. Lin TY. Granular computing. Announcement of the BISC Special Interest Group on Granular Computing, Zadeh LA. Fuzzy sets and information granularity. In: Gupta M, Ragade R and Yager R, editors. Advances in fuzzy set theory and applications. Amsterdam: North-Holland; pp Pawlak Z. Rough sets. Int J Comput Inf Sci 1982;11: Hobbs JR. Granularity. In: Proc of the 9th Int Joint Conf on Artificial Intelligence, pp Lin TY. Neighborhood systems and approximation in database and knowledge base systems. In: Proc of the Fourth Int Symp on Methodologies of Intelligent Systems, Poster Session, October 12 15, pp Lin TY. Topological and fuzzy rough sets. In: Slowinski R, editor. Decision support by experience application of the rough sets theory. Boston, MA: Kluwer; pp Lin TY. Chinese wall security policy an aggressive model. In: Proc of the Fifth Aerospace Computer Security Application Conf, December 4 8, pp Lin TY. Chinese wall security policy models: information flows and confining trojan horses. Proc 17th IFIP11.3 Working Conf on Database and Application Security, Estes Park, CO, August 4 6, 2003, pp Lin TY. Granular computing on binary relations I: data mining and neighborhood systems and II: rough set representations and belief functions. In:. Skowron A, Polkowski L. editors. Rough sets in knowledge discovery. Heidelberg, Germany: Physica-Verlag; pp , Giunchglia F, Walsh T. A theory of abstraction. Artif Intell 1992;56: Zadeh LA. Towards a theory of fuzzy information granulation and its centrality in human reasoning and fuzzy logic. Fuzzy Sets Syst 1997;19: Zadeh LA. Some reflections on soft computing, granular computing and their roles in the conception, design and utilization of information/ intelligent systems. Soft Comput 1998;2: Bargiela A, Pedrycz W. Granular computing: An introduction. Boston, MA: Kluwer; Lin TY. A rough logic formalism for fuzzy controllers: A hard and soft computing view. Int J Approx Reason 1996;15(4):
7 GRANULAR COMPUTING: MODELS AND APPLICATIONS Lin TY. Data mining: granular computing approach. Proc the Pacific-Asia Conference on Knowledge Discovery and Data Mining, Beijing, China, April 26 28, 1999, pp Lin TY. Data mining and machine oriented modeling: a granular computing approach. J Appl Intell 2000;13(2): Lin TY, Chiang I-J. A simplicial complex, a hypergraph, structure in the latent semantic space of document clustering. Int J Approxi Reason 2005;40(1 2): Pawlak Z. Granularity of knowledge, indiscernibility and rough sets. In: Proc IEEE Int Conf on Fuzzy Systems, pp Peters JF, Pawlak Z, Skowron A. A rough set approach to measuring information granules. In: Proc COMPSAC 2002, Oxford, England. pp Polkowski L, Skowron A. Towards adaptive calculus of granules, In: Proc IEEE Int Conf on Fuzzy Systems, pp Skowron A, Stepaniuk J. Information granules: towards foundations of granular computing. Int J Intell Syst 2001;16: Yao YY. Perspectives of Granular Computing, Proc. Of IEEE Int Conf on Granular Computing 1: Beijing, China 2005, pp Zhang L, Zhang B. The quotient space theory of problem solving. LNCS 2003;2639: Lin TY. Granular computing, practices, theories and future directions. In: Meyers RA, editor. Encyclopedia on complexity and systems. Berlin: Springer; pp Lin TY. Granular computing I: the concept of granulation and its formal model. Int J Granular Comput Rough Sets Intell Syst 2009;1(1): Ahl V, Allen TFH. Hierarchy theory, a vision, vocabulary and epistemology. Irvington, NY: Columbia University Press; Agrawal R, Imielinski T, Swami A. Mining association rules between sets of items in large databases. In: Proc ACM SIGMOD Conf, Washington DC; Agrawal R, Srikant R. Fast algorithms for mining association rules. In: Proc of the 20th Int Conf Very Large Data Bases, Santiago, Chile; pp Houtsma M, Swami A. Set-oriented mining for association rules in relational databases. In: Proc of the IEEE Int Conf on Data Engineering; pp Klemettinen M, Mannila H, Ronkainen P, Toivonen H, Verkamo A. Finding interesting rules from large sets of discovered association rules. In: 3rd Int Conf on Information and Knowledge Management; pp Savasere A, Omiecinski E, Navathe S. An efficient algorithm for mining association rules. In: Proc of 21th VLDB Conf, Switzerland; 1995.
Granular Computing. Y. Y. Yao
Granular Computing Y. Y. Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: yyao@cs.uregina.ca, http://www.cs.uregina.ca/~yyao Abstract The basic ideas
More informationA Generalized Decision Logic Language for Granular Computing
A Generalized Decision Logic Language for Granular Computing Y.Y. Yao Department of Computer Science, University of Regina, Regina Saskatchewan, Canada S4S 0A2, E-mail: yyao@cs.uregina.ca Churn-Jung Liau
More informationRough Sets, Neighborhood Systems, and Granular Computing
Rough Sets, Neighborhood Systems, and Granular Computing Y.Y. Yao Department of Computer Science University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: yyao@cs.uregina.ca Abstract Granulation
More informationModeling the Real World for Data Mining: Granular Computing Approach
Modeling the Real World for Data Mining: Granular Computing Approach T. Y. Lin Department of Mathematics and Computer Science San Jose State University San Jose California 95192-0103 and Berkeley Initiative
More informationA Logic Language of Granular Computing
A Logic Language of Granular Computing Yiyu Yao and Bing Zhou Department of Computer Science University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: {yyao, zhou200b}@cs.uregina.ca Abstract Granular
More informationMining High Order Decision Rules
Mining High Order Decision Rules Y.Y. Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 e-mail: yyao@cs.uregina.ca Abstract. We introduce the notion of high
More informationOn Generalizing Rough Set Theory
On Generalizing Rough Set Theory Y.Y. Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: yyao@cs.uregina.ca Abstract. This paper summarizes various formulations
More informationOn Reduct Construction Algorithms
1 On Reduct Construction Algorithms Yiyu Yao 1, Yan Zhao 1 and Jue Wang 2 1 Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 {yyao, yanzhao}@cs.uregina.ca 2 Laboratory
More informationGranular Computing based on Rough Sets, Quotient Space Theory, and Belief Functions
Granular Computing based on Rough Sets, Quotient Space Theory, and Belief Functions Yiyu (Y.Y.) Yao 1, Churn-Jung Liau 2, Ning Zhong 3 1 Department of Computer Science, University of Regina Regina, Saskatchewan,
More informationGranular Computing: A Paradigm in Information Processing Saroj K. Meher Center for Soft Computing Research Indian Statistical Institute, Kolkata
Granular Computing: A Paradigm in Information Processing Saroj K. Meher Center for Soft Computing Research Indian Statistical Institute, Kolkata Granular computing (GrC): Outline Introduction Definitions
More informationEFFICIENT ATTRIBUTE REDUCTION ALGORITHM
EFFICIENT ATTRIBUTE REDUCTION ALGORITHM Zhongzhi Shi, Shaohui Liu, Zheng Zheng Institute Of Computing Technology,Chinese Academy of Sciences, Beijing, China Abstract: Key words: Efficiency of algorithms
More informationAn Evolutionary Algorithm for Mining Association Rules Using Boolean Approach
An Evolutionary Algorithm for Mining Association Rules Using Boolean Approach ABSTRACT G.Ravi Kumar 1 Dr.G.A. Ramachandra 2 G.Sunitha 3 1. Research Scholar, Department of Computer Science &Technology,
More informationDiscovering interesting rules from financial data
Discovering interesting rules from financial data Przemysław Sołdacki Institute of Computer Science Warsaw University of Technology Ul. Andersa 13, 00-159 Warszawa Tel: +48 609129896 email: psoldack@ii.pw.edu.pl
More informationAssociation Rules with Additional Semantics Modeled by Binary Relations
Association Rules with Additional Semantics Modeled by Binary Relations T. Y. Lin 1 and Eric Louie 2 1 Department of Mathematics and Computer Science San Jose State University, San Jose, California 95192-0103
More informationDMSA TECHNIQUE FOR FINDING SIGNIFICANT PATTERNS IN LARGE DATABASE
DMSA TECHNIQUE FOR FINDING SIGNIFICANT PATTERNS IN LARGE DATABASE Saravanan.Suba Assistant Professor of Computer Science Kamarajar Government Art & Science College Surandai, TN, India-627859 Email:saravanansuba@rediffmail.com
More informationValue Added Association Rules
Value Added Association Rules T.Y. Lin San Jose State University drlin@sjsu.edu Glossary Association Rule Mining A Association Rule Mining is an exploratory learning task to discover some hidden, dependency
More informationPSON: A Parallelized SON Algorithm with MapReduce for Mining Frequent Sets
2011 Fourth International Symposium on Parallel Architectures, Algorithms and Programming PSON: A Parallelized SON Algorithm with MapReduce for Mining Frequent Sets Tao Xiao Chunfeng Yuan Yihua Huang Department
More informationMinimal Test Cost Feature Selection with Positive Region Constraint
Minimal Test Cost Feature Selection with Positive Region Constraint Jiabin Liu 1,2,FanMin 2,, Shujiao Liao 2, and William Zhu 2 1 Department of Computer Science, Sichuan University for Nationalities, Kangding
More informationFeature Selection Based on Relative Attribute Dependency: An Experimental Study
Feature Selection Based on Relative Attribute Dependency: An Experimental Study Jianchao Han, Ricardo Sanchez, Xiaohua Hu, T.Y. Lin Department of Computer Science, California State University Dominguez
More informationRough Set Approaches to Rule Induction from Incomplete Data
Proceedings of the IPMU'2004, the 10th International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems, Perugia, Italy, July 4 9, 2004, vol. 2, 923 930 Rough
More informationA mining method for tracking changes in temporal association rules from an encoded database
A mining method for tracking changes in temporal association rules from an encoded database Chelliah Balasubramanian *, Karuppaswamy Duraiswamy ** K.S.Rangasamy College of Technology, Tiruchengode, Tamil
More informationTransforming Quantitative Transactional Databases into Binary Tables for Association Rule Mining Using the Apriori Algorithm
Transforming Quantitative Transactional Databases into Binary Tables for Association Rule Mining Using the Apriori Algorithm Expert Systems: Final (Research Paper) Project Daniel Josiah-Akintonde December
More informationInformation Granulation and Approximation in a Decision-theoretic Model of Rough Sets
Information Granulation and Approximation in a Decision-theoretic Model of Rough Sets Y.Y. Yao Department of Computer Science University of Regina Regina, Saskatchewan Canada S4S 0A2 E-mail: yyao@cs.uregina.ca
More informationMining Quantitative Association Rules on Overlapped Intervals
Mining Quantitative Association Rules on Overlapped Intervals Qiang Tong 1,3, Baoping Yan 2, and Yuanchun Zhou 1,3 1 Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China {tongqiang,
More informationFormal Concept Analysis and Hierarchical Classes Analysis
Formal Concept Analysis and Hierarchical Classes Analysis Yaohua Chen, Yiyu Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: {chen115y, yyao}@cs.uregina.ca
More informationA rule-extraction framework under multigranulation rough sets
DOI 10.1007/s13042-013-0194-0 ORIGINAL ARTICLE A rule-extraction framework under multigranulation rough sets Xin Liu Yuhua Qian Jiye Liang Received: 25 January 2013 / Accepted: 10 August 2013 Ó Springer-Verlag
More informationEfficient SQL-Querying Method for Data Mining in Large Data Bases
Efficient SQL-Querying Method for Data Mining in Large Data Bases Nguyen Hung Son Institute of Mathematics Warsaw University Banacha 2, 02095, Warsaw, Poland Abstract Data mining can be understood as a
More informationKnowledge Engineering in Search Engines
San Jose State University SJSU ScholarWorks Master's Projects Master's Theses and Graduate Research Spring 2012 Knowledge Engineering in Search Engines Yun-Chieh Lin Follow this and additional works at:
More informationClassification with Diffuse or Incomplete Information
Classification with Diffuse or Incomplete Information AMAURY CABALLERO, KANG YEN Florida International University Abstract. In many different fields like finance, business, pattern recognition, communication
More informationEfficient Mining of Generalized Negative Association Rules
2010 IEEE International Conference on Granular Computing Efficient Mining of Generalized egative Association Rules Li-Min Tsai, Shu-Jing Lin, and Don-Lin Yang Dept. of Information Engineering and Computer
More informationA Decision-Theoretic Rough Set Model
A Decision-Theoretic Rough Set Model Yiyu Yao and Jingtao Yao Department of Computer Science University of Regina Regina, Saskatchewan, Canada S4S 0A2 {yyao,jtyao}@cs.uregina.ca Special Thanks to Professor
More informationMathematical Foundation of Association Rules - Mining Associations by Solving Integral Linear Inequalities
Mathematical Foundation of Association Rules - Mining Associations by Solving Integral Linear Inequalities Tsau Young ( T. Y. ) Lin Department of Computer Science San Jose State University San Jose, CA
More informationDiscovery of Multi-level Association Rules from Primitive Level Frequent Patterns Tree
Discovery of Multi-level Association Rules from Primitive Level Frequent Patterns Tree Virendra Kumar Shrivastava 1, Parveen Kumar 2, K. R. Pardasani 3 1 Department of Computer Science & Engineering, Singhania
More informationData Analysis and Mining in Ordered Information Tables
Data Analysis and Mining in Ordered Information Tables Ying Sai, Y.Y. Yao Department of Computer Science University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: yyao@cs.uregina.ca Ning Zhong
More informationDiscovery of Multi Dimensional Quantitative Closed Association Rules by Attributes Range Method
Discovery of Multi Dimensional Quantitative Closed Association Rules by Attributes Range Method Preetham Kumar, Ananthanarayana V S Abstract In this paper we propose a novel algorithm for discovering multi
More informationHierarchical Online Mining for Associative Rules
Hierarchical Online Mining for Associative Rules Naresh Jotwani Dhirubhai Ambani Institute of Information & Communication Technology Gandhinagar 382009 INDIA naresh_jotwani@da-iict.org Abstract Mining
More informationA Rough Set Approach for Generation and Validation of Rules for Missing Attribute Values of a Data Set
A Rough Set Approach for Generation and Validation of Rules for Missing Attribute Values of a Data Set Renu Vashist School of Computer Science and Engineering Shri Mata Vaishno Devi University, Katra,
More informationTo Enhance Projection Scalability of Item Transactions by Parallel and Partition Projection using Dynamic Data Set
To Enhance Scalability of Item Transactions by Parallel and Partition using Dynamic Data Set Priyanka Soni, Research Scholar (CSE), MTRI, Bhopal, priyanka.soni379@gmail.com Dhirendra Kumar Jha, MTRI, Bhopal,
More informationGeneralized Infinitive Rough Sets Based on Reflexive Relations
2012 IEEE International Conference on Granular Computing Generalized Infinitive Rough Sets Based on Reflexive Relations Yu-Ru Syau Department of Information Management National Formosa University Huwei
More informationStructure of Association Rule Classifiers: a Review
Structure of Association Rule Classifiers: a Review Koen Vanhoof Benoît Depaire Transportation Research Institute (IMOB), University Hasselt 3590 Diepenbeek, Belgium koen.vanhoof@uhasselt.be benoit.depaire@uhasselt.be
More informationUsing Association Rules for Better Treatment of Missing Values
Using Association Rules for Better Treatment of Missing Values SHARIQ BASHIR, SAAD RAZZAQ, UMER MAQBOOL, SONYA TAHIR, A. RAUF BAIG Department of Computer Science (Machine Intelligence Group) National University
More informationAn Algorithm for Frequent Pattern Mining Based On Apriori
An Algorithm for Frequent Pattern Mining Based On Goswami D.N.*, Chaturvedi Anshu. ** Raghuvanshi C.S.*** *SOS In Computer Science Jiwaji University Gwalior ** Computer Application Department MITS Gwalior
More informationA Granular Computing Approach. T.Y. Lin 1;2. Abstract. From the processing point of view, data mining is machine
Data Mining and Machine Oriented Modeling: A Granular Computing Approach T.Y. Lin 1;2 1 Department of Mathematics and Computer Science San Jose State University, San Jose, California 95192 tylin@cs.sjsu.edu
More informationA study on lower interval probability function based decision theoretic rough set models
Annals of Fuzzy Mathematics and Informatics Volume 12, No. 3, (September 2016), pp. 373 386 ISSN: 2093 9310 (print version) ISSN: 2287 6235 (electronic version) http://www.afmi.or.kr @FMI c Kyung Moon
More informationCombined Intra-Inter transaction based approach for mining Association among the Sectors in Indian Stock Market
Ranjeetsingh BParihar et al, / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol 3 (3), 01,3895-3899 Combined Intra-Inter transaction based approach for mining Association
More informationApplying Fuzzy Sets and Rough Sets as Metric for Vagueness and Uncertainty in Information Retrieval Systems
Applying Fuzzy Sets and Rough Sets as Metric for Vagueness and Uncertainty in Information Retrieval Systems Nancy Mehta,Neera Bawa Lect. In CSE, JCDV college of Engineering. (mehta_nancy@rediffmail.com,
More informationAppropriate Item Partition for Improving the Mining Performance
Appropriate Item Partition for Improving the Mining Performance Tzung-Pei Hong 1,2, Jheng-Nan Huang 1, Kawuu W. Lin 3 and Wen-Yang Lin 1 1 Department of Computer Science and Information Engineering National
More informationROUGH SETS THEORY AND UNCERTAINTY INTO INFORMATION SYSTEM
ROUGH SETS THEORY AND UNCERTAINTY INTO INFORMATION SYSTEM Pavel Jirava Institute of System Engineering and Informatics Faculty of Economics and Administration, University of Pardubice Abstract: This article
More informationMaterialized Data Mining Views *
Materialized Data Mining Views * Tadeusz Morzy, Marek Wojciechowski, Maciej Zakrzewicz Poznan University of Technology Institute of Computing Science ul. Piotrowo 3a, 60-965 Poznan, Poland tel. +48 61
More informationMining of Web Server Logs using Extended Apriori Algorithm
International Association of Scientific Innovation and Research (IASIR) (An Association Unifying the Sciences, Engineering, and Applied Research) International Journal of Emerging Technologies in Computational
More informationA Comparison of Global and Local Probabilistic Approximations in Mining Data with Many Missing Attribute Values
A Comparison of Global and Local Probabilistic Approximations in Mining Data with Many Missing Attribute Values Patrick G. Clark Department of Electrical Eng. and Computer Sci. University of Kansas Lawrence,
More informationMining Temporal Association Rules in Network Traffic Data
Mining Temporal Association Rules in Network Traffic Data Guojun Mao Abstract Mining association rules is one of the most important and popular task in data mining. Current researches focus on discovering
More informationDIVERSITY-BASED INTERESTINGNESS MEASURES FOR ASSOCIATION RULE MINING
DIVERSITY-BASED INTERESTINGNESS MEASURES FOR ASSOCIATION RULE MINING Huebner, Richard A. Norwich University rhuebner@norwich.edu ABSTRACT Association rule interestingness measures are used to help select
More informationA Conflict-Based Confidence Measure for Associative Classification
A Conflict-Based Confidence Measure for Associative Classification Peerapon Vateekul and Mei-Ling Shyu Department of Electrical and Computer Engineering University of Miami Coral Gables, FL 33124, USA
More informationASIAN JOURNAL OF MANAGEMENT RESEARCH Online Open Access publishing platform for Management Research
ASIAN JOURNAL OF MANAGEMENT RESEARCH Online Open Access publishing platform for Management Research Copyright 2010 All rights reserved Integrated Publishing association Review Article ISSN 2229 3795 The
More informationAvailable online at ScienceDirect. Procedia Computer Science 96 (2016 )
Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 96 (2016 ) 179 186 20th International Conference on Knowledge Based and Intelligent Information and Engineering Systems,
More informationGraph Based Approach for Finding Frequent Itemsets to Discover Association Rules
Graph Based Approach for Finding Frequent Itemsets to Discover Association Rules Manju Department of Computer Engg. CDL Govt. Polytechnic Education Society Nathusari Chopta, Sirsa Abstract The discovery
More informationWeb page recommendation using a stochastic process model
Data Mining VII: Data, Text and Web Mining and their Business Applications 233 Web page recommendation using a stochastic process model B. J. Park 1, W. Choi 1 & S. H. Noh 2 1 Computer Science Department,
More informationRough Approximations under Level Fuzzy Sets
Rough Approximations under Level Fuzzy Sets W.-N. Liu J.T. Yao Y.Y.Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: [liuwe200, jtyao, yyao]@cs.uregina.ca
More informationA Data Mining Framework for Extracting Product Sales Patterns in Retail Store Transactions Using Association Rules: A Case Study
A Data Mining Framework for Extracting Product Sales Patterns in Retail Store Transactions Using Association Rules: A Case Study Mirzaei.Afshin 1, Sheikh.Reza 2 1 Department of Industrial Engineering and
More informationData Access Paths for Frequent Itemsets Discovery
Data Access Paths for Frequent Itemsets Discovery Marek Wojciechowski, Maciej Zakrzewicz Poznan University of Technology Institute of Computing Science {marekw, mzakrz}@cs.put.poznan.pl Abstract. A number
More informationContent Based Image Retrieval system with a combination of Rough Set and Support Vector Machine
Shahabi Lotfabadi, M., Shiratuddin, M.F. and Wong, K.W. (2013) Content Based Image Retrieval system with a combination of rough set and support vector machine. In: 9th Annual International Joint Conferences
More informationFast Discovery of Sequential Patterns Using Materialized Data Mining Views
Fast Discovery of Sequential Patterns Using Materialized Data Mining Views Tadeusz Morzy, Marek Wojciechowski, Maciej Zakrzewicz Poznan University of Technology Institute of Computing Science ul. Piotrowo
More informationGranular association rules for multi-valued data
Granular association rules for multi-valued data Fan Min and William Zhu Lab of Granular Computing, Zhangzhou Normal University, Zhangzhou 363, China. Email: minfanphd@163.com, williamfengzhu@gmail.com
More informationApproximation Theories: Granular Computing vs Rough Sets
Approximation Theories: Granular Computing vs Rough Sets Tsau Young ( T. Y. ) Lin Department of Computer Science, San Jose State University San Jose, CA 95192-0249 tylin@cs.sjsu.edu Abstract. The goal
More informationData with Missing Attribute Values: Generalization of Indiscernibility Relation and Rule Induction
Data with Missing Attribute Values: Generalization of Indiscernibility Relation and Rule Induction Jerzy W. Grzymala-Busse 1,2 1 Department of Electrical Engineering and Computer Science, University of
More informationCS570 Introduction to Data Mining
CS570 Introduction to Data Mining Frequent Pattern Mining and Association Analysis Cengiz Gunay Partial slide credits: Li Xiong, Jiawei Han and Micheline Kamber George Kollios 1 Mining Frequent Patterns,
More informationAn Improved Apriori Algorithm for Association Rules
Research article An Improved Apriori Algorithm for Association Rules Hassan M. Najadat 1, Mohammed Al-Maolegi 2, Bassam Arkok 3 Computer Science, Jordan University of Science and Technology, Irbid, Jordan
More informationTemporal Weighted Association Rule Mining for Classification
Temporal Weighted Association Rule Mining for Classification Purushottam Sharma and Kanak Saxena Abstract There are so many important techniques towards finding the association rules. But, when we consider
More informationAN IMPROVISED FREQUENT PATTERN TREE BASED ASSOCIATION RULE MINING TECHNIQUE WITH MINING FREQUENT ITEM SETS ALGORITHM AND A MODIFIED HEADER TABLE
AN IMPROVISED FREQUENT PATTERN TREE BASED ASSOCIATION RULE MINING TECHNIQUE WITH MINING FREQUENT ITEM SETS ALGORITHM AND A MODIFIED HEADER TABLE Vandit Agarwal 1, Mandhani Kushal 2 and Preetham Kumar 3
More informationA Model of Machine Learning Based on User Preference of Attributes
1 A Model of Machine Learning Based on User Preference of Attributes Yiyu Yao 1, Yan Zhao 1, Jue Wang 2 and Suqing Han 2 1 Department of Computer Science, University of Regina, Regina, Saskatchewan, Canada
More informationAn Efficient Tree-based Fuzzy Data Mining Approach
150 International Journal of Fuzzy Systems, Vol. 12, No. 2, June 2010 An Efficient Tree-based Fuzzy Data Mining Approach Chun-Wei Lin, Tzung-Pei Hong, and Wen-Hsiang Lu Abstract 1 In the past, many algorithms
More informationCOMBINATION OF ROUGH AND FUZZY SETS
1 COMBINATION OF ROUGH AND FUZZY SETS BASED ON α-level SETS Y.Y. Yao Department of Computer Science, Lakehead University Thunder Bay, Ontario, Canada P7B 5E1 E-mail: yyao@flash.lakeheadu.ca 1 ABSTRACT
More informationConcurrent Processing of Frequent Itemset Queries Using FP-Growth Algorithm
Concurrent Processing of Frequent Itemset Queries Using FP-Growth Algorithm Marek Wojciechowski, Krzysztof Galecki, Krzysztof Gawronek Poznan University of Technology Institute of Computing Science ul.
More informationAn Efficient Algorithm for finding high utility itemsets from online sell
An Efficient Algorithm for finding high utility itemsets from online sell Sarode Nutan S, Kothavle Suhas R 1 Department of Computer Engineering, ICOER, Maharashtra, India 2 Department of Computer Engineering,
More informationApplying Objective Interestingness Measures. in Data Mining Systems. Robert J. Hilderman and Howard J. Hamilton. Department of Computer Science
Applying Objective Interestingness Measures in Data Mining Systems Robert J. Hilderman and Howard J. Hamilton Department of Computer Science University of Regina Regina, Saskatchewan, Canada SS 0A fhilder,hamiltong@cs.uregina.ca
More informationImproved Frequent Pattern Mining Algorithm with Indexing
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 16, Issue 6, Ver. VII (Nov Dec. 2014), PP 73-78 Improved Frequent Pattern Mining Algorithm with Indexing Prof.
More informationAN IMPROVED GRAPH BASED METHOD FOR EXTRACTING ASSOCIATION RULES
AN IMPROVED GRAPH BASED METHOD FOR EXTRACTING ASSOCIATION RULES ABSTRACT Wael AlZoubi Ajloun University College, Balqa Applied University PO Box: Al-Salt 19117, Jordan This paper proposes an improved approach
More informationMining High Average-Utility Itemsets
Proceedings of the 2009 IEEE International Conference on Systems, Man, and Cybernetics San Antonio, TX, USA - October 2009 Mining High Itemsets Tzung-Pei Hong Dept of Computer Science and Information Engineering
More informationA Mining Algorithm to Generate the Candidate Pattern for Authorship Attribution for Filtering Spam Mail
A Mining Algorithm to Generate the Candidate Pattern for Authorship Attribution for Filtering Spam Mail Khongbantabam Susila Devi #1, Dr. R. Ravi *2 1 Research Scholar, Department of Information & Communication
More informationTexture Image Segmentation using FCM
Proceedings of 2012 4th International Conference on Machine Learning and Computing IPCSIT vol. 25 (2012) (2012) IACSIT Press, Singapore Texture Image Segmentation using FCM Kanchan S. Deshmukh + M.G.M
More informationA New Approach To Graph Based Object Classification On Images
A New Approach To Graph Based Object Classification On Images Sandhya S Krishnan,Kavitha V K P.G Scholar, Dept of CSE, BMCE, Kollam, Kerala, India Sandhya4parvathy@gmail.com Abstract: The main idea of
More informationOpen Access Apriori Algorithm Research Based on Map-Reduce in Cloud Computing Environments
Send Orders for Reprints to reprints@benthamscience.ae 368 The Open Automation and Control Systems Journal, 2014, 6, 368-373 Open Access Apriori Algorithm Research Based on Map-Reduce in Cloud Computing
More informationIJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, ISSN:
IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, 20131 Improve Search Engine Relevance with Filter session Addlin Shinney R 1, Saravana Kumar T
More informationSEQUENTIAL PATTERN MINING FROM WEB LOG DATA
SEQUENTIAL PATTERN MINING FROM WEB LOG DATA Rajashree Shettar 1 1 Associate Professor, Department of Computer Science, R. V College of Engineering, Karnataka, India, rajashreeshettar@rvce.edu.in Abstract
More informationPattern Mining. Knowledge Discovery and Data Mining 1. Roman Kern KTI, TU Graz. Roman Kern (KTI, TU Graz) Pattern Mining / 42
Pattern Mining Knowledge Discovery and Data Mining 1 Roman Kern KTI, TU Graz 2016-01-14 Roman Kern (KTI, TU Graz) Pattern Mining 2016-01-14 1 / 42 Outline 1 Introduction 2 Apriori Algorithm 3 FP-Growth
More informationAn Improved Algorithm for Mining Association Rules Using Multiple Support Values
An Improved Algorithm for Mining Association Rules Using Multiple Support Values Ioannis N. Kouris, Christos H. Makris, Athanasios K. Tsakalidis University of Patras, School of Engineering Department of
More informationData Mining: Concepts and Techniques. Chapter 5. SS Chung. April 5, 2013 Data Mining: Concepts and Techniques 1
Data Mining: Concepts and Techniques Chapter 5 SS Chung April 5, 2013 Data Mining: Concepts and Techniques 1 Chapter 5: Mining Frequent Patterns, Association and Correlations Basic concepts and a road
More informationChapter 4: Mining Frequent Patterns, Associations and Correlations
Chapter 4: Mining Frequent Patterns, Associations and Correlations 4.1 Basic Concepts 4.2 Frequent Itemset Mining Methods 4.3 Which Patterns Are Interesting? Pattern Evaluation Methods 4.4 Summary Frequent
More informationInfrequent Weighted Itemset Mining Using SVM Classifier in Transaction Dataset
Infrequent Weighted Itemset Mining Using SVM Classifier in Transaction Dataset M.Hamsathvani 1, D.Rajeswari 2 M.E, R.Kalaiselvi 3 1 PG Scholar(M.E), Angel College of Engineering and Technology, Tiruppur,
More informationModel for Load Balancing on Processors in Parallel Mining of Frequent Itemsets
American Journal of Applied Sciences 2 (5): 926-931, 2005 ISSN 1546-9239 Science Publications, 2005 Model for Load Balancing on Processors in Parallel Mining of Frequent Itemsets 1 Ravindra Patel, 2 S.S.
More informationImproving the Efficiency of Fast Using Semantic Similarity Algorithm
International Journal of Scientific and Research Publications, Volume 4, Issue 1, January 2014 1 Improving the Efficiency of Fast Using Semantic Similarity Algorithm D.KARTHIKA 1, S. DIVAKAR 2 Final year
More informationMaintenance of Generalized Association Rules for Record Deletion Based on the Pre-Large Concept
Proceedings of the 6th WSEAS Int. Conf. on Artificial Intelligence, Knowledge Engineering and ata Bases, Corfu Island, Greece, February 16-19, 2007 142 Maintenance of Generalized Association Rules for
More information620 HUANG Liusheng, CHEN Huaping et al. Vol.15 this itemset. Itemsets that have minimum support (minsup) are called large itemsets, and all the others
Vol.15 No.6 J. Comput. Sci. & Technol. Nov. 2000 A Fast Algorithm for Mining Association Rules HUANG Liusheng (ΛΠ ), CHEN Huaping ( ±), WANG Xun (Φ Ψ) and CHEN Guoliang ( Ξ) National High Performance Computing
More informationMedical Data Mining Based on Association Rules
Medical Data Mining Based on Association Rules Ruijuan Hu Dep of Foundation, PLA University of Foreign Languages, Luoyang 471003, China E-mail: huruijuan01@126.com Abstract Detailed elaborations are presented
More informationMINING ASSOCIATION RULES WITH UNCERTAIN ITEM RELATIONSHIPS
MINING ASSOCIATION RULES WITH UNCERTAIN ITEM RELATIONSHIPS Mei-Ling Shyu 1, Choochart Haruechaiyasak 1, Shu-Ching Chen, and Kamal Premaratne 1 1 Department of Electrical and Computer Engineering University
More informationLecture 2 Wednesday, August 22, 2007
CS 6604: Data Mining Fall 2007 Lecture 2 Wednesday, August 22, 2007 Lecture: Naren Ramakrishnan Scribe: Clifford Owens 1 Searching for Sets The canonical data mining problem is to search for frequent subsets
More informationSurvey on Rough Set Feature Selection Using Evolutionary Algorithm
Survey on Rough Set Feature Selection Using Evolutionary Algorithm M.Gayathri 1, Dr.C.Yamini 2 Research Scholar 1, Department of Computer Science, Sri Ramakrishna College of Arts and Science for Women,
More informationNovel Materialized View Selection in a Multidimensional Database
Graphic Era University From the SelectedWorks of vijay singh Winter February 10, 2009 Novel Materialized View Selection in a Multidimensional Database vijay singh Available at: https://works.bepress.com/vijaysingh/5/
More informationOptimization using Ant Colony Algorithm
Optimization using Ant Colony Algorithm Er. Priya Batta 1, Er. Geetika Sharmai 2, Er. Deepshikha 3 1Faculty, Department of Computer Science, Chandigarh University,Gharaun,Mohali,Punjab 2Faculty, Department
More information