Association Rules with Additional Semantics Modeled by Binary Relations
|
|
- Asher Roberts
- 5 years ago
- Views:
Transcription
1 Association Rules with Additional Semantics Modeled by Binary Relations T. Y. Lin 1 and Eric Louie 2 1 Department of Mathematics and Computer Science San Jose State University, San Jose, California tylin@cs.sjsu.edu 2 IBM Almaden Research Center 650 Harry Road, San Jose, CA ewlouie@almaden.ibm.com Abstract. This paper continues the study of mining patterns from the real world data. Association rules that respects the semantics modeled by binary relations are called binary semantic association rules. By experiments we find that semantic computation is necessary, efficient and fruitful. It is necessary, because we find the supports of length 2 candidate is quite high in randomly generated data. It is efficient, because the checking of semantics constraints occurs only at length 2. It is fruitful the additional cost is well compensated by the saving in pruning away (non-semantic) association rules. Keywords: Binary relation, clustered(semantics)association rules 1 Introduction Relational theory assumes everything is a classical Cantor set. In other words, the interactions among real world objects are "forgotten" in the relational modeling. However, in practical database processing, some additional semantics in the attribute domains are often employed. For example, in numerical attributes, the order of numbers is often used in SQL statements. In geographical attributes some relationships, such as "near," "in the same area" are often used in data processing by human operators. Therefore these additional semantics implicitly exist in the stored database. The natural question is: Can such semantics be modeled mathematically? Fortunately the model theory of first order logic provides some answers. Model theory uses relational structure to model the real world. By taking different kind of relational structure, we can capture different level of semantics. 1. Classical relational theory: Each attribute domain is discrete, that is, no interactions among entities or attribute values are modeled. The relational structure consists of identity equivalence relations. M. Inuiguchi et al. (eds.), Rough Set Theory and Granular Computing Springer-Verlag Berlin Heidelberg 2003
2 148 T. Y. Lin 2. Binary granular relational theory: Attribute values in each domain are interacted, related or granulated by a binary relation, for example, the order in a numerical attribute. (a) A binary relation(br), B ~ U x U defines a map p E U -> B p, where Bp = {u I (p,u) E B}, This map or the collection {Bp} is called a binary neighborhood system (BNS). Bp is called the elementary neighborhood or granule of p. (b) Conversely, a BNS defines a binary relation B = Up p x Bp. (c) If the binary relation B is an equivalence relation E, the elementary neighborhood Bp is the elementary set [PJE (the equivalence class containing p) [16J. In this paper, we focus on binary granulated data. In other words, the term "additional semantics" means each attribute domain is clustered or granulated by a binary relation. 2 Databases with Additional Semantics 2.1 Machine Oriented Relational Models and Rough Structures A relation is a knowledge representation that maps each entity to a tuple of attribute values. Table 1 illustrates a knowledge representation of the universe V = {Vl,V2,V3,V4,V5}. In this view, an attribute can be regarded as a projection that maps entities to attribute values, for example in Table 1, the CITY attribute is the map, I: V -> Dom(CITY), which assigns, at every tuple, the element in the first column to the element in the last column. The family of complete inverse image 1-1 (y) forms a partition (equivalence relation). So each column (attribute) defines an equivalence relation. Table 1 gives rise to 4 named equivalence relations. Pawlak called the pair V and a finite family of equivalence relations a knowledge base. Since knowledge bases often have different meaning, so we, after naming all granules, have called it rough granular structure, or rough structure, which is a special form of binary granular structure [5J. Formally A binary granular structure consists of 4-tuple (V, U,B,C) where V is called the object space, U the data space (V and U could be the same set), B = {Bi, i = 1, 2,... n} is a finite set of crisp/fuzzy binary relation, and C is a finite set of elementary concept spaces.
3 Association Rules with Additional Semantics 149 An elementary concept space consists of meaningful name of the elementary neighborhood (subset) B~ = {u I (u,p) E Bi} V P E U. Using traditional language, an elementary concept space is an attribute domain; The name of an elementary neighborhood represents an elementary concept which traditionally is referred to as an attribute value. When V and U are identical, i.e., V = U, and the binary relations are equivalence relations, i.e., B = E, then the triple (U, E, C) = (U, U, E, C) is called a rough granular structure, or simply rough structure. Proposition 1. A relation instance is equivalent to a rough granular structure. Attribute names and attribute values are the meaningful names of equivalence relations and elementary granules ( equivalence classes) respectively. Such a view of a relation instance have been called machine oriented modeling; see Table 1, 2. v (8# SNAME Status City) Vi --> (81 Smith TWENTY Cd V2 --> (82 Jones TEN ) V3 --> (83 Blake TEN ) V4 --> (84 Clark TWENTY Cd Vs --> (8s Adams THIRTY C3) Table 1. Information Table of Suppliers; arrows and parentheses will be suppresed Equiv. Elementary Attribute Class Granule value encoded label meaningful name * 8#(*) * * 8NAME(*) * Vl,V4 8T ATU 8(10010) TWENTY V2,V3 8T ATU8(01l00) TEN Vs 8TATU8(00001) THIRTY Vi, V4 CITY(1001O) C1 V2, V3 CITY(OllOO) Vs CITY(OOOOI) C3 Table 2. Rough granular structure and machine oriented model
4 150 T. Y. Lin 2.2 Relations with Additional Semantics using Binary Relations Let BCITY be a binary relation displayed in Table 3. Then we have the elementary neighborhoods, B6[TY = {G I, G2 }, B6;-TY = {GI, G2, G3 }, Bg1'TY = {G2, G3 }. CITY CITY C1 C1 C3 C3 C1 C1 C3 C3 Table 3. "near"-binary Relation Table 1 with additional semantics defined in Table 3 defines 4 named binary relations on U. The first 3 are equivalence relations, and the last one is a binary relation Bind uced from the" near" -binary relation on the domain of CITY-attribute. The binary relation B can easily be expressed by BNS: B vt = {VI, V2, V3, V4}. BV2 = {Vl,V2,V3,V4,V5}. BV3 = {VI, V2, V3, V4, V5}. BV4 = {Vl,V2,V3,V4}. BV5 = {V2, V3, V5}. 3 Re-formulating Data Mining What is data mining? The common answer is essentially" to find the pattern in data." This is not entirely accurate. For example, we will not be interested in a rule, say, "all data are represented by 5 characters." Because this is a pattern of knowledge representation, not Real World. To show that a discovered pattern in a knowledge representation is, indeed, a pattern of Real World is a difficult problem; we need to show that the equivalent pattern also exits in other knowledge representations. So we will take the following alternative: Find the patterns in the mathematical model of Real World.
5 Association Rules with Additional Semantics 151 For relational databases, the mathematical model of Real World is the rough structure; see Table 2. If we conduct the data mining in such a structure, it is automatically a pattern of real world. In this paper, we extend this approach of relational theory to databases with additional semantics. 4 Mining Semantically In databases with additional semantics, attribute values are semantically related, so in processing any logical formula, e.g., decision or association rules, it is important that one checks the semantics. Since we use the notion of neighborhood systems (generalization of topological space), rules or patterns that respect the semantics will be termed continuous rules or patterns. We collect some standard continuous patterns [4]: Let A and B be two attributes of a relation-with-additional-semantics. Let c, d be two values of A and B respectively. Let NEIGH(c), NEIGH(d) be the respective elementary granules. It is clear that c = NAME(NEIGH(c)) and d= NAME( NEIGH(d)). Let Card(~)be the cardinal number of a set~. 1. A formula c -> d is a continuous(semantics)decision rule, if N EIGH(c) ~ N EIGH(d) continuously. 2. A formula A -> B is a continuous(semantics)universal decision rule (extensional function dependence), iff 'V c E A :3 deb such that NEIGH(c) ~ NEIGH(d) 3. A formula c -> d is a robust continuous(semantics)decision rule, if N EIGH(c) ~ N EIGH(d) and Card(Pc ) 2': threshold [9]. 4. A formula c -> d is a soft continuous(semantics)decision rule (strong rule), if NEIGH(c) is softly included in NEIGH(d), NEIGH(c)c;;'N EIGH(d) [15]. 5. Continuous(semantics)association rule: see next section 6. Weak association rule: A pair (c, d) is an association rules, if Card (NEIGH(c) n NEIGH(d)) 2': threshhold. We will illustrate the continuous(semantics)decision rules only; see next section for association rules. c -> d is a continuous(semantics)decision rule if an attribute value in NEIGH(c) appears in a tuple, it must imply that an attribute value in NEIGH(d) also appears. So to check If STATUS = TEN, then CITY =." One needs to scan through the two columns in Table 1 and check if "TEN" ("TEN = NEIGH("TEN)) is continuously associated with NEIGH()." In machine oriented model, the same fact can be checked by the inclusion of two elementary granules, namely, "TEN" nn EIGH( )= ST ATUS(010000) ncity(11111)= (010000) n(11111) = (01100)= ST ATUS(010000) = "TEN"
6 152 T. Y. Lin 5 Semantic Association Rules We will call any pattern or rule that respects the semantics a continuous (semantic) pattern or continuous (semantic) rule. Let c and d be two attribute values in a relation Definition Continuous (semantic) Association rules 1. A pair (c, d) is said to be in a relation or database, if it is a sub-tuple of a tuple that belongs to a relation or database. 2. A pair (c, d) in a given relation is one-way (c d) continuous (semantic) if every x E Be there is at least one y E Bd such that (x,y) is in the given relation. 3. A pair (c, d) in a given relation is a two way continuous (or semantic) if (c d) and (d c) are both continuous. 4. Continuous (semantic) association rule: A pair (c, d) is an association rule iff the pair is an association rule and two way continuous. 5. Two continuous pairs, (Cl,) and (,C3) compose into a continuous pair (Cl, C3)' In particular, composition of continuous association rules is continuous association rule. 6. Soft association rule: A pair (c, d) is a soft association rule, if Card (NEIGH(c) n NEIGH(d))?:: threshhold. [4], [5] Here is some of our experimental results: see Table 4, 5, 6. Some comments on algorithms and data: The table has rows and 16 columns; we require the support to be items. The algorithm is restricted to use 10 mega bytes of main memory, so it is reasonably scalable. It checks one way continuous. The first column represent the length of combinations (of candidates). A q-combination (combination of length q) exits in the database is called q itemset. The second column is the all possible combinations. q-combination is a join of two (q - I)-association rules. The third column is the support count; the fourth is the number of association rules. An q-association rule is a q-combination that meets the support requirement. The fifth column is the time needed to generate the results of next row. 5.1 Randomly generated data In this experiment, the data is generated randomly. So each individual data is totally independent. Yet, we still find some highly supported item set of length 2. The computation implies that frequency itself is not an "adequate criterion" for meaningful patterns. In other words, digging into deeper semantics seems necessary.
7 Length Cand Supp Rules 8-time Association Rules with Additional Semantics 153 Comment 0 0 O.OOOs Start s # of candidates: 99 I-combinations O.OOOs # of association rules: 62 I-combinations O.OlOs # of candidates: combinations s # of supported: 44 2-combinations 0 O.OOOs # of association rules: 0 2-combinations 0 O.OOOs # of association rules: 0 2-combinations O.OOOs Complete s Totals Table 4. Even randomly generated data still has length 2 rules 5.2 Semantically generated data In this experiment, an association rule of length 16 is embedded in the randomly generated data. Then the data is generated as follows: In generating first column, the algorithm randomly generate a data, if this data is the selected one, then based on the assigned probability, the algorithm choose the selected one in next column or randomly generated another data. The selected rules is randomly embedded in the data. Instead of traditional apriori algorithm, we use granular computing [13J to find the association rules. The result is in Table Semantic/continuous association rules - Data with neighborhoods In this experiment, an interval (neighborhood) is selected systematically for each element. The interval slides up and down like sign curve. The detail is not important, but the selection reflect some semantic is imposed on the random data. Note that compositions of continuous association rules of length 2 are continuous. So the continuity checking is unnecessarily once passes length 2. The pruning occurs only at length 2; this is reflected in the experiments. Look at the last rows of Tabel 5 and Tabel 6 to see the time saved. 6 Conclusion Here are our observations 1. Classical relation is a knowledge representation, while granular structure is the mathematical model of real world. 2. Granular computing, then, mines the patterns from real world, not its representation.
8 154 T. Y. Lin Length Cand Supp Rules 6-time Comment 0 0 O.OOOs Start s # of candidates: 152 l-combinatiuns O.OOOs # of assoc. rules: 16 I-combinations O.OOOs # of candidates: combinations s # of supports: combinations 120 O.OOOs # of assoc. rules: combinations O.OOOs # of candidates: combinations s # of supports: combinations 560 O.OOOs # of assoc. rules: combinations s # of candidates: combinations s # of supports: combinations 1820 O.OOOs # of assoc. rules: combinations s # of candidates: combinations s # of supports: combinations 4368 O.OOOs # of assoc. rules: combinations s # of candidates: combinations s 8008 O.OOOs # of supports: combinations # of assoc. rules: combinations s # of candidates: combinations s # of supports: combinations O.OOOs # of assoc. rules: combinations s # of candidates: combinations s # of supports: combinations O.OOOs # of assoc. rules: combinations s # of candidates: combinations s # of supports: combinations O.OOOs # of assoc. rules: combinations s # of candidates: 8008 lo-combinations s # of supports: 8008 lo-combinations 8008 O.OOOs # of assoc. rules: 8008 lo-combinations s # of candidates: 4368 II-combinations s # of supports: 4368 ll-combinations 4368 O.OOOs # of assoc. rules: 4368 II-combinations s # of candidates: combinations s # of supports: combinations 1820 O.OOOs # of assoc. rules: combinations s # of candidates: combinations s # of supports: combinations 560 O.OOOs # of assoc. rules: combinations s # of candidates: combinations s # of supports: combinations 120 O.OOOs # of assoc. rules: combinations O.OOOs # of candidates: combinations s # of supports: combinations 16 O.OOOs # of assoc. rules: combinations 16 1 O.OOOs # of candidates: 1 16-combinations Os # of supports: 1 16-combinations 1 O.OOOs # of assoc. rules: 1 16-combinations O.OOOs Complete s Totals Table 5. "Semantically" generated data-association rule is expensive
9 Length Cand Supp Rules 8-time Association Rules with Additional Semantics 155 Comment 0 0 O.OOOs Start s # of candidates: 152 I-combinations O.OOOs # of assoc. rules: 16 I-combinations s # of candidates: combinations s # of supports: combinations O.OOOs # of assoc. rules: 54 2-combinations O.OOOs # of candidates: combinations s # of supports: combinations 118 O.OOOs # of assoc. rules: combinations s # of candidates: combinations s # of supports: combinations 171 O.OOOs # of assoc. rules: combinations O.OlOs # of candidates: combinations s # of supports: combinations 166 O.OOOs # of assoc. rules: combinations O.OlOs # of candidates: combinations s # of supports: combinations 106 O.OOOs # of assoc. rules: combinations 7 43 O.OOOs # of candidates: 43 7-combinations s # of supports: 43 7-combinations 43 O.OOOs # of assoc. rules: 43 7-combinations 8 10 O.OOOs # of candidates: 10 8-combinations. 10 O.OlOs # of supports: 10 8-combinations 10 O.OOOs # of assoc. rules: 10 8-combinations 9 1 O.OOOs 1 O.OOOs # of candidates: 1 9-combinations. # of supports: 1 9-combinations 1 O.OOOs # of assoc. rules: 1 9-combinations 10 0 O.OOOs # of candidates: 0 lo-combinations. 0 0 O.OOOs # of assoc. rules: 0 10-combinations O.OOOs Complete s Totals Table 6. Data with neighborhoods- semantic rule cost is inexpensive 3. Granular computing was shown to be faster than traditional data mining [13],[14]. Now, we apply to databases with additional semantic. 4. In granular computing the cost of checking additional semantics is well compensated by pruning away non-semantic rules. References 1. Agrawal, R., R. Srikant, "Fast Algorithms for Mining Association Rules," in Proceeding of 20th VLDB Conference San Tiago, Chile, 1994.
10 156 T. Y. Lin 2. \V. Chu and Q. Chen, "Neighborhood and associative query answering," Journal of Intelligent Information Systems, vol 1, , K. Engesser, Some connections between topological and Modal Logic, Mathematical Logic Quarterly, 41, 49-64, T. Y. Lin, "Data Mining and Machine Oriented Modeling: A Granular Computing Approach," Journal of Applied Intelligence, Kluwer, Vol. 13,No 2, September/October,2000, pp T. Y. Lin, "Data Mining: Granular Computing Approach." In: Methodologies for Knowledge Discovery and Data Mining, Lecture Notes in Artificial Intelligence 1574, Third Pacific-Asia Conference, Beijing, April 26-28, 1999, T. Y. Lin," Granular Computing: Fuzzy Logic and Rough Sets. " In: Computing with words in information/intelligent systems, L.A. Zadeh and J. Kacprzyk (eds), Springer-Verlag, , T. Y. Lin, "Granular Computing on Binary Relations I: Data Mining and Neighborhood Systems." In: Rough Sets In Knowledge Discovery, A. Skoworn and L. Polkowski (eds), Springer-Verlag, 1998, T. Y. Lin, "Granular Computing on Binary Relations II: Rough Set Representations and Belief Functions." In: Rough Sets In Knowledge Discovery, A. Skoworn and L. Polkowski (eds), Springer-Verlag, 1998, T. Y. Lin, "Rough Set Theory in Very Large Databases," Symposium on Modeling, Analysis and Simulation, CESA'96 IMACS Multi Conference (Computational Engineering in Systems Applications), Lille, France, July 9-12, 1996, Vol. 2 of 2, T. Y. Lin, " Neighborhood Systems and Approximation in Database and Knowledge Base Systems," Proceedings of the Fourth International Symposium on Methodologies of Intelligent Systems, Poster Session, October 12-15, pp , T. Y. Lin, "Topological Data Models and Approximate Retrieval and Reasoning," in: Proceedings of 1989 ACM Seventeenth Annual Computer Science Conference, February 21-23, Louisville, Kentucky, 1989, T. Y. Lin,"Neighborhood Systems and Relational Database". Abstract, Proceedings of CSC '88, February, 1988, pp Eric Louie and T.Y. Lin, "Finding Association Rules using Fast Bit Computation: Machine-Oriented Modeling." In: Proceeding of 12th International Symposium ISMIS2000, Charlotte, North Carolina, Oct 11-14, Lecture Notes in AI T. Y. Lin and E. Louie, "A Data Mining Approach using Machine Oriented Modeling: Finding Association Rules using Canonical Names.". In: Proceeding of 14th Annual International Symposium Aerospace/Defense Sensing, Simulation, and Controls, SPIE Vol 4057, Orlando, April 24-28, 2000, pp T. Y. Lin, and Y.Y. Yao "Mining Soft Rules Using Rough Sets and Neighborhoods." In: Symposium on Modeling, Analysis and Simulation, IMACS Multiconference (Computational Engineering in Systems Applications), Lille, France, July 9-12, 1996, Vol. 2 of 2, Z. Pawlak, Rough sets. Theoretical Aspects of Reasoning about Data, Kluwer Academic Publishers, W. Sierpenski and C. Krieger, General Topology, University of Toronto Press 1952.
Modeling the Real World for Data Mining: Granular Computing Approach
Modeling the Real World for Data Mining: Granular Computing Approach T. Y. Lin Department of Mathematics and Computer Science San Jose State University San Jose California 95192-0103 and Berkeley Initiative
More informationSemantics Oriented Association Rules
Semantics Oriented Association Rules Eric Louie BM Almaden Research Center 650 Harry Road, San Jose, CA 95 120 ewlouie@almaden.ibm.com Abstract - t is well known that relational theory carries very little
More informationA Granular Computing Approach. T.Y. Lin 1;2. Abstract. From the processing point of view, data mining is machine
Data Mining and Machine Oriented Modeling: A Granular Computing Approach T.Y. Lin 1;2 1 Department of Mathematics and Computer Science San Jose State University, San Jose, California 95192 tylin@cs.sjsu.edu
More informationMathematical Foundation of Association Rules - Mining Associations by Solving Integral Linear Inequalities
Mathematical Foundation of Association Rules - Mining Associations by Solving Integral Linear Inequalities Tsau Young ( T. Y. ) Lin Department of Computer Science San Jose State University San Jose, CA
More informationRough Sets, Neighborhood Systems, and Granular Computing
Rough Sets, Neighborhood Systems, and Granular Computing Y.Y. Yao Department of Computer Science University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: yyao@cs.uregina.ca Abstract Granulation
More informationApproximation Theories: Granular Computing vs Rough Sets
Approximation Theories: Granular Computing vs Rough Sets Tsau Young ( T. Y. ) Lin Department of Computer Science, San Jose State University San Jose, CA 95192-0249 tylin@cs.sjsu.edu Abstract. The goal
More informationQualitative Fuzzy Sets and Granularity
Qualitative Fuzzy Sets and Granularity T. Y. Lin Department of Mathematics and Computer Science San Jose State University, San Jose, California 95192-0103 E-mail: tylin@cs.sjsu.edu and Shusaku Tsumoto
More informationA Generalized Decision Logic Language for Granular Computing
A Generalized Decision Logic Language for Granular Computing Y.Y. Yao Department of Computer Science, University of Regina, Regina Saskatchewan, Canada S4S 0A2, E-mail: yyao@cs.uregina.ca Churn-Jung Liau
More informationMining High Order Decision Rules
Mining High Order Decision Rules Y.Y. Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 e-mail: yyao@cs.uregina.ca Abstract. We introduce the notion of high
More informationOn Generalizing Rough Set Theory
On Generalizing Rough Set Theory Y.Y. Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: yyao@cs.uregina.ca Abstract. This paper summarizes various formulations
More informationA Logic Language of Granular Computing
A Logic Language of Granular Computing Yiyu Yao and Bing Zhou Department of Computer Science University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: {yyao, zhou200b}@cs.uregina.ca Abstract Granular
More informationValue Added Association Rules
Value Added Association Rules T.Y. Lin San Jose State University drlin@sjsu.edu Glossary Association Rule Mining A Association Rule Mining is an exploratory learning task to discover some hidden, dependency
More informationEfficient SQL-Querying Method for Data Mining in Large Data Bases
Efficient SQL-Querying Method for Data Mining in Large Data Bases Nguyen Hung Son Institute of Mathematics Warsaw University Banacha 2, 02095, Warsaw, Poland Abstract Data mining can be understood as a
More informationFormal Concept Analysis and Hierarchical Classes Analysis
Formal Concept Analysis and Hierarchical Classes Analysis Yaohua Chen, Yiyu Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: {chen115y, yyao}@cs.uregina.ca
More informationAttribute (Feature) Completion The Theory of Attributes from Data Mining Prospect
Attribute (Feature) Completion The Theory of Attributes from Data Mining Prospect Tsay Young ( T. Y. ) Lin Department of Computer Science San Jose State University San Jose, CA 95192, USA tylin@cs.sjsu.edu
More informationA Set Theory For Soft Computing A Unified View of Fuzzy Sets via Neighbrohoods
A Set Theory For Soft Computing A Unified View of Fuzzy Sets via Neighbrohoods T. Y. Lin Department of Mathematics and Computer Science, San Jose State University, San Jose, California 95192-0103, and
More informationGranular Computing based on Rough Sets, Quotient Space Theory, and Belief Functions
Granular Computing based on Rough Sets, Quotient Space Theory, and Belief Functions Yiyu (Y.Y.) Yao 1, Churn-Jung Liau 2, Ning Zhong 3 1 Department of Computer Science, University of Regina Regina, Saskatchewan,
More informationRough Set Approaches to Rule Induction from Incomplete Data
Proceedings of the IPMU'2004, the 10th International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems, Perugia, Italy, July 4 9, 2004, vol. 2, 923 930 Rough
More informationGranular Computing. Y. Y. Yao
Granular Computing Y. Y. Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: yyao@cs.uregina.ca, http://www.cs.uregina.ca/~yyao Abstract The basic ideas
More informationGranular Computing: A Paradigm in Information Processing Saroj K. Meher Center for Soft Computing Research Indian Statistical Institute, Kolkata
Granular Computing: A Paradigm in Information Processing Saroj K. Meher Center for Soft Computing Research Indian Statistical Institute, Kolkata Granular computing (GrC): Outline Introduction Definitions
More informationA Comparison of Global and Local Probabilistic Approximations in Mining Data with Many Missing Attribute Values
A Comparison of Global and Local Probabilistic Approximations in Mining Data with Many Missing Attribute Values Patrick G. Clark Department of Electrical Eng. and Computer Sci. University of Kansas Lawrence,
More informationGeneralized Infinitive Rough Sets Based on Reflexive Relations
2012 IEEE International Conference on Granular Computing Generalized Infinitive Rough Sets Based on Reflexive Relations Yu-Ru Syau Department of Information Management National Formosa University Huwei
More informationMining High Average-Utility Itemsets
Proceedings of the 2009 IEEE International Conference on Systems, Man, and Cybernetics San Antonio, TX, USA - October 2009 Mining High Itemsets Tzung-Pei Hong Dept of Computer Science and Information Engineering
More informationGranular Computing on Binary Relations In Data Mining and Neighborhood Systems
Granular Computing on Binary Relations In Data Mining and Neighborhood Systems T. Y. Lin Department of Mathematics and Computer Science San Jose State University San Jose, California 95192-0103 And Department
More informationKnowledge Engineering in Search Engines
San Jose State University SJSU ScholarWorks Master's Projects Master's Theses and Graduate Research Spring 2012 Knowledge Engineering in Search Engines Yun-Chieh Lin Follow this and additional works at:
More informationA Rough Set Approach for Generation and Validation of Rules for Missing Attribute Values of a Data Set
A Rough Set Approach for Generation and Validation of Rules for Missing Attribute Values of a Data Set Renu Vashist School of Computer Science and Engineering Shri Mata Vaishno Devi University, Katra,
More informationWeb page recommendation using a stochastic process model
Data Mining VII: Data, Text and Web Mining and their Business Applications 233 Web page recommendation using a stochastic process model B. J. Park 1, W. Choi 1 & S. H. Noh 2 1 Computer Science Department,
More informationMolodtsov's Soft Set Theory and its Applications in Decision Making
International Journal of Engineering Science Invention ISSN (Online): 239 6734, ISSN (Print): 239 6726 Volume 6 Issue 2 February 27 PP. 86-9 Molodtsov's Soft Set Theory and its Applications in Decision
More informationData with Missing Attribute Values: Generalization of Indiscernibility Relation and Rule Induction
Data with Missing Attribute Values: Generalization of Indiscernibility Relation and Rule Induction Jerzy W. Grzymala-Busse 1,2 1 Department of Electrical Engineering and Computer Science, University of
More informationInduction of Strong Feature Subsets
Induction of Strong Feature Subsets Mohamed Quafafou and Moussa Boussouf IRIN, University of Nantes, 2 rue de la Houssiniere, BP 92208-44322, Nantes Cedex 03, France. quafafou9 Abstract The problem of
More informationGranular association rules for multi-valued data
Granular association rules for multi-valued data Fan Min and William Zhu Lab of Granular Computing, Zhangzhou Normal University, Zhangzhou 363, China. Email: minfanphd@163.com, williamfengzhu@gmail.com
More informationRough Approximations under Level Fuzzy Sets
Rough Approximations under Level Fuzzy Sets W.-N. Liu J.T. Yao Y.Y.Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: [liuwe200, jtyao, yyao]@cs.uregina.ca
More informationA mining method for tracking changes in temporal association rules from an encoded database
A mining method for tracking changes in temporal association rules from an encoded database Chelliah Balasubramanian *, Karuppaswamy Duraiswamy ** K.S.Rangasamy College of Technology, Tiruchengode, Tamil
More informationGranular Computing II:
Granular Computing II: Infrastructures for AI-Engineering Tsau Young (T. Y.) Lin, Member, IEEE, Abstract What is granular computing? There are no well accepted formal definitions yet. Informally, any computing
More informationData Analysis and Mining in Ordered Information Tables
Data Analysis and Mining in Ordered Information Tables Ying Sai, Y.Y. Yao Department of Computer Science University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: yyao@cs.uregina.ca Ning Zhong
More informationA Rough Set Approach to Data with Missing Attribute Values
A Rough Set Approach to Data with Missing Attribute Values Jerzy W. Grzymala-Busse Department of Electrical Engineering and Computer Science, University of Kansas, Lawrence, KS 66045, USA and Institute
More informationGranular Computing: Models and Applications
Granular Computing: Models and Applications Jianchao Han, 1, Tsau Young Lin 2, 1 Department of Computer Science, California State University, Dominguez Hills, Carson, CA 90747 2 Department of Computer
More informationGenerating Topology on Graphs by. Operations on Graphs
Applied Mathematical Sciences, Vol. 9, 2015, no. 57, 2843-2857 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ams.2015.5154 Generating Topology on Graphs by Operations on Graphs M. Shokry Physics
More informationEnumerating Pseudo-Intents in a Partial Order
Enumerating Pseudo-Intents in a Partial Order Alexandre Bazin and Jean-Gabriel Ganascia Université Pierre et Marie Curie, Laboratoire d Informatique de Paris 6 Paris, France Alexandre.Bazin@lip6.fr Jean-Gabriel@Ganascia.name
More informationROUGH SETS THEORY AND UNCERTAINTY INTO INFORMATION SYSTEM
ROUGH SETS THEORY AND UNCERTAINTY INTO INFORMATION SYSTEM Pavel Jirava Institute of System Engineering and Informatics Faculty of Economics and Administration, University of Pardubice Abstract: This article
More informationON DECOMPOSITION OF FUZZY BԐ OPEN SETS
ON DECOMPOSITION OF FUZZY BԐ OPEN SETS 1 B. Amudhambigai, 2 K. Saranya 1,2 Department of Mathematics, Sri Sarada College for Women, Salem-636016, Tamilnadu,India email: 1 rbamudha@yahoo.co.in, 2 saranyamath88@gmail.com
More informationGranular Computing: Examples, Intuitions and Modeling
Granular Computing: Examples, Intuitions and Modeling Tsau Young (T. Y.) Lin, Member; IEEE, Abstract- The notion of granular computing is examined. Obvious examples, such as fuzzy numbers, infinitesimal
More informationOn Reduct Construction Algorithms
1 On Reduct Construction Algorithms Yiyu Yao 1, Yan Zhao 1 and Jue Wang 2 1 Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 {yyao, yanzhao}@cs.uregina.ca 2 Laboratory
More informationSets with Partial Memberships A Rough Set View of Fuzzy Sets
Sets with Partial Memberships A Rough Set View of Fuzzy Sets T. Y. Lin Department of Mathematics and Computer Science San Jose State University, San Jose, California 9592-3 E-mail: tylin @ cs.sj st.l.edu
More informationCollaborative Rough Clustering
Collaborative Rough Clustering Sushmita Mitra, Haider Banka, and Witold Pedrycz Machine Intelligence Unit, Indian Statistical Institute, Kolkata, India {sushmita, hbanka r}@isical.ac.in Dept. of Electrical
More informationAvailable online at ScienceDirect. Procedia Computer Science 96 (2016 )
Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 96 (2016 ) 179 186 20th International Conference on Knowledge Based and Intelligent Information and Engineering Systems,
More informationROUGH MEMBERSHIP FUNCTIONS: A TOOL FOR REASONING WITH UNCERTAINTY
ALGEBRAIC METHODS IN LOGIC AND IN COMPUTER SCIENCE BANACH CENTER PUBLICATIONS, VOLUME 28 INSTITUTE OF MATHEMATICS POLISH ACADEMY OF SCIENCES WARSZAWA 1993 ROUGH MEMBERSHIP FUNCTIONS: A TOOL FOR REASONING
More informationIMPLEMENTATION AND COMPARATIVE STUDY OF IMPROVED APRIORI ALGORITHM FOR ASSOCIATION PATTERN MINING
IMPLEMENTATION AND COMPARATIVE STUDY OF IMPROVED APRIORI ALGORITHM FOR ASSOCIATION PATTERN MINING 1 SONALI SONKUSARE, 2 JAYESH SURANA 1,2 Information Technology, R.G.P.V., Bhopal Shri Vaishnav Institute
More informationCombined Intra-Inter transaction based approach for mining Association among the Sectors in Indian Stock Market
Ranjeetsingh BParihar et al, / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol 3 (3), 01,3895-3899 Combined Intra-Inter transaction based approach for mining Association
More informationApplying Objective Interestingness Measures. in Data Mining Systems. Robert J. Hilderman and Howard J. Hamilton. Department of Computer Science
Applying Objective Interestingness Measures in Data Mining Systems Robert J. Hilderman and Howard J. Hamilton Department of Computer Science University of Regina Regina, Saskatchewan, Canada SS 0A fhilder,hamiltong@cs.uregina.ca
More informationTemporal Weighted Association Rule Mining for Classification
Temporal Weighted Association Rule Mining for Classification Purushottam Sharma and Kanak Saxena Abstract There are so many important techniques towards finding the association rules. But, when we consider
More informationAppropriate Item Partition for Improving the Mining Performance
Appropriate Item Partition for Improving the Mining Performance Tzung-Pei Hong 1,2, Jheng-Nan Huang 1, Kawuu W. Lin 3 and Wen-Yang Lin 1 1 Department of Computer Science and Information Engineering National
More informationMining Local Association Rules from Temporal Data Set
Mining Local Association Rules from Temporal Data Set Fokrul Alom Mazarbhuiya 1, Muhammad Abulaish 2,, Anjana Kakoti Mahanta 3, and Tanvir Ahmad 4 1 College of Computer Science, King Khalid University,
More informationSongklanakarin Journal of Science and Technology SJST R1 Ghareeb SPATIAL OBJECT MODELING IN SOFT TOPOLOGY
Songklanakarin Journal of Science and Technology SJST-0-00.R Ghareeb SPATIAL OBJECT MODELING IN SOFT TOPOLOGY Journal: Songklanakarin Journal of Science and Technology Manuscript ID: SJST-0-00.R Manuscript
More informationEFFICIENT ATTRIBUTE REDUCTION ALGORITHM
EFFICIENT ATTRIBUTE REDUCTION ALGORITHM Zhongzhi Shi, Shaohui Liu, Zheng Zheng Institute Of Computing Technology,Chinese Academy of Sciences, Beijing, China Abstract: Key words: Efficiency of algorithms
More informationE-R Model. Hi! Here in this lecture we are going to discuss about the E-R Model.
E-R Model Hi! Here in this lecture we are going to discuss about the E-R Model. What is Entity-Relationship Model? The entity-relationship model is useful because, as we will soon see, it facilitates communication
More informationEfficient Mining of Generalized Negative Association Rules
2010 IEEE International Conference on Granular Computing Efficient Mining of Generalized egative Association Rules Li-Min Tsai, Shu-Jing Lin, and Don-Lin Yang Dept. of Information Engineering and Computer
More informationApplying Fuzzy Sets and Rough Sets as Metric for Vagueness and Uncertainty in Information Retrieval Systems
Applying Fuzzy Sets and Rough Sets as Metric for Vagueness and Uncertainty in Information Retrieval Systems Nancy Mehta,Neera Bawa Lect. In CSE, JCDV college of Engineering. (mehta_nancy@rediffmail.com,
More informationPredefined Object Reduction
ISSN: 8 Volume Issue December 0 Predefined Object Reduction Mohammed Adam TaheirMohammedWan Maseri Binti Wan Mohd Ruzaini Bin Abdullah Arshah LiuYao Abstract - Reduction techniques is still an open area
More informationA Graded Meaning of Formulas in Approximation Spaces
Fundamenta Informaticae 60 (2004) 159 172 159 IOS Press A Graded Meaning of Formulas in Approximation Spaces Anna Gomolińska Department of Mathematics University of Białystok ul. Akademicka 2, 15-267 Białystok,
More informationInformation Granulation and Approximation in a Decision-theoretic Model of Rough Sets
Information Granulation and Approximation in a Decision-theoretic Model of Rough Sets Y.Y. Yao Department of Computer Science University of Regina Regina, Saskatchewan Canada S4S 0A2 E-mail: yyao@cs.uregina.ca
More informationRelational Model. Rab Nawaz Jadoon DCS. Assistant Professor. Department of Computer Science. COMSATS IIT, Abbottabad Pakistan
Relational Model DCS COMSATS Institute of Information Technology Rab Nawaz Jadoon Assistant Professor COMSATS IIT, Abbottabad Pakistan Management Information Systems (MIS) Relational Model Relational Data
More informationAttribute Reduction using Forward Selection and Relative Reduct Algorithm
Attribute Reduction using Forward Selection and Relative Reduct Algorithm P.Kalyani Associate Professor in Computer Science, SNR Sons College, Coimbatore, India. ABSTRACT Attribute reduction of an information
More informationAction Rules. (*Corresponding author)
Action Rules Zbigniew W. Ras* Department of Computer Science University of North Carolina 9201 University City Blvd. Charlotte, NC 28223, USA voice: +1 704-687-4567 fax: +1 704-687-3516 email: ras@uncc.edu
More informationDefinition 2.3: [5] Let, and, be two simple graphs. Then the composition of graphs. and is denoted by,
International Journal of Pure Applied Mathematics Volume 119 No. 14 2018, 891-898 ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu ijpam.eu ON M-POLAR INTUITIONISTIC FUZZY GRAPHS K. Sankar 1,
More informationDMSA TECHNIQUE FOR FINDING SIGNIFICANT PATTERNS IN LARGE DATABASE
DMSA TECHNIQUE FOR FINDING SIGNIFICANT PATTERNS IN LARGE DATABASE Saravanan.Suba Assistant Professor of Computer Science Kamarajar Government Art & Science College Surandai, TN, India-627859 Email:saravanansuba@rediffmail.com
More informationMining Quantitative Association Rules on Overlapped Intervals
Mining Quantitative Association Rules on Overlapped Intervals Qiang Tong 1,3, Baoping Yan 2, and Yuanchun Zhou 1,3 1 Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China {tongqiang,
More informationInterpreting Association Rules in Granular Data Model via Decision Logic
Interpreting Association Rules in Granular Data Model via Decision Logic Tsau. Young.("T. Y.") Lin Department of Computer Science San Jose State University San Jose, CA 95192-0462 tylin@cs.sjsu.edu Abstruct
More informationMinimal Test Cost Feature Selection with Positive Region Constraint
Minimal Test Cost Feature Selection with Positive Region Constraint Jiabin Liu 1,2,FanMin 2,, Shujiao Liao 2, and William Zhu 2 1 Department of Computer Science, Sichuan University for Nationalities, Kangding
More informationApproximation of Frequency Queries by Means of Free-Sets
Approximation of Frequency Queries by Means of Free-Sets Jean-François Boulicaut, Artur Bykowski, and Christophe Rigotti Laboratoire d Ingénierie des Systèmes d Information INSA Lyon, Bâtiment 501 F-69621
More informationDATA MINING II - 1DL460
Uppsala University Department of Information Technology Kjell Orsborn DATA MINING II - 1DL460 Assignment 2 - Implementation of algorithm for frequent itemset and association rule mining 1 Algorithms for
More informationCOMBINATION OF ROUGH AND FUZZY SETS
1 COMBINATION OF ROUGH AND FUZZY SETS BASED ON α-level SETS Y.Y. Yao Department of Computer Science, Lakehead University Thunder Bay, Ontario, Canada P7B 5E1 E-mail: yyao@flash.lakeheadu.ca 1 ABSTRACT
More informationThe Rough Set Database System: An Overview
The Rough Set Database System: An Overview Zbigniew Suraj 1,2 and Piotr Grochowalski 2 1 Chair of Computer Science Foundations University of Information Technology and Management, Rzeszow, Poland zsuraj@wenus.wsiz.rzeszow.pl
More informationMining Association Rules From Time Series Data Using Hybrid Approaches
International Journal Of Computational Engineering Research (ijceronline.com) Vol. Issue. ining Association Rules From Time Series Data Using ybrid Approaches ima Suresh 1, Dr. Kumudha Raimond 2 1 PG Scholar,
More informationLocal and Global Approximations for Incomplete Data
Local and Global Approximations for Incomplete Data Jerzy W. Grzyma la-busse 1,2 and Wojciech Rz asa 3 1 Department of Electrical Engineering and Computer Science, University of Kansas, Lawrence, KS 66045,
More informationClassification with Diffuse or Incomplete Information
Classification with Diffuse or Incomplete Information AMAURY CABALLERO, KANG YEN Florida International University Abstract. In many different fields like finance, business, pattern recognition, communication
More informationEfficient integration of data mining techniques in DBMSs
Efficient integration of data mining techniques in DBMSs Fadila Bentayeb Jérôme Darmont Cédric Udréa ERIC, University of Lyon 2 5 avenue Pierre Mendès-France 69676 Bron Cedex, FRANCE {bentayeb jdarmont
More informationApproximation of Relations. Andrzej Skowron. Warsaw University. Banacha 2, Warsaw, Poland. Jaroslaw Stepaniuk
Approximation of Relations Andrzej Skowron Institute of Mathematics Warsaw University Banacha 2, 02-097 Warsaw, Poland e-mail: skowron@mimuw.edu.pl Jaroslaw Stepaniuk Institute of Computer Science Technical
More informationYiyu Yao University of Regina, Regina, Saskatchewan, Canada
ROUGH SET APPROXIMATIONS: A CONCEPT ANALYSIS POINT OF VIEW Yiyu Yao University of Regina, Regina, Saskatchewan, Canada Keywords: Concept analysis, data processing and analysis, description language, form
More informationself-organizing maps and symbolic data
self-organizing maps and symbolic data Aïcha El Golli, Brieuc Conan-Guez, Fabrice Rossi AxIS project, National Research Institute in Computer Science and Control (INRIA) Rocquencourt Research Unit Domaine
More informationUpper bound tighter Item caps for fast frequent itemsets mining for uncertain data Implemented using splay trees. Shashikiran V 1, Murali S 2
Volume 117 No. 7 2017, 39-46 ISSN: 1311-8080 (printed version); ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu ijpam.eu Upper bound tighter Item caps for fast frequent itemsets mining for uncertain
More informationAn Algorithm for Frequent Pattern Mining Based On Apriori
An Algorithm for Frequent Pattern Mining Based On Goswami D.N.*, Chaturvedi Anshu. ** Raghuvanshi C.S.*** *SOS In Computer Science Jiwaji University Gwalior ** Computer Application Department MITS Gwalior
More informationReichenbach Fuzzy Set of Transitivity
Available at http://pvamu.edu/aam Appl. Appl. Math. ISSN: 1932-9466 Vol. 9, Issue 1 (June 2014), pp. 295-310 Applications and Applied Mathematics: An International Journal (AAM) Reichenbach Fuzzy Set of
More informationRough Connected Topologized. Approximation Spaces
International Journal o Mathematical Analysis Vol. 8 04 no. 53 69-68 HIARI Ltd www.m-hikari.com http://dx.doi.org/0.988/ijma.04.4038 Rough Connected Topologized Approximation Spaces M. J. Iqelan Department
More informationDiscovery of Interesting Data Dependencies from a Workload of SQL Statements
Discovery of Interesting Data Dependencies from a Workload of SQL Statements S. Lopes J-M. Petit F. Toumani Université Blaise Pascal Laboratoire LIMOS Campus Universitaire des Cézeaux 24 avenue des Landais
More informationSensitive Rule Hiding and InFrequent Filtration through Binary Search Method
International Journal of Computational Intelligence Research ISSN 0973-1873 Volume 13, Number 5 (2017), pp. 833-840 Research India Publications http://www.ripublication.com Sensitive Rule Hiding and InFrequent
More informationPerformance Analysis of Apriori Algorithm with Progressive Approach for Mining Data
Performance Analysis of Apriori Algorithm with Progressive Approach for Mining Data Shilpa Department of Computer Science & Engineering Haryana College of Technology & Management, Kaithal, Haryana, India
More informationSTUDY ON FREQUENT PATTEREN GROWTH ALGORITHM WITHOUT CANDIDATE KEY GENERATION IN DATABASES
STUDY ON FREQUENT PATTEREN GROWTH ALGORITHM WITHOUT CANDIDATE KEY GENERATION IN DATABASES Prof. Ambarish S. Durani 1 and Mrs. Rashmi B. Sune 2 1 Assistant Professor, Datta Meghe Institute of Engineering,
More informationPSON: A Parallelized SON Algorithm with MapReduce for Mining Frequent Sets
2011 Fourth International Symposium on Parallel Architectures, Algorithms and Programming PSON: A Parallelized SON Algorithm with MapReduce for Mining Frequent Sets Tao Xiao Chunfeng Yuan Yihua Huang Department
More informationModeling with Uncertainty Interval Computations Using Fuzzy Sets
Modeling with Uncertainty Interval Computations Using Fuzzy Sets J. Honda, R. Tankelevich Department of Mathematical and Computer Sciences, Colorado School of Mines, Golden, CO, U.S.A. Abstract A new method
More informationA study on lower interval probability function based decision theoretic rough set models
Annals of Fuzzy Mathematics and Informatics Volume 12, No. 3, (September 2016), pp. 373 386 ISSN: 2093 9310 (print version) ISSN: 2287 6235 (electronic version) http://www.afmi.or.kr @FMI c Kyung Moon
More informationAssociation Rules Mining using BOINC based Enterprise Desktop Grid
Association Rules Mining using BOINC based Enterprise Desktop Grid Evgeny Ivashko and Alexander Golovin Institute of Applied Mathematical Research, Karelian Research Centre of Russian Academy of Sciences,
More informationBipolar Fuzzy Line Graph of a Bipolar Fuzzy Hypergraph
BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 13, No 1 Sofia 2013 Print ISSN: 1311-9702; Online ISSN: 1314-4081 DOI: 10.2478/cait-2013-0002 Bipolar Fuzzy Line Graph of a
More informationMining Distributed Frequent Itemset with Hadoop
Mining Distributed Frequent Itemset with Hadoop Ms. Poonam Modgi, PG student, Parul Institute of Technology, GTU. Prof. Dinesh Vaghela, Parul Institute of Technology, GTU. Abstract: In the current scenario
More informationThe Rough Set View on Bayes Theorem
The Rough Set View on Bayes Theorem Zdzis law Pawlak University of Information Technology and Management ul. Newelska 6, 01 447 Warsaw, Poland zpw@ii.pw.edu.pl MOTTO: It is a capital mistake to theorise
More informationBRACE: A Paradigm For the Discretization of Continuously Valued Data
Proceedings of the Seventh Florida Artificial Intelligence Research Symposium, pp. 7-2, 994 BRACE: A Paradigm For the Discretization of Continuously Valued Data Dan Ventura Tony R. Martinez Computer Science
More informationAn overview of interval encoded temporal mining involving prioritized mining, fuzzy mining, and positive and negative rule mining
An overview of interval encoded temporal mining involving prioritized mining, fuzzy mining, and positive and negative rule mining # C. Balasubramanian 1, K. Duraiswamy 2, V.Palanisamy 3 ISSN:0975-9646
More informationA database can be modeled as: + a collection of entities, + a set of relationships among entities.
The Relational Model Lecture 2 The Entity-Relationship Model and its Translation to the Relational Model Entity-Relationship (ER) Model + Entity Sets + Relationship Sets + Database Design Issues + Mapping
More informationThe Application of K-medoids and PAM to the Clustering of Rules
The Application of K-medoids and PAM to the Clustering of Rules A. P. Reynolds, G. Richards, and V. J. Rayward-Smith School of Computing Sciences, University of East Anglia, Norwich Abstract. Earlier research
More informationAn Evolutionary Algorithm for Mining Association Rules Using Boolean Approach
An Evolutionary Algorithm for Mining Association Rules Using Boolean Approach ABSTRACT G.Ravi Kumar 1 Dr.G.A. Ramachandra 2 G.Sunitha 3 1. Research Scholar, Department of Computer Science &Technology,
More information