Semantics Oriented Association Rules
|
|
- Emily Chandler
- 5 years ago
- Views:
Transcription
1 Semantics Oriented Association Rules Eric Louie BM Almaden Research Center 650 Harry Road, San Jose, CA Abstract - t is well known that relational theory carries very little semantic. To mine deeper semantics, additional modeling are necessary. n fact, some pure association rules are found exist even in a randomly generated data. n this paper, we consider the relational database in which every attribute value bas some additional information, such as price, fuzzy degree, neighborhood, or security compartment and levels. Two types of additions are considered: one is structure added, the other is valued-added. Somewhat a surprise, the additional cost in semantics checking is found very well compensated by the pruning of non-semantic rules. 1. NTRODUCTON n relation theory, attribute domains are Cantor sets; the interactions among members of real world objects are forgotten. For data mining, additional modeling of attribute domains are needed to organize deeper semantics of the data. We will term such addition to the existing data model; semantic added modeling. n this paper, we will consider two aspects; one is structure added, the other value added. 2. MOTVATON- ASSOCATON RULES N RANDOM DATA. n the experiment [4], a totally discrete data are randomly generated. Somewhat a surprise, we find some association with substantial supports of length 2 (though they did not meet our artificial high requirement of supports). This computation implies that frequency itself may not be an adequate criterion for meaningful pattems; see the Table 3 in Experiments Report, Section 7. So semantic modeling seems necessary for database mining. 3. SEMANTC ADDED MODELNG What would be the correct mathematical structure to capture the semantics of real world objects? This is a question that has many ad hoc answers; we decide to consult the history. Model theory of the first order logic uses a cantor set together with relational structures and functions to model the real world. We will follow it; attribute domains are assumed to have all such structures. Previously we have explored the simplest structure, namely, one binary relation is added to each attribute domains [5], [6], [9], [lo], [12], [ll], [7]; totally, they induce finitely many binary relations on the universe (of eneitites). n [3], we consider one real valued function for each domain. n this paper, we combine the two, T. Y. Lin Department of Mathematics and Computer Science San Jose State University, San Jose, CA tylin@cs.sj su.edu namely, on each domain we have one binary relation and one function. 4. STRUCTURE ADDED DATA MODEL BNARY RELATONS We will examine the case each attribute domain is assumed to have one binary relation. ts geometric corresponding concept is called a binary neighborhood system (BNS). n the case of equivalence relation, a BNS is a partition. The binary relation on each attribute domain in turns induced a binary relation on the universe. So on the universe, there are finitely many binary relations. We will examine the impact of such added structure on data mining. 4.1 Crisp/Fuzzy binary neighborhood systems A binary relation (BR) is a subset t defines a set B cvxu. called elementary (basic or binary) neighborhood at p E V A binary neighborhood system (BNS) denotes either the map B: p + Bp, or the family {BP 1 p E V}. The map B has also been called a binary granulation(bg). The set V together with BNS is called a BNS-space on U or simply BNS-space if V and U is the same. Proposition. BNS, BR and BG are equivalent to each other The induced equivalence relation: Note that BG, B:p E V +Bp 2u, induces a partition as follows: The collection of complete inverse images B-l(Bp) forms a partition on V, and hence an equivalence on V. We use EB to denote this equivalence relation. We may drop the subscript, if B is understood Fuzzifications: Binary relation and neighborhood system can be fkzified; in other words, instead of being a /02/$ EEE 956
2 subset of V x U, it could be a fuzzy subset (a membership hction FB: V x U +[O. 11 ) 4.2 Structure Added Data Models Which, in each tuple of Table 1, assigns the element in the first column to the element in the last column. The inverse map CTY-1 induces a BNS on U. So we have A traditional relation instance can be viewed as a knowledge representation that maps each entity to a tuple of attribute values. Table 1 illustrates the notion 0f.a relation instance on the universe U={ul, u2, u3, u4, u5 }. U K (S# i STATUS j CTY) U + 1 (S / TWENTY / C) ~3 j (S3 i TEN Q j (S4 i TWENTY i C1) US j (S5 j THRTY C3) Table 1. An nformation Table; arrows and parentheses will be suppressed n geographical attribute domain, one can use binary relation to capture the "near" semantics. So on CTY attribute, we assume a binary relation holds in the domain; see Table 2 CTY C1 CTY c1 c1 c2 c2 c1 Each binary relation, say B, induced an equivalence relation E. n this case the neighborhood is an equivalence class We should have similar results for "4'; we denote it by 0 (order binary relation). tc3 c3 c2 c3 Table 2 "near"-binary Relation L. J n numerical attribute, such as STUATS, we have the order binary relation ''5.'' Next, we express both B and "5. " in BNS format: Definition. The 3-tuple (U, Aj, Dom(Aj), j=1,2,..., n) is called structure added data model, where U is the universe, AJ is the attributes. For the example in Table 1, the structure added model is (U, B, {Cl, C2, C3}, 0, {TEN, TWENTY, THRTY}) 4.3. The mpact ofadded Structure to Data Mining Note that attributes can be regarded as projections. The CTY attribute is a map, denoted by CTY again. CTY: U + Dom(CTY), n mining such a data model, first concern is the cost in checking the added structure. So experiments have been conduct in [4]. Somewhat a surprise, the cost is well compensated by the saving. t does have cost in checking the continuity of association rules, however, the pruning of noncontinuous rules save the time in computing the long rules. One beauty of continuity is that the compositions of continuous rules are also continuous, so the only cost is at the length /02/$ JEEE 957
3 Table 4 is generated with some embedded semantics. That is, some associations are embedded in the algorithm of generating the test data. From the table it is clear finding pure association rules is expensive. n Table 5, some neighborhoods (one binary relation) are generated. Based on such a structure, the cost of finding continuous association rule is greatly reduced. From the artificial data, it proves that this BNS theory is promising. The next step is to test on real world applications. 5. VALUED ADDED DATA MODEL n this section we will consider the case the function valued will be part of the model 5.1. Valued Added Granular Data Model Definition. The 4-tuple (U, Aj, Xj, Dom(Aj), j=1,2,..., n) is a Value Added Granular Data Model or VA-Granular Data Model, where for each AJ, a value add function is defined on each domain Dom(AJ) : Xj : Dom(Aj) + M where M is either a Cantor set W, the security lattice SC, real numbers, or [0, 11. Proposition 1. f M is 2 then X j is a binary neighborhood system, which is equivalent to a binary relation. Proposition 2. f M is SC, the security lattice then Xj is a classification and the granular data model is a MLS data model. Proposition 3. f M is real number, then Xj is a random variable. Random variable is not a variable varies randomly, it is merely a function whose numerical values are determined by chance; please see [3] for connecting the mathematics to intuition. Proposition 4. f M is [0, 13, then X j could be a grade of fizziness. n this case the granular data model is a fizzy database The mpact of Added Structure Model to Data Mining The difference between this and last sections is that the values of the function do participate in computing. For example, the existing of real valued function implies the existing of a neighborhood system on an attribute domain D (a topological space).. However, the imposed constraints are imposed more than on the structure of D, we use the real values. Table 5 and 6 say the computing of VA-association is quite expensive, if we use values alone. We would like to comment that by assigning the nearest or smallest neighborhood at each point, we have a BNS. n next, project, we will use this BNS, called nearest neighborhood system, and values. 6. PATTERNS PRESERVNG STRUCTURES We collect some generalized standard patterns: [5]. Let A and B be two attributes of a relation-with-additionalsemantics. Let c, d be two values of A and B respectively. Let NEGH(c), NEGH(d) be the respective elementary granules. t is clear that c = NAME(NEGH(c)) and d = NAME( NEGH(d)). Let Card(.)be the cardinal number of a set 0. Structure added association rules 1 A formula c + d is a continuous or semantic decision rule, if the inclusion NEGH(c) E NEGH(d) is continuous. 2 A formula A + B is a continuous or semantic universal decision rule, iff V c E A 3 d E B such that NEGH(c) cnegh(d). This rule is equivalent to extension hctional dependence.. 3. A formula c + d is a robust continuouslsemantic decision rule, if NEGH(c) ENEGH(d) and Card(Pc) 2 threshold [S. 4 A formula c + d is a soft continuouslsemantic decision rule (strong rule), if NEGH(c) is softly included in NEGH(d), NEGH(c) 0 NEGH(d) [12]. 5. Weak association rule: A pair (c, d) is an association rules, if Card (NEGH(c) n NEGH(d)) 2 threshhold. 6. A pair (c, d) is said to be in a relation (or database), if it is a sub-tuple of a tuple that belongs to a relation (or database). 7. A pair (c, d) in a given relation is one-way (c +d) continuous (or semantic) if every x E Bc, there is at least one y E Bd such that (x, y) is in the given relation. 8. A pair (c, d) in a given relation is a two way continuous (or semantic) if (c +d) and (d +c) are both continuous /02/$ EEE 958
4 9. Semantic association rule: A pair (c, d) is an association rule iff the pair is an association rule and two way continuous. 10. Soft association rule: A pair (c, d) is a soft association rule, if Card (NEGH(c) n NEGH(d)) 2 threshhold. [5], [61 Valued added association rules 11. n-the-average-association rule: Two attributes Ai and Aj is associated in the average, if JE(Xi)- E(Xj)l where E(.) is the expected value, and is the absolute value. 12. Fuzzy decision rule: A formula c + d is a fuzzy decision rule, if E, G Ed and X(c) X(d), where E, and Ed are the equivalence classes of c and d. n other words, c and d are the names of the equivalence class E, and Ed 13. Security leak rule: A formula c + d is a security leak decision rule, if E, c Ed and X(c) X(d.). 14. Value added association rule (VA-association rule) [3] Sum-version: A granule (sub-tuple) b=(bl nb2 n... nb ) is a q-va-association rule q if Sum(b) 2 sq, where sum@) = ~j,j,*p(xjo) = ~ qj=1 P Cbj)*lbl/lU, where xjo = P (bj) Min-version: A granule (sub-tuple) b=(bl nb2 n... nb ) q is a q-va-association rule if Min(b) ) 2 sq, where Min(b) =Minj xjo*p(xjo) = Minq i=lh(bj)*lbl/lu 14.3 Max-version: A granule (sub-tuple) b=(bl nb2 n.. nb ) is a q-va-association rule 9 if Max(b) ) 2 sq, where Max(b) =Maxj xjo*p(xj0) = Maxqi=l(f(b-i)*lbl) Traditional: The Max and Min-versions are the traditional one iff the profit function is the constant=l. Recall that we are concerning only with the supports, so association rules have no directions. Since we are using granules, a q-association rule is equivalent to a q-large granule; the former q means the length of tuple, the latter q means the length of the intersections. The frequency of an itemset is the cardinal number of a granule (= a finite intersection of elementary granules) n general, there are no apriori criteria for value added case. However, if we require the thresholds increase with the lengths, that is, Then there are apriori criteria: q-large implies all sub-tuples are (q-+large, where i 2 0. Value added granular data model allows us to import many probability theory into data mining. We list a sample here and more work will be reported in the near future. 7. EXPERMENT REPORTS Here are the ACRONYMNs: q= length; c =candidate; a=association rules; s=support count; 6 = time need to generated next rows (in seconds) 7.1. Random Data Table 3 is the results of finding association rules on randomly generated data: 1. The relation has rows and 16 columns; we require the support to be items. 2. The distinct attribute is limited to 10; there are real world medical data meet this constraints. Table 3. Randomly generated data 7.2. Structure Added Computing Table 4 and 5 is the same as Table 3 except the distinct attribute is limited to s s s s l02/$ EEE 959
5 s s s s s s s s s s s s s s s s s s Table 4: Finding pure association rule is expensive s s s s c S a s a OlOS s s s s lo s lo semantic rules is inexpinsive 7.3. Valued Added Computing Table 6 is the results of finding association rules based on data with real valued function: 1. The relation has 500 rows and 8 columns; we require the weights greater than The distinct attribute is limited to s lo 0.460s 10 ( s s s s l o 2.824s s s s s s s s 0.060s 0.010s s /02/$ leee 960 o 1.312s s s s 1
6 8. CONCLUSONS The advantage of data mining by granular computing are: 1. it is fast in mining classical relations, granular computing is faster than Apriori [13], [14] because the "database scan" are replaced by bit operations. 2. the use of granular computing is extend to Yea1 world" databases (semantically richer relations); its cost is well compensated by pruning. Such extra semantics may be able to use for analyzing unexpected, peculiar rules [ Granular structure is the mathematical structure of the real world. So this method is mining directly on the real world, not on its representations REFERENCES [l] R. Agrawal, T. mielinski, and A. Swami, "Mining Association Rules Between Sets of tems in Large Databases," in Proceeding of ACM-SGMOD international Conference on Management of Data, pp , Washington, DC, June, 1993 [2] P. Halmos, Measure Theory, Van Nostrand, 1950 [3] T. Y. Lin, Y. Y. Yao, and E. Louie, "Value Added Association Rules, " 6~ Pacific-Asia Conference, Taipei, Taiwan, May 6-8,2002 [4] T. Y. Lin, Y. Y. Yao, and E. Louie, "Association Rules with Additional Semantics Modeled by Binary Relations," n: Rough Set Theory and Granular Computing" physica- Verlag, Shusaku Tsumoto, Masahiro nuiguchi and Shoji Hirano (Eds), to appear [5] T. Y. Lin, "Data Mining and Machine Oriented Modeling: A Granular Computing Approach," Journal of Applied ntelligence, Kluwer, Vol. 13,No 2, September/October,2000, pp [6] T. Y. Lin, "Data Mining: Granular Computing Approach.'' n: Methodologies for Knowledge Discovery and Data Mining, Lecture Notes in Artificial ntelligence 1574, Third Pacific-Asia Conference, Beijing, April 26-28, 1999, [7] "Granular Computing on Binary Relations : Data Mining and Neighborhood Systems." n: Rough Sets n Knowledge Discovery, A. Skowom and L. Polkowski (eds), Physica-Verlag, 1998, [8] T. Y. Lin, "Rough Set Theory in Very Large Databases," Symposium on Modeling, Analysis and Simulation, CESA'96 MACS Multi Conference (Computational Engineering in Systems Applications), Lille, France, July 9-12, 1996, Vol. 2 of 2, [9] T. Y. Lin, Neighborhood Systems and Approximation in Database and Knowledge Base Systems, Proceedings of the Fourth nternational Symposium on Methodologies of ntelligent Systems, Poster Session, October 12-15, pp , [lo] T. Y. Lin, "Neighborhood Systems and Relational Database". Abstract, Proceedings of CSC '88, February, 1988, pp [ll] T. Y. Lin and M. Hadjimichael "Non-classificatory Generalization in Data Mining," Proceedings of The Fourth Workshop on Rough Sets, Fuzzy Sets and Machine Discovety, Tokyo, Japan, November 8-10,1996, [12] T. Y. Lin, and Y.Y. Yao "Mining Soft Rules Using Rough Sets and Neighborhoods." n: Symposium on Modeling, Analysis and Simulation, MACS Multiconference (Computational Engineering in Systems Applications), Lille, France, July 9-12, 1996, Vol. 2 of 2, [13] Eric Louie and T.Y. Lin, "Finding Association Rules using Fast Bit Computation: Machine-Oriented Modeling." n: Proceeding of 12th ntemational Symposium SMS2000, Charlotte, North Carolina, Oct 11-14, Lecture Notes in A [14] E. Louie, T. Y. Lin and "A Data Mining Approach using Machine Oriented Modeling: Finding Association Rules using Canonical Names.". n: Proceeding of 14th Annual nternational Symposium Aerospace/Defense Sensing, Simulation, and Controls, SPE Vol 4057, Orlando, April 24-28,2000, pp [15] Balaji Padmanabhan and Alexander Tuzhilin "Finding Unexpected Patterns in Data." n: Data Mining and Granular Computing T. Y. Lin, Y.Y. Yao and L. Zadeh (eds), Physica-Verlag, to appear /02/$ EEE 961
Modeling the Real World for Data Mining: Granular Computing Approach
Modeling the Real World for Data Mining: Granular Computing Approach T. Y. Lin Department of Mathematics and Computer Science San Jose State University San Jose California 95192-0103 and Berkeley Initiative
More informationAssociation Rules with Additional Semantics Modeled by Binary Relations
Association Rules with Additional Semantics Modeled by Binary Relations T. Y. Lin 1 and Eric Louie 2 1 Department of Mathematics and Computer Science San Jose State University, San Jose, California 95192-0103
More informationValue Added Association Rules
Value Added Association Rules T.Y. Lin San Jose State University drlin@sjsu.edu Glossary Association Rule Mining A Association Rule Mining is an exploratory learning task to discover some hidden, dependency
More informationMathematical Foundation of Association Rules - Mining Associations by Solving Integral Linear Inequalities
Mathematical Foundation of Association Rules - Mining Associations by Solving Integral Linear Inequalities Tsau Young ( T. Y. ) Lin Department of Computer Science San Jose State University San Jose, CA
More informationRough Sets, Neighborhood Systems, and Granular Computing
Rough Sets, Neighborhood Systems, and Granular Computing Y.Y. Yao Department of Computer Science University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: yyao@cs.uregina.ca Abstract Granulation
More informationMining High Order Decision Rules
Mining High Order Decision Rules Y.Y. Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 e-mail: yyao@cs.uregina.ca Abstract. We introduce the notion of high
More informationQualitative Fuzzy Sets and Granularity
Qualitative Fuzzy Sets and Granularity T. Y. Lin Department of Mathematics and Computer Science San Jose State University, San Jose, California 95192-0103 E-mail: tylin@cs.sjsu.edu and Shusaku Tsumoto
More informationA Generalized Decision Logic Language for Granular Computing
A Generalized Decision Logic Language for Granular Computing Y.Y. Yao Department of Computer Science, University of Regina, Regina Saskatchewan, Canada S4S 0A2, E-mail: yyao@cs.uregina.ca Churn-Jung Liau
More informationA Granular Computing Approach. T.Y. Lin 1;2. Abstract. From the processing point of view, data mining is machine
Data Mining and Machine Oriented Modeling: A Granular Computing Approach T.Y. Lin 1;2 1 Department of Mathematics and Computer Science San Jose State University, San Jose, California 95192 tylin@cs.sjsu.edu
More informationOn Generalizing Rough Set Theory
On Generalizing Rough Set Theory Y.Y. Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: yyao@cs.uregina.ca Abstract. This paper summarizes various formulations
More informationAttribute (Feature) Completion The Theory of Attributes from Data Mining Prospect
Attribute (Feature) Completion The Theory of Attributes from Data Mining Prospect Tsay Young ( T. Y. ) Lin Department of Computer Science San Jose State University San Jose, CA 95192, USA tylin@cs.sjsu.edu
More informationRough Set Approaches to Rule Induction from Incomplete Data
Proceedings of the IPMU'2004, the 10th International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems, Perugia, Italy, July 4 9, 2004, vol. 2, 923 930 Rough
More informationEfficient SQL-Querying Method for Data Mining in Large Data Bases
Efficient SQL-Querying Method for Data Mining in Large Data Bases Nguyen Hung Son Institute of Mathematics Warsaw University Banacha 2, 02095, Warsaw, Poland Abstract Data mining can be understood as a
More informationApproximation Theories: Granular Computing vs Rough Sets
Approximation Theories: Granular Computing vs Rough Sets Tsau Young ( T. Y. ) Lin Department of Computer Science, San Jose State University San Jose, CA 95192-0249 tylin@cs.sjsu.edu Abstract. The goal
More informationGeneralized Infinitive Rough Sets Based on Reflexive Relations
2012 IEEE International Conference on Granular Computing Generalized Infinitive Rough Sets Based on Reflexive Relations Yu-Ru Syau Department of Information Management National Formosa University Huwei
More informationGranular Computing based on Rough Sets, Quotient Space Theory, and Belief Functions
Granular Computing based on Rough Sets, Quotient Space Theory, and Belief Functions Yiyu (Y.Y.) Yao 1, Churn-Jung Liau 2, Ning Zhong 3 1 Department of Computer Science, University of Regina Regina, Saskatchewan,
More informationSets with Partial Memberships A Rough Set View of Fuzzy Sets
Sets with Partial Memberships A Rough Set View of Fuzzy Sets T. Y. Lin Department of Mathematics and Computer Science San Jose State University, San Jose, California 9592-3 E-mail: tylin @ cs.sj st.l.edu
More informationA Logic Language of Granular Computing
A Logic Language of Granular Computing Yiyu Yao and Bing Zhou Department of Computer Science University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: {yyao, zhou200b}@cs.uregina.ca Abstract Granular
More informationFormal Concept Analysis and Hierarchical Classes Analysis
Formal Concept Analysis and Hierarchical Classes Analysis Yaohua Chen, Yiyu Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: {chen115y, yyao}@cs.uregina.ca
More informationData Analysis and Mining in Ordered Information Tables
Data Analysis and Mining in Ordered Information Tables Ying Sai, Y.Y. Yao Department of Computer Science University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: yyao@cs.uregina.ca Ning Zhong
More informationRough Approximations under Level Fuzzy Sets
Rough Approximations under Level Fuzzy Sets W.-N. Liu J.T. Yao Y.Y.Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: [liuwe200, jtyao, yyao]@cs.uregina.ca
More informationKnowledge Engineering in Search Engines
San Jose State University SJSU ScholarWorks Master's Projects Master's Theses and Graduate Research Spring 2012 Knowledge Engineering in Search Engines Yun-Chieh Lin Follow this and additional works at:
More informationA Comparison of Global and Local Probabilistic Approximations in Mining Data with Many Missing Attribute Values
A Comparison of Global and Local Probabilistic Approximations in Mining Data with Many Missing Attribute Values Patrick G. Clark Department of Electrical Eng. and Computer Sci. University of Kansas Lawrence,
More informationInduction of Strong Feature Subsets
Induction of Strong Feature Subsets Mohamed Quafafou and Moussa Boussouf IRIN, University of Nantes, 2 rue de la Houssiniere, BP 92208-44322, Nantes Cedex 03, France. quafafou9 Abstract The problem of
More informationA Rough Set Approach for Generation and Validation of Rules for Missing Attribute Values of a Data Set
A Rough Set Approach for Generation and Validation of Rules for Missing Attribute Values of a Data Set Renu Vashist School of Computer Science and Engineering Shri Mata Vaishno Devi University, Katra,
More informationInformation Granulation and Approximation in a Decision-theoretic Model of Rough Sets
Information Granulation and Approximation in a Decision-theoretic Model of Rough Sets Y.Y. Yao Department of Computer Science University of Regina Regina, Saskatchewan Canada S4S 0A2 E-mail: yyao@cs.uregina.ca
More informationMA651 Topology. Lecture 4. Topological spaces 2
MA651 Topology. Lecture 4. Topological spaces 2 This text is based on the following books: Linear Algebra and Analysis by Marc Zamansky Topology by James Dugundgji Fundamental concepts of topology by Peter
More informationA Rough Set Approach to Data with Missing Attribute Values
A Rough Set Approach to Data with Missing Attribute Values Jerzy W. Grzymala-Busse Department of Electrical Engineering and Computer Science, University of Kansas, Lawrence, KS 66045, USA and Institute
More informationConsistency and Set Intersection
Consistency and Set Intersection Yuanlin Zhang and Roland H.C. Yap National University of Singapore 3 Science Drive 2, Singapore {zhangyl,ryap}@comp.nus.edu.sg Abstract We propose a new framework to study
More informationData with Missing Attribute Values: Generalization of Indiscernibility Relation and Rule Induction
Data with Missing Attribute Values: Generalization of Indiscernibility Relation and Rule Induction Jerzy W. Grzymala-Busse 1,2 1 Department of Electrical Engineering and Computer Science, University of
More informationA Set Theory For Soft Computing A Unified View of Fuzzy Sets via Neighbrohoods
A Set Theory For Soft Computing A Unified View of Fuzzy Sets via Neighbrohoods T. Y. Lin Department of Mathematics and Computer Science, San Jose State University, San Jose, California 95192-0103, and
More informationInterpreting Association Rules in Granular Data Model via Decision Logic
Interpreting Association Rules in Granular Data Model via Decision Logic Tsau. Young.("T. Y.") Lin Department of Computer Science San Jose State University San Jose, CA 95192-0462 tylin@cs.sjsu.edu Abstruct
More informationGranular Computing: A Paradigm in Information Processing Saroj K. Meher Center for Soft Computing Research Indian Statistical Institute, Kolkata
Granular Computing: A Paradigm in Information Processing Saroj K. Meher Center for Soft Computing Research Indian Statistical Institute, Kolkata Granular computing (GrC): Outline Introduction Definitions
More informationOn Reduct Construction Algorithms
1 On Reduct Construction Algorithms Yiyu Yao 1, Yan Zhao 1 and Jue Wang 2 1 Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 {yyao, yanzhao}@cs.uregina.ca 2 Laboratory
More informationRough Connected Topologized. Approximation Spaces
International Journal o Mathematical Analysis Vol. 8 04 no. 53 69-68 HIARI Ltd www.m-hikari.com http://dx.doi.org/0.988/ijma.04.4038 Rough Connected Topologized Approximation Spaces M. J. Iqelan Department
More informationGranular Computing. Y. Y. Yao
Granular Computing Y. Y. Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: yyao@cs.uregina.ca, http://www.cs.uregina.ca/~yyao Abstract The basic ideas
More informationMining High Average-Utility Itemsets
Proceedings of the 2009 IEEE International Conference on Systems, Man, and Cybernetics San Antonio, TX, USA - October 2009 Mining High Itemsets Tzung-Pei Hong Dept of Computer Science and Information Engineering
More informationXI International PhD Workshop OWD 2009, October Fuzzy Sets as Metasets
XI International PhD Workshop OWD 2009, 17 20 October 2009 Fuzzy Sets as Metasets Bartłomiej Starosta, Polsko-Japońska WyŜsza Szkoła Technik Komputerowych (24.01.2008, prof. Witold Kosiński, Polsko-Japońska
More informationROUGH MEMBERSHIP FUNCTIONS: A TOOL FOR REASONING WITH UNCERTAINTY
ALGEBRAIC METHODS IN LOGIC AND IN COMPUTER SCIENCE BANACH CENTER PUBLICATIONS, VOLUME 28 INSTITUTE OF MATHEMATICS POLISH ACADEMY OF SCIENCES WARSZAWA 1993 ROUGH MEMBERSHIP FUNCTIONS: A TOOL FOR REASONING
More informationThe strong chromatic number of a graph
The strong chromatic number of a graph Noga Alon Abstract It is shown that there is an absolute constant c with the following property: For any two graphs G 1 = (V, E 1 ) and G 2 = (V, E 2 ) on the same
More informationEFFICIENT ATTRIBUTE REDUCTION ALGORITHM
EFFICIENT ATTRIBUTE REDUCTION ALGORITHM Zhongzhi Shi, Shaohui Liu, Zheng Zheng Institute Of Computing Technology,Chinese Academy of Sciences, Beijing, China Abstract: Key words: Efficiency of algorithms
More informationGranular Computing on Binary Relations In Data Mining and Neighborhood Systems
Granular Computing on Binary Relations In Data Mining and Neighborhood Systems T. Y. Lin Department of Mathematics and Computer Science San Jose State University San Jose, California 95192-0103 And Department
More informationStrong Chromatic Number of Fuzzy Graphs
Annals of Pure and Applied Mathematics Vol. 7, No. 2, 2014, 52-60 ISSN: 2279-087X (P), 2279-0888(online) Published on 18 September 2014 www.researchmathsci.org Annals of Strong Chromatic Number of Fuzzy
More informationA Closest Fit Approach to Missing Attribute Values in Preterm Birth Data
A Closest Fit Approach to Missing Attribute Values in Preterm Birth Data Jerzy W. Grzymala-Busse 1, Witold J. Grzymala-Busse 2, and Linda K. Goodwin 3 1 Department of Electrical Engineering and Computer
More informationAvailable online at ScienceDirect. Procedia Computer Science 96 (2016 )
Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 96 (2016 ) 179 186 20th International Conference on Knowledge Based and Intelligent Information and Engineering Systems,
More informationA Graph Theoretic Approach to Image Database Retrieval
A Graph Theoretic Approach to Image Database Retrieval Selim Aksoy and Robert M. Haralick Intelligent Systems Laboratory Department of Electrical Engineering University of Washington, Seattle, WA 98195-2500
More informationEnumerating Pseudo-Intents in a Partial Order
Enumerating Pseudo-Intents in a Partial Order Alexandre Bazin and Jean-Gabriel Ganascia Université Pierre et Marie Curie, Laboratoire d Informatique de Paris 6 Paris, France Alexandre.Bazin@lip6.fr Jean-Gabriel@Ganascia.name
More informationAn Efficient Tree-based Fuzzy Data Mining Approach
150 International Journal of Fuzzy Systems, Vol. 12, No. 2, June 2010 An Efficient Tree-based Fuzzy Data Mining Approach Chun-Wei Lin, Tzung-Pei Hong, and Wen-Hsiang Lu Abstract 1 In the past, many algorithms
More informationAdvanced Operations Research Techniques IE316. Quiz 1 Review. Dr. Ted Ralphs
Advanced Operations Research Techniques IE316 Quiz 1 Review Dr. Ted Ralphs IE316 Quiz 1 Review 1 Reading for The Quiz Material covered in detail in lecture. 1.1, 1.4, 2.1-2.6, 3.1-3.3, 3.5 Background material
More informationA GRAPH FROM THE VIEWPOINT OF ALGEBRAIC TOPOLOGY
A GRAPH FROM THE VIEWPOINT OF ALGEBRAIC TOPOLOGY KARL L. STRATOS Abstract. The conventional method of describing a graph as a pair (V, E), where V and E repectively denote the sets of vertices and edges,
More informationAvoiding Fake Boundaries in Set Interval Computing
Journal of Uncertain Systems Vol.11, No.2, pp.137-148, 2017 Online at: www.jus.org.uk Avoiding Fake Boundaries in Set Interval Computing Anthony Welte 1, Luc Jaulin 1, Martine Ceberio 2, Vladik Kreinovich
More informationCardinality of Sets. Washington University Math Circle 10/30/2016
Cardinality of Sets Washington University Math Circle 0/0/06 The cardinality of a finite set A is just the number of elements of A, denoted by A. For example, A = {a, b, c, d}, B = {n Z : n } = {,,, 0,,,
More informationPerformance Analysis of Apriori Algorithm with Progressive Approach for Mining Data
Performance Analysis of Apriori Algorithm with Progressive Approach for Mining Data Shilpa Department of Computer Science & Engineering Haryana College of Technology & Management, Kaithal, Haryana, India
More informationTemporal Weighted Association Rule Mining for Classification
Temporal Weighted Association Rule Mining for Classification Purushottam Sharma and Kanak Saxena Abstract There are so many important techniques towards finding the association rules. But, when we consider
More informationApplying Fuzzy Sets and Rough Sets as Metric for Vagueness and Uncertainty in Information Retrieval Systems
Applying Fuzzy Sets and Rough Sets as Metric for Vagueness and Uncertainty in Information Retrieval Systems Nancy Mehta,Neera Bawa Lect. In CSE, JCDV college of Engineering. (mehta_nancy@rediffmail.com,
More informationOn Fuzzy Topological Spaces Involving Boolean Algebraic Structures
Journal of mathematics and computer Science 15 (2015) 252-260 On Fuzzy Topological Spaces Involving Boolean Algebraic Structures P.K. Sharma Post Graduate Department of Mathematics, D.A.V. College, Jalandhar
More informationDefinition 2.3: [5] Let, and, be two simple graphs. Then the composition of graphs. and is denoted by,
International Journal of Pure Applied Mathematics Volume 119 No. 14 2018, 891-898 ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu ijpam.eu ON M-POLAR INTUITIONISTIC FUZZY GRAPHS K. Sankar 1,
More informationA Graded Meaning of Formulas in Approximation Spaces
Fundamenta Informaticae 60 (2004) 159 172 159 IOS Press A Graded Meaning of Formulas in Approximation Spaces Anna Gomolińska Department of Mathematics University of Białystok ul. Akademicka 2, 15-267 Białystok,
More informationSemantics of Fuzzy Sets in Rough Set Theory
Semantics of Fuzzy Sets in Rough Set Theory Y.Y. Yao Department of Computer Science University of Regina Regina, Saskatchewan Canada S4S 0A2 E-mail: yyao@cs.uregina.ca URL: http://www.cs.uregina.ca/ yyao
More informationMining of Web Server Logs using Extended Apriori Algorithm
International Association of Scientific Innovation and Research (IASIR) (An Association Unifying the Sciences, Engineering, and Applied Research) International Journal of Emerging Technologies in Computational
More information2. Discovery of Association Rules
2. Discovery of Association Rules Part I Motivation: market basket data Basic notions: association rule, frequency and confidence Problem of association rule mining (Sub)problem of frequent set mining
More informationClassification with Diffuse or Incomplete Information
Classification with Diffuse or Incomplete Information AMAURY CABALLERO, KANG YEN Florida International University Abstract. In many different fields like finance, business, pattern recognition, communication
More informationSongklanakarin Journal of Science and Technology SJST R1 Ghareeb SPATIAL OBJECT MODELING IN SOFT TOPOLOGY
Songklanakarin Journal of Science and Technology SJST-0-00.R Ghareeb SPATIAL OBJECT MODELING IN SOFT TOPOLOGY Journal: Songklanakarin Journal of Science and Technology Manuscript ID: SJST-0-00.R Manuscript
More informationCSC Discrete Math I, Spring Sets
CSC 125 - Discrete Math I, Spring 2017 Sets Sets A set is well-defined, unordered collection of objects The objects in a set are called the elements, or members, of the set A set is said to contain its
More information6. Concluding Remarks
[8] K. J. Supowit, The relative neighborhood graph with an application to minimum spanning trees, Tech. Rept., Department of Computer Science, University of Illinois, Urbana-Champaign, August 1980, also
More informationA New Method For Forecasting Enrolments Combining Time-Variant Fuzzy Logical Relationship Groups And K-Means Clustering
A New Method For Forecasting Enrolments Combining Time-Variant Fuzzy Logical Relationship Groups And K-Means Clustering Nghiem Van Tinh 1, Vu Viet Vu 1, Tran Thi Ngoc Linh 1 1 Thai Nguyen University of
More informationA Model of Machine Learning Based on User Preference of Attributes
1 A Model of Machine Learning Based on User Preference of Attributes Yiyu Yao 1, Yan Zhao 1, Jue Wang 2 and Suqing Han 2 1 Department of Computer Science, University of Regina, Regina, Saskatchewan, Canada
More informationUsing level-2 fuzzy sets to combine uncertainty and imprecision in fuzzy regions
Using level-2 fuzzy sets to combine uncertainty and imprecision in fuzzy regions Verstraete Jörg Abstract In many applications, spatial data need to be considered but are prone to uncertainty or imprecision.
More informationCOMBINATION OF ROUGH AND FUZZY SETS
1 COMBINATION OF ROUGH AND FUZZY SETS BASED ON α-level SETS Y.Y. Yao Department of Computer Science, Lakehead University Thunder Bay, Ontario, Canada P7B 5E1 E-mail: yyao@flash.lakeheadu.ca 1 ABSTRACT
More informationApproximation of Relations. Andrzej Skowron. Warsaw University. Banacha 2, Warsaw, Poland. Jaroslaw Stepaniuk
Approximation of Relations Andrzej Skowron Institute of Mathematics Warsaw University Banacha 2, 02-097 Warsaw, Poland e-mail: skowron@mimuw.edu.pl Jaroslaw Stepaniuk Institute of Computer Science Technical
More information- The Theory of Attributes from Data Mining Prospect
Attribute (Feature) Completion - The Theory of Attributes from Data Mining Prospect Tsay Young ( T. Y. ) Lin Department of Computer Science San Jose State University San Jose, CA 95192, USA tylin@cs.sjsu.edu
More informationGranular association rules for multi-valued data
Granular association rules for multi-valued data Fan Min and William Zhu Lab of Granular Computing, Zhangzhou Normal University, Zhangzhou 363, China. Email: minfanphd@163.com, williamfengzhu@gmail.com
More informationMining Distributed Frequent Itemset with Hadoop
Mining Distributed Frequent Itemset with Hadoop Ms. Poonam Modgi, PG student, Parul Institute of Technology, GTU. Prof. Dinesh Vaghela, Parul Institute of Technology, GTU. Abstract: In the current scenario
More informationThe Rough Set Database System: An Overview
The Rough Set Database System: An Overview Zbigniew Suraj 1,2 and Piotr Grochowalski 2 1 Chair of Computer Science Foundations University of Information Technology and Management, Rzeszow, Poland zsuraj@wenus.wsiz.rzeszow.pl
More information4 Basis, Subbasis, Subspace
4 Basis, Subbasis, Subspace Our main goal in this chapter is to develop some tools that make it easier to construct examples of topological spaces. By Definition 3.12 in order to define a topology on a
More informationNotes on Minimum Cuts and Modular Functions
Notes on Minimum Cuts and Modular Functions 1 Introduction The following are my notes on Cunningham s paper [1]. Given a submodular function f and a set S, submodular minimisation is the problem of finding
More informationGenerating Topology on Graphs by. Operations on Graphs
Applied Mathematical Sciences, Vol. 9, 2015, no. 57, 2843-2857 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ams.2015.5154 Generating Topology on Graphs by Operations on Graphs M. Shokry Physics
More informationLecture notes for April 6, 2005
Lecture notes for April 6, 2005 Mining Association Rules The goal of association rule finding is to extract correlation relationships in the large datasets of items. Many businesses are interested in extracting
More informationA study on lower interval probability function based decision theoretic rough set models
Annals of Fuzzy Mathematics and Informatics Volume 12, No. 3, (September 2016), pp. 373 386 ISSN: 2093 9310 (print version) ISSN: 2287 6235 (electronic version) http://www.afmi.or.kr @FMI c Kyung Moon
More informationAn Algorithm for Frequent Pattern Mining Based On Apriori
An Algorithm for Frequent Pattern Mining Based On Goswami D.N.*, Chaturvedi Anshu. ** Raghuvanshi C.S.*** *SOS In Computer Science Jiwaji University Gwalior ** Computer Application Department MITS Gwalior
More informationBipolar Fuzzy Line Graph of a Bipolar Fuzzy Hypergraph
BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 13, No 1 Sofia 2013 Print ISSN: 1311-9702; Online ISSN: 1314-4081 DOI: 10.2478/cait-2013-0002 Bipolar Fuzzy Line Graph of a
More informationAction Rules. (*Corresponding author)
Action Rules Zbigniew W. Ras* Department of Computer Science University of North Carolina 9201 University City Blvd. Charlotte, NC 28223, USA voice: +1 704-687-4567 fax: +1 704-687-3516 email: ras@uncc.edu
More informationFuzzy Set-Theoretical Approach for Comparing Objects with Fuzzy Attributes
Fuzzy Set-Theoretical Approach for Comparing Objects with Fuzzy Attributes Y. Bashon, D. Neagu, M.J. Ridley Department of Computing University of Bradford Bradford, BD7 DP, UK e-mail: {Y.Bashon, D.Neagu,
More informationCHAPTER 4 K-MEANS AND UCAM CLUSTERING ALGORITHM
CHAPTER 4 K-MEANS AND UCAM CLUSTERING 4.1 Introduction ALGORITHM Clustering has been used in a number of applications such as engineering, biology, medicine and data mining. The most popular clustering
More informationA New Approach for Handling the Iris Data Classification Problem
International Journal of Applied Science and Engineering 2005. 3, : 37-49 A New Approach for Handling the Iris Data Classification Problem Shyi-Ming Chen a and Yao-De Fang b a Department of Computer Science
More informationROUGH SETS THEORY AND UNCERTAINTY INTO INFORMATION SYSTEM
ROUGH SETS THEORY AND UNCERTAINTY INTO INFORMATION SYSTEM Pavel Jirava Institute of System Engineering and Informatics Faculty of Economics and Administration, University of Pardubice Abstract: This article
More informationIrregular Bipolar Fuzzy Graphs
Inernational Journal of pplications of Fuzzy Sets (ISSN 4-40) Vol ( 0), 9-0 Irregular ipolar Fuzzy Graphs Sovan Samanta ssamantavu@gmailcom Madhumangal Pal mmpalvu@gmailcom Department of pplied Mathematics
More informationWeb Service Usage Mining: Mining For Executable Sequences
7th WSEAS International Conference on APPLIED COMPUTER SCIENCE, Venice, Italy, November 21-23, 2007 266 Web Service Usage Mining: Mining For Executable Sequences MOHSEN JAFARI ASBAGH, HASSAN ABOLHASSANI
More informationGranular Computing: Models and Applications
Granular Computing: Models and Applications Jianchao Han, 1, Tsau Young Lin 2, 1 Department of Computer Science, California State University, Dominguez Hills, Carson, CA 90747 2 Department of Computer
More informationAttribute Reduction using Forward Selection and Relative Reduct Algorithm
Attribute Reduction using Forward Selection and Relative Reduct Algorithm P.Kalyani Associate Professor in Computer Science, SNR Sons College, Coimbatore, India. ABSTRACT Attribute reduction of an information
More information1. Fuzzy sets, fuzzy relational calculus, linguistic approximation
1. Fuzzy sets, fuzzy relational calculus, linguistic approximation 1.1. Fuzzy sets Let us consider a classical set U (Universum) and a real function : U --- L. As a fuzzy set A we understand a set of pairs
More informationLecture 17: Continuous Functions
Lecture 17: Continuous Functions 1 Continuous Functions Let (X, T X ) and (Y, T Y ) be topological spaces. Definition 1.1 (Continuous Function). A function f : X Y is said to be continuous if the inverse
More informationCHAPTER 4 FUZZY LOGIC, K-MEANS, FUZZY C-MEANS AND BAYESIAN METHODS
CHAPTER 4 FUZZY LOGIC, K-MEANS, FUZZY C-MEANS AND BAYESIAN METHODS 4.1. INTRODUCTION This chapter includes implementation and testing of the student s academic performance evaluation to achieve the objective(s)
More informationA mining method for tracking changes in temporal association rules from an encoded database
A mining method for tracking changes in temporal association rules from an encoded database Chelliah Balasubramanian *, Karuppaswamy Duraiswamy ** K.S.Rangasamy College of Technology, Tiruchengode, Tamil
More informationPincer-Search: An Efficient Algorithm. for Discovering the Maximum Frequent Set
Pincer-Search: An Efficient Algorithm for Discovering the Maximum Frequent Set Dao-I Lin Telcordia Technologies, Inc. Zvi M. Kedem New York University July 15, 1999 Abstract Discovering frequent itemsets
More informationJournal of Asian Scientific Research FEATURES COMPOSITION FOR PROFICIENT AND REAL TIME RETRIEVAL IN CBIR SYSTEM. Tohid Sedghi
Journal of Asian Scientific Research, 013, 3(1):68-74 Journal of Asian Scientific Research journal homepage: http://aessweb.com/journal-detail.php?id=5003 FEATURES COMPOSTON FOR PROFCENT AND REAL TME RETREVAL
More informationTHREE LECTURES ON BASIC TOPOLOGY. 1. Basic notions.
THREE LECTURES ON BASIC TOPOLOGY PHILIP FOTH 1. Basic notions. Let X be a set. To make a topological space out of X, one must specify a collection T of subsets of X, which are said to be open subsets of
More information8 Matroid Intersection
8 Matroid Intersection 8.1 Definition and examples 8.2 Matroid Intersection Algorithm 8.1 Definitions Given two matroids M 1 = (X, I 1 ) and M 2 = (X, I 2 ) on the same set X, their intersection is M 1
More informationCompactness in Countable Fuzzy Topological Space
Compactness in Countable Fuzzy Topological Space Apu Kumar Saha Assistant Professor, National Institute of Technology, Agartala, Email: apusaha_nita@yahoo.co.in Debasish Bhattacharya Associate Professor,
More informationself-organizing maps and symbolic data
self-organizing maps and symbolic data Aïcha El Golli, Brieuc Conan-Guez, Fabrice Rossi AxIS project, National Research Institute in Computer Science and Control (INRIA) Rocquencourt Research Unit Domaine
More information