Modeling the Real World for Data Mining: Granular Computing Approach

Size: px
Start display at page:

Download "Modeling the Real World for Data Mining: Granular Computing Approach"

Transcription

1 Modeling the Real World for Data Mining: Granular Computing Approach T. Y. Lin Department of Mathematics and Computer Science San Jose State University San Jose California and Berkeley Initiative in Soft Computing University of California Berkeley California Abstract To each object in an object space a (possibly empty) family of granules (crisp/fuzzy subsets) of the data space is assigned; we call it a granulation. It is a mild generalization of the neighborhood system of (pre- )topological spaces. If each family has at most one granule the granulation defines a binary relations. Interestingly if the granulation is defined by general relations the data space is the (world) models in first order logic. A knowledge representation of such a world model that is assigning a uniquely meaningful name (attribute value) to each neighborhood is called a granular data model. If the granulation are by equivalence relations the model is the classical relational model. Intuitively it is a real world data model; note that granules have overlapped so attribute values may not be independent. In other words attribute domains are more than Cantor sets; intuitively they are real world Models (sets). Depending on the structures and representations the model can be useful in fuzzy logic or data mining. The focus of this paper is on data mining in fact semantically rich rules are mined. Its performance are measured; it twenty some times faster than traditional Apriori. 1. Introduction Relation theory is designed to model the real world of a long duration. To accommodate all instances it assumes the universe and attribute domains are all Cantor sets. In other words the interactions among entities are forgotten in the relational modeling. To have a better approximation we need an appropriate model. In logic a real world is modeled by a Cantor set (of entities) with relational structure. As a first step we decide to consider binary relational structures. Interestingly it agrees with Zadeh s notion of information granulation [5]. In applications we reach seemingly unrelated topics data mining and fuzzy control. In this paper however we focus on data mining. Some impressive experimental results have achieved. In classical case we are 24 time faster than the traditional Apriori. 2 Granulations and Neighborhood Systems In [5]) Zadeh defines (rephrased) information granulation is a collection of granules with a granule being a clump of objects (points) which are drawn towards an object. In other words each object is associated with a family of clumps. This is essentially the notion of Frechet(V) space[?] or neighborhood systems [9]. In this paper a fuzzy set is uniquely defined by its membership function [15]. It is a w-sofset if we use the language of [7]. A crisp/fuzzy neighborhood system (F/NS) is: To each object we associate an (empty finite or infinite) family of crisp/fuzzy subsets called clumps. The mathematical system defined by these families is called crisp/fuzzy neighborhood system or simply neighborhood system and these clumps associated to are called fundamental neighborhoods of. Note that if there is at most one fundamental neighborhood at each point then the neighborhood system is defined by a binary relation; see Section 4. 1

2 3 Representations of neighborhood systems Weighted sum veristic constraints 3.1 Multiple valued representations We will illustrate the idea by examples. Let and be a family of fuzzy sets of that covers. is a fuzzy neighborhood system and each cover is a fuzzy neighborhood of any point in the cover. are the fuzzy neighborhood system at point The association to each object we associate a set of names is a multiple valued representation of the universe. 3.2 Fuzzy representations Since the neighborhood system is fuzzy we will take the weighted average of multiple values. Let us consider the following formal expressions: where are real numbers. Mathematically the collection of all such expressions is a vector space. Each vector is called a formal word. Let represents the grade of at i.e.. We will call the weight of in. Based on the weight we will form a formal word representation: defined by The expression is called the formal word representation of ; it is Zadeh s veristic constraint [?]. Table 1 consists of all such formal expressions; it is a vector-valued representation of the universe. Each expression represents a certain weighted sum of attribute values. 4 Binary granulation and Partitions A partition is a collection of pair-wise disjoint subsets whose union is. This is the simplest granulation. Its algebraic concept is an equivalence relation. So a natural generalization is a binary relation. We should like to comment that an obvious geometric generalization of a partition is a covering. Unfortunately a covering is not the geometric equivalence of a binary relation. The equivalent one is the more elaborate notion called the binary neighborhood system. This is the subject that will be covered next. Intuitively it is a cover with center more than a simple notion of cover. The notion of the center plays an essential role in this paper. 4.1 Binary granulation relations and neighborhood systems A Crisp/Fuzzy binary relation(br or FBR) is a crisp/fuzzy subset whose membership function is where M is the membership space that is M is either the unit interval [0 1] or the binary values. It defines a crisp/fuzzy set called binary (or elementary) neighborhood whose membership function is is defined by. The collection of all crisp/fuzzy sets on U is denoted by FZ(U). The map is called a crisp/fuzzy binary granulation and the set a crisp/fuzzy binary neighborhood system. 2

3 Proposition. and are equivalent to each other and will be used interchangeably; see [5]. A subset is a definable set if it is a union of equivalence classes. So a subset is called a definable neighborhood if is a union of elementary neighborhoods. If the definable neighborhood contains the elementary neighborhood of p it is a definable neighborhood of p. 4.2 Induced partitions The binary granulation is a map it induces a partition (or equivalence relation) denoted by on by the collection of complete inverse images. 5 Real world Model and Data Mining 5.1 Granular Data Model- Real world relational theory A crisp/fuzzy granular data model consists of 3-tuple where is called the object space is the data space ( and could be the same set) is a finite family of crisp/fuzzy binary granulations (neighborhood systems or binary relations). If and that will be denoted by is a finite family of equivalence relations then (U E) is called rough data model; it was called knowledge base in [10] [5] [6] [4]. We will not use it here since it confuses with standard usage. The notion of knowledge representation is essentially naming the granular data model that is assign meaningful names to the binary relations (attributes) and their binary neighborhoods (attribute values) [?]. Smith Jones Blake Clark Adams Peterson Ewing Johnson Pike Meyers We will illustrate the idea by examples. In the case of rough data model Table 6 its representation is an ordinary relation Table 2. If there are additional semantics conflict of interests among agents which is represented in the second column of Table??; it induces the equivalence relation in third column. Equiv. Elementary Attribute Class Granule value encoded label meaningful name S# TEN TWENTY THIRTY FORTY EIGHTY NINTY 1. In rough data model the universe is partitioned into equivalence classes. So we consider the following composition: where [p] is equivalence class. 2. In granular data model which maps each object to its unique binary neighborhood and then to a meaningful name. 5.2 Data mining = Granular Computing Granular data model uses granules as its attribute values so any logical formula is translated to set theoretical formula of granules. However we should note that attribute values are semantically related so elementary granules of a column (a binary relation) may overlap. So in processing any logical formula based on attribute values it is important that one checks the continuity 3

4 TEN TWENTY THIRTY FORTY EIGHTY NINTY Binary neighborhood Center Binary meaningful neighborhood name S# (namely see if it respects the semantics). Such checking is implicitly included in the computing of granules. We collect some generalized standard patterns: [2]. Let and be two attributes of a relation-withadditional-semantics. Let be two values of and respectively. Let be the respective elementary granules. It is clear that = NAME( ) and = NAME( ). Let Card( )be the cardinal number of a set. 1. Continuous decision rule: A formula is a continuous decision rule if continuously. Binary neighborhood on V Center (induced partition) 2. Continuous universal decision rule: A formula is a continuous universal decision rule (extensional function dependence) iff such that 3. Robust continuous decision rule : A formula is a robust continuous decision rule if and Card threshhold. 4. Soft continuous decision rule [8]: A formula is a soft continuous decision rule (strong rule) if is softly included in. 5. Continuous association rule: A pair is an association rules if Card ( threshhold. We will illustrate the continuous decision rules only; we skip the rest. is a continuous decision rule if an attribute value in NEIGH( ) appears in a tuple it must imply that an attribute value in NEIGH( ) also appears. So to check 4

5 If then. One needs to scan through the two columns in Table?? and check if ( = NEIGH( )) is continuously associated with NEIGH( ). In machine oriented model the same fact can be checked by the inclusion of two elementary granules namely 5.3 Some performance data = We collect some results on the performance of finding association rules. The relation consists of 128K rows = Columns the support requires 8192 and memory is 10 megabytes; see Tabele 7 [3]. The program for Apriori AporiTid and AprioriHybrid are our honest implementations of the algorithms in [?] [1]. In the implementation we use some buffer scheme to speedup read/write for all algorithms. 6 Conclusions In this conclusion we will reflect on our over all approach. In several of our papers we have literally taken Zadeh s intuitive description of clumps as a formal mathematical notion of granulation. It is essentially a mild generalization of binary relations and neighborhood systems in (pre-)topological spaces [12 9? 5]. By giving a meaningful name to each granule we have a representation theory. It extends the classical relational model based on Cantor sets to real world data model based on real world set theory (neighborhood system space). It is worthy to note here that in crisp world the representation is locally multi-valued in fuzzy world we can use weights to combine these names linearly (a weighted average) and form formal words; this tune it into a single-valued representation namely a formal word table; see Section 3.2. Using Zadeh s terminology such formal word representations are veristic constraints [?]. A formal word table is a generalization of information table. So by employing table processing techniques of rough set methodology to formal word tables we expect some useful applications to fuzzy logic control. Our study seems saying that granular computing is a reasonable notion. At this point its essential ingredients are (1) a representation theory of granular structure which will be useful in data mining (2) a formal = = word representation of input/output spaces and potentially useful to fuzzy logic control. In the over simplified terms the two applications are computing with words. Final we would like to say few words on the computational performance in classical data mining granular computing is faster than Apriori [3] because the database scan are replaced by bit operations. In this paper we extend the use of granular computing to semantically richer models. Such extra semantics can be used to analyze unexpected rules [11]. Granular computing is fast; it seems a promising approach to data mining. References [1] Agrawal R. R. Srikant Fast Algorithms for Mining Association Rules in Proceeding of 20th VLDB Conference San Tiago Chile [2] T. Y. Lin Data Mining and Machine Oriented Modeling: A Granular Computing Approach Journal of Applied Intelligence Kluwer Vol. 13No 2 September/October2000 pp [3] Eric Louie and T.Y. Lin Finding Association Rules using Fast Bit Computation: Machine- Oriented Modeling. In: Proceeding of 12th International Symposium ISMIS2000 Charlotte North Carolina Oct Lecture Notes in AI [4] T. Y. Lin Granular Computing: Fuzzy Logic and Rough Sets. In: Computing with words in information/intelligent systems L.A. Zadeh and J. Kacprzyk (eds) Springer-Verlag [5] T. Y. Lin Granular Computing on Binary Relations I: Data Mining and Neighborhood Systems. In: Rough Sets In Knowledge Discovery A. Skoworn and L. Polkowski (eds) Springer- Verlag [6] T. Y. Lin Granular Computing on Binary Relations II: Rough Set Representations and Belief Functions. In: Rough Sets In Knowledge Discovery A. Skoworn and L. Polkowski (eds) Springer- Verlag [7] T. Y Lin A Set Theory for Soft Computing. In: Proceedings of 1996 IEEE International Conference on Fuzzy Systems New Orleans Louisiana September [8] T. Y. Lin and Y.Y. Yao Mining Soft Rules Using Rough Sets and Neighborhoods. In: Symposium on Modeling Analysis and Simulation IMACS 5

6 Length of # of Association Granule(Full Granule Apriori Apriori Apriori combination Candidates rules Computation Partial Hybrid 199 Tid Multiconference (Computational Engineering in Systems Applications) Lille France July Vol. 2 of [9] T. Y. Lin Neighborhood Systems and Relational Database. In: Proceedings of 1988 ACM Sixteen Annual Computer Science Conference February [10] Z. Pawlak Rough sets. Theoretical Aspects of Reasoning about Data Kluwer Academic Publishers 1991 [11] Balaji Padmanabhan and Alexander Tuzhilin Finding Unexpected Patterns in Data. In: Data Mining and Granular Computing T. Y. Lin Y.Y. Yao and L. Zadeh (eds) Physica-Verlag to appear. [12] W. Sierpenski and C. Krieger General Topology University of Torranto Press [13] Lotfi Zadeh The Key Roles of Information Granulation and Fuzzy logic in Human Reasoning. In: 1996 IEEE International Conference on Fuzzy Systems September [14] W. Ziarko Variable Precision Rough Set Model. Journal of Computer and Systems Science Vol 46No1 February Academic Press 1993 pp [15] H. Zimmerman Fuzzy Set Theory and its Applications Second Ed. Kluwer Acdamic Publisher

Association Rules with Additional Semantics Modeled by Binary Relations

Association Rules with Additional Semantics Modeled by Binary Relations Association Rules with Additional Semantics Modeled by Binary Relations T. Y. Lin 1 and Eric Louie 2 1 Department of Mathematics and Computer Science San Jose State University, San Jose, California 95192-0103

More information

Qualitative Fuzzy Sets and Granularity

Qualitative Fuzzy Sets and Granularity Qualitative Fuzzy Sets and Granularity T. Y. Lin Department of Mathematics and Computer Science San Jose State University, San Jose, California 95192-0103 E-mail: tylin@cs.sjsu.edu and Shusaku Tsumoto

More information

Semantics Oriented Association Rules

Semantics Oriented Association Rules Semantics Oriented Association Rules Eric Louie BM Almaden Research Center 650 Harry Road, San Jose, CA 95 120 ewlouie@almaden.ibm.com Abstract - t is well known that relational theory carries very little

More information

Rough Sets, Neighborhood Systems, and Granular Computing

Rough Sets, Neighborhood Systems, and Granular Computing Rough Sets, Neighborhood Systems, and Granular Computing Y.Y. Yao Department of Computer Science University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: yyao@cs.uregina.ca Abstract Granulation

More information

A Granular Computing Approach. T.Y. Lin 1;2. Abstract. From the processing point of view, data mining is machine

A Granular Computing Approach. T.Y. Lin 1;2. Abstract. From the processing point of view, data mining is machine Data Mining and Machine Oriented Modeling: A Granular Computing Approach T.Y. Lin 1;2 1 Department of Mathematics and Computer Science San Jose State University, San Jose, California 95192 tylin@cs.sjsu.edu

More information

A Generalized Decision Logic Language for Granular Computing

A Generalized Decision Logic Language for Granular Computing A Generalized Decision Logic Language for Granular Computing Y.Y. Yao Department of Computer Science, University of Regina, Regina Saskatchewan, Canada S4S 0A2, E-mail: yyao@cs.uregina.ca Churn-Jung Liau

More information

On Generalizing Rough Set Theory

On Generalizing Rough Set Theory On Generalizing Rough Set Theory Y.Y. Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: yyao@cs.uregina.ca Abstract. This paper summarizes various formulations

More information

Mathematical Foundation of Association Rules - Mining Associations by Solving Integral Linear Inequalities

Mathematical Foundation of Association Rules - Mining Associations by Solving Integral Linear Inequalities Mathematical Foundation of Association Rules - Mining Associations by Solving Integral Linear Inequalities Tsau Young ( T. Y. ) Lin Department of Computer Science San Jose State University San Jose, CA

More information

Approximation Theories: Granular Computing vs Rough Sets

Approximation Theories: Granular Computing vs Rough Sets Approximation Theories: Granular Computing vs Rough Sets Tsau Young ( T. Y. ) Lin Department of Computer Science, San Jose State University San Jose, CA 95192-0249 tylin@cs.sjsu.edu Abstract. The goal

More information

Granular Computing based on Rough Sets, Quotient Space Theory, and Belief Functions

Granular Computing based on Rough Sets, Quotient Space Theory, and Belief Functions Granular Computing based on Rough Sets, Quotient Space Theory, and Belief Functions Yiyu (Y.Y.) Yao 1, Churn-Jung Liau 2, Ning Zhong 3 1 Department of Computer Science, University of Regina Regina, Saskatchewan,

More information

Granular Computing on Binary Relations In Data Mining and Neighborhood Systems

Granular Computing on Binary Relations In Data Mining and Neighborhood Systems Granular Computing on Binary Relations In Data Mining and Neighborhood Systems T. Y. Lin Department of Mathematics and Computer Science San Jose State University San Jose, California 95192-0103 And Department

More information

A Set Theory For Soft Computing A Unified View of Fuzzy Sets via Neighbrohoods

A Set Theory For Soft Computing A Unified View of Fuzzy Sets via Neighbrohoods A Set Theory For Soft Computing A Unified View of Fuzzy Sets via Neighbrohoods T. Y. Lin Department of Mathematics and Computer Science, San Jose State University, San Jose, California 95192-0103, and

More information

A Logic Language of Granular Computing

A Logic Language of Granular Computing A Logic Language of Granular Computing Yiyu Yao and Bing Zhou Department of Computer Science University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: {yyao, zhou200b}@cs.uregina.ca Abstract Granular

More information

Mining High Order Decision Rules

Mining High Order Decision Rules Mining High Order Decision Rules Y.Y. Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 e-mail: yyao@cs.uregina.ca Abstract. We introduce the notion of high

More information

Sets with Partial Memberships A Rough Set View of Fuzzy Sets

Sets with Partial Memberships A Rough Set View of Fuzzy Sets Sets with Partial Memberships A Rough Set View of Fuzzy Sets T. Y. Lin Department of Mathematics and Computer Science San Jose State University, San Jose, California 9592-3 E-mail: tylin @ cs.sj st.l.edu

More information

Rough Set Approaches to Rule Induction from Incomplete Data

Rough Set Approaches to Rule Induction from Incomplete Data Proceedings of the IPMU'2004, the 10th International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems, Perugia, Italy, July 4 9, 2004, vol. 2, 923 930 Rough

More information

Formal Concept Analysis and Hierarchical Classes Analysis

Formal Concept Analysis and Hierarchical Classes Analysis Formal Concept Analysis and Hierarchical Classes Analysis Yaohua Chen, Yiyu Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: {chen115y, yyao}@cs.uregina.ca

More information

Generalized Infinitive Rough Sets Based on Reflexive Relations

Generalized Infinitive Rough Sets Based on Reflexive Relations 2012 IEEE International Conference on Granular Computing Generalized Infinitive Rough Sets Based on Reflexive Relations Yu-Ru Syau Department of Information Management National Formosa University Huwei

More information

Granular Computing: A Paradigm in Information Processing Saroj K. Meher Center for Soft Computing Research Indian Statistical Institute, Kolkata

Granular Computing: A Paradigm in Information Processing Saroj K. Meher Center for Soft Computing Research Indian Statistical Institute, Kolkata Granular Computing: A Paradigm in Information Processing Saroj K. Meher Center for Soft Computing Research Indian Statistical Institute, Kolkata Granular computing (GrC): Outline Introduction Definitions

More information

Granular Computing. Y. Y. Yao

Granular Computing. Y. Y. Yao Granular Computing Y. Y. Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: yyao@cs.uregina.ca, http://www.cs.uregina.ca/~yyao Abstract The basic ideas

More information

Information Granulation and Approximation in a Decision-theoretic Model of Rough Sets

Information Granulation and Approximation in a Decision-theoretic Model of Rough Sets Information Granulation and Approximation in a Decision-theoretic Model of Rough Sets Y.Y. Yao Department of Computer Science University of Regina Regina, Saskatchewan Canada S4S 0A2 E-mail: yyao@cs.uregina.ca

More information

Value Added Association Rules

Value Added Association Rules Value Added Association Rules T.Y. Lin San Jose State University drlin@sjsu.edu Glossary Association Rule Mining A Association Rule Mining is an exploratory learning task to discover some hidden, dependency

More information

Attribute (Feature) Completion The Theory of Attributes from Data Mining Prospect

Attribute (Feature) Completion The Theory of Attributes from Data Mining Prospect Attribute (Feature) Completion The Theory of Attributes from Data Mining Prospect Tsay Young ( T. Y. ) Lin Department of Computer Science San Jose State University San Jose, CA 95192, USA tylin@cs.sjsu.edu

More information

A Comparison of Global and Local Probabilistic Approximations in Mining Data with Many Missing Attribute Values

A Comparison of Global and Local Probabilistic Approximations in Mining Data with Many Missing Attribute Values A Comparison of Global and Local Probabilistic Approximations in Mining Data with Many Missing Attribute Values Patrick G. Clark Department of Electrical Eng. and Computer Sci. University of Kansas Lawrence,

More information

Granular Computing: Examples, Intuitions and Modeling

Granular Computing: Examples, Intuitions and Modeling Granular Computing: Examples, Intuitions and Modeling Tsau Young (T. Y.) Lin, Member; IEEE, Abstract- The notion of granular computing is examined. Obvious examples, such as fuzzy numbers, infinitesimal

More information

Rough Approximations under Level Fuzzy Sets

Rough Approximations under Level Fuzzy Sets Rough Approximations under Level Fuzzy Sets W.-N. Liu J.T. Yao Y.Y.Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: [liuwe200, jtyao, yyao]@cs.uregina.ca

More information

XI International PhD Workshop OWD 2009, October Fuzzy Sets as Metasets

XI International PhD Workshop OWD 2009, October Fuzzy Sets as Metasets XI International PhD Workshop OWD 2009, 17 20 October 2009 Fuzzy Sets as Metasets Bartłomiej Starosta, Polsko-Japońska WyŜsza Szkoła Technik Komputerowych (24.01.2008, prof. Witold Kosiński, Polsko-Japońska

More information

Applying Fuzzy Sets and Rough Sets as Metric for Vagueness and Uncertainty in Information Retrieval Systems

Applying Fuzzy Sets and Rough Sets as Metric for Vagueness and Uncertainty in Information Retrieval Systems Applying Fuzzy Sets and Rough Sets as Metric for Vagueness and Uncertainty in Information Retrieval Systems Nancy Mehta,Neera Bawa Lect. In CSE, JCDV college of Engineering. (mehta_nancy@rediffmail.com,

More information

Using level-2 fuzzy sets to combine uncertainty and imprecision in fuzzy regions

Using level-2 fuzzy sets to combine uncertainty and imprecision in fuzzy regions Using level-2 fuzzy sets to combine uncertainty and imprecision in fuzzy regions Verstraete Jörg Abstract In many applications, spatial data need to be considered but are prone to uncertainty or imprecision.

More information

Semantics of Fuzzy Sets in Rough Set Theory

Semantics of Fuzzy Sets in Rough Set Theory Semantics of Fuzzy Sets in Rough Set Theory Y.Y. Yao Department of Computer Science University of Regina Regina, Saskatchewan Canada S4S 0A2 E-mail: yyao@cs.uregina.ca URL: http://www.cs.uregina.ca/ yyao

More information

COMBINATION OF ROUGH AND FUZZY SETS

COMBINATION OF ROUGH AND FUZZY SETS 1 COMBINATION OF ROUGH AND FUZZY SETS BASED ON α-level SETS Y.Y. Yao Department of Computer Science, Lakehead University Thunder Bay, Ontario, Canada P7B 5E1 E-mail: yyao@flash.lakeheadu.ca 1 ABSTRACT

More information

Knowledge Engineering in Search Engines

Knowledge Engineering in Search Engines San Jose State University SJSU ScholarWorks Master's Projects Master's Theses and Graduate Research Spring 2012 Knowledge Engineering in Search Engines Yun-Chieh Lin Follow this and additional works at:

More information

Efficient SQL-Querying Method for Data Mining in Large Data Bases

Efficient SQL-Querying Method for Data Mining in Large Data Bases Efficient SQL-Querying Method for Data Mining in Large Data Bases Nguyen Hung Son Institute of Mathematics Warsaw University Banacha 2, 02095, Warsaw, Poland Abstract Data mining can be understood as a

More information

A Graded Meaning of Formulas in Approximation Spaces

A Graded Meaning of Formulas in Approximation Spaces Fundamenta Informaticae 60 (2004) 159 172 159 IOS Press A Graded Meaning of Formulas in Approximation Spaces Anna Gomolińska Department of Mathematics University of Białystok ul. Akademicka 2, 15-267 Białystok,

More information

Molodtsov's Soft Set Theory and its Applications in Decision Making

Molodtsov's Soft Set Theory and its Applications in Decision Making International Journal of Engineering Science Invention ISSN (Online): 239 6734, ISSN (Print): 239 6726 Volume 6 Issue 2 February 27 PP. 86-9 Molodtsov's Soft Set Theory and its Applications in Decision

More information

A Rough Set Approach to Data with Missing Attribute Values

A Rough Set Approach to Data with Missing Attribute Values A Rough Set Approach to Data with Missing Attribute Values Jerzy W. Grzymala-Busse Department of Electrical Engineering and Computer Science, University of Kansas, Lawrence, KS 66045, USA and Institute

More information

Induction of Strong Feature Subsets

Induction of Strong Feature Subsets Induction of Strong Feature Subsets Mohamed Quafafou and Moussa Boussouf IRIN, University of Nantes, 2 rue de la Houssiniere, BP 92208-44322, Nantes Cedex 03, France. quafafou9 Abstract The problem of

More information

Granular Computing II:

Granular Computing II: Granular Computing II: Infrastructures for AI-Engineering Tsau Young (T. Y.) Lin, Member, IEEE, Abstract What is granular computing? There are no well accepted formal definitions yet. Informally, any computing

More information

SOME OPERATIONS ON INTUITIONISTIC FUZZY SETS

SOME OPERATIONS ON INTUITIONISTIC FUZZY SETS IJMMS, Vol. 8, No. 1, (June 2012) : 103-107 Serials Publications ISSN: 0973-3329 SOME OPERTIONS ON INTUITIONISTIC FUZZY SETS Hakimuddin Khan bstract In This paper, uthor Discuss about some operations on

More information

Data with Missing Attribute Values: Generalization of Indiscernibility Relation and Rule Induction

Data with Missing Attribute Values: Generalization of Indiscernibility Relation and Rule Induction Data with Missing Attribute Values: Generalization of Indiscernibility Relation and Rule Induction Jerzy W. Grzymala-Busse 1,2 1 Department of Electrical Engineering and Computer Science, University of

More information

EFFICIENT ATTRIBUTE REDUCTION ALGORITHM

EFFICIENT ATTRIBUTE REDUCTION ALGORITHM EFFICIENT ATTRIBUTE REDUCTION ALGORITHM Zhongzhi Shi, Shaohui Liu, Zheng Zheng Institute Of Computing Technology,Chinese Academy of Sciences, Beijing, China Abstract: Key words: Efficiency of algorithms

More information

A Rough Set Approach for Generation and Validation of Rules for Missing Attribute Values of a Data Set

A Rough Set Approach for Generation and Validation of Rules for Missing Attribute Values of a Data Set A Rough Set Approach for Generation and Validation of Rules for Missing Attribute Values of a Data Set Renu Vashist School of Computer Science and Engineering Shri Mata Vaishno Devi University, Katra,

More information

Granular association rules for multi-valued data

Granular association rules for multi-valued data Granular association rules for multi-valued data Fan Min and William Zhu Lab of Granular Computing, Zhangzhou Normal University, Zhangzhou 363, China. Email: minfanphd@163.com, williamfengzhu@gmail.com

More information

On Reduct Construction Algorithms

On Reduct Construction Algorithms 1 On Reduct Construction Algorithms Yiyu Yao 1, Yan Zhao 1 and Jue Wang 2 1 Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 {yyao, yanzhao}@cs.uregina.ca 2 Laboratory

More information

A Decision-Theoretic Rough Set Model

A Decision-Theoretic Rough Set Model A Decision-Theoretic Rough Set Model Yiyu Yao and Jingtao Yao Department of Computer Science University of Regina Regina, Saskatchewan, Canada S4S 0A2 {yyao,jtyao}@cs.uregina.ca Special Thanks to Professor

More information

[Ch 6] Set Theory. 1. Basic Concepts and Definitions. 400 lecture note #4. 1) Basics

[Ch 6] Set Theory. 1. Basic Concepts and Definitions. 400 lecture note #4. 1) Basics 400 lecture note #4 [Ch 6] Set Theory 1. Basic Concepts and Definitions 1) Basics Element: ; A is a set consisting of elements x which is in a/another set S such that P(x) is true. Empty set: notated {

More information

Yiyu Yao University of Regina, Regina, Saskatchewan, Canada

Yiyu Yao University of Regina, Regina, Saskatchewan, Canada ROUGH SET APPROXIMATIONS: A CONCEPT ANALYSIS POINT OF VIEW Yiyu Yao University of Regina, Regina, Saskatchewan, Canada Keywords: Concept analysis, data processing and analysis, description language, form

More information

Granular Computing: Models and Applications

Granular Computing: Models and Applications Granular Computing: Models and Applications Jianchao Han, 1, Tsau Young Lin 2, 1 Department of Computer Science, California State University, Dominguez Hills, Carson, CA 90747 2 Department of Computer

More information

ROUGH MEMBERSHIP FUNCTIONS: A TOOL FOR REASONING WITH UNCERTAINTY

ROUGH MEMBERSHIP FUNCTIONS: A TOOL FOR REASONING WITH UNCERTAINTY ALGEBRAIC METHODS IN LOGIC AND IN COMPUTER SCIENCE BANACH CENTER PUBLICATIONS, VOLUME 28 INSTITUTE OF MATHEMATICS POLISH ACADEMY OF SCIENCES WARSZAWA 1993 ROUGH MEMBERSHIP FUNCTIONS: A TOOL FOR REASONING

More information

Data Analysis and Mining in Ordered Information Tables

Data Analysis and Mining in Ordered Information Tables Data Analysis and Mining in Ordered Information Tables Ying Sai, Y.Y. Yao Department of Computer Science University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: yyao@cs.uregina.ca Ning Zhong

More information

Approximation of Relations. Andrzej Skowron. Warsaw University. Banacha 2, Warsaw, Poland. Jaroslaw Stepaniuk

Approximation of Relations. Andrzej Skowron. Warsaw University. Banacha 2, Warsaw, Poland.   Jaroslaw Stepaniuk Approximation of Relations Andrzej Skowron Institute of Mathematics Warsaw University Banacha 2, 02-097 Warsaw, Poland e-mail: skowron@mimuw.edu.pl Jaroslaw Stepaniuk Institute of Computer Science Technical

More information

Available online at ScienceDirect. Procedia Computer Science 96 (2016 )

Available online at   ScienceDirect. Procedia Computer Science 96 (2016 ) Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 96 (2016 ) 179 186 20th International Conference on Knowledge Based and Intelligent Information and Engineering Systems,

More information

ROUGH SETS THEORY AND UNCERTAINTY INTO INFORMATION SYSTEM

ROUGH SETS THEORY AND UNCERTAINTY INTO INFORMATION SYSTEM ROUGH SETS THEORY AND UNCERTAINTY INTO INFORMATION SYSTEM Pavel Jirava Institute of System Engineering and Informatics Faculty of Economics and Administration, University of Pardubice Abstract: This article

More information

Collaborative Rough Clustering

Collaborative Rough Clustering Collaborative Rough Clustering Sushmita Mitra, Haider Banka, and Witold Pedrycz Machine Intelligence Unit, Indian Statistical Institute, Kolkata, India {sushmita, hbanka r}@isical.ac.in Dept. of Electrical

More information

FUNDAMENTALS OF FUZZY SETS

FUNDAMENTALS OF FUZZY SETS FUNDAMENTALS OF FUZZY SETS edited by Didier Dubois and Henri Prade IRIT, CNRS & University of Toulouse III Foreword by LotfiA. Zadeh 14 Kluwer Academic Publishers Boston//London/Dordrecht Contents Foreword

More information

NEIGHBORHOOD SYSTEMS: A Qualitative Theory for Fuzzy and Rough Sets

NEIGHBORHOOD SYSTEMS: A Qualitative Theory for Fuzzy and Rough Sets NEIGHBORHOOD SYSTEMS: A Qualitative Theory for Fuzzy and Rough Sets T.Y. Lin tylin@cs.sjsu.edu Berkeley Initiative in Soft Computing, Department of Electrical Engineering and Computer Science, University

More information

A fuzzy soft set theoretic approach to decision making problems

A fuzzy soft set theoretic approach to decision making problems Journal of Computational and Applied Mathematics 203 (2007) 412 418 www.elsevier.com/locate/cam A fuzzy soft set theoretic approach to decision making problems A.R. Roy, P.K. Maji Department of Mathematics,

More information

Interpreting Association Rules in Granular Data Model via Decision Logic

Interpreting Association Rules in Granular Data Model via Decision Logic Interpreting Association Rules in Granular Data Model via Decision Logic Tsau. Young.("T. Y.") Lin Department of Computer Science San Jose State University San Jose, CA 95192-0462 tylin@cs.sjsu.edu Abstruct

More information

REDUNDANCY OF MULTISET TOPOLOGICAL SPACES

REDUNDANCY OF MULTISET TOPOLOGICAL SPACES Iranian Journal of Fuzzy Systems Vol. 14, No. 4, (2017) pp. 163-168 163 REDUNDANCY OF MULTISET TOPOLOGICAL SPACES A. GHAREEB Abstract. In this paper, we show the redundancies of multiset topological spaces.

More information

Generating Topology on Graphs by. Operations on Graphs

Generating Topology on Graphs by. Operations on Graphs Applied Mathematical Sciences, Vol. 9, 2015, no. 57, 2843-2857 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ams.2015.5154 Generating Topology on Graphs by Operations on Graphs M. Shokry Physics

More information

MA651 Topology. Lecture 4. Topological spaces 2

MA651 Topology. Lecture 4. Topological spaces 2 MA651 Topology. Lecture 4. Topological spaces 2 This text is based on the following books: Linear Algebra and Analysis by Marc Zamansky Topology by James Dugundgji Fundamental concepts of topology by Peter

More information

Action Rules. (*Corresponding author)

Action Rules. (*Corresponding author) Action Rules Zbigniew W. Ras* Department of Computer Science University of North Carolina 9201 University City Blvd. Charlotte, NC 28223, USA voice: +1 704-687-4567 fax: +1 704-687-3516 email: ras@uncc.edu

More information

Union and intersection of Level-2 fuzzy regions

Union and intersection of Level-2 fuzzy regions Union and intersection of Level- fuzzy regions Verstraete Jörg Systems esearch Institute, Polish Academy of Sciences ul. Newelska 6; 0-447 Warszawa; Polska Email: jorg.verstraete@ibspan.waw.pl Department

More information

Brian Hamrick. October 26, 2009

Brian Hamrick. October 26, 2009 Efficient Computation of Homology Groups of Simplicial Complexes Embedded in Euclidean Space TJHSST Senior Research Project Computer Systems Lab 2009-2010 Brian Hamrick October 26, 2009 1 Abstract Homology

More information

Bipolar Fuzzy Line Graph of a Bipolar Fuzzy Hypergraph

Bipolar Fuzzy Line Graph of a Bipolar Fuzzy Hypergraph BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 13, No 1 Sofia 2013 Print ISSN: 1311-9702; Online ISSN: 1314-4081 DOI: 10.2478/cait-2013-0002 Bipolar Fuzzy Line Graph of a

More information

Assessment of Human Skills Using Trapezoidal Fuzzy Numbers

Assessment of Human Skills Using Trapezoidal Fuzzy Numbers American Journal of Computational and Applied Mathematics 2015, 5(4): 111-116 DOI: 10.5923/j.ajcam.20150504.03 Assessment of Human Skills Using Trapezoidal Fuzzy Numbers Michael Gr. Voskoglou Department

More information

Introduction to Sets and Logic (MATH 1190)

Introduction to Sets and Logic (MATH 1190) Introduction to Sets and Logic () Instructor: Email: shenlili@yorku.ca Department of Mathematics and Statistics York University Dec 4, 2014 Outline 1 2 3 4 Definition A relation R from a set A to a set

More information

Granular Computing: The Concept of Granulation and Its Formal Theory I

Granular Computing: The Concept of Granulation and Its Formal Theory I Granular Computing: The Concept of Granulation and Its Formal Theory I Tsau Young (T. Y.) Lin Department of Computer Science, San Jose State University San Jose, California 95192, USA tylin@cs.sjsu.edu

More information

A mining method for tracking changes in temporal association rules from an encoded database

A mining method for tracking changes in temporal association rules from an encoded database A mining method for tracking changes in temporal association rules from an encoded database Chelliah Balasubramanian *, Karuppaswamy Duraiswamy ** K.S.Rangasamy College of Technology, Tiruchengode, Tamil

More information

Fuzzy Set-Theoretical Approach for Comparing Objects with Fuzzy Attributes

Fuzzy Set-Theoretical Approach for Comparing Objects with Fuzzy Attributes Fuzzy Set-Theoretical Approach for Comparing Objects with Fuzzy Attributes Y. Bashon, D. Neagu, M.J. Ridley Department of Computing University of Bradford Bradford, BD7 DP, UK e-mail: {Y.Bashon, D.Neagu,

More information

A GRAPH FROM THE VIEWPOINT OF ALGEBRAIC TOPOLOGY

A GRAPH FROM THE VIEWPOINT OF ALGEBRAIC TOPOLOGY A GRAPH FROM THE VIEWPOINT OF ALGEBRAIC TOPOLOGY KARL L. STRATOS Abstract. The conventional method of describing a graph as a pair (V, E), where V and E repectively denote the sets of vertices and edges,

More information

Fuzzy Sets and Systems. Lecture 1 (Introduction) Bu- Ali Sina University Computer Engineering Dep. Spring 2010

Fuzzy Sets and Systems. Lecture 1 (Introduction) Bu- Ali Sina University Computer Engineering Dep. Spring 2010 Fuzzy Sets and Systems Lecture 1 (Introduction) Bu- Ali Sina University Computer Engineering Dep. Spring 2010 Fuzzy sets and system Introduction and syllabus References Grading Fuzzy sets and system Syllabus

More information

Attribute Reduction using Forward Selection and Relative Reduct Algorithm

Attribute Reduction using Forward Selection and Relative Reduct Algorithm Attribute Reduction using Forward Selection and Relative Reduct Algorithm P.Kalyani Associate Professor in Computer Science, SNR Sons College, Coimbatore, India. ABSTRACT Attribute reduction of an information

More information

CSC Discrete Math I, Spring Sets

CSC Discrete Math I, Spring Sets CSC 125 - Discrete Math I, Spring 2017 Sets Sets A set is well-defined, unordered collection of objects The objects in a set are called the elements, or members, of the set A set is said to contain its

More information

A Closest Fit Approach to Missing Attribute Values in Preterm Birth Data

A Closest Fit Approach to Missing Attribute Values in Preterm Birth Data A Closest Fit Approach to Missing Attribute Values in Preterm Birth Data Jerzy W. Grzymala-Busse 1, Witold J. Grzymala-Busse 2, and Linda K. Goodwin 3 1 Department of Electrical Engineering and Computer

More information

Mining High Average-Utility Itemsets

Mining High Average-Utility Itemsets Proceedings of the 2009 IEEE International Conference on Systems, Man, and Cybernetics San Antonio, TX, USA - October 2009 Mining High Itemsets Tzung-Pei Hong Dept of Computer Science and Information Engineering

More information

Finite-Resolution Simplicial Complexes

Finite-Resolution Simplicial Complexes 1 Finite-Resolution Simplicial Complexes Werner Hölbling, Werner Kuhn, Andrew U. Frank Department of Geoinformation Technical University Vienna Gusshausstrasse 27-29, A-1040 Vienna (Austria) frank@geoinfo.tuwien.ac.at

More information

Feature Selection Based on Relative Attribute Dependency: An Experimental Study

Feature Selection Based on Relative Attribute Dependency: An Experimental Study Feature Selection Based on Relative Attribute Dependency: An Experimental Study Jianchao Han, Ricardo Sanchez, Xiaohua Hu, T.Y. Lin Department of Computer Science, California State University Dominguez

More information

The Application of K-medoids and PAM to the Clustering of Rules

The Application of K-medoids and PAM to the Clustering of Rules The Application of K-medoids and PAM to the Clustering of Rules A. P. Reynolds, G. Richards, and V. J. Rayward-Smith School of Computing Sciences, University of East Anglia, Norwich Abstract. Earlier research

More information

Formal Model. Figure 1: The target concept T is a subset of the concept S = [0, 1]. The search agent needs to search S for a point in T.

Formal Model. Figure 1: The target concept T is a subset of the concept S = [0, 1]. The search agent needs to search S for a point in T. Although this paper analyzes shaping with respect to its benefits on search problems, the reader should recognize that shaping is often intimately related to reinforcement learning. The objective in reinforcement

More information

Songklanakarin Journal of Science and Technology SJST R1 Ghareeb SPATIAL OBJECT MODELING IN SOFT TOPOLOGY

Songklanakarin Journal of Science and Technology SJST R1 Ghareeb SPATIAL OBJECT MODELING IN SOFT TOPOLOGY Songklanakarin Journal of Science and Technology SJST-0-00.R Ghareeb SPATIAL OBJECT MODELING IN SOFT TOPOLOGY Journal: Songklanakarin Journal of Science and Technology Manuscript ID: SJST-0-00.R Manuscript

More information

Introducing fuzzy quantification in OWL 2 ontologies

Introducing fuzzy quantification in OWL 2 ontologies Introducing fuzzy quantification in OWL 2 ontologies Francesca Alessandra Lisi and Corrado Mencar Dipartimento di Informatica, Centro Interdipartimentale di Logica e Applicazioni Università degli Studi

More information

Local and Global Approximations for Incomplete Data

Local and Global Approximations for Incomplete Data Local and Global Approximations for Incomplete Data Jerzy W. Grzyma la-busse 1,2 and Wojciech Rz asa 3 1 Department of Electrical Engineering and Computer Science, University of Kansas, Lawrence, KS 66045,

More information

Minimal Test Cost Feature Selection with Positive Region Constraint

Minimal Test Cost Feature Selection with Positive Region Constraint Minimal Test Cost Feature Selection with Positive Region Constraint Jiabin Liu 1,2,FanMin 2,, Shujiao Liao 2, and William Zhu 2 1 Department of Computer Science, Sichuan University for Nationalities, Kangding

More information

A New Method For Forecasting Enrolments Combining Time-Variant Fuzzy Logical Relationship Groups And K-Means Clustering

A New Method For Forecasting Enrolments Combining Time-Variant Fuzzy Logical Relationship Groups And K-Means Clustering A New Method For Forecasting Enrolments Combining Time-Variant Fuzzy Logical Relationship Groups And K-Means Clustering Nghiem Van Tinh 1, Vu Viet Vu 1, Tran Thi Ngoc Linh 1 1 Thai Nguyen University of

More information

Disjunctive and Conjunctive Normal Forms in Fuzzy Logic

Disjunctive and Conjunctive Normal Forms in Fuzzy Logic Disjunctive and Conjunctive Normal Forms in Fuzzy Logic K. Maes, B. De Baets and J. Fodor 2 Department of Applied Mathematics, Biometrics and Process Control Ghent University, Coupure links 653, B-9 Gent,

More information

Modeling with Uncertainty Interval Computations Using Fuzzy Sets

Modeling with Uncertainty Interval Computations Using Fuzzy Sets Modeling with Uncertainty Interval Computations Using Fuzzy Sets J. Honda, R. Tankelevich Department of Mathematical and Computer Sciences, Colorado School of Mines, Golden, CO, U.S.A. Abstract A new method

More information

Reichenbach Fuzzy Set of Transitivity

Reichenbach Fuzzy Set of Transitivity Available at http://pvamu.edu/aam Appl. Appl. Math. ISSN: 1932-9466 Vol. 9, Issue 1 (June 2014), pp. 295-310 Applications and Applied Mathematics: An International Journal (AAM) Reichenbach Fuzzy Set of

More information

The Rough Set View on Bayes Theorem

The Rough Set View on Bayes Theorem The Rough Set View on Bayes Theorem Zdzis law Pawlak University of Information Technology and Management ul. Newelska 6, 01 447 Warsaw, Poland zpw@ii.pw.edu.pl MOTTO: It is a capital mistake to theorise

More information

Sets MAT231. Fall Transition to Higher Mathematics. MAT231 (Transition to Higher Math) Sets Fall / 31

Sets MAT231. Fall Transition to Higher Mathematics. MAT231 (Transition to Higher Math) Sets Fall / 31 Sets MAT231 Transition to Higher Mathematics Fall 2014 MAT231 (Transition to Higher Math) Sets Fall 2014 1 / 31 Outline 1 Sets Introduction Cartesian Products Subsets Power Sets Union, Intersection, Difference

More information

The Rough Set Database System: An Overview

The Rough Set Database System: An Overview The Rough Set Database System: An Overview Zbigniew Suraj 1,2 and Piotr Grochowalski 2 1 Chair of Computer Science Foundations University of Information Technology and Management, Rzeszow, Poland zsuraj@wenus.wsiz.rzeszow.pl

More information

DISCRETE DOMAIN REPRESENTATION FOR SHAPE CONCEPTUALIZATION

DISCRETE DOMAIN REPRESENTATION FOR SHAPE CONCEPTUALIZATION DISCRETE DOMAIN REPRESENTATION FOR SHAPE CONCEPTUALIZATION Zoltán Rusák, Imre Horváth, György Kuczogi, Joris S.M. Vergeest, Johan Jansson Department of Design Engineering Delft University of Technology

More information

Classification with Diffuse or Incomplete Information

Classification with Diffuse or Incomplete Information Classification with Diffuse or Incomplete Information AMAURY CABALLERO, KANG YEN Florida International University Abstract. In many different fields like finance, business, pattern recognition, communication

More information

From Topology to Anti-reflexive Topology

From Topology to Anti-reflexive Topology From Topology to Anti-reflexive Topology Tsau Young ( T. Y. ) Lin, Guilong Liu, Mihir K. Chakraborty and Dominik Ślȩzak Department of Computer Science, San Jose State University San Jose, CA 95192-0249,

More information

An Architecture Model of Distributed Simulation System Based on Quotient Space

An Architecture Model of Distributed Simulation System Based on Quotient Space Appl. Math. Inf. Sci. 6 No. S pp. 603S-609S (01) Applied Mathematics & Information Sciences An International Journal @ 01 NSP Natural Sciences Publishing Cor. An Architecture Model of Distributed Simulation

More information

This is an author produced version of Ordered Information Systems and Graph Granulation.

This is an author produced version of Ordered Information Systems and Graph Granulation. This is an author produced version of Ordered Information Systems and Graph Granulation. White Rose Research Online URL for this paper: http://eprints.whiterose.ac.uk/103999/ Proceedings Paper: Stell,

More information

Discrete Mathematics Lecture 4. Harper Langston New York University

Discrete Mathematics Lecture 4. Harper Langston New York University Discrete Mathematics Lecture 4 Harper Langston New York University Sequences Sequence is a set of (usually infinite number of) ordered elements: a 1, a 2,, a n, Each individual element a k is called a

More information

A study on lower interval probability function based decision theoretic rough set models

A study on lower interval probability function based decision theoretic rough set models Annals of Fuzzy Mathematics and Informatics Volume 12, No. 3, (September 2016), pp. 373 386 ISSN: 2093 9310 (print version) ISSN: 2287 6235 (electronic version) http://www.afmi.or.kr @FMI c Kyung Moon

More information

Rough Connected Topologized. Approximation Spaces

Rough Connected Topologized. Approximation Spaces International Journal o Mathematical Analysis Vol. 8 04 no. 53 69-68 HIARI Ltd www.m-hikari.com http://dx.doi.org/0.988/ijma.04.4038 Rough Connected Topologized Approximation Spaces M. J. Iqelan Department

More information

A database can be modeled as: + a collection of entities, + a set of relationships among entities.

A database can be modeled as: + a collection of entities, + a set of relationships among entities. The Relational Model Lecture 2 The Entity-Relationship Model and its Translation to the Relational Model Entity-Relationship (ER) Model + Entity Sets + Relationship Sets + Database Design Issues + Mapping

More information