Cooperative Information Retrieval enhanced. by Formal Concept Analysis

Size: px
Start display at page:

Download "Cooperative Information Retrieval enhanced. by Formal Concept Analysis"

Transcription

1 Cooperative Information Retrieval enhanced by Formal Concept Analysis IBTISSEM NAFKHA 1, ALI JAOUA 2 1 Department of Computer Science University of Tunis Campus Universitaire, Le Belvédère, 1060 Tunis. TUNISIA 2 Department of Computer Science University of Qatar Faculty of Science, Doha. QATAR Abstract: - Along with the growth of the World Wide Web, information retrieval systems gain importance since they are often the only way to find the few documents actually relevant to a specific question in the vast quantities of text available. Moreover, with the advent of the Web along with the unprecedented amount of information available in electronic format and its distributed structure, Formal Concept Analysis (FCA) is more useful and practical than ever, because this technology addresses important limitations of the systems that currently support users in their quest for information. Over the last few years, the range of functionality has been expanded to include new tasks such as data reduction and collaborative (or cooperative) information retrieval. In fact, due to the huge quantity of available information and its distributed structure, it is necessary to abstract it and eliminate the redundancy data. In this context, a method for data reduction based on the formal concept analysis is proposed in [16,17]. At the same time, new IR domains have been investigated including different types of information ( messages, web documents,..). Thus, there is nowadays a much better awareness of the strengths and limitations of this technique for organising and searching distributed information. We are interested by searching in distributed information. Key-Words: - Information Retrieval - Formal concept Analysis Cooperation Data reduction Conceptual Information retrieval. 1 Introduction As The quick increase of the accessible data quantity, under electronic form, is changing our world. In fact, the development of the Web allows the users the access, the division and the disclosure of the news of an easier manner and in big quantities. Consequently, this quantity of information does not stop increasing. The nature of this information can be various: literal, picture, video, etc. The classical technologies of data treatment prove themselves insufficient for their exploitation. In fact, the catalogs and the research engines in line (Google, Lycos, etc.) allow carrying out queries by keyword and to refine them to the assistance of Boolean operators. Nevertheless, the task more difficult comes back to the user that must search in this mass of information to select the documents that will be for him the pertinent. The obtained results researches from these engines present limits. In fact, the rediscovered documents can be redundant or incomplete. The arrival of the «World Wide Web» and the big internet usage open again perspectives, such as the integration of the cooperation for diffusion and the information division. The information systems evolve while becoming more and more complexes and equally the quantity

2 of information treated is more and more voluminous. Leaving simple data management systems describing the documents «paper», they became accesses systems to the «electronic» documents stocked in different documents collections. The multiplication of the documents as well as the interconnection of the waiters of documents induced the introduction of the cooperation concept between a systems set. The documents collections multiply themselves and the waiters of documents cooperate. Besides, the development of the web, example the more known collections of divided documents, constitutes equally a non negligible factor of the growth of the volume of available information. Thus, it is more and more difficult for each of knowledge where and how to attain quickly to information more pertinent. The meta-engines are presentation techniques of the results coming from several tools. They operate while questioning for every query that is submitted for them different research tools in order to furnish the more exhaustive answer. For their functioning, some index contained information in different research engines, of others questions them in a manner dynamic simultaneously. The problem is not simple for research every tool has its characteristics. Of more, they generate, on the other hand, too much noise (non pertinent answers), and they do not take advantage of the characteristics furnished by every tool. The treatment of the results coming from different research tools is very variable. Certain meta-engines limit the number of obtained answers of each different questioned tool (the 10 to 20 first answers furnished by each tool). This should allow obtaining the answers more pertinent. Of other meta-engine do not synthesize the results and they post them in independent windows such as the metaengine All4One. Certain meta-engines, after to have recovered the different results, merge them with elimination of duplications or the classes by relevance such as the meta-engine Profusion. The answer times of the engines used researches being able to be very variable, certain meta-engines use an expiration value or time out such as the meta-engine Debriefing. For others, the user has a total check on the length of the research. Our work is very interested in to furnish means that pull the best possible party of the cooperative oriented approach system in the information research while benefiting from the provisions furnish by the formal concept analysis. Indeed, several works [4,5,6,7,8,10,13,14,15,18,19,20] were very interested in the exploitation of the formal concept analysis as research tool and of navigation. These works use the terminologies of the formal concept analysis such as the usage of thesaurus in the research process and of navigation [4,5,6,8,10,13,14,15] or in the expansion of queries [7,9]. Nevertheless, they always treat an alone information collection. The put problem is enough complicated and important for that we could treat it on the different types of information. Thus, we particularly were very interested in the specific problems of the information research in of multiples documents collections. So, this paper is organized as follows. In section 2, we introduce some basic definitions on formal concept analysis. Then in section 3, we present the data reduction. The section 4 is devoted to the presentation of several methods for searching in distributed information. In section 5, we present the complexity of those methods. The section 6 is devoted for the evaluation prototype. 2 Mathematical Foundations Among the mathematical theories found recently with important applications in computer science, lattice theory has a specific place for data organization, information engineering, data mining and for reasoning. It may be considered as the mathematical tool that unifies data and knowledge or information retrieval [1,2,3,4,5,6,7,8,9,10,13,14,15,18,19,20]. In this section, we define formal context, formal concept, Galois connection and the lattice of concepts associated to the formal context. 2.1 Formal Context Definition 1: A formal context is a triple k = <O,P,R>, where O is a finite set of elements called objects, P a finite set of elements called properties and R is a binary relation defined between O and P. The notations (g,m), or R(g,m)=1, means that "formal object g verifies property m in relation R" [12]. Example 1: Let O be a set of some animals and P be a set of the properties such as: a= needs water, b= lives in water, c= lives on land, d= can move around. Let O= {Leech, Bream, Frog, Dog} and P={a, b, c, d}. The following context may be defined by the table 1. a b c d 1 Leech Bream Frog Dog Table 1. An example of a formal context.

3 2.2 Galois Connection Definition 2: Let A Ο and B P two finite sets, R a relation on O x P. For both sets A and B, operators f(a) and h (B) are defined as: f (A) = {m g, g A (g,m) R} h (B) = {g m, m B (g,m) R} Operator f defines the properties shared by all elements of A. Operator h defines objects sharing the same properties included in set B. Operators f and h define a Galois Connection between sets O and P [12]. (x)=closure(a)={x} A, where x A. Example 2: Let R be the following relation, in the table, with 5 objects {O1,O2,O3,O4,O5} and three attributes {A,B,C}. A B C O O O O O Table 2. Example of a relation R Proposition 1: Operators f and h define a Galois connection between O and P, such that if A 1, A 2 are subsets of O, and B 1, B 2 are two subsets of P, then f and h verify the following properties [12]: A 1 A 2 f (A 1 ) f (A 2 ) (1) B 1 B 2 h (B 1 ) h (B 2 ) (2) A 1 h o f (A 1 ) and B 1 f o h (B 1 ) (3) A h (B) B f (A) (4) f = f o h o f and h = h o f o h (5) 2.3 Formal Concept Definition 3: A formal concept of the context <O,P,R> is a pair (A,B), where A Ο, B P, such f (A) = B and h (B) = A. Sets A and B are called respectively the domain (extent) and range (intent) of the formal concept [3,12]. 2.4 Concept Lattice Definition 4: From a formal context <O,P,R>, we can extract all possible concepts. In [3,12], we prove that the set of all concepts may be organized as a lattice, when we define the following partial order relation << between two concepts, (A1,B1) << (A2,B2) (A1 A2) and (B2 B1). The concepts (A1,B1) and (A2,B2) are called nodes in the lattice. The information searching using FCA takes as input 2.5 Equivalence between an object and a subset of other objects Definition 5: Let x O be an object, A Ο be a finite set and R a relation on O P. We define closure(x) = g(f(x)) and closure(a) = g(f(a)) [16,17]. Definition 6: Let x O be an object, A Ο be a finite set and R be a relation on O P. We say that an object x is equivalent to the objects A, relatively to a relation R, if and only if, {x} A is a domain of a concept of R, and that the closure O5 is equivalent to {O1,O2,O4}, the reason is that the concept containing O5 is CP={O1,O2,O4,O5} {C} ; and inversely the concept containing {O1,O2,O4} is also CP. 3 Current Search Using FCA can complement the existing search systems to address some of their main limitations. Basically, FCA exploits the similarity between documents in order to offer an automatically support structure (i.e., the document lattice) in which to place the information retrieval process. The document lattice can be used to improve basic individual search strategies [1,2,4,5,6,8,10,13,14,15]. Moreover, query refinement is one of the most natural applications of concept lattices. Its main objective is to recover from the null-output or the information overload problem. The concept lattice may be used to make a transformation between the representation of a query and the representation of each document [7,9]. The query is merged into the document lattice and each document is ranked according to the length of the shortest path linking the query to the document concept. In the other hand, on the set of terms describing the document, there exist hierarchies in the form of thesaurus [4,10,13,14]. a query that will be forwarded to a selected search engine [6,7,8]. The first pages retrieved by the search engine in answer to the query are collected and parsed. At this point, a set of index units that describe each returned document is generated; such indices are next used to build the concept lattice corresponding to the retrieved results. The last step consists of showing the lattice to the user and managing the subsequent interaction between the user and the system. In spite of such limitations such as for larger information collection, generally we get a huge number of references, we are interest to build a FCA-based system for distributed information,

4 which may affect both the efficiency and the effectiveness of the overall system. These limitations can be overcome by using data reduction that will be described in the next section. 4 conceptual Reduction Method The fundamental question for data reduction is the following: How can we reduce data without losing any knowledge? The advantage of reduced data is that it can be used directly as a prototype for making decision, for supervised learning or reasoning [16,17]. The proposed algorithm is based on the elimination of any object that may be replaced by a subset of the equivalent objects [17]. Firstly, we use relational join operator in order to obtain a total relation. Secondly, we apply reduction algorithm on the final table to minimize the number of rows. The major problem is that the treatment of an important size databases needs a considerable storage space and makes the reduction process sultry. Our idea is to propose an algorithm which makes simultaneously the join and the reduction of the tables set in only one reduced database. 5 Search enhanced by FCA In this section we will present some search that may be improved through a FCA for distributed information. 5.1 Cooperative conceptual information retrieval The majority of the proposed approaches based on the formal concept analysis construct a documents lattice. These different approaches handle an alone bases documents from which is constructed the documents lattice. Nevertheless, the data distribution on several bases becomes a solution to several problems met. Indeed, our system handles several divided up databases to which ones apply us the foundations of the formal concept analysis for a conceptual information retrieval [18]. As it is showed in the figure 1, the system C2IRS is composed from two phases: - In the first phase, C2IRS uses the different databases for the research process in order to furnish local results. Every local result is a concept that we stock in an Answer vector. If we have different N databases, we obtain thus to the maximum N concepts in the Answer vector. - In the second phase, we are very interested in the final answer formulation composite of two steps. First, we apply algorithm Merge1 that we propose on the Answer vector [18]. Next, we apply algorithm Merge2 on the modified vector in the objective to obtain the final answer. Cooperative IRS Conceptual IRS 1 Concept 1 User Query Fig. 1 C2IRS Architecture. Concept N We were very interested in the retrieval information systems based on the concept formal analysis. Indeed, the concept formal analysis is applied in a retrieval information system handling an alone information database. We presented our system integrating the cooperation between several systems for a research of conceptual information. The research process used by our system, that handles a local databases set, is based on the formal concept analysis. For every database we look for the local concept. We use a cooperative approach to formulate the final answer while basing us on the concepts extract of different local databases. If we merge the local databases in an alone bases on which apply us the conceptual research, we obtain the same result for a same query. The problem that remains put is linked to the magnitude of the treated database volume. In fact, a research in a voluminous database takes more time than a research in a smaller database. Thus, we need to minimize the every database size. This can be assured by the integration of the conceptual data reduction in our retrieval system that will be the object of the next section Cooperative Conceptual Information Retrieval System based on Data Reduction The research in the voluminous databases takes more time than the small databases. This raises consequently a problem to treat. Of more, the treatment of the databases having an important size Final Answer Formulation Final Answer Conceptual IRS N

5 necessitates a considerable storage space. Thus, we need to minimize the every database size. This can be assured by the integration of the conceptual data reduction in our research system. We propose a system for the conceptual cooperative information research based on the conceptual data reduction of the local data C2IRS_DR [19]. To improve upon cooperative conceptual information retrieval system, we propose a system for cooperative conceptual information retrieval basing on data reduction C2IRS_DR. As it is shown in figure 2, our proposed system is composed of two parts: - In the first party, the system that we propose uses a conceptual approach of reduction of the different local databases. Every local database is reduced in the objective to minimize his size and to obtain a local reduced database and an equivalent objects set. - In the second party, we propose a research process. Basing on the local reduced databases set and on the equivalent objects sets, we apply a cooperative approach in order to treat a query. Part 2 User Query Cooperative Conceptual Information Retrieval Final Answer Part 1 DB 1 Conceptual Data Reduction RDB 1 Equivalent objects 1 Fig. 2 Cooperative Conceptual Information retrieval based on data reduction The C2IRS_DR system solicits reduced databases of small size after to have applied the conceptual data reduction on initial databases having an important size. Thus, this system uses a storage space less important that the one used by the research system of cooperative information C2IRS describes in the DB N RDB N Equivalent objects N section 5.1. Of more, the research while using C2IRS_DR takes less than the time than the research while using C2IRS. Nevertheless, the research in several local databases, reduced or non-reduced, necessitates a merger step of the local results. This step has an impact on the answer time. Of more, the reduction of voluminous databases is costly in term times. Our idea is to propose an algorithm allowing reducing a set of local databases in an alone reduced databases on which the research process will be executed. We propose a cooperative data reduction algorithm applied in an information research. We propose a conceptual information retrieval system based on the conceptual cooperative global data reduction CIRS_CDR. Our conceptual cooperative data reduction algorithm is applied on the set of the local databases to have an alone reduced database and an equivalent objects set. We apply the research process conceptual on the reduced basis and we obtain a concept containing the answer to the query that can be enriched while soliciting the equivalent objects set Conceptual information retrieval system based on cooperative conceptual data reduction We propose a conceptual information retrieval system based on the conceptual cooperative global data reduction CIRS_CDR [20]. Our conceptual cooperative data reduction algorithm is applied on the set of the local databases to have an alone reduced database and an equivalent objects set. We apply the conceptual research process on the reduced database and we obtain a concept containing the answer to the query that can be enriched while soliciting the equivalent objects set. We present an approach for information retrieval that is composed of two parts as it is shows in the following figure: - Part 1: From a set of formal contexts representing the databases, we apply the proposed algorithm for cooperative conceptual data reduction in order to obtain the reduced database and the set of equivalent objects. - Part 2: The research is based on this reduced database. To respond to a query that is a set of terms, we apply the Galois connection to retrieve the documents satisfying the query. The final answer can be enriched by the set of equivalent objects.

6 Part 1 DB 1 DB N C T (n,m) = 2nm + n. Space complexity. The system C2IRS uses k matrix with n lines and m columns. So, it uses (k*n*m) memory cells. So, the space complexity of this method is C S = knm. User Query Equivalent objects Cooperative Conceptual Data Reduction Part Conceptual Information Retrieval RDB 6.2 Cooperative conceptual information retrieval information based on Data reduction We suppose that the database has n documents and m keywords and the reduced database has n documents and m keywords. We remind first all steps of this system: - Phase 1: conceptual data reduction applied on several databases. - Phase 2: cooperative conceptual information retrieval applied on the reduced databases. Answer Fig. 3 Architecture of the conceptual information retrieval system based on cooperative conceptual data reduction 6 Complexity Analysis The complexity analysis of a treatment is ensured by calculating: i) The time complexity, which is the number of operations of the treatment, and ii) The space complexity, which is the number of memories cells used by the treatment. We compute the complexity analysis of cooperative conceptual information retrieval system, conceptual information retrieval system based on data reduction and conceptual information retrieval system based on cooperative data reduction. 6.1 Cooperative Conceptual Information Retrieval System We present the complexity of the cooperative conceptual information retrieval system C2IRS. We suppose that the database has n documents and m keywords. We remind first of all the steps of this system: - Phase 1: the search of concepts from different databases. - Phase 2: the formulation of the final answer from the found concepts. Time Complexity. We suppose that each database has n documents and m keywords. The time complexity of the system C2IRS is 2(n*m) + n. So, Time complexity. It is easy to find the complexity C T (n,m) for this system which is equal in: C T (n,m)=c TPhase 1 (n,m)+c TPhase 2 (n,m ). The complexity of phase 1 is O(n*m). The phase 2 realize (2(n *m )+n ) operations [17]. So, the system realizes n*m+[2(n *m )+n ] operations. So, C T (n,m) =nm+2n m +n. Since the size of the reduced database is lesser than that of the initial database (n <<n). So, the complexity of our proposed system is lesser than that of the C2IRS. C T (n,m)=nm+2n m +n <<C T (n,m)=2nm+ n. Space Complexity. Our retrieval system handles a reduced database. So, we can reserve only (k*n *m ) memory cells. So, the space complexity of our system is C S =kn m. We note that our algorithm uses a less number of memory cells that the C2IRS: C S <<C S. This can be explained by the integration of the conceptual data reduction in our proposed system. 6.3 Conceptual information retrieval system based on Cooperative data reduction Time complexity. If we have K databases, each one has n documents and m keywords, we realize N iterations where N = n n. In each loop, we cover the concepts graph G and we do then G operations where G represents the graph cardinality. So, we do G update operations on the graph (adding or modification operation). So, C T O(N* G 2 ). Then, our algorithm have a complexity less then that of the conceptual data reduction considering the

7 cardinality of the graph is too small compared to the number of keywords ( G << M) Space Complexity. Our algorithm treats in each loop one document. So, we can reserve only M memory cases. For the treatment of each line, the algorithm uses a graph that is constructed in incremental manner. This graph necessities 8* G memory cells since for each graph node, we reserve 8 memory cells. So, the space complexity of our algorithm is C S =M+(8* G ). We note that our algorithm uses a less number of memory cells that the conceptual data reduction: C S <<C S. This can be to explain by the fact why the conceptual data reduction treats a matrix of N lines and M columns on the other hand our algorithm treats only one line. 7 Evaluation prototype The experimentations is done on two test collections MED and CRAN. The CRAN collection (Cranfield collection) includes a textual corpus that has a size greater than 1.6Mo. This collection contains 1400 documents and 4612 different and it is tested on 225 queries. The MED collection MED includes a textual corpus that has a size 1.1Mo. It contains 1033 scientific articles extracted from databases in the medicine domain and 5831 terms different and it is tested on 30 queries. The experimentations realised on the CRAN and MED test collections show that the cooperative conceptual reduction applied on the fragmented collection and the conceptual reduction applied on the initial database generates the same reduced database. The trump of the cooperative conceptual reduction is to reduce a set of databases set in order to obtain a reduced database while generating a concepts graph. So, the research process can be executed either on the reduced database or on the obtained graph. Executing the different queries of the two test collections, we obtain the same answers that the other presented systems. Later, CIRS_RCD has the same recall and precision graph that the systems CIRS, C2IRS and C2IRS_DR (see figure 4 and figure 5). This shows that our proposed approaches assure the non-loss information. In fact, our research approaches integrated the data reduction generates the reduced database and the equivalent objects set. The research in the reduced databases can be enriched by the consultation of equivalent objects. The experimentations show that the treatment of 30 queries of the MED collection with the reduced databases also as the equivalents objects set offers the same result that with the initial MED database or that fragmented on portions. 1 0,9 0,8 0,7 0,6 0,5 0,4 0,3 0,2 0, Recall CIRS - C2IRS - C2IRS_DR - CIRS_CDR -IRS_Graph Fig. 4 The Recall and precision graph of the MED fragmented collection for CIRS, CIRS2, C2IRS_DR and CIRS_RCD. 0,9 0,8 0,7 0,6 0,5 0,4 0,3 0,2 0, Recall CIRS - C2IRS - C2IRS_DR - CIRS_CDR - IRS_Graph Fig. 5 The recall and precision graph of the CRAN fragmented collection for CIRS, CIRS2, C2IRS_DR and CIRS_RCD. As we are presented already, the research process can be executed either on the reduced database or on the concepts graph. We remark that research the final answer in the graph is simple and faster than research in the reduced database. Moreover, the system permit to accelerate the answer time par rapport aux three first systems as well as for the MED collection MED as for CRAN collection. (see figure 6 and figure 7)

8 Queries CIRS C2IRS C2IRS_DR CIRS_CDR IRS_Graph Fig. 6 Answer time takes by each system CIRS, CIRS2, C2IRS_DR, CIRS_CDR and IRS_Graph for the MED collection. The systems C2IRS_DR and CIRS_CDR treat the MED collection queries in a reasonable answer time in comparison with that of CIRS. This can be explained by the data reduction integration. In fact, the objective of this later is to reduce the size of a database on that is done the research tout en offer an equivalent object set. The answer time of research executed on the CRAN collection with the five retrieval systems: CIRS, C2IRS, C2IRS_DR and CIRS_CDR (research in the reduced database and in the graph) is presented in the figure 7. We consider only the 30 first queries of a total of 250 queries, because the MED collection treats 30 queries Queries CIRS C2IRS C2IRS_DR CIRS_CDR IRS_Graph Fig. 7 Answer time takes by each system CIRS, C2IRS, C2IRS_DR, CIRS_CDR and IRS_Graph for the CRAN collection. 8 Conclusion In this paper, we have presented three conceptual information retrieval systems. The cooperative conceptual information retrieval system combines the results of several information retrieval systems. Each one has its local formal context representing a database on which we apply a conceptual search to obtain a first set of concepts. From this set of concepts, we formulate the final answer by applying a proposed algorithm merging and combining concepts with a suitable way. The volume of the available information is increasing exponentially. So, generally we get a huge number of references. It is necessary to abstract it and eliminate the redundancy data. However, our main idea is to integrate the data reduction in the search process. The information retrieval is based on the data reduction. From a set of formal contexts presenting a databases set, we apply the relational join operator in order to fusion the tables. Basing on the join resulting table, we apply the data reduction algorithm to obtain the reduced table on which the search process is executed. The major problem is that the treatment of an important size databases needs a considerable storage space and makes the reduction and the research process sultry. We are presented an information retrieval based on a cooperative conceptual data reduction. We have developed a fast algorithm that combines the join and the reduction steps for minimizing formal contexts and we have applied it in an information retrieval system. From a set of databases, we obtain a reduced database on which we apply a conceptual research for responding to a query. This application was experimented on many databases and the obtained results are promising. This study has several perspectives for real and fuzzy data. References: [1] Aboud M., Chrisment C., Razouk R., Florence S., Soulé-Dupuy, Query a Hypertext Information Retrieval System by use of Classification, Information Processing and Management, 29(3), (1993) [2] Amati G., Carpineto C., and Romano G., FUB at TREC-10 Web Track: A Probabilistic Framework for Topic Relevance Term Weighting. In Proceedings of the 10 th Text REtrieval Conference (TREC-10), NIST Special Publication , Gaithersburg, MD, USA (2001) [3] Bordat J.P., Calcul pratique du treillis de Galois d'une correspondance, Math. Sci. Hum., 96, (1986) [4] Carpineto C. and Romano G., Using Concept Lattices for Text Retrieval and Mining. In the first International Conference on Formal Concept Analysis, Darmstadt, Germany, (2003).

9 [5] Carpineto C. and Romano G., Information retrieval through hybrid navigation of lattice representations, International Journal of Human-Computer Studies, 45(5), (1996) [6] Carpineto C. and Romano G., A lattice conceptual clustering system and its application to browsing retrieval, Machine Learning, 24(2), (1996) [7] Carpineto C. and Romano G., Effective reformulation of Boolean queries with concept lattices, In Proceedings of the 3rd International Conference on Flexible Query-Answering Systems, pages 83-94, Roskilde, Denmark, [8] Cole R. and Eklund P., Browsing semistructured web texts using formal concept analysis, In Proceedings of the 9th International Conference on Conceptual Structures, Stanford, CA, USA, (2001) [9] Efthimiadis E., Query expansion, In M. E. Williams, editor, Annual Review of Information Systems and Technology, v31, American Society for Information Science, Silver Spring, Maryland, USA, (1996) [10] Ferrfie S. and Ridoux O., A file system based on concept analysis, In Proceedings of the 1st International Conference on Computational Logic, London, UK, (2000) [11] Gammoudi M. M., and Nafkha I., A formal method for inheritance graph hierarchy construction, Information Sciences 140(3-4), (2002) [12] Ganter B. and Wille R., Formal Concept Analysis, Mathematical Foundations, Springer, [13] Godin R. and Mili. H., Building and Maintaining Analysis Level Class Hierarchies Using Galois Lattices, In Proceedings of the 8th Annual Conference on Object Oriented Programming Systems Languages and Applications, Washington, D.C., USA, (1993) [14] Godin R., Missaoui R., and April A., Experimental comparison of navigation in a Galois lattice with conventional information retrieval methods, International Journal of Man-Machine Studies, 38: (1993) [15] Godin R., Saunders E., and Jecsei J., Lattice model of browsable data spaces, Journal of Information Sciences, 40: (1986) [16] Jaoua A., Bsaies Kh., and Consmtini W., May reasoning be reduced to an Information Retrieval problem, Relational Methods in Computer Science, Quebec, Canada, (1999). [17] Jaoua A., Al-Rashdi A., AL-Muraikhi H., Al- Subaiey M., Al-Ghanim N., and Al-Misaifri S., Conceptual Data Reduction, Application for Reasoning and Learning. The 4 th Workshop on Information and Computer Science, KFUPM, Dhahran, Saudi Arabia, (2002). [18] Nafkha I., Elloumi S. and Jaoua A, Conceptual Cooperative Information Retrieval System, In International Arab Conference on Information Technology, Doha December 16-19, Qatar, (2002) [19] Nafkha I., Elloumi S. and Jaoua A., Conceptual Information Retrieval System based on cooperative conceptual data reduction, 1 St International Conference on Information & Communication Technologies: from Theory to Applications, Syria, (2004). [20] Nafkha I., Elloumi S., Jaoua A., Using Concept Formal Analysis for Cooperative Information Retrieval. Concept Lattices and their applications Workshop (CLA 04), VSB-TU Ostrava, September 23th-24th, [21] Salton G., Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer. Addison Wesley, 1989.

Using Using Formal Concept Analysis for for Heterogeneous Information Retrieval

Using Using Formal Concept Analysis for for Heterogeneous Information Retrieval Using Using Formal Concept Analysis for for Heterogeneous Information Retrieval Ibtissem Nafkha Ibtissem Nafkha 1 1, and Ali Jaoua and Ali Jaoua 2 2 1 1 University of Tunis, Department of Computer Science,

More information

Improving web search with FCA

Improving web search with FCA Improving web search with FCA Radim BELOHLAVEK Jan OUTRATA Dept. Systems Science and Industrial Engineering Watson School of Engineering and Applied Science Binghamton University SUNY, NY, USA Dept. Computer

More information

Pseudo-conceptual text and web Structuring

Pseudo-conceptual text and web Structuring Pseudo-conceptual text and web Structuring Ali Jaoua Computer Science and Engineering Department College of Engineering, Qatar University jaoua@qu.edu.qa Abstract: Structuring huge documents with a high

More information

The Further Mathematics Support Programme

The Further Mathematics Support Programme Degree Topics in Mathematics Groups A group is a mathematical structure that satisfies certain rules, which are known as axioms. Before we look at the axioms, we will consider some terminology. Elements

More information

Joining Collaborative and Content-based Filtering

Joining Collaborative and Content-based Filtering Joining Collaborative and Content-based Filtering 1 Patrick Baudisch Integrated Publication and Information Systems Institute IPSI German National Research Center for Information Technology GMD 64293 Darmstadt,

More information

Operational Specification for FCA using Z

Operational Specification for FCA using Z Operational Specification for FCA using Z Simon Andrews and Simon Polovina Faculty of Arts, Computing, Engineering and Sciences Sheffield Hallam University, Sheffield, UK {s.andrews, s.polovina}@shu.ac.uk

More information

Intelligent flexible query answering Using Fuzzy Ontologies

Intelligent flexible query answering Using Fuzzy Ontologies International Conference on Control, Engineering & Information Technology (CEIT 14) Proceedings - Copyright IPCO-2014, pp. 262-277 ISSN 2356-5608 Intelligent flexible query answering Using Fuzzy Ontologies

More information

Enumerating Pseudo-Intents in a Partial Order

Enumerating Pseudo-Intents in a Partial Order Enumerating Pseudo-Intents in a Partial Order Alexandre Bazin and Jean-Gabriel Ganascia Université Pierre et Marie Curie, Laboratoire d Informatique de Paris 6 Paris, France Alexandre.Bazin@lip6.fr Jean-Gabriel@Ganascia.name

More information

Open Research Online The Open University s repository of research publications and other research outputs

Open Research Online The Open University s repository of research publications and other research outputs Open Research Online The Open University s repository of research publications and other research outputs Bottom-Up Ontology Construction with Contento Conference or Workshop Item How to cite: Daga, Enrico;

More information

IMPROVING THE RELEVANCY OF DOCUMENT SEARCH USING THE MULTI-TERM ADJACENCY KEYWORD-ORDER MODEL

IMPROVING THE RELEVANCY OF DOCUMENT SEARCH USING THE MULTI-TERM ADJACENCY KEYWORD-ORDER MODEL IMPROVING THE RELEVANCY OF DOCUMENT SEARCH USING THE MULTI-TERM ADJACENCY KEYWORD-ORDER MODEL Lim Bee Huang 1, Vimala Balakrishnan 2, Ram Gopal Raj 3 1,2 Department of Information System, 3 Department

More information

A NEW PERFORMANCE EVALUATION TECHNIQUE FOR WEB INFORMATION RETRIEVAL SYSTEMS

A NEW PERFORMANCE EVALUATION TECHNIQUE FOR WEB INFORMATION RETRIEVAL SYSTEMS A NEW PERFORMANCE EVALUATION TECHNIQUE FOR WEB INFORMATION RETRIEVAL SYSTEMS Fidel Cacheda, Francisco Puentes, Victor Carneiro Department of Information and Communications Technologies, University of A

More information

Enhanced Performance of Search Engine with Multitype Feature Co-Selection of Db-scan Clustering Algorithm

Enhanced Performance of Search Engine with Multitype Feature Co-Selection of Db-scan Clustering Algorithm Enhanced Performance of Search Engine with Multitype Feature Co-Selection of Db-scan Clustering Algorithm K.Parimala, Assistant Professor, MCA Department, NMS.S.Vellaichamy Nadar College, Madurai, Dr.V.Palanisamy,

More information

XRay Views: Understanding the Internals of Classes

XRay Views: Understanding the Internals of Classes XRay Views: Understanding the Internals of Classes Gabriela Arévalo, Stéphane Ducasse, Oscar Nierstrasz Software Composition Group University of Bern (Switzerland) {arevalo, ducasse, oscar}@iam.unibe.ch

More information

Formal Concept Analysis and Hierarchical Classes Analysis

Formal Concept Analysis and Hierarchical Classes Analysis Formal Concept Analysis and Hierarchical Classes Analysis Yaohua Chen, Yiyu Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: {chen115y, yyao}@cs.uregina.ca

More information

Information mining and information retrieval : methods and applications

Information mining and information retrieval : methods and applications Information mining and information retrieval : methods and applications J. Mothe, C. Chrisment Institut de Recherche en Informatique de Toulouse Université Paul Sabatier, 118 Route de Narbonne, 31062 Toulouse

More information

Comparing Performance of Algorithms for Generating Concept Lattices

Comparing Performance of Algorithms for Generating Concept Lattices Comparing Performance of Algorithms for Generating Concept Lattices Sergei O. Kuznetsov 1 and Sergei A. Ob edkov 2 1 All-Russia Institute for Scientific and Technical Information (VINITI), Moscow, Russia

More information

FCA-Map Results for OAEI 2016

FCA-Map Results for OAEI 2016 FCA-Map Results for OAEI 2016 Mengyi Zhao 1 and Songmao Zhang 2 1,2 Institute of Mathematics, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, P. R. China 1 myzhao@amss.ac.cn,

More information

Triadic Formal Concept Analysis within Multi Agent Systems

Triadic Formal Concept Analysis within Multi Agent Systems Triadic Formal Concept Analysis within Multi Agent Systems Petr Gajdoš, Pavel Děrgel Department of Computer Science, VŠB - Technical University of Ostrava, tř. 17. listopadu 15, 708 33 Ostrava-Poruba Czech

More information

Lattice-Based Information Retrieval Application

Lattice-Based Information Retrieval Application Karen S.K. Cheung a and Douglas Vogel b a Department of Information Systems, City University of Hong Kong 83 Tat Chee Avenue, Kowloon, Hong Kong iskaren@is.cityu.edu.hk b Department of Information Systems,

More information

Digital Archives: Extending the 5S model through NESTOR

Digital Archives: Extending the 5S model through NESTOR Digital Archives: Extending the 5S model through NESTOR Nicola Ferro and Gianmaria Silvello Department of Information Engineering, University of Padua, Italy {ferro, silvello}@dei.unipd.it Abstract. Archives

More information

Distributed minimum spanning tree problem

Distributed minimum spanning tree problem Distributed minimum spanning tree problem Juho-Kustaa Kangas 24th November 2012 Abstract Given a connected weighted undirected graph, the minimum spanning tree problem asks for a spanning subtree with

More information

Improving the Performance of Search Engine With Respect To Content Mining Kr.Jansi, L.Radha

Improving the Performance of Search Engine With Respect To Content Mining Kr.Jansi, L.Radha Improving the Performance of Search Engine With Respect To Content Mining Kr.Jansi, L.Radha 1 Asst. Professor, Srm University, Chennai 2 Mtech, Srm University, Chennai Abstract R- Google is a dedicated

More information

PROOF OF THE COLLATZ CONJECTURE KURMET SULTAN. Almaty, Kazakhstan. ORCID ACKNOWLEDGMENTS

PROOF OF THE COLLATZ CONJECTURE KURMET SULTAN. Almaty, Kazakhstan.   ORCID ACKNOWLEDGMENTS PROOF OF THE COLLATZ CONJECTURE KURMET SULTAN Almaty, Kazakhstan E-mail: kurmet.sultan@gmail.com ORCID 0000-0002-7852-8994 ACKNOWLEDGMENTS 2 ABSTRACT This article contains a proof of the Collatz conjecture.

More information

Exploring the Query Expansion Methods for Concept Based Representation

Exploring the Query Expansion Methods for Concept Based Representation Exploring the Query Expansion Methods for Concept Based Representation Yue Wang and Hui Fang Department of Electrical and Computer Engineering University of Delaware 140 Evans Hall, Newark, Delaware, 19716,

More information

Graph Based Workflow Validation

Graph Based Workflow Validation Graph Based Workflow Validation Anastasios Giouris and Manolis Wallace Department of Computer Science, University of Indianapolis Athens, Ipitou 9, Syntagma, 557, GREECE http://cs.uindy.gr cs@uindy.gr

More information

Improving Adaptive Hypermedia by Adding Semantics

Improving Adaptive Hypermedia by Adding Semantics Improving Adaptive Hypermedia by Adding Semantics Anton ANDREJKO Slovak University of Technology Faculty of Informatics and Information Technologies Ilkovičova 3, 842 16 Bratislava, Slovak republic andrejko@fiit.stuba.sk

More information

Information Retrieval (Part 1)

Information Retrieval (Part 1) Information Retrieval (Part 1) Fabio Aiolli http://www.math.unipd.it/~aiolli Dipartimento di Matematica Università di Padova Anno Accademico 2008/2009 1 Bibliographic References Copies of slides Selected

More information

RAQUEL s Relational Operators

RAQUEL s Relational Operators Contents RAQUEL s Relational Operators Introduction 2 General Principles 2 Operator Parameters 3 Ordinary & High-Level Operators 3 Operator Valency 4 Default Tuples 5 The Relational Algebra Operators in

More information

Improving Information Retrieval Effectiveness in Peer-to-Peer Networks through Query Piggybacking

Improving Information Retrieval Effectiveness in Peer-to-Peer Networks through Query Piggybacking Improving Information Retrieval Effectiveness in Peer-to-Peer Networks through Query Piggybacking Emanuele Di Buccio, Ivano Masiero, and Massimo Melucci Department of Information Engineering, University

More information

ENTITIES IN THE OBJECT-ORIENTED DESIGN PROCESS MODEL

ENTITIES IN THE OBJECT-ORIENTED DESIGN PROCESS MODEL INTERNATIONAL DESIGN CONFERENCE - DESIGN 2000 Dubrovnik, May 23-26, 2000. ENTITIES IN THE OBJECT-ORIENTED DESIGN PROCESS MODEL N. Pavković, D. Marjanović Keywords: object oriented methodology, design process

More information

Integrating SysML and OWL

Integrating SysML and OWL Integrating SysML and OWL Henson Graves Lockheed Martin Aeronautics Company Fort Worth Texas, USA henson.graves@lmco.com Abstract. To use OWL2 for modeling a system design one must be able to construct

More information

Using Formal Concept Analysis for discovering knowledge patterns

Using Formal Concept Analysis for discovering knowledge patterns Using Formal Concept Analysis for discovering knowledge patterns Mohamed Rouane-Hacene 1, Marianne Huchard 2, Amedeo Napoli 3, and Petko Valtchev 1 1 Dépt. informatique, UQAM, C.P. 8888, Succ. CV, Montréal,

More information

In the recent past, the World Wide Web has been witnessing an. explosive growth. All the leading web search engines, namely, Google,

In the recent past, the World Wide Web has been witnessing an. explosive growth. All the leading web search engines, namely, Google, 1 1.1 Introduction In the recent past, the World Wide Web has been witnessing an explosive growth. All the leading web search engines, namely, Google, Yahoo, Askjeeves, etc. are vying with each other to

More information

Core Membership Computation for Succinct Representations of Coalitional Games

Core Membership Computation for Succinct Representations of Coalitional Games Core Membership Computation for Succinct Representations of Coalitional Games Xi Alice Gao May 11, 2009 Abstract In this paper, I compare and contrast two formal results on the computational complexity

More information

Designing and Building an Automatic Information Retrieval System for Handling the Arabic Data

Designing and Building an Automatic Information Retrieval System for Handling the Arabic Data American Journal of Applied Sciences (): -, ISSN -99 Science Publications Designing and Building an Automatic Information Retrieval System for Handling the Arabic Data Ibrahiem M.M. El Emary and Ja'far

More information

Modeling Systems Using Design Patterns

Modeling Systems Using Design Patterns Modeling Systems Using Design Patterns Jaroslav JAKUBÍK Slovak University of Technology Faculty of Informatics and Information Technologies Ilkovičova 3, 842 16 Bratislava, Slovakia jakubik@fiit.stuba.sk

More information

Information Retrieval and Web Search

Information Retrieval and Web Search Information Retrieval and Web Search Introduction to IR models and methods Rada Mihalcea (Some of the slides in this slide set come from IR courses taught at UT Austin and Stanford) Information Retrieval

More information

Bézier Surface Modeling for Neutrosophic Data Problems

Bézier Surface Modeling for Neutrosophic Data Problems Neutrosophic Sets and Systems, Vol. 19, 2018 19 University of New Mexico Bézier Surface Modeling for Neutrosophic Data Problems Selçuk Topal 1 * and Ferhat Taş 2 1 Department of Mathematics, Faculty of

More information

FCA-based Search for Duplicate objects in Ontologies

FCA-based Search for Duplicate objects in Ontologies FCA-based Search for Duplicate objects in Ontologies Dmitry A. Ilvovsky and Mikhail A. Klimushkin National Research University Higher School of Economics, School of Applied Mathematics and Informational

More information

BayesTH-MCRDR Algorithm for Automatic Classification of Web Document

BayesTH-MCRDR Algorithm for Automatic Classification of Web Document BayesTH-MCRDR Algorithm for Automatic Classification of Web Document Woo-Chul Cho and Debbie Richards Department of Computing, Macquarie University, Sydney, NSW 2109, Australia {wccho, richards}@ics.mq.edu.au

More information

Knowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A.

Knowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. Knowledge Retrieval Franz J. Kurfess Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. 1 Acknowledgements This lecture series has been sponsored by the European

More information

XI International PhD Workshop OWD 2009, October Fuzzy Sets as Metasets

XI International PhD Workshop OWD 2009, October Fuzzy Sets as Metasets XI International PhD Workshop OWD 2009, 17 20 October 2009 Fuzzy Sets as Metasets Bartłomiej Starosta, Polsko-Japońska WyŜsza Szkoła Technik Komputerowych (24.01.2008, prof. Witold Kosiński, Polsko-Japońska

More information

A fast algorithm for building lattices

A fast algorithm for building lattices Information Processing Letters 71 (1999) 199 204 A fast algorithm for building lattices Lhouari Nourine, Olivier Raynaud 1 Département d Informatique Fondamentale, LIRMM, Université Montpellier II, CNRS

More information

A Partial-Closure Canonicity Test to Increase the Efficiency of CbO-Type Algorithms

A Partial-Closure Canonicity Test to Increase the Efficiency of CbO-Type Algorithms A Partial-Closure Canonicity Test to Increase the Efficiency of CbO-Type Algorithms ANDREWS, Simon Available from Sheffield Hallam University Research Archive (SHURA)

More information

A Content Based Image Retrieval System Based on Color Features

A Content Based Image Retrieval System Based on Color Features A Content Based Image Retrieval System Based on Features Irena Valova, University of Rousse Angel Kanchev, Department of Computer Systems and Technologies, Rousse, Bulgaria, Irena@ecs.ru.acad.bg Boris

More information

Word Disambiguation in Web Search

Word Disambiguation in Web Search Word Disambiguation in Web Search Rekha Jain Computer Science, Banasthali University, Rajasthan, India Email: rekha_leo2003@rediffmail.com G.N. Purohit Computer Science, Banasthali University, Rajasthan,

More information

FOGA: A Fuzzy Ontology Generation Framework for Scholarly Semantic Web

FOGA: A Fuzzy Ontology Generation Framework for Scholarly Semantic Web FOGA: A Fuzzy Ontology Generation Framework for Scholarly Semantic Web Thanh Tho Quan 1, Siu Cheung Hui 1 and Tru Hoang Cao 2 1 School of Computer Engineering, Nanyang Technological University, Singapore

More information

Ontology Extraction from Heterogeneous Documents

Ontology Extraction from Heterogeneous Documents Vol.3, Issue.2, March-April. 2013 pp-985-989 ISSN: 2249-6645 Ontology Extraction from Heterogeneous Documents Kirankumar Kataraki, 1 Sumana M 2 1 IV sem M.Tech/ Department of Information Science & Engg

More information

Information Retrieval and Web Search Engines

Information Retrieval and Web Search Engines Information Retrieval and Web Search Engines Lecture 7: Document Clustering May 25, 2011 Wolf-Tilo Balke and Joachim Selke Institut für Informationssysteme Technische Universität Braunschweig Homework

More information

The Information Retrieval Series. Series Editor W. Bruce Croft

The Information Retrieval Series. Series Editor W. Bruce Croft The Information Retrieval Series Series Editor W. Bruce Croft Sándor Dominich The Modern Algebra of Information Retrieval 123 Sándor Dominich Computer Science Department University of Pannonia Egyetem

More information

CS506/606 - Topics in Information Retrieval

CS506/606 - Topics in Information Retrieval CS506/606 - Topics in Information Retrieval Instructors: Class time: Steven Bedrick, Brian Roark, Emily Prud hommeaux Tu/Th 11:00 a.m. - 12:30 p.m. September 25 - December 6, 2012 Class location: WCC 403

More information

A Hybrid Recommender System for Dynamic Web Users

A Hybrid Recommender System for Dynamic Web Users A Hybrid Recommender System for Dynamic Web Users Shiva Nadi Department of Computer Engineering, Islamic Azad University of Najafabad Isfahan, Iran Mohammad Hossein Saraee Department of Electrical and

More information

A model of information searching behaviour to facilitate end-user support in KOS-enhanced systems

A model of information searching behaviour to facilitate end-user support in KOS-enhanced systems A model of information searching behaviour to facilitate end-user support in KOS-enhanced systems Dorothee Blocks Hypermedia Research Unit School of Computing University of Glamorgan, UK NKOS workshop

More information

Information Retrieval Tools for Efficient Data Searching using Big Data

Information Retrieval Tools for Efficient Data Searching using Big Data ISSN: 2393-8528 Contents lists available at www.ijicse.in International Journal of Innovative Computer Science & Engineering Volume 4 Issue 4; July-August-2017; Page No. 06-12 Information Retrieval Tools

More information

On Generalizing Rough Set Theory

On Generalizing Rough Set Theory On Generalizing Rough Set Theory Y.Y. Yao Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: yyao@cs.uregina.ca Abstract. This paper summarizes various formulations

More information

ESANN'2001 proceedings - European Symposium on Artificial Neural Networks Bruges (Belgium), April 2001, D-Facto public., ISBN ,

ESANN'2001 proceedings - European Symposium on Artificial Neural Networks Bruges (Belgium), April 2001, D-Facto public., ISBN , An Integrated Neural IR System. Victoria J. Hodge Dept. of Computer Science, University ofyork, UK vicky@cs.york.ac.uk Jim Austin Dept. of Computer Science, University ofyork, UK austin@cs.york.ac.uk Abstract.

More information

VIDEO SEARCHING AND BROWSING USING VIEWFINDER

VIDEO SEARCHING AND BROWSING USING VIEWFINDER VIDEO SEARCHING AND BROWSING USING VIEWFINDER By Dan E. Albertson Dr. Javed Mostafa John Fieber Ph. D. Student Associate Professor Ph. D. Candidate Information Science Information Science Information Science

More information

Conceptual Analysis of Intrusion Alarms

Conceptual Analysis of Intrusion Alarms Conceptual Analysis of Intrusion Alarms Benjamin Morin and Hervé Debar France Télécom R&D, Caen, France {benjamin.morin, herve.debar}@rd.francetelecom.com Abstract. Security information about information

More information

Text Mining. Representation of Text Documents

Text Mining. Representation of Text Documents Data Mining is typically concerned with the detection of patterns in numeric data, but very often important (e.g., critical to business) information is stored in the form of text. Unlike numeric data,

More information

INTRODUCTION. Chapter GENERAL

INTRODUCTION. Chapter GENERAL Chapter 1 INTRODUCTION 1.1 GENERAL The World Wide Web (WWW) [1] is a system of interlinked hypertext documents accessed via the Internet. It is an interactive world of shared information through which

More information

Spemmet - A Tool for Modeling Software Processes with SPEM

Spemmet - A Tool for Modeling Software Processes with SPEM Spemmet - A Tool for Modeling Software Processes with SPEM Tuomas Mäkilä tuomas.makila@it.utu.fi Antero Järvi antero.jarvi@it.utu.fi Abstract: The software development process has many unique attributes

More information

Using Concept Lattices for Text Retrieval and. Mining. Claudio Carpineto and Giovanni Romano. Fondazione Ugo Bordoni.

Using Concept Lattices for Text Retrieval and. Mining. Claudio Carpineto and Giovanni Romano. Fondazione Ugo Bordoni. Using Concept Lattices for Text Retrieval and Mining Claudio Carpineto and Giovanni Romano Fondazione Ugo Bordoni Via Baldassarre Castiglione 59, 00142, Rome, Italy carpinet,romano@fub.it Abstract. The

More information

A METHOD FOR MINING FUNCTIONAL DEPENDENCIES IN RELATIONAL DATABASE DESIGN USING FCA

A METHOD FOR MINING FUNCTIONAL DEPENDENCIES IN RELATIONAL DATABASE DESIGN USING FCA STUDIA UNIV. BABEŞ BOLYAI, INFORMATICA, Volume LIII, Number 1, 2008 A METHOD FOR MINING FUNCTIONAL DEPENDENCIES IN RELATIONAL DATABASE DESIGN USING FCA KATALIN TUNDE JANOSI RANCZ AND VIORICA VARGA Abstract.

More information

Equivalence Detection Using Parse-tree Normalization for Math Search

Equivalence Detection Using Parse-tree Normalization for Math Search Equivalence Detection Using Parse-tree Normalization for Math Search Mohammed Shatnawi Department of Computer Info. Systems Jordan University of Science and Tech. Jordan-Irbid (22110)-P.O.Box (3030) mshatnawi@just.edu.jo

More information

Information Retrieval. Information Retrieval and Web Search

Information Retrieval. Information Retrieval and Web Search Information Retrieval and Web Search Introduction to IR models and methods Information Retrieval The indexing and retrieval of textual documents. Searching for pages on the World Wide Web is the most recent

More information

Models for Metadata or Metamodels for Data?

Models for Metadata or Metamodels for Data? Published in Proceedings of 2nd IEEE Metadata Conference, 1997 Models for Metadata or Metamodels for Data? Brigitte Kerhervé Université du Québec à Montréal Département Informatique CP 8888, succursale

More information

QUERY EXPANSION USING WORDNET WITH A LOGICAL MODEL OF INFORMATION RETRIEVAL

QUERY EXPANSION USING WORDNET WITH A LOGICAL MODEL OF INFORMATION RETRIEVAL QUERY EXPANSION USING WORDNET WITH A LOGICAL MODEL OF INFORMATION RETRIEVAL David Parapar, Álvaro Barreiro AILab, Department of Computer Science, University of A Coruña, Spain dparapar@udc.es, barreiro@udc.es

More information

I&R SYSTEMS ON THE INTERNET/INTRANET CITES AS THE TOOL FOR DISTANCE LEARNING. Andrii Donchenko

I&R SYSTEMS ON THE INTERNET/INTRANET CITES AS THE TOOL FOR DISTANCE LEARNING. Andrii Donchenko International Journal "Information Technologies and Knowledge" Vol.1 / 2007 293 I&R SYSTEMS ON THE INTERNET/INTRANET CITES AS THE TOOL FOR DISTANCE LEARNING Andrii Donchenko Abstract: This article considers

More information

RETRACTED ARTICLE. Web-Based Data Mining in System Design and Implementation. Open Access. Jianhu Gong 1* and Jianzhi Gong 2

RETRACTED ARTICLE. Web-Based Data Mining in System Design and Implementation. Open Access. Jianhu Gong 1* and Jianzhi Gong 2 Send Orders for Reprints to reprints@benthamscience.ae The Open Automation and Control Systems Journal, 2014, 6, 1907-1911 1907 Web-Based Data Mining in System Design and Implementation Open Access Jianhu

More information

SYSTEMS FOR NON STRUCTURED INFORMATION MANAGEMENT

SYSTEMS FOR NON STRUCTURED INFORMATION MANAGEMENT SYSTEMS FOR NON STRUCTURED INFORMATION MANAGEMENT Prof. Dipartimento di Elettronica e Informazione Politecnico di Milano INFORMATION SEARCH AND RETRIEVAL Inf. retrieval 1 PRESENTATION SCHEMA GOALS AND

More information

Probabilistic Abstraction Lattices: A Computationally Efficient Model for Conditional Probability Estimation

Probabilistic Abstraction Lattices: A Computationally Efficient Model for Conditional Probability Estimation Probabilistic Abstraction Lattices: A Computationally Efficient Model for Conditional Probability Estimation Daniel Lowd January 14, 2004 1 Introduction Probabilistic models have shown increasing popularity

More information

CLUSTERING, TIERED INDEXES AND TERM PROXIMITY WEIGHTING IN TEXT-BASED RETRIEVAL

CLUSTERING, TIERED INDEXES AND TERM PROXIMITY WEIGHTING IN TEXT-BASED RETRIEVAL STUDIA UNIV. BABEŞ BOLYAI, INFORMATICA, Volume LVII, Number 4, 2012 CLUSTERING, TIERED INDEXES AND TERM PROXIMITY WEIGHTING IN TEXT-BASED RETRIEVAL IOAN BADARINZA AND ADRIAN STERCA Abstract. In this paper

More information

A MAS Based ETL Approach for Complex Data

A MAS Based ETL Approach for Complex Data A MAS Based ETL Approach for Complex Data O. Boussaid, F. Bentayeb, J. Darmont Abstract : In a data warehousing process, the phase of data integration is crucial. Many methods for data integration have

More information

Information management - Topic Maps visualization

Information management - Topic Maps visualization Information management - Topic Maps visualization Benedicte Le Grand Laboratoire d Informatique de Paris 6, Universite Pierre et Marie Curie, Paris, France Benedicte.Le-Grand@lip6.fr http://www-rp.lip6.fr/~blegrand

More information

Relevancy Measurement of Retrieved Webpages Using Ruzicka Similarity Measure

Relevancy Measurement of Retrieved Webpages Using Ruzicka Similarity Measure Relevancy Measurement of Retrieved Webpages Using Ruzicka Similarity Measure Manjeet*, Jaswinder Singh** *Master of Technology (Dept. of Computer Science and Engineering) GJUS&T, Hisar, Haryana, India

More information

GRAPHS WITH 1-FACTORS

GRAPHS WITH 1-FACTORS proceedings of the american mathematical society Volume 42, Number 1, January 1974 GRAPHS WITH 1-FACTORS DAVID P. SUMNER Abstract. In this paper it is shown that if G is a connected graph of order 2n (n>

More information

Towards Automatic Recognition of Fonts using Genetic Approach

Towards Automatic Recognition of Fonts using Genetic Approach Towards Automatic Recognition of Fonts using Genetic Approach M. SARFRAZ Department of Information and Computer Science King Fahd University of Petroleum and Minerals KFUPM # 1510, Dhahran 31261, Saudi

More information

Theme Identification in RDF Graphs

Theme Identification in RDF Graphs Theme Identification in RDF Graphs Hanane Ouksili PRiSM, Univ. Versailles St Quentin, UMR CNRS 8144, Versailles France hanane.ouksili@prism.uvsq.fr Abstract. An increasing number of RDF datasets is published

More information

Configuration Management for Component-based Systems

Configuration Management for Component-based Systems Configuration Management for Component-based Systems Magnus Larsson Ivica Crnkovic Development and Research Department of Computer Science ABB Automation Products AB Mälardalen University 721 59 Västerås,

More information

A Constraint Programming Based Approach to Detect Ontology Inconsistencies

A Constraint Programming Based Approach to Detect Ontology Inconsistencies The International Arab Journal of Information Technology, Vol. 8, No. 1, January 2011 1 A Constraint Programming Based Approach to Detect Ontology Inconsistencies Moussa Benaissa and Yahia Lebbah Faculté

More information

Information Management (IM)

Information Management (IM) 1 2 3 4 5 6 7 8 9 Information Management (IM) Information Management (IM) is primarily concerned with the capture, digitization, representation, organization, transformation, and presentation of information;

More information

Retrieval of Web Documents Using a Fuzzy Hierarchical Clustering

Retrieval of Web Documents Using a Fuzzy Hierarchical Clustering International Journal of Computer Applications (97 8887) Volume No., August 2 Retrieval of Documents Using a Fuzzy Hierarchical Clustering Deepti Gupta Lecturer School of Computer Science and Information

More information

Relevance Feedback and Query Reformulation. Lecture 10 CS 510 Information Retrieval on the Internet Thanks to Susan Price. Outline

Relevance Feedback and Query Reformulation. Lecture 10 CS 510 Information Retrieval on the Internet Thanks to Susan Price. Outline Relevance Feedback and Query Reformulation Lecture 10 CS 510 Information Retrieval on the Internet Thanks to Susan Price IR on the Internet, Spring 2010 1 Outline Query reformulation Sources of relevance

More information

Introduction to Relational Databases. Introduction to Relational Databases cont: Introduction to Relational Databases cont: Relational Data structure

Introduction to Relational Databases. Introduction to Relational Databases cont: Introduction to Relational Databases cont: Relational Data structure Databases databases Terminology of relational model Properties of database relations. Relational Keys. Meaning of entity integrity and referential integrity. Purpose and advantages of views. The relational

More information

A Novel Categorized Search Strategy using Distributional Clustering Neenu Joseph. M 1, Sudheep Elayidom 2

A Novel Categorized Search Strategy using Distributional Clustering Neenu Joseph. M 1, Sudheep Elayidom 2 A Novel Categorized Search Strategy using Distributional Clustering Neenu Joseph. M 1, Sudheep Elayidom 2 1 Student, M.E., (Computer science and Engineering) in M.G University, India, 2 Associate Professor

More information

An Integrated Framework to Enhance the Web Content Mining and Knowledge Discovery

An Integrated Framework to Enhance the Web Content Mining and Knowledge Discovery An Integrated Framework to Enhance the Web Content Mining and Knowledge Discovery Simon Pelletier Université de Moncton, Campus of Shippagan, BGI New Brunswick, Canada and Sid-Ahmed Selouani Université

More information

INFORMATION RETRIEVAL SYSTEM USING FUZZY SET THEORY - THE BASIC CONCEPT

INFORMATION RETRIEVAL SYSTEM USING FUZZY SET THEORY - THE BASIC CONCEPT ABSTRACT INFORMATION RETRIEVAL SYSTEM USING FUZZY SET THEORY - THE BASIC CONCEPT BHASKAR KARN Assistant Professor Department of MIS Birla Institute of Technology Mesra, Ranchi The paper presents the basic

More information

CASE BASED REASONING A SHORT OVERVIEW

CASE BASED REASONING A SHORT OVERVIEW CASE BASED REASONING A SHORT OVERVIEW Z. Budimac, V. Kurbalija Institute of Mathematics and Computer Science, Fac. of Science, Univ. of Novi Sad Trg D. Obradovića 4, 21000 Novi Sad, Yugoslavia zjb@im.ns.ac.yu,

More information

SAP. Modeling Guide for PPF

SAP. Modeling Guide for PPF Modeling Guide for PPF Contents 1 Document Organization... 3 1.1 Authors... 3 1.2 Intended Group of Readers... 3 1.3 References... 3 1.4 Glossary... 4 2 Modeling Guidelines - Application Analysis... 6

More information

Graceful Labeling for Some Star Related Graphs

Graceful Labeling for Some Star Related Graphs International Mathematical Forum, Vol. 9, 2014, no. 26, 1289-1293 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/imf.2014.4477 Graceful Labeling for Some Star Related Graphs V. J. Kaneria, M.

More information

WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS

WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS 1 WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS BRUCE CROFT NSF Center for Intelligent Information Retrieval, Computer Science Department, University of Massachusetts,

More information

Custom IDF weights for boosting the relevancy of retrieved documents in textual retrieval

Custom IDF weights for boosting the relevancy of retrieved documents in textual retrieval Annals of the University of Craiova, Mathematics and Computer Science Series Volume 44(2), 2017, Pages 238 248 ISSN: 1223-6934 Custom IDF weights for boosting the relevancy of retrieved documents in textual

More information

Better Contextual Suggestions in ClueWeb12 Using Domain Knowledge Inferred from The Open Web

Better Contextual Suggestions in ClueWeb12 Using Domain Knowledge Inferred from The Open Web Better Contextual Suggestions in ClueWeb12 Using Domain Knowledge Inferred from The Open Web Thaer Samar 1, Alejandro Bellogín 2, and Arjen P. de Vries 1 1 Centrum Wiskunde & Informatica, {samar,arjen}@cwi.nl

More information

Software re-use assessment for quality M. Ramachandran School of Computing and Mathematical Sciences, Jo/m Moores C/mrerszZ?/,

Software re-use assessment for quality M. Ramachandran School of Computing and Mathematical Sciences, Jo/m Moores C/mrerszZ?/, Software re-use assessment for quality M. Ramachandran School of Computing and Mathematical Sciences, Jo/m Moores C/mrerszZ?/, ABSTRACT Reuse of software components can improve software quality and productivity

More information

A METHOD TO MODELIZE THE OVERALL STIFFNESS OF A BUILDING IN A STICK MODEL FITTED TO A 3D MODEL

A METHOD TO MODELIZE THE OVERALL STIFFNESS OF A BUILDING IN A STICK MODEL FITTED TO A 3D MODEL A METHOD TO MODELIE THE OVERALL STIFFNESS OF A BUILDING IN A STICK MODEL FITTED TO A 3D MODEL Marc LEBELLE 1 SUMMARY The aseismic design of a building using the spectral analysis of a stick model presents

More information

Implementing a customised meta-search interface for user query personalisation

Implementing a customised meta-search interface for user query personalisation Implementing a customised meta-search interface for user query personalisation I. Anagnostopoulos, I. Psoroulas, V. Loumos and E. Kayafas Electrical and Computer Engineering Department, National Technical

More information

IDECSE: A Semantic Integrated Development Environment for Composite Services Engineering

IDECSE: A Semantic Integrated Development Environment for Composite Services Engineering IDECSE: A Semantic Integrated Development Environment for Composite Services Engineering Ahmed Abid 1, Nizar Messai 1, Mohsen Rouached 2, Thomas Devogele 1 and Mohamed Abid 3 1 LI, University Francois

More information

Development of Contents Management System Based on Light-Weight Ontology

Development of Contents Management System Based on Light-Weight Ontology Development of Contents Management System Based on Light-Weight Ontology Kouji Kozaki, Yoshinobu Kitamura, and Riichiro Mizoguchi Abstract In the Structuring Nanotechnology Knowledge project, a material-independent

More information

CHAPTER THREE INFORMATION RETRIEVAL SYSTEM

CHAPTER THREE INFORMATION RETRIEVAL SYSTEM CHAPTER THREE INFORMATION RETRIEVAL SYSTEM 3.1 INTRODUCTION Search engine is one of the most effective and prominent method to find information online. It has become an essential part of life for almost

More information

Query Processing and Interlinking of Fuzzy Object-Oriented Database

Query Processing and Interlinking of Fuzzy Object-Oriented Database American Journal of Engineering Research (AJER) e-issn: 2320-0847 p-issn : 2320-0936 Volume-6, Issue-2, pp-36-41 Research Paper www.ajer.org Open Access Query Processing and Interlinking of Fuzzy Object-Oriented

More information