A Framework on Ontology Based Classification and Clustering for Grouping Research Proposals

Similar documents
Research Domain Selection using Naive Bayes Classification

Ontology Based Search Engine

INTERNATIONAL JOURNAL OF SCIENTIFIC & ENGINEERING RESEARCH, VOLUME 6, ISSUE 5, MAY-2015 ISSN

International Journal of Engineering Research-Online A Peer Reviewed International Journal Articles available online

A Survey On Different Text Clustering Techniques For Patent Analysis

Information Retrieval System Based on Context-aware in Internet of Things. Ma Junhong 1, a *

A Semantic Model for Concept Based Clustering

Keywords Data alignment, Data annotation, Web database, Search Result Record

Chapter 27 Introduction to Information Retrieval and Web Search

Annotating Multiple Web Databases Using Svm

Survey on Recommendation of Personalized Travel Sequence

Reading group on Ontologies and NLP:

Letter Pair Similarity Classification and URL Ranking Based on Feedback Approach

Ontology Extraction from Heterogeneous Documents

MAINTAIN TOP-K RESULTS USING SIMILARITY CLUSTERING IN RELATIONAL DATABASE

ISSN: (Online) Volume 2, Issue 3, March 2014 International Journal of Advance Research in Computer Science and Management Studies

Frequent Pattern Mining On Un-rooted Unordered Tree Using FRESTM

SHORTEST PATH ALGORITHM FOR QUERY PROCESSING IN PEER TO PEER NETWORKS

IMAGE RETRIEVAL SYSTEM: BASED ON USER REQUIREMENT AND INFERRING ANALYSIS TROUGH FEEDBACK

Personalized Information Retrieval by Using Adaptive User Profiling and Collaborative Filtering

R. R. Badre Associate Professor Department of Computer Engineering MIT Academy of Engineering, Pune, Maharashtra, India

ADAPTIVE HANDLING OF 3V S OF BIG DATA TO IMPROVE EFFICIENCY USING HETEROGENEOUS CLUSTERS

ISSN (Online) ISSN (Print)

Classifying Twitter Data in Multiple Classes Based On Sentiment Class Labels

What you have learned so far. Interoperability. Ontology heterogeneity. Being serious about the semantic web

CHAPTER THREE INFORMATION RETRIEVAL SYSTEM

Ontology based Model and Procedure Creation for Topic Analysis in Chinese Language

Automated Online News Classification with Personalization

Learning to Match. Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li

A Novel Categorized Search Strategy using Distributional Clustering Neenu Joseph. M 1, Sudheep Elayidom 2

International Journal of Research in Computer and Communication Technology, Vol 3, Issue 11, November

On Veracious Search In Unsystematic Networks

CANDIDATE LINK GENERATION USING SEMANTIC PHEROMONE SWARM

Hybrid Approach for Query Expansion using Query Log

AN APPROACH OF CONTEXT ONTOLOGY FOR ROBUST FACE RECOGNITION AGAINST ILLUMINATION VARIATIONS

IMPROVED FACE RECOGNITION USING ICP TECHNIQUES INCAMERA SURVEILLANCE SYSTEMS. Kirthiga, M.E-Communication system, PREC, Thanjavur

INDEXED SEARCH USING SEMANTIC ASSOCIATION GRAPH

A. Krishna Mohan *1, Harika Yelisala #1, MHM Krishna Prasad #2

IRCE at the NTCIR-12 IMine-2 Task

Information Retrieval

A Hierarchical Document Clustering Approach with Frequent Itemsets

UNCOVERING OF ANONYMOUS ATTACKS BY DISCOVERING VALID PATTERNS OF NETWORK

International Journal of Advanced Research in Computer Science and Software Engineering

KNOWLEDGE MANAGEMENT VIA DEVELOPMENT IN ACCOUNTING: THE CASE OF THE PROFIT AND LOSS ACCOUNT

Ontology-Based Web Query Classification for Research Paper Searching

A Technique for Design Patterns Detection

Semantic Web. Ontology Engineering and Evaluation. Morteza Amini. Sharif University of Technology Fall 95-96

AN EFFECTIVE SEARCH ON WEB LOG FROM MOST POPULAR DOWNLOADED CONTENT

SCHEME OF COURSE WORK. Data Warehousing and Data mining

The Effect of Diversity Implementation on Precision in Multicriteria Collaborative Filtering

Domain-specific Concept-based Information Retrieval System

MULTI - KEYWORD RANKED SEARCH OVER ENCRYPTED DATA SUPPORTING SYNONYM QUERY

INTELLIGENT SYSTEMS OVER THE INTERNET

Keyword Extraction by KNN considering Similarity among Features

Research and implementation of search engine based on Lucene Wan Pu, Wang Lisha

Knowledge Engineering in Search Engines

Collaboration System using Agent based on MRA in Cloud

An Overview of various methodologies used in Data set Preparation for Data mining Analysis

Ranking Algorithms For Digital Forensic String Search Hits

Relevance Feature Discovery for Text Mining

Automatic New Topic Identification in Search Engine Transaction Log Using Goal Programming

Research on Heterogeneous Communication Network for Power Distribution Automation

Enhanced Performance of Search Engine with Multitype Feature Co-Selection of Db-scan Clustering Algorithm

Saudi Journal of Engineering and Technology (SJEAT)

International Journal of Software and Web Sciences (IJSWS)

Supervised classification of law area in the legal domain

AN ENHANCED ATTRIBUTE RERANKING DESIGN FOR WEB IMAGE SEARCH

VALLIAMMAI ENGINEERING COLLEGE SRM Nagar, Kattankulathur DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING QUESTION BANK VII SEMESTER

Text Data Pre-processing and Dimensionality Reduction Techniques for Document Clustering

Multimodal Information Spaces for Content-based Image Retrieval

Prior Art Retrieval Using Various Patent Document Fields Contents

Lily: Ontology Alignment Results for OAEI 2009

A Content Based Image Retrieval System Based on Color Features

Image Classification Using Text Mining and Feature Clustering (Text Document and Image Categorization Using Fuzzy Similarity Based Feature Clustering)

IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, ISSN:

TEXT CHAPTER 5. W. Bruce Croft BACKGROUND

ESSENTIALLY, system modeling is the task of building

Data Mining Technology Based on Bayesian Network Structure Applied in Learning

Ontology for Exploring Knowledge in C++ Language

Mining Web Data. Lijun Zhang

Fausto Giunchiglia and Mattia Fumagalli

Development of an Ontology-Based Portal for Digital Archive Services

An Industry Definition of Business Architecture

EFFECTIVE INTRUSION DETECTION AND REDUCING SECURITY RISKS IN VIRTUAL NETWORKS (EDSV)

Chapter 6: Information Retrieval and Web Search. An introduction

RSDC 09: Tag Recommendation Using Keywords and Association Rules

DESIGN OF PARAMETER EXTRACTOR IN LOW POWER PRECOMPUTATION BASED CONTENT ADDRESSABLE MEMORY

Remotely Sensed Image Processing Service Automatic Composition

ONTOLOGY is an important foundation of knowledge

Extraction of Web Image Information: Semantic or Visual Cues?

Development of Contents Management System Based on Light-Weight Ontology

A New Technique to Optimize User s Browsing Session using Data Mining

The HMatch 2.0 Suite for Ontology Matchmaking

K-MEANS BASED CONSENSUS CLUSTERING (KCC) A FRAMEWORK FOR DATASETS

Development of Prediction Model for Linked Data based on the Decision Tree for Track A, Task A1

Prediction of Student Performance using MTSD algorithm

ijade Reporter An Intelligent Multi-agent Based Context Aware News Reporting System

In the recent past, the World Wide Web has been witnessing an. explosive growth. All the leading web search engines, namely, Google,

Category Theory in Ontology Research: Concrete Gain from an Abstract Approach

is easing the creation of new ontologies by promoting the reuse of existing ones and automating, as much as possible, the entire ontology

Transcription:

A Framework on Ontology Based Classification and Clustering for Grouping Research Proposals KODAM ANUSHA* 1 PG Scholar, Dept of CSE, Kakatiya Institute of Technology and Science, Warangal. Abstract- The Research project proposals selection is an challenging task in organizations and companies. When large numbers of research proposals are collected then research Project Selection is a remarkable in all Research funding and Government agencies. The process of Assignment and Selection are unproblematic. The number of proposal submissions are very less. But with mounting in numbers both assignment and Selection processes will become complicated. The traditional methodologies, Proposal Grouping and Experts grouping experts are done keywords or manually. which yield inefficient experiment results. In present days methodologies are Ontology based Text Mining Method has been used to cluster Proposals based on their likeness in research area by means of Research Ontology method. Where as Reviewers grouping and proposal assignment to experts is still not precise and it is done manually. In this proposed paper proposes Ontology based Text Mining Method to cluster reviewers based on their specialism and overcome the Ontology Mapping Problem to allocate proposals to relevant reviewers analytically. This methodology an effective way for research project selection with escalating proposals and reviewers. Keywords Ontology, Clustering, Conceptbased analysis, Research Project Selection, Genetic Algorithm, Grouping process, Ontology Matching. 1. INTRODUCTION: The Research project proposals selection is an important and challenging task by the private funding agencies and government. When large numbers of research proposals are collected. Optimal allocation of research project proposals is a challenging multi process starts with research proposals by a funding agencies. The proposal invoking is IJCSIET-ISSUE4-VOLUME3-SERIES3 Page 1

distributed to relevant communities such as research institutions. The Submission of the research proposals by many institutes organizations and companies are assigned to experts for peer view based on their similarity. The review results are examined and the review proposals are then based on ranking and the aggregation of the experts review results. Present methodology, Experts grouping, Proposal grouping and Proposal Assignments to experts are done keywords or manually. In the process of manual grouping the reviewers may not have applicants subject view so by misinterpreting the concept of proposals. The reviewers may group proposals wrongly. In the process of keyword based grouping the proposals are grouped together according to keywords similarities. The grouped proposals are very poorly will be assigned to poorly grouped experts. To overcome all these problems Ontology based Text Mining Method has been introduced. The Ontology is a technique that describing knowledge. The set of concepts and the relationship between these concepts for specialized domains. The Domain experts are helping us to construct ontology by providing all domain concepts and their relationships. This Ontology based grouping process has been done semantically. 2. RELATED WORK: The Research Project Selection task is mandatory as well as complicated process in Research funding agencies. if number of proposal submissions are increases before using OTMM technique more techniques are used for Research Project Selection. Q. Tian, J. Ma [8] and O.Liu [9] proposed Knowledge rules system decision models. The Decision Models use mathematical formulae which gives the optimized results. But the decision models can used only for structured documents and knowledge rules will give only feasible result. T.H. Cheng and C.P. Wei [5] gave clustering based category hierarchy integration techniques to overcome the technique clustering. which is used to category documents hieratically. But there is no intermediate category while clustering and it does not consider the semantic method. this method is also not efficient in heterogeneous scenarios next Ontology based systems are introduced. A. J. C. Trappey et al. Suggested a novel hierarchal clustering approaches for knowledge document self organizing using fuzzy ontology method. This method has been used for automatic interpretation and knowledge documents clustering using ontology schema method. But this method is pertinent only for patent documents. IJCSIET-ISSUE4-VOLUME3-SERIES3 Page 2

Fig 1: Research Selection Process O. Liu and J. Ma[8] established multilingual ontology framework for R&D project management systems which supports languages and facilitates R&D information sharing among users with different culture backgrounds. In this only three languages are taken for Implementation there is no using of context sensitive information. M. Nagy and M. Vargas Vera presented multi agent ontology mapping skeleton which is prerequisite for attaining heterogeneous data integration on web. Using this mature web applications were developed and is used to interpret and align heterogeneous and distributed ontology on semantic web.jian Ma, Wei Xu[5] and et al. proposed OTMM to cluster and to balance research project proposals. Then these proposals have been assigned to reviewers by hand. This Paper has been extended the work of Jian Ma, Wei Xu and et al. and Pave Shako because of these extention proposals were assigned to reviewers systematically. This methodology is an efficient and effective means for the selection of research project proposals with the escalating quantity of research proposals. 3. METHODOLOHY- The Research Project Selection process initiates with Call for Proposals then Proposal grouping, Proposal submission, Proposal assignment to experts, review and ranking proposals. This paper presents the methodology to overcome clustering problems. This project has two phases: 1) The Clustering research project proposals. 2) Clustering the Reviewers and Assigning Proposals to appropriate Reviewers. 3.1 Research Ontology for Proposals: The Research ontology describes various research areas that announced by authority. Step 1: Collecting the research area topics and its related area topics. It is built by adding semantics to topics of the research areas that is grouping. These topics are collected from past proposals the research area topics. e.g. Data Mining where as sub area topic. It means sub area example Web IJCSIET-ISSUE4-VOLUME3-SERIES3 Page 3

mining, Text mining, Image mining, video mining etc. Step 2: In Research ontology creation the Ontology constructions can be done either manually or using some tools. Jain Ma has been created ontology manually. Step 3: the revising the research ontology should be done per annum from proposals that are submitted each year. Fig 2: creation of Ontology Fig 3: overall Architecture 3.2 Research Proposals under its Sub- Area Using Text Mining: After classifying new ontology proposals are clustered under its research sub areas. This is done based on their similarities using text mining. Following steps Step1: Clustered documents Collection. Present research Proposals were collected after classifying each proposal under the discipline areas. All these proposals were collected for preprocessing. Step 2: Preprocessing Collected Documents contents were generally non structural. The total documents are analyzed and extracted. The keywords were identified by using Research Ontology that has been constructed previously. Finally vocabulary size has been abridged by eliminating or deleting words that came into sight only a small amount of times in the final document. Step 3: The Encoding term Frequency Inverse Document Frequency has been used for encoding of keywords. After preprocessing all documents are Segmented and converted to trait vector representation. Encoding of TF-IDF used to produce a trait by using weighted techniques. These techniques based on IDF. The Trait vectors were put up as a consequence of process of Encoding. Trait ( i ) tfi * log( N / di) Step 4: Vector dimension reduction is obligatory to diminish the Dimension of the generated trait vectors. it is too outsized. Latent Semantic Indexing (LSI) has been used to trim down the dimension of the trait vector to generate the semantic relations IJCSIET-ISSUE4-VOLUME3-SERIES3 Page 4

among keywords. Step 5: Clustering of Text Vector. The Self Organized Mapping algorithm has been used to group the trait vector based on their similarities. This result of Text Vector Clustering, documents was grouped in research area. 3.3 Grouping Reviewers by using Domain Ontology: The Domain ontology describes various domains. Research and Domain Ontology are almost referring the same but Research ontology has been created for grouping of proposal and Domain ontology has been created for grouping of reviewers. Step 1: The Collecting information about reviewers. The Reviewers who were eligible can register by using this registering details reviewers were clustered Step 2: The Constructing the domain ontology construction can be done either manually or using some tools. Jain Ma et al. has been created research ontology s manually. Where as here PROTEGE tool is used for ontology construction of research and domain ontology. Step 3: The Revising the domain ontology. Domain ontology Revising should be done annually through topics collected from reviewers details that are registered every year. 3.4 Assign Proposals to Experts by Ontology Mapping: The Research and Domain Ontology were given as input to this method. Ontology mapping will do between them. As a result of this process the balanced proposals were assigned to grouped reviewers for future Review process. The process Ontology matching is done by using the tools like SAMBO Falcon Poznan, Ri MOM U. of Science and Anchor Flood ASMOV Agreement Maker In this SAMBO tool has been used for ontology matching Fig 4: Genetic Algorithm IJCSIET-ISSUE4-VOLUME3-SERIES3 Page 5

3.5. Information Retrieval: For information retrieval knowledge based agent I used. Which comprises a knowledge base and inference system? Systematic allocation of retrieved grouped proposals to grouped external reviewers is done by knowledge base agent E. the Proposals Assign to Reviewers The Final step of this approach is to assign the Research Proposals group to the External Research Reviewers group systematically. The Proposals of the particular Discipline area is assign to the Reviewers having the same research area. they can examine the proposals efficiently for the peer review. 4. CONCLUSIONS AND FUTURE WORK: This paper has presented a framework on ontology based classification and clustering for grouping research proposals as well as reviewers. which will be used by research funding Agencies for grouping and assigning the grouped proposal to reviewers group systematically OTMM has been proposed for grouping Proposals and Reviewers. The Genetic Algorithm has been used for balancing and Similarity measure has been calculated for assigning appropriate reviewers for given proposals. And F Measure values, these values illustrate that OTMM grouping was better than existing technologies. The experimental results showed that OTMM clusters proposals efficient manner and yield better results than previous methods. In Future this work can be extended to lots of languages by creating multilingual ontology. Finally with this effort improved. The end results have been achieved while balancing proposals. Fig 5 : F-Measure in TMM Vs OTMM REFERENCES: [1].Cai.M, Zhang.Y.W, and Zhang.K, ManuHub: A semantic web system for ontology- based service management in distributed manufacturing environments, IEEE Trans. Syst., Man, Cybern. A, Syst., Humans, vol. 41, no. 3, pp. 574 582, May 2011. [2].Cheng.T.H and Wei.C.P, A clustering-based approach for integrating document-category hierarchies, IEEE Trans. Syst., Man, Cybern. A, Syst., Humans, vol. 38, no. 2, pp. 410 424, Mar. IJCSIET-ISSUE4-VOLUME3-SERIES3 Page 6

2008. [3].Chiang.D.A, Keh.H.C, Huang.H.H, and Chyr.D,The Chinese text categorization system with association rule and category priority, Expert Syst. Appl., vol. 35, no.1/2, pp. 102 110, Jul./Aug. 2008. [4].Fan.Z.P, Chen.Y, Ma.J, and Zhu.Y, Decision support for proposal grouping: A hybrid approach using knowledge rule and genetic algorithm, Expert Syst. Appl., vol. 36, no. 2, pp. 1004 1013, Mar. 2009. [5].VJian Ma, Wei Xu, Yong-hong Sun, Efraim Turban, Shouyang Wang, and Ou Liu, An Ontology-Based Text-Mining Method to Cluster Proposals for Research Project Selection, IEEE Trans. Syst., Man, Cybernetics. A, Syst., Humans, vol. 42, no. 3, pp. 784 790, May. 2012. [6].Liang.Q, Wu.X, Park.E.K, Khoshg of taar.t.m, and Chi.C.H, Ontology based business process customization for composite web services, IEEE Trans. Syst., Man, Cybern. A, Syst., Humans, vol. 41, no. 4, pp. 717 729, Jul. 2011. [7].Lim.G.H, Suh.I.H, and Suh.H, Ontology-based unified robot knowledge for service robots in indoor environments, IEEE Trans. Syst., Man, Cybern. A, Syst., Humans, vol. 41, no. 3, pp. 492 509, May 2011. [8].Liu. O and Ma.J, A multilingual ontology framework for R&D project management systems, Expert Syst. Appl., vol. 37, no. 6, pp. 4626 4631, Jun. 2010. [9].Liu.Y, Jiang.Y, and Huang.L, Modeling complex architectures based on granular computing on ontology, IEEE Trans. Fuzzy Syst., vol. 18, no. 3, pp. 585 598, Jun. 2010. [10].Liu.Y, Wang.X, and Wu.C, ConSOM: A conceptional self-organizing map model for text clustering, Neurocomputing, vol. 71, no. 4 6, pp. 857 862, Jan. 2008. [11].Lu.C, Hu.X, and Park.J.R, Exploiting the social tagging network for web clustering, IEEE Trans. Syst., Man, Cybern. A, Syst., Humans, vol. 41, no. 5, pp. 840 852, Sep. 2011. [12].Nagy.M and Vargas-Vera.M, Multiagent ontology mapping framework for the semantic web, IEEE Trans. Syst., Man, Cybern. A, Syst., Humans, vol. 41, no. 4, pp. 693 704, Jul. 2011. [13].Razmerita.L, An ontology-based framework for modeling user behavior. ABOUT THE AUTHORS: KODAM ANUSHA is currently pursuing her M.Tech Software Engineering Department in Kakatiya Institute of Technology and Science, Warangal. She IJCSIET-ISSUE4-VOLUME3-SERIES3 Page 7

received her B.Tech in Information Technology from SR institutes of technological sciences, WARANGAL. Her area of interests includes Data mining and Network security. IJCSIET-ISSUE4-VOLUME3-SERIES3 Page 8