An Intelligent Retrieval Platform for Distributional Agriculture Science and Technology Data

Similar documents
An Agricultural Tri-dimensional Pollution Data Management Platform Based on DNDC Model

An Agricultural Tri-dimensional Pollution Data Management Platform based on DNDC Model

Design and Realization of Agricultural Information Intelligent Processing and Application Platform

Research on 3G Terminal-Based Agricultural Information Service

Remote Control System Based on Compressed Image

APPLICATION OF JAVA TECHNOLOGY IN THE REGIONAL COMPARATIVE ADVANTAGE ANALYSIS SYSTEM OF MAIN GRAIN IN CHINA

A Network-Based Management Information System for Animal Husbandry in Farms

The Study and Implementation of Text-to-Speech System for Agricultural Information

A REASONING COMPONENT S CONSTRUCTION FOR PLANNING REGIONAL AGRICULTURAL ADVANTAGEOUS INDUSTRY DEVELOPMENT

Ontology Molecule Theory-based Information Integrated Service for Agricultural Risk Management

Information Retrieval System Based on Context-aware in Internet of Things. Ma Junhong 1, a *

Grid Resources Search Engine based on Ontology

A Growth Measuring Approach for Maize Based on Computer Vision

Remotely Sensed Image Processing Service Automatic Composition

[Type text] [Type text] [Type text]

The Research and Design of the Android-Based Facilities Environment Multifunction Remote Monitoring System*

Design and Development of Water Quality Monitoring System Based on Wireless Sensor Network in Aquaculture

Design and Implementation of Henan Huinong Client System Based on ios

Application of Individualized Service System for Scientific and Technical Literature In Colleges and Universities

Design and Implementation of Aquarium Remote Automation Monitoring and Control System

Study on XML-based Heterogeneous Agriculture Database Sharing Platform

Application and Research of Man-Machine Interface and Communication Technique of Mobile Information Acquisition Terminal in Facility Production

Research and Application of Unstructured Data Acquisition and Retrieval Technology

The Research and Design of the Application Domain Building Based on GridGIS

Research on Approach of Equipment Status and Operation Information Acquisition Based on Equipment Control Bus

Construction of Knowledge Base for Automatic Indexing and Classification Based. on Chinese Library Classification

Personalized Search for TV Programs Based on Software Man

LDAP-based IOT Object Information Management Scheme

Study on Jabber Be Applied to Video Diagnosis for Plant Diseases and Insect Pests

Research on Remote Sensing Image Template Processing Based on Global Subdivision Theory

A metadata based agricultural universal scientific and technical information fusion and service framework

A Novel Categorized Search Strategy using Distributional Clustering Neenu Joseph. M 1, Sudheep Elayidom 2

Extending Enterprise Services Descriptive Metadata with Semantic Aspect Based on RDF

Developing ArXivSI to Help Scientists to Explore the Research Papers in ArXiv

Information Push Service of University Library in Network and Information Age

Yunfeng Zhang 1, Huan Wang 2, Jie Zhu 1 1 Computer Science & Engineering Department, North China Institute of Aerospace

Title Grid for Multimedia Communication Ne. The original publication is availabl. Press

Research on Full-text Retrieval based on Lucene in Enterprise Content Management System Lixin Xu 1, a, XiaoLin Fu 2, b, Chunhua Zhang 1, c

Mapping UML Models to Colored Petri Nets Models based on Edged Graph Grammar

The Application Research of Neural Network in Embedded Intelligent Detection

Design of an Intelligent PH Sensor for Aquaculture Industry

Agriculture Wireless Temperature and Humidity Sensor Network Based on ZigBee Technology

Assisting Trustworthiness Based Web Services Selection Using the Fidelity of Websites *

A Design of Greenhouse Remote Monitoring System Based on WSN and WEB

Research on Design Information Management System for Leather Goods

Zigbee Wireless Sensor Network Nodes Deployment Strategy for Digital Agricultural Data Acquisition

Research on the Interoperability Architecture of the Digital Library Grid

THE STUDY AND IMPLEMENTATION OF TEXT-TO-SPEECH SYSTEM FOR AGRICULTURAL INFORMATION

A New Model of Search Engine based on Cloud Computing

The Design and Application of GIS Mathematical Model Database System with Meta-algorithm Li-Zhijiang

On Transformation from The Thesaurus into Domain Ontology

Design and Implementation of Agro-technical Extension Information System Based on Cloud Storage

The Design of Model for Tibetan Language Search System

Design of Labour Agency Platform Based on Agent Technology of JADE *

CONSTRUCTION OF AGRICULTURAL PRODUCTS LOGISTICS INFORMATION SYSTEM BASED ON.NET AND WAP

Analysis on the technology improvement of the library network information retrieval efficiency

The Comparative Study of Machine Learning Algorithms in Text Data Classification*

Development and Application of Database System for Rubber Material

Design and Implementation of Remote Push System of Resources Based on Internet

An Inspection Method of Rice Milling Degree Based on Machine Vision and Gray-Gradient Co-occurrence Matrix

The Extended Model Design of Non-Metallic Mineral Material Information Resource System Based on Semantic Web Service

SkyEyes: A Semantic Browser For the KB-Grid

A Novel Approach for Restructuring Web Search Results by Feedback Sessions Using Fuzzy clustering

Mubug: a mobile service for rapid bug tracking

An Solution of Network Service Oriented Operator Network Intrusion Prevention

Design and Implementation of Search Engine Using Vector Space Model for Personalized Search

Understanding the Query: THCIB and THUIS at NTCIR-10 Intent Task. Junjun Wang 2013/4/22

Research and Application of Mobile Geographic Information Service Technology Based on JSP Chengtong GUO1, a, Yan YAO1,b

An Indian Journal FULL PAPER. Trade Science Inc.

TEXT CHAPTER 5. W. Bruce Croft BACKGROUND

Ontology based Model and Procedure Creation for Topic Analysis in Chinese Language

Log System Based on Software Testing System Design And Implementation

A New Distance Independent Localization Algorithm in Wireless Sensor Network

Design of Coal Mine Power Supply Monitoring System

Semantic Web Search Model for Information Retrieval of the Semantic Data *

A New Technique to Optimize User s Browsing Session using Data Mining

An Improved Method of Vehicle Driving Cycle Construction: A Case Study of Beijing

Design and Implementation of Computer Room Management System in University

THE EXPLOITATION OF WEBGIS BASED ON ARCGIS SERVER AND AJAX

Data Mining Technology Based on Bayesian Network Structure Applied in Learning

IMPROVING INFORMATION RETRIEVAL BASED ON QUERY CLASSIFICATION ALGORITHM

The Design of Flower Ecological Environment Monitoring System Based on ZigBee Technology

Research on QR Code Image Pre-processing Algorithm under Complex Background

Research on the Knowledge Based Parameterized CAD System of Wheat and Rice Combine Chassis

Real-time Data Process Software for POAC Space Mission Management System

On Reduct Construction Algorithms

International Conference on Automation, Mechanical Control and Computational Engineering (AMCCE 2015)

Study on Ontology-based Multi-technologies Supported Service-Oriented Architecture

XETA: extensible metadata System

An Abnormal Data Detection Method Based on the Temporal-spatial Correlation in Wireless Sensor Networks

Domain-specific Concept-based Information Retrieval System

Research and Design of Key Technology of Vertical Search Engine for Educational Resources

Research on the Shape of Wheat Kernels Based on Fourier Describer

The Application of Wireless Sensor in Aquaculture Water Quality Monitoring

A Fast and High Throughput SQL Query System for Big Data

Enhanced retrieval using semantic technologies:

Study on Personalized Recommendation Model of Internet Advertisement

WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS

Study on Pear Diseases Query System Based on Ontology and SWRL

Preliminary Research on Distributed Cluster Monitoring of G/S Model

Transcription:

An Intelligent Retrieval Platform for Distributional Agriculture Science and Technology Data Xiaorong Yang 1,2, Wensheng Wang 1,2, Qingtian Zeng 3, and Nengfu Xie 1,2 1 Agriculture Information Institute, Chinese Academy of Agriculture sciences, Beijing, P.R. China 2 Key Laboratory of Digital Agricultural Early-warning Technology (2006-2010), Ministry of Agriculture, The People s Republic of China 3 Shandong Science and Technology University, Shandong Province, P.R. China Abstract. In the agricultural domain, the variety of data used by organizations is increasing rapidly. Also, there is an increasing demand for accessing these data. Now, the problem of the digital divide causes serious problems in manipulating the distributed information. Based on this condition, this paper presents the intelligent retrieval architecture of distributional agriculture science and technology data which focuses on research of the integration support technology, the concept extending retrial technology based on agricultural ontology and the personalization retrieval technique based on the user model. In the experiment, the intelligent data application platform provided by the paper proves that the architecture is effective. Keywords: Agriculture science and technology data, Data integration, Agriculture ontology, Intelligent retrieval. 1 Introduction With the development of computer and network technology, the amount of data which are collected, saved, processed and transmitted has grown rapidly. Many sharing and serving platforms of agriculture science and technology information are constructed by different departments throughout the country. But these platforms lack a unified plan and management in the important implementation techniques and storage technology. The heterogeneity and dynamic distribution become basic features of these systems at present. Particularly the heterogeneity in semantics results in data sharing difficulty. An intelligent data application platform should be constructed to make full use of different distributed heterogeneous data resources. The platform can provide a public and unified data access interface of different distributed data sources for users. Users needn t consider the problem of data extracting and data combining. So the unified and high-efficiency access of data can be achieved. D. Li, Y. Liu, and Y. Chen (Eds.): CCTA 2010, Part I, IFIP AICT 344, pp. 203 209, 2011. IFIP International Federation for Information Processing 2011

204 X. Yang et al. 2 The Architecture of the Intelligent Retrieval Platform of Distributional Agriculture Science and Technology Data 2.1 The Logical Architecture of the Intelligent Retrieval Platform of Distributional Agriculture Science and Technology Data A traditional retrieval system of distributional agriculture science and technology data includes distributional data integration module, data category module and data retrieval module. To improve retrieval intelligence and satisfy users individualized need, the intelligent retrieval module and personalization service module are designed based on the traditional retrieval architecture of distributional agriculture science and technology data. The intelligent retrieval module uses domain ontology to support the information retrieval based on different languages, synonyms and related information resources. The personalization service module can record and mine users historical data to discovery users interest. It can recommend information according to users interest. Fig 1 shows the logical architecture of the intelligent retrieval platform of distributional agriculture science and technology data. Fig. 1. Intelligent Retrieval Logical Architecture of Distributional Agriculture Science and Technology Data

An Intelligent Retrieval Platform for Distributional Agriculture Science 205 2.2 The Function Architecture of the Intelligent Retrieval Platform of Distributional Agriculture Science and Technology Data According to the logical architecture the detailed function architecture of the intelligent retrieval platform of distributional agriculture science and technology data (Fig 2) is designed. The function architecture includes the management and retrieval of data sources layer, the central metadata mapping and intelligent retrieval layer and the system interface layer. The management and retrieval of data sources layer consists of the node metadata management module and the web retrieval module. The node metadata management module manages bottom database sources and the web retrieval module can accept query parameters from upper layer, access database and return retrieval results. The central metadata mapping and intelligent retrieval layer consists of the intelligent retrieval module, the central metadata mapping database and the central metadata manager. The intelligent retrieval module can accept query parameters from system interface. According to the central metadata mapping table it can find corresponding data source and submit this query parameters to corresponding web retrieval module. The central metadata manager can manage metadata and mapping relation between metadata and data sources. The system interface layer provides classification retrieval and keywords retrieval for users. The platform completes semantic extension for query condition which a user inputs and submits them to the bottom web retrieval module. Fig. 2. Detailed Function Architecture of the Intelligent Retrieval Platform of Distributional Agriculture Science and Technology Data

206 X. Yang et al. 3 The Key Technology about Intelligent Retrieval of Distributional Agriculture Science and Technology Data 3.1 The Integration Supporting Technology of Distributional Agriculture Science and Technology Data Integration and category are main function of the integration of distributional agriculture science and technology data. This study adopts middleware technology to solve the integration of distributed heterogeneous data. A middle layer is developed between users and distributional agriculture science and technology data sources. It can provide a unified data access interface for distributed heterogeneous data sources. It also defines classification standards for data resources. Then the information is classified and displayed to users (Song Lan et al. 2010). Fig 3 shows the logical architecture of Integration and category of distributional agriculture science and technology data. Because node administrators know more about node database, this study adopts metadata technique to descript resources. Metadata of bottom resources are described by node administrators. The middle layer uses the metadata to manage different node data sources. It administers collectively the metadata of different node database and sets up a unified metadata mapping table. So all heterogeneous database can be operated as a Fig. 3. Logical Architecture of Integration and Category of Distributional Agriculture Science and Technology Data

An Intelligent Retrieval Platform for Distributional Agriculture Science 207 simple database. The unified metadata mapping table can organize and access heterogeneous network information resources (Li Jianhui. 2007) (Song Xiaoyu et al. 2008). User layer establishes query performance according to the information classification to submit it to the data integration layer. And the data integration layer searches the classification mapping table and metadata mapping table and locate the corresponding data source. This study presents own metadata standard according to database structure based on Dublin metadata standard. 3.2 The Concept Expansion Retrieval Technology Based on Domain Ontology Domain ontology technology is adopted to standardize retrieval keywords in order to reach united comprehension to information between human and human or Machine. The concept-based retrieval technology can improve retrieval efficiency and speed. This study adopts description logic to establish domain ontology model and analyses the reasons of the heterogeneity among information systems from the angle of ontology. The semantic integration framework of the heterogeneous systems is established based on ontology mapping. Domain ontology can be set up in two methods (Cao Yukun et al. 2010). Semantic extension keywords set and some metadata set of node data sources inputted by node administrators are main domain ontology keywords. And the domain keywords extracted from the specialized websites are ancillary sources. This study constructs ontology by using the graphical interfaces of the ontology edit tool Protégé. According to ontology rules, node administrators extend the keywords in the semantic dictionary from four properties of synonym, abbreviation, English language and Chinese pinyin. Thus a unified semantic dictionary is set up in the center database and becomes more and more abundant (Song Lan et al. 2009). It can improve the recall ratio of information retrieval. Then the center administrators delete repetitive and ambiguous words. The precision ratio of the platform can be implemented (Chen Lihua. 2010). By semantic analysis of ontology concept and retrieval keywords, retrieval association and expansion are completed step by step. This technology supports synonym retrieval, information retrieval of different languages and recommendation of related information resources. For example, if a user inputs the keyword crop, Chinese and English information about crop can be searched and information about rice and wheat can be searched. 3.3 The Personalized Recommending Technology Based on User Model User interest model is set up according to user s explicit demand and implicit demand. It maintains uses history behavior information and personal information. It provides different comprehension of same keywords from different users in depth and scope. User feedback process based on users' opinions makes retrieval service more accurate and friendly. User interest model can analyse a user s behavior and record and mine users hidden interest. Because the user s interest changes, user interest model self-studies continuously to improve itself (Fei Hongxiao et al. 2009). The platform sets higher priority to the information which are often accessed by the user. user interest model can forecast a user s interest and demand to implement personal information

208 X. Yang et al. retrieval and recommending. Firstly, this study implements the dynamic sort of information resources according to a user s interest. When a user accesses some information resources, the system records his behavior and analyses his interest in classified information resources. When the user retrieves information again, the data resources which are often accessed by him will be displayed ahead. Secondly, the system can customize personal fields of database. The fields can be defined as the language and words needed by a user in order to satisfy his usage pattern. Finally, the system can record information accessed by a user. The user can operate the accessed records and define if they are useful to him. By calculating the probability of the accessed information, the user s interest in information resources of some sort can be gotten. 4 Application Case To evaluate the intelligent retrieval platform for distributional agriculture science and technology data, the platform is applied in the management of Tibet science and technology information resource. In Tibet, all kinds of information resources are saved in different database and websites. These systems don t communicate each other because of the independence in the design and deployment. By applying the intelligent retrieval technology, the platform integrates, maintains and shares the distributional agriculture science and technology data in Tibet. The platform provides a unified data access interface for distributed heterogeneous data sources. Through the interface users access the needed information conveniently and needn t consider the problem of data extracting and data combining. Not only the information which meets the inputted keywords can be searched, but also the information about the synonym, English language and related information of the inputted keywords can be found. And the information are displayed according to users interest priority. The retrieval intelligence and individualized service of the platform satisfy users demand. 5 Conclusion To integrate and share distributional heterogeneous agriculture science and technology data, this study designs and implements the intelligent retrieval platform for distributional agriculture science and technology data. This paper introduces the logic and function architecture of the platform and the integration supporting technology of distributional agriculture science and technology data, the concept expansion retrieval technology based on domain ontology and the personalized recommending technology based on user model. Finally, as an application case, the platform has been applied to manage Tibet science and technology information resource to verify the performance of the management platform. Acknowledgements The work is supported by the Academy of Science and Technology for Development fund project intelligent search-based Tibet science & technology information resource

An Intelligent Retrieval Platform for Distributional Agriculture Science 209 sharing technology, the National Science and Technology Major Project of the Ministry of Science and Technology of China (Grant No. 2009ZX03001-019-01), and the special fund project for Basic Science Research Business Fee, AII (No. 2010-J-07). References 1. Cao, Y., Ding, M.: Discovering Model of Semantic Web Service Based on Ontology. Computer Systems & Applications 19(4), 98 102 (2010) 2. Chen, L.: Comment on Latent Semantic Analysis of Retrieval Precision Rate Factors Based on the Impact of Natural Language. Journal of Modern Information 3(03), 26 31 (2010) 3. Song, L., Lei, L., Wang, H.: A Study of Intelligent Semantic Information Processing System Based on Ontology. Journal of East China Jiaotong University 26(05), 31 34 (2009) 4. Wang, X.: Semantic-based Query in Heterogeneous Information Integration Environment., Doctor Degree Dissertation of Huazhong University of Science & Technology (2006) 5. Li, J.: Key Problems Research on Metadata Oriented to Scientific Data Sharing. Doctor Degree Dissertation of the Chinese Academy of Science (2007) 6. Song, X., Wang, Y.: Data Integration and Integration Application. The Chinese Publishing Press of Water Conservancy and Hydroelectric Power (2008) 7. Hongxiao, F., Siming, T., Wenxing, L., Qinxiu, L., Xin, D.: Web User Clustering Based on Interest. Computer Systems & Applications 19(4), 62 65 (2010)