Design and Realization of Agricultural Information Intelligent Processing and Application Platform

Similar documents
An Intelligent Retrieval Platform for Distributional Agriculture Science and Technology Data

Design and Implementation of Aquarium Remote Automation Monitoring and Control System

An Agricultural Tri-dimensional Pollution Data Management Platform Based on DNDC Model

Research on 3G Terminal-Based Agricultural Information Service

The Study and Implementation of Text-to-Speech System for Agricultural Information

A Network-Based Management Information System for Animal Husbandry in Farms

Research and Design of Key Technology of Vertical Search Engine for Educational Resources

Agricultural bibliographic data sharing & interoperability in China

The Research and Design of the Android-Based Facilities Environment Multifunction Remote Monitoring System*

Application of RMAN Backup Technology in the Agricultural Products Wholesale Market System

Design and Implementation of Computer Room Management System in University

An Agricultural Tri-dimensional Pollution Data Management Platform based on DNDC Model

Design and Implementation of Agro-technical Extension Information System Based on Cloud Storage

A metadata based agricultural universal scientific and technical information fusion and service framework

Study on Jabber Be Applied to Video Diagnosis for Plant Diseases and Insect Pests

Application of Individualized Service System for Scientific and Technical Literature In Colleges and Universities

A Growth Measuring Approach for Maize Based on Computer Vision

Design and Implementation of Remote Push System of Resources Based on Internet

Data Mining in the Application of E-Commerce Website

Construction of Knowledge Base for Automatic Indexing and Classification Based. on Chinese Library Classification

Design and Implementation of Henan Huinong Client System Based on ios

[Type text] [Type text] [Type text]

Information Push Service of University Library in Network and Information Age

The Design of Water Quality Monitoring Cloud Platform Based on. BS Architecture

Research and Application of Mobile Geographic Information Service Technology Based on JSP Chengtong GUO1, a, Yan YAO1,b

THE STUDY AND IMPLEMENTATION OF TEXT-TO-SPEECH SYSTEM FOR AGRICULTURAL INFORMATION

Application of Theory and Technology of Wireless Sensor Network System for Soil Environmental Monitoring

Ontology Molecule Theory-based Information Integrated Service for Agricultural Risk Management

Constructing an University Scientific Research Management Information System of NET Platform Jianhua Xie 1, a, Jian-hua Xiao 2, b

Research on Design Information Management System for Leather Goods

Development and Application of Database System for Rubber Material

Crop Production Management Information System Design and Implementation

TCM Health-keeping Proverb English Translation Management Platform based on SQL Server Database

M out of N Safety Computing System Based on General-Purpose Computers

Grid Resources Search Engine based on Ontology

A Decision Support System Based on SSH and DWR for the Retail Industry

A Design of Greenhouse Remote Monitoring System Based on WSN and WEB

Context-based Roles and Competencies of Data Curators in Supporting Data Lifecycle: Multi-Case Study in China

Implementation and Design of Security Configuration Check Toolkit for Classified Evaluation of Information System

Zigbee Wireless Sensor Network Nodes Deployment Strategy for Digital Agricultural Data Acquisition

Computer Life (CPL) ISSN: Research on the Construction of Network and Information Security. Architecture in Campus

Research on QR Code Image Pre-processing Algorithm under Complex Background

Research on Geo-information Data Model for Preselected Areas of Geological Disposal of Highlevel Radioactive Waste

An Improved Method of Vehicle Driving Cycle Construction: A Case Study of Beijing

Integration of information security and network data mining technology in the era of big data

H. W. Wilson OmniFile Full Text Mega Edition Database

Indoor optimal path planning based on Dijkstra Algorithm Yicheng Xu 1,a, Zhigang Wen 1,2,b, Xiaoying Zhang 1

Remote Control System Based on Compressed Image

HUMAN-COMPUTER INTERFACE DEVELOPMENT OF WIRELESS MONITORING SYSTEM BASED ON MINIGUI

An algorithm of lips secondary positioning and feature extraction based on YCbCr color space SHEN Xian-geng 1, WU Wei 2

The Research and Design of the Application Domain Building Based on GridGIS

Design and Development of Water Quality Monitoring System Based on Wireless Sensor Network in Aquaculture

INFORMATION RETRIEVAL SYSTEM: CONCEPT AND SCOPE

Research on Power Quality Monitoring and Analyzing System Based on Embedded Technology

Journal of Chemical and Pharmaceutical Research, 2014, 6(5): Research Article

Intelligent Computer Room Management Platform Based on RF Card

Yunfeng Zhang 1, Huan Wang 2, Jie Zhu 1 1 Computer Science & Engineering Department, North China Institute of Aerospace

The Monitoring System of Water Environment Based on Overlay Network Technology 1

Design and Implementation of Information Display Platform about Grain Economy Team Based on ASP.NET Lei-meng LI, Na ZHANG * and Guo-jian CAI

LDAP-based IOT Object Information Management Scheme

Research on Two - Way Interactive Communication and Information System Design Analysis Dong Xu1, a

Key Technology of Online Writing System Development Hongmei Zhao

Cluster Validity Classification Approaches Based on Geometric Probability and Application in the Classification of Remotely Sensed Images

A Experiment Report about a Web Information Retrieval System

Chinese Agricultural Thesaurus and its application on data sharing & interoperability

A Design of Remote Monitoring System based on 3G and Internet Technology

Research and Application of Unstructured Data Acquisition and Retrieval Technology

Analysis on international yak document and information resources

Research on the Application of Digital Images Based on the Computer Graphics. Jing Li 1, Bin Hu 2

WebBiblio Subject Gateway System:

Design and Implement of Laboratory Management System based Web Zheng-Bo LI School of Economic Management, Beihua University, Jilin , China.

Agriculture Wireless Temperature and Humidity Sensor Network Based on ZigBee Technology

Design and Implementation of unified Identity Authentication System Based on LDAP in Digital Campus

Design of Labour Agency Platform Based on Agent Technology of JADE *

Research on Computer Network Virtual Laboratory based on ASP.NET. JIA Xuebin 1, a

Zigbee Wireless Sensor Network Nodes Deployment Strategy for Digital Agricultural Data Acquisition

Application and Research of Man-Machine Interface and Communication Technique of Mobile Information Acquisition Terminal in Facility Production

The Analysis and Research of IPTV Set-top Box System. Fangyan Bai 1, Qi Sun 2

The Design of Electronic Color Screen Based on Proteus Visual Designer Ting-Yu HOU 1,a, Hao LIU 2,b,*

The Design of Distributed File System Based on HDFS Yannan Wang 1, a, Shudong Zhang 2, b, Hui Liu 3, c

Autonomous System Network Topology Discovery Algorithm Based On OSPF Protocol

A New Method Of VPN Based On LSP Technology

THE EXPLOITATION OF WEBGIS BASED ON ARCGIS SERVER AND AJAX

On Transformation from The Thesaurus into Domain Ontology

Data Mining Technology Based on Bayesian Network Structure Applied in Learning

Remote monitoring system based on C/S and B/S mixed mode Kaibing Song1, a, Yinsong Wang2,band Dandan Shang3,c

Virtual Simulation of Seismic Forward Data Processing Based on LabVIEW

The Application of Wireless Sensor in Aquaculture Water Quality Monitoring

An Inspection Method of Rice Milling Degree Based on Machine Vision and Gray-Gradient Co-occurrence Matrix

Deployment Scheme of Video Conferencing MCU Based on OpenStack Haifeng Han a, Jianxin Song b

Design and Implementation of Full Text Search Engine Based on Lucene Na-na ZHANG 1,a *, Yi-song WANG 1 and Kun ZHU 1

Microservices log gathering, processing and storing

A priority based dynamic bandwidth scheduling in SDN networks 1

Analysis on the technology improvement of the library network information retrieval efficiency

APPLICATION OF JAVA TECHNOLOGY IN THE REGIONAL COMPARATIVE ADVANTAGE ANALYSIS SYSTEM OF MAIN GRAIN IN CHINA

Title Grid for Multimedia Communication Ne. The original publication is availabl. Press

Test Analysis of Serial Communication Extension in Mobile Nodes of Participatory Sensing System Xinqiang Tang 1, Huichun Peng 2

Automatic analysis technology for aviation equipment software requirements ZHOUHan-Qing, LI Hai-Feng,HUANG Yan-Bing

The design and implementation of UML-based students information management system

Construction Scheme for Cloud Platform of NSFC Information System

Transcription:

Design and Realization of Agricultural Information Intelligent Processing and Application Platform Dan Wang 1,2 1 Institute of Agricultural Information, Chinese Academy of Agricultural Sciences, Beijing 100081, China 2 Key Laboratory of Digital Agricultural Early-warning Technology, Ministry of Agriculture, P.R. China wd@mail.caas.net.cn Abstract. The work of this paper aims to provide a practical software product, the intelligently gathering, processing and utilizing for the leaders of agricultural departments, research institutions, library and information systems and information consultancy departments, etc. The software product is an information processing platform integrated intelligent gathering, automatic indexing, and search and utilization of agriculture information. This software is tested to be accurate and can substitute people to process information to some extent. Keywords: agricultural information, automatic collection, automatic indexing, information search, information management. 1 Introduction After 40 years in the development of the Internet and after 20 years in the development of the World Wide Web, we have entered into the era of digital information. There is a mass of real-time and valuable information and electronic documents scattered in various departments on the Internet. Currently the effective use of the digital information is affected by the three restricting factors. Firstly, how to crawl the desired information from the mass information on Internet in real time and collect electronic documents scattered in various departments to the local database. Secondly, how to classify and deal with the digital information from the local database, make it the standard information resources so that the information retrieval and utilization. Thirdly, how to retrieve the information they need from the local database easily, accurately and quickly, to achieve the purpose of efficient information exchanging, using and spreading. To solve the above technical problems, we have developed the platform of the Agricultural Information Intelligence Collection, Processing and Utilization, also called information processing platform (IPP). The IPP is a general platform which provides information intelligence collection, information ordering classification, subject indexing and information retrieval for the leading departments, agricultural D. Li and Y. Chen (Eds.): CCTA 2012, Part II, IFIP AICT 393, pp. 229 237, 2013. IFIP International Federation for Information Processing 2013

230 D. Wang information and advisory organizations, the construction unit of agricultural information resources. Agricultural information institute of the Chinese Academy of Agricultural Sciences (hereinafter referred to as AII have made a lot of basic research work in the information organization and management of agricultural domain from 1990s, which have prepared Agricultural Science Thesauri, Chinese Library Classification - Agricultural Professional Classification and Multifunctional Agricultural Vocabulary, and developed the Chinese information automatic indexing system. The AII has officially studied and developed IPP in order to enhance the original scientific and technological achievements with the development of the information age. Agricultural Information Intelligent Processing and Application Platform Fig. 1. Agricultural Information Intelligent Processing and Application Platform 2 Research Objectives and Content R & D goal of the IPP is to develop agricultural information processing platform based on B/S structure for agricultural information intelligence collection, automatic classification, subject indexing and the use of agricultural information retrieval. The functions of the IPP can be classified the following parts: 1) the subsystem of integrated classification vocabulary of agricultural information management; 2) the subsystem of automatically grabbing information on the Internet and smart obtaining electronic document; 3) the subsystem of the automatic classification and subject indexing of agriculture information; 4) the subsystem of database definition and data storage; 5) the subsystem of retrieval information; 6) the subsystem of data import and export.

Design and Realization of Agricultural Information Intelligent Processing 231 3 Architecture and Functional Design of the IPP 3.1 System Architecture Information processing of the IPP is firstly a collection of agricultural information that is stored in the temporary database. Secondly, the information is automatic processed and stored in the formal database. Thirdly, information is retrieved and used. Based on the flow of information, the agricultural information Integrated classification vocabulary management subsystem, the database management subsystem, and system operating parameters participate in together to achieve the goal of the IPP. 3.2 System Functions The main functions of the IPP can be concluded in the six following aspects: 3.2.1 Intelligent Real-Time Acquisition of the Agriculture Network Information The IPP may intelligently collect personalized information real-time from the agricultural website. The IPP can collect requirement information on the Internet and store in the local database by way of user-set information acquisition parameters of generic and professional templates. Website name Startup Fig. 2. The IPP may intelligently collect personalized information real-time 3.2.2 Information Import Function Data import function of the IPP is to provide the collection function of other electronic documents and the collection function of network information. The main features are:

232 D. Wang Fig. 3. A screen shot of information import function 3.2.2.1 XML Format Data XML format data is read and analyzed by IPP after the data is imported into the related temporary database of the IPP. 3.2.2.2 TRS Format Data TRS format data is the data format of import and export supported by full-text retrieval system which is developed by well-known domestic TRS company. Therefore, the IPP provides TRS format data import feature. 3.2.2.3 ISO2709 Format Data ISO2709 format is the international standard of data format which is widely used in library and information sector for bibliographic information exchange. Firstly the IPP can read the data of the ISO format. Secondly the data can be converted to XML data. Thirdly the data is stored in the temporary database. 3.2.2.4 Electronic Text Format Data The IPP can invoke directly WORD processing software that read DOC, PDF and other document, and edit it. Then the data is imported into a temporary database. 3.2.2.5 Information Based on Form The IPP can automatically generate information form of the database, the user directly login and input the data which is stored in a temporary database.

Design and Realization of Agricultural Information Intelligent Processing 233 The data collected from the website and import data from other electronic documents are stored in a temporary database. After information is automatic classified and subject indexed by the IPP, they are stored in a formal database of the IPP. 3.2.3 Information Processing Function The IPP provides the function of information processing. The first step is that the data of the IPP is stored in the temporary database. Then, information is batch indexed on temporary database by the IPP selectively automatically, or which is entire database indexed or unattended indexed or artificial auxiliary indexed. At last, index result of the data is edited by the IPP. Specific indexing features: The IPP automatically produces classification codes (CLC, category code), the category names, geographical names, geographical codes and information keywords to pretreatment information. The IPP supports a variety of indexing methods, including the indexing of single articles, batch indexing, automatic indexing of the entire database as well as unattended indexing. The IPP supports the functions of modify and editing for automatic indexing results. Artificial auxiliary indexing functions are supported. When users index a message online, they may browse the information classified directory tree at any time and click required indexing category name. The category names and corresponding classification coding are automatic inputted the database by the IPP. Support the function of the word frequency statistics analysis and the function of updating vocabulary of data. When users may modify correction of the results of indexing keywords, if vocabulary words are not yet logged, the IPP will automatically added, and then the IPP will compute vocabulary frequency. When vocabulary frequency of the word exceeds the prescribed threshold, the word is updated automatically on the vocabulary of data. Flexible configuration in word field range for indexing, such as the title field, summary field, the body field. ƒflexible configuration for the position weight value of the keyword, such as keyword from the title field, from summary field, from the body field, from the first or the tail paragraph of the text, and from the first few words in the paragraph. ƒthe flexible configuration of indexing depth, that is, a literature can be marked with the number of keywords. According to the results of statistical analysis of artificial indexing, indexing depth is generally appropriate in 7-8 words. 3.2.4 Information Retrieval Functions The IPP can retrieval information which is stored in the formal database, the specific features can be summed up: The user can choose a database or multiple databases at the same time for retrieval. the IPP supports for multi-database searching.

234 D. Wang Data processing Fig. 4. A screen shot of information processing function Support keyword search, the second search and text search in the limited field. Support boolean logic expression to search, according to user s searching requests, composition of search expression with "and", "or", "non", as well as the search expression with operator priority retrieval. Support the search function of the SDI. The IPP save the user's search expression, in order to retrieve the information repeatedly to form a thematic database. 3.2.5 System Management Function System management function contains the configuration and management operating parameters of Intelligence collection, processing and utilization from the agriculture network information, including management of users, databases of custom parameter configuration management, information collection indexing parameters of configuration management, information taxonomies, thesauri, regional tables and information areas of the table maintenance, and run log management. User management: It Supports the user s function of increasing, deleting and modifying; It sets up and manages the user name, password, roles and permissions. Custom database: Users can flexibly define databases structure in different type databases, describe the properties of the database to determine the taxonomic relationship between the different databases and create a user database.

Design and Realization of Agricultural Information Intelligent Processing 235 Data searching Fig. 5. A screen shot of information retrieval function Vocabulary management: It contains maintenance and management of thesaurus of information, the scope of vocabulary, regional tables. Users can add, delete, change keywords according to the above vocabulary. The word frequency statistics and vocabulary Automatic Updates: The IPP computes word frequency for indexing word. If indexing words does not appear in any vocabulary database or word frequency in excess of the threshold, the system will automatically input into vocabulary database. 3.2.6 System Management Functions The IPP provides the exchange with the outside world database interface that can output the information in XML format document, TRS format documents, ISO2709 format document output. 4 Operating Environment The platform is a software products of cross-platform operating system with JAVA language development based on B/S architecture, and operating environment is: hardware configuration: CPU Pentium 4 or above CPU Clock Speed more than 1G Computer memory: more than 2G Hard disk: more than 20G Video card: resolution ratio support of 1028 * 768 pixels, True color refresh frequency 85HZ, Video card memory more than 32M

236 D. Wang System management Fig. 6. A screen shot of system management functions Indicator: Standard display, more than 15-inch Operating system: Windows2000, Windows XP, Redhat, or Linux9.0 Database: ORACLE, MYSQL, or SQLSERVER Development software: JAVA 1.6 Application : IE6.0 above 5 Conclusion The user uses login name and password to enter the information processing platform, and click on the system menu to finish related operations. It automatically crawled and annotated 10000 pieces of documents from the websites and the documents provided by Chinese Academy of Agricultural Information Institute, China Agricultural University and other units, agricultural engineering, plant protection, soil nutrients, food economy, vegetable gardening, animal husbandry and veterinary literature information. We compare the annotated result with the results of manual annotation, which proved that the method is very effective. References 1. Feng, X.-X., Zhu, X.-F.: The Design and Implementation of Information Resources Extractor of Agricultural Websites Based on Spring. Informatization Research 37, 19 22 (2011) (06) 2. Dong, S., Yu, J., Li, H., Sun, D.: Development and Construction of Beijing Suburban Natural Resources and Socio-economic Information Management Platform. Chinese Agricultural Science Bulletin 167 171 (2011) (20)

Design and Realization of Agricultural Information Intelligent Processing 237 3. Zhang, W.-H., et al.: Design of Agricultural Information Retrieval Platform Based on Chinese Word Segmentation. Journal of Anhui Agricultural Sciences 39(20), 12586 12587 (2011) 4. Zhang, Y.: On the Exploitation of Online Teaching Resources Abroad. New Century Library, 78 79 (2012) (05) 5. Zheng, Z.-H., Xu, Z.-J., Wen, H., Zheng, Z.-G.: Improved Search Engine and Its Data Structure Design. Information Science 30(2), 200 205 (2012) (2) 6. Kong, W.-T., Yan, H.-Y.: The Design of Lucene-based Automatic Answering System. Computer Development & Applications 25(4), 32 37 (2012) (4)