The Study and Implementation of Text-to-Speech System for Agricultural Information

Similar documents
THE STUDY AND IMPLEMENTATION OF TEXT-TO-SPEECH SYSTEM FOR AGRICULTURAL INFORMATION

The Study and Implementation of Text-to-Speech System for Agricultural Information

Zigbee Wireless Sensor Network Nodes Deployment Strategy for Digital Agricultural Data Acquisition

Tacked Link List - An Improved Linked List for Advance Resource Reservation

Application of RMAN Backup Technology in the Agricultural Products Wholesale Market System

Design and Implementation of Agro-technical Extension Information System Based on Cloud Storage

Fault-Tolerant Storage Servers for the Databases of Redundant Web Servers in a Computing Grid

Multimedia CTI Services for Telecommunication Systems

The New Territory of Lightweight Security in a Cloud Computing Environment

A Resource Discovery Algorithm in Mobile Grid Computing based on IP-paging Scheme

Visualization of Virtual Plants Growth Based on Open L-System

Research on the Knowledge Based Parameterized CAD System of Wheat and Rice Combine Chassis

Linux: Understanding Process-Level Power Consumption

Change Detection System for the Maintenance of Automated Testing

Quality of Service Enhancement by Using an Integer Bloom Filter Based Data Deduplication Mechanism in the Cloud Storage Environment

BoxPlot++ Zeina Azmeh, Fady Hamoui, Marianne Huchard. To cite this version: HAL Id: lirmm

Hardware Acceleration for Measurements in 100 Gb/s Networks

LaHC at CLEF 2015 SBS Lab

Natural Language Based User Interface for On-Demand Service Composition

Every 3-connected, essentially 11-connected line graph is hamiltonian

DANCer: Dynamic Attributed Network with Community Structure Generator

Open Digital Forms. Hiep Le, Thomas Rebele, Fabian Suchanek. HAL Id: hal

Recommendation-Based Trust Model in P2P Network Environment

Setup of epiphytic assistance systems with SEPIA

Prototype Selection Methods for On-line HWR

YAM++ : A multi-strategy based approach for Ontology matching task

Relabeling nodes according to the structure of the graph

Service Reconfiguration in the DANAH Assistive System

Mokka, main guidelines and future

QAKiS: an Open Domain QA System based on Relational Patterns

Linked data from your pocket: The Android RDFContentProvider

A Logical Pattern for Integrating Business Intelligence into Information Systems Design

AgOnt: Ontology for Agriculture Internet of Things

FIT IoT-LAB: The Largest IoT Open Experimental Testbed

An FCA Framework for Knowledge Discovery in SPARQL Query Answers

Blind Browsing on Hand-Held Devices: Touching the Web... to Understand it Better

Analysis and Implementation of Embedded SNMP Agent

Real-Time and Resilient Intrusion Detection: A Flow-Based Approach

A Voronoi-Based Hybrid Meshing Method

A New Method for Mining High Average Utility Itemsets

Taking Benefit from the User Density in Large Cities for Delivering SMS

KeyGlasses : Semi-transparent keys to optimize text input on virtual keyboard

Application of Artificial Neural Network to Predict Static Loads on an Aircraft Rib

Search-Based Testing for Embedded Telecom Software with Complex Input Structures

Representation of Finite Games as Network Congestion Games

NP versus PSPACE. Frank Vega. To cite this version: HAL Id: hal

Comparator: A Tool for Quantifying Behavioural Compatibility

SIM-Mee - Mobilizing your social network

Open Source Software Developer and Project Networks

Scalewelis: a Scalable Query-based Faceted Search System on Top of SPARQL Endpoints

Robust IP and UDP-lite header recovery for packetized multimedia transmission

UsiXML Extension for Awareness Support

Malware models for network and service management

DSM GENERATION FROM STEREOSCOPIC IMAGERY FOR DAMAGE MAPPING, APPLICATION ON THE TOHOKU TSUNAMI

From Microsoft Word 2003 to Microsoft Word 2007: Design Heuristics, Design Flaws and Lessons Learnt

BugMaps-Granger: A Tool for Causality Analysis between Source Code Metrics and Bugs

Real-time FEM based control of soft surgical robots

Simulations of VANET Scenarios with OPNET and SUMO

How to simulate a volume-controlled flooding with mathematical morphology operators?

Regularization parameter estimation for non-negative hyperspectral image deconvolution:supplementary material

Assisted Policy Management for SPARQL Endpoints Access Control

MUTE: A Peer-to-Peer Web-based Real-time Collaborative Editor

A Practical Evaluation Method of Network Traffic Load for Capacity Planning

Study on Feebly Open Set with Respect to an Ideal Topological Spaces

Pipeline and Data Parallel Hybrid Indexing Algorithm for Multi-core Platform

The Animation Loop Station: Near Real-Time Animation Production

HySCaS: Hybrid Stereoscopic Calibration Software

Catalogue of architectural patterns characterized by constraint components, Version 1.0

A Methodology for Improving Software Design Lifecycle in Embedded Control Systems

YANG-Based Configuration Modeling - The SecSIP IPS Case Study

Preliminary analysis of the drive system of the CTA LST Telescope and its integration in the whole PLC architecture

Reverse-engineering of UML 2.0 Sequence Diagrams from Execution Traces

Very Tight Coupling between LTE and WiFi: a Practical Analysis

An Experimental Assessment of the 2D Visibility Complex

Modularity for Java and How OSGi Can Help

An Implementation of a Paper Based Authentication Using HC2D Barcode and Digital Signature

QuickRanking: Fast Algorithm For Sorting And Ranking Data

On Code Coverage of Extended FSM Based Test Suites: An Initial Assessment

Spectral Active Clustering of Remote Sensing Images

Comparison of radiosity and ray-tracing methods for coupled rooms

The Connectivity Order of Links

Mapping classifications and linking related classes through SciGator, a DDC-based browsing library interface

Decentralised and Privacy-Aware Learning of Traversal Time Models

Implementing Forensic Readiness Using Performance Monitoring Tools

Deformetrica: a software for statistical analysis of anatomical shapes

Teaching Encapsulation and Modularity in Object-Oriented Languages with Access Graphs

Scan chain encryption in Test Standards

Syrtis: New Perspectives for Semantic Web Adoption

On the Impact of Network Topology on Wireless Sensor Networks Performances - Illustration with Geographic Routing

X-Kaapi C programming interface

Comparison of spatial indexes

Structuring the First Steps of Requirements Elicitation

A General SDN-based IoT Framework with NVF Implementation

Stream Ciphers: A Practical Solution for Efficient Homomorphic-Ciphertext Compression

Branch-and-price algorithms for the Bi-Objective Vehicle Routing Problem with Time Windows

A case-based reasoning approach for unknown class invoice processing

Full Text Search Engine as Scalable k-nearest Neighbor Recommendation System

Traffic Grooming in Bidirectional WDM Ring Networks

A Component Data-Focused Method to Build the Executable Model for a DoDAF Compliant Architecture

Experimental Evaluation of an IEC Station Bus Communication Reliability

Transcription:

The Study and Implementation of Text-to-Speech System for Agricultural Information Huoguo Zheng, Haiyan Hu, Shihong Liu, Hong Meng To cite this version: Huoguo Zheng, Haiyan Hu, Shihong Liu, Hong Meng. The Study and Implementation of Textto-Speech System for Agricultural Information. Daoliang Li; Chunjiang Zhao. Third IFIP TC 12 International Conference on Computer and Computing Technologies in Agriculture III (CCTA), Oct 2009, Beijing, China. Springer, IFIP Advances in Information and Communication Technology, AICT-317, pp.291-296, 2010, Computer and Computing Technologies in Agriculture III.. HAL Id: hal-01055437 https://hal.inria.fr/hal-01055437 Submitted on 12 Aug 2014 HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Distributed under a Creative Commons Attribution 4.0 International License

THE STUDY AND IMPLEMENTATION OF TEXT-TO-SPEECH SYSTEM FOR AGRICULTURAL INFORMATION Huoguo Zheng,2,* 1, Haiyan Hu 1, 2,Shihong Liu 1, 2, Hong Meng 1, 2 1 Agricultural Information Institute, Chinese Academy of Agricultural Sciences, Beijing, P. R. China 100081 2. Key Laboratory of Digital Agricultural Early-warning Technology, Ministry of Agriculture, The People s Republic of China, Beijing, P. R. China 100081 * Corresponding author, Address: Agricultural Information Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, Beijing, P. R. China, Tel: +86-10-82106263, Fax: +86-10-82106263, Email: huoguos@caas.net.cn Abstract: Keywords: The Broadcast and Television coverage has increased to more than 98% in china. Information services by radio have wide coverage, low cost, easy-to-grass-roots farmers to accept etc. characteristics. In order to play the better role of broadcast information service, as well as aim at the problem of lack of information resource in rural, we R & D the text-to-speech system. The system includes two parts, software and hardware device, both of them can translate text into audio file. The software subsystem was implemented basic on third-part middleware, and the hardware subsystem was realized with microelectronics technology. Results indicate that the hardware is better than software. The system has been applied in huailai city hebei province, which has conversed more than 8000 audio files as programming materials for the local radio station. text-to-speech;software;hardware device; microelectronics 1. INTRODUCTION After year s development, China has made the amazing progress in the development face the farmer's information service. Agricultural information service played a more positive role in solving the problem of "Farmers,

2 错误! 未找到引用源 Rural Areas and Agriculture Production (Guo Hongmin et al, 2007). In the recent two years, there had emerged a number of new agriculture information service patterns, which providing timely and accurate information for farmers by the media of TV, phone and network, such as integrated service pattern in Hebei province (Gao Jikui et al, 2005), farmer's mail in Zhejiang province (Wu Yitian et al, 2007), given number "12316" in Jilin province (Jilin farmer website, 2007), 110 in science and technology in Hainan province (Wu Yuanbin et al, 2007). However, we should see that the radio and TV are still playing the main role of information service infrastructure in rural. According to the latest statistics, China's radio and television coverage has increased to more than 98%. Compared with other services, broadcasting as a carrier of information services has a wide coverage and low cost, easy-to-grass-roots farmers to accept and so on. In the investigation and study we discovered that broadcasts the information service to emerge quietly in many places and the prospects for development are huge. To develop the broadcast information service well, we have researched and developed the rural broadcast information service system, including the network information acquisition, text-to-speech, the program arrangement; provide the information resource for the rural Broadcasting station. This article introduced the realization of text-to-speech system, which can transform the text information into the audio document, and provide program resource. 2. DESIGN OF TEXT-TO-SPEECH SYSTEM Text-to-speech system, also known as text speech synthesis system, concludes two sub-systems, TTS software and TTS hardware. The system s functions are shown in Figure 1. Both of them can construct audio database by means of translating the text user defined or export from database into audio. Fig.1 Text-to-speech system s functions

The Study and Implementation of Text-to-Speech System for Agricultural Information 3 2.1 Design of TTS software TTS software, which was implemented base on the third party middleware, can translate the user-defined text, text export from database into audio files. Its core function includes parameter setting, text-input, text-pretreatment and Text-to-Speech. 2.2 Design of TTS hardware TTS hardware sub-system includes the text speech synthesis hardware and the text-editing software. The synthesis hardware is overall system's core, which function is transforms the text information into the pronunciation, the text-editing software is the systematic input interface, which sent the text message to the hardware. Hardware must be associated with the computer through the USB interface to get power and adoption of the text-editing software to achieve text waiting to be converted between PC and hardware. 1 Design of hardware The hardware is the text-to-speech sub-system's foundation; its functions must be careful, comprehensive low-level hardware support. The hardware's connection principle is shown in Figure 2. The text speech synthesis hardware's main component include 8 microprocessor ATmega128 (Li Hua et al, 2008), the USB main line general connection chip, the XF-S4240 module, the low voltage audio power amplifier. Fig.2 The principle figure of text speech synthesis hardware

4 错误! 未找到引用源 2 Design of text-editing software The text-editing software is the controller of text data transmission between PC machine and the text speech synthesis hardware. After the process of edition, operation, pretreatment, the text can be accurately loaded, recognized, read by the outside loudspeaker. The test-editing software's major function includes: imports the text document (the.txt form), set special tags, preprocess the document, save the file and play voice. The text adopted by the special tags settings, you can effectively control the speed of play voice, tone, volume and so on, then have a clear, natural, accurate text-to-speech effect. 3. IMPLEMENTATION OF TEXT-TO-SPEECH SYSTEM 3.1 Implementation of text-to-speech software The text-to-speech software has implemented the following functions, text-edit, audition, audition pause, audition stop, save the text as file and translate the text into audio file, base on the object-oriented software design thought, as well as the consideration of the interface of the software. The work flow of the software is shown in Figure 3. Fig.3 The work flow of the text-to-speech software 3.2 Implementation of text-to-speech hardware 1 Implementation of software The hardware platform is composed of four components: the microprocessor, USB interface, the text speech synthesis module, as well as audio player. The hardware gets power and the text data by the USB interface from the computer, undergoes an 8-bit microprocessor system

The Study and Implementation of Text-to-Speech System for Agricultural Information 5 programming processing, to the information which receives carries on the correct judgment, the recognition again through the XF-S4240 text speech synthesis module, finally play the voice through the audio amplifier module. The audio can be recorded, preserved. The text speech synthesis hardware s work flow is shown in figure4. Fig.4 The work flow of text speech synthesis module The single-chip embedded program realized by ICCAVR (Ma Chao et al, 2007) programming in the course of hardware programming (Qin Changjiang et al, 2007), and its function is getting text data through USB interface by means of interrupts, then sent the text data to XF-S4240 module to synthesis pronunciation, in addition gains the transition status in interrupt service through the serial port, back to the PC machine in order to control the continued transmission of data. 2 Implementation of text-editing software Basic on the detailed analysis of whole text-to-speech system, we carry the work of design and realization on the text-editing software. According to the characteristic of the function, divides it into four modules: import text file, edit the tags, text speech synthesis and save the audio file. The text-editing software s work flow is shown in figure 5.

6 错误! 未找到引用源 Fig.5 The work flow of text-editing software While the text-to-speech system running, we input or import the text which will be deal into the pretreatment module of the text-editing software, set format tags and pre-standardized, and then sent the processed text to the hardware, to judge the accuracy, identification. Finally, the text will be converted to voice, then play. 4. DISCUSSIONS Results indicate that the hardware works better than software. The audio file transformed by hardware sub-system is smoother than software sub-system. The text-to-speech system has translated more than 8000 piece of agricultural science and technology information into audio file in huailai radio station hebei province. Those audio files provide for the local farmers as information service by broadcast. Good effect achieved, according to the local people. China's rural reform has entered a new period of development. The effect of agriculture information service in the promotion of rural development, agricultural efficiency and increase farmers in rural appear already. Agriculture information will play a more and more important role in rural area. All sectors of society participate in the growing movement. Under this kind of advantageous environment, how to display the function of traditional media fully, innovate rural information service pattern, is the question which the scientific research worker and the department of information service must consider.

The Study and Implementation of Text-to-Speech System for Agricultural Information 7 REFERENCES AIC Monaghan. Intonation in a text-to-speech conversion system. University of Edinburgh dissertation, 1991 Dennis H. Klatt. Review of text-to-speech conversion for English. The Journal of the Acoustical Society of America,1990(3):737-743 Guo hongmin, Pan yunhong. Agriculture information service practice and discussion[j]. Agricultural network Information, 2007,8(3) Gao jikui, Liu yongqiang, Cui jianyu. The three electricity and a hall information service system help the farmer be rich[j]. China agricultural information,2005(11) Jilin farmer website. 12316 new rural hotline resolve famer s problem[n]. Jilin agricultural and rural economic information, 2007(7) Kain, A.Macon, M.W. Spectral voice conversion for text-to-speech synthesis. Proceedings of the 1998 IEEE International Conference on Volume 1,1998:285-288 Li hua, Mo wei, Wang baoyou. Design of CAN-GPRS Gateway Based on ATMEGA128[J]. Micro-computer information, 2008,(3) LS Lee, CY Tseng, M Ouh-Young. The synthesis rules in a Chinese text-to-speech system. IEEE Transactions on Acoustics, Speech and Signal processing, 1989(9) Ma chao. AVR Monolithic integrated circuit embedded system principle and application practice[m]. The press of Beijing University of Aeronautics and Astronautics, 2007 PAGE J. H. BREEN A. P. The Laureate text-to-speech system : architecture and applications. BT technology journal,1996(14):57-67 Qin changjiang, Yu ziquan, Li yuquan. Design and Implement of the Communication Bus Interface between PLD and AVR in VHDL[J]. Micro-computer information, 2008,6-2 Wu yitian. farmer mailbox -- practice new countryside by means of informationization[j]. The construction of informationization,2007(3) Wu yuanbin, Wang jinhua. Agricultural science and technology 110 is the effective way to service the "Farmers, Rural Areas, Agriculture Production [J]. Agricultural science and technology communication, 2007(1)