GIS Application based on Cloud Storage for Atmospheric Environmental Monitoring

Similar documents
Course Content MongoDB

The Establishment of Large Data Mining Platform Based on Cloud Computing. Wei CAI

Multisource Remote Sensing Data Mining System Construction in Cloud Computing Environment Dong YinDi 1, Liu ChengJun 1

Research on the value of search engine optimization based on Electronic Commerce WANG Yaping1, a

A data-driven framework for archiving and exploring social media data

The Design and Implementation of Disaster Recovery in Dual-active Cloud Center

The course modules of MongoDB developer and administrator online certification training:

New research on Key Technologies of unstructured data cloud storage

Huge Data Analysis and Processing Platform based on Hadoop Yuanbin LI1, a, Rong CHEN2

A Training Simulator for PD Detection Personnel

How to Scale MongoDB. Apr

The Design of Water Quality Monitoring Cloud Platform Based on. BS Architecture

An Indian Journal FULL PAPER ABSTRACT KEYWORDS. Trade Science Inc. The study on magnanimous data-storage system based on cloud computing

Design of Coal Mine Power Supply Monitoring System

Comprehensive analysis and evaluation of big data for main transformer equipment based on PCA and Apriority

Intel Solid State Drive Data Center Family for PCIe* in Baidu s Data Center Environment

BioTechnology. An Indian Journal FULL PAPER. Trade Science Inc. Study on secure data storage based on cloud computing ABSTRACT KEYWORDS

Research on Mass Image Storage Platform Based on Cloud Computing

MongoDB Architecture

The Design and Application of GIS Mathematical Model Database System with Meta-algorithm Li-Zhijiang

Web-based Building Energy Consumption Monitoring and Conservation Service DONG Wei1, a

Research on Heterogeneous Communication Network for Power Distribution Automation

The Application of CAN Bus in Intelligent Substation Automation System Yuehua HUANG 1, a, Ruiyong LIU 2, b, Peipei YANG 3, C, Dongxu XIANG 4,D

Research on Power Quality Monitoring and Analyzing System Based on Embedded Technology

Research and Application of Mobile Geographic Information Service Technology Based on JSP Chengtong GUO1, a, Yan YAO1,b

Design and Implementation of Inspection System for Lift Based on Android Platform Yan Zhang1, a, Yanping Hu2,b

Construction of SSI Framework Based on MVC Software Design Model Yongchang Rena, Yongzhe Mab

Design of Physical Education Management System Guoquan Zhang

Charting the Progress of Smart City Development in Shanghai

Open Access Research on Algorithms of Spatial-Temporal Multi-Channel Allocation Based on the Greedy Algorithm for Wireless Mesh Network

Application of Theory and Technology of Wireless Sensor Network System for Soil Environmental Monitoring

Chapter 24 NOSQL Databases and Big Data Storage Systems

Research on Technologies in Smart Substation

MongoDB Essentials - Level 2. Description. Course Duration: 2 Days. Course Authored by CloudThat

A New Model of Search Engine based on Cloud Computing

The Discussion of 500kV Centralized Monitoring System for Large Operation and Large Maintenance Mode

Application of Nonlinear Later TV Edition in Gigabit Ethernet. Hong Ma

MongoDB - a No SQL Database What you need to know as an Oracle DBA

Traffic Flow Prediction Based on the location of Big Data. Xijun Zhang, Zhanting Yuan

MongoDB Backup & Recovery Field Guide

Performance improvement of sharding in MongoDB using k-mean clustering algorithm

NoSQL Databases MongoDB vs Cassandra. Kenny Huynh, Andre Chik, Kevin Vu

Oracle NoSQL Database

Research on Computer Network Virtual Laboratory based on ASP.NET. JIA Xuebin 1, a

ITG Software Engineering

DESIGN AND IMPLEMENTATION OF TOURIST WEBGIS BASED ON J2EE

Study on Personalized Recommendation Model of Internet Advertisement

Implementation of a High-Performance Distributed Web Crawler and Big Data Applications with Husky

Energy efficient optimization method for green data center based on cloud computing

Document Object Storage with MongoDB

MongoDB. copyright 2011 Trainologic LTD

A Security Audit Module for HBase

Intelligent Control of Micro Grid: A Big Data-Based Control Center

MongoDB in AWS (MongoDB as a DBaaS)

Document Sub Title. Yotpo. Technical Overview 07/18/ Yotpo

A CEP based ETL method of active distribution network operation monitoring and controlling signal data

Yunfeng Zhang 1, Huan Wang 2, Jie Zhu 1 1 Computer Science & Engineering Department, North China Institute of Aerospace

Research and Design of Data Storage Scheme for Electric Power Big Data

Construction Scheme for Cloud Platform of NSFC Information System

Research on the Knowledge Representation Method of Instance Based on Functional Surface

Group13: Siddhant Deshmukh, Sudeep Rege, Sharmila Prakash, Dhanusha Varik

A Design of Remote Monitoring System based on 3G and Internet Technology

Study on Ontology-based Multi-technologies Supported Service-Oriented Architecture

Research Article Mobile Storage and Search Engine of Information Oriented to Food Cloud

Research and Design of Education and Teaching Resource Management System based on ASP.NET Technology

CIS 612 Advanced Topics in Database Big Data Project Lawrence Ni, Priya Patil, James Tench

EMC Documentum xdb. High-performance native XML database optimized for storing and querying large volumes of XML content

Decision analysis of the weather log by Hadoop

Remote monitoring system based on C/S and B/S mixed mode Kaibing Song1, a, Yinsong Wang2,band Dandan Shang3,c

Intelligent Computer Room Management Platform Based on RF Card

Research on Design and Application of Computer Database Quality Evaluation Model

Hyper-Converged Infrastructure: Providing New Opportunities for Improved Availability

SQL, NoSQL, MongoDB. CSE-291 (Cloud Computing) Fall 2016 Gregory Kesden

RESEARCH AND APPLICATION OF DATA MINING TECHNOLOGY ON CONSTRUCTION PROJECT COST CONTROL SYSTEM

Personalized Search for TV Programs Based on Software Man

The Research of the Digital Environmental Protection Intellectualization Platform

Analyzing and Improving Load Balancing Algorithm of MooseFS

Online Teaching Evaluation System Design and Implementation

Industrial system integration experts with combined 100+ years of experience in software development, integration and large project execution

Designing High-Performance Data Structures for MongoDB

A Fast and High Throughput SQL Query System for Big Data

Framework Research on Privacy Protection of PHR Owners in Medical Cloud System Based on Aggregation Key Encryption Algorithm

OSM-SVG Converting for Open Road Simulator

Rapid Modeling of Digital City Based on Sketchup

Accelerate Database Performance and Reduce Response Times in MongoDB Humongous Environments with the LSI Nytro MegaRAID Flash Accelerator Card

Every SAS Cloud has a Silver Lining. Letting your data reign in the cloud

Jargons, Concepts, Scope and Systems. Key Value Stores, Document Stores, Extensible Record Stores. Overview of different scalable relational systems

Introduction to the Active Everywhere Database

Unstructured Data Migration and Dump Technology of Large-scale Enterprises

What is database? Types and Examples

The Analysis and Design of the Object-oriented System Li Xin 1, a

Research on Software Scheduling Technology Based on Multi-Buffered Parallel Encryption

IMAGINE Objective. The Future of Feature Extraction, Update & Change Mapping

A Multi-Tenant Framework for Multimedia Conference System

An adaptive container code character segmentation algorithm Yajie Zhu1, a, Chenglong Liang2, b

Processing Technology of Massive Human Health Data Based on Hadoop

Scaling MongoDB. Percona Webinar - Wed October 18th 11:00 AM PDT Adamo Tonete MongoDB Senior Service Technical Service Engineer.

Construction and Application of Cloud Data Center in University

The Design and Optimization of Database

Joint Center of Intelligent Computing George Mason University. Federal Geographic Data Committee (FGDC) ESIP 2011 Winter Meeting

Transcription:

3rd International Conference on Material, Mechanical and Manufacturing Engineering (IC3ME 2015) GIS Application based on Cloud Storage for Atmospheric Environmental Monitoring Mei Han 1, a *,Ke Feng 2,b 1 Jining University Jining Shandong China 2 Jining University Jining Shandong China a hanmeisd@126.com, b fk831@163.com Keywords: environmental monitoring, GIS, cloud storage, MongoDB Abstract. GIS has been introduced to Jining s air quality automatic monitoring system. The system provides the client with easily available services of high quality. In order to meet the need of big data storage of the monitoring system we designed the distributed cloud storage framework to deploy MongoDB storage service. We designed the interface programs of GIS applications. The interface program make the client's cluster access to the MongoDB service more convenient and efficient and lays a foundation for the GIS to access the storage system rapidly. Introduction China is one of the serious air polluted countries, and the air quality in Jining is not optimistic either. In March 2015, the average concentration of fine particulate matter(pm2.5) in Jining is 71 ug/m 3, the average concentration of inhalable particulate(pm10)is 153 ug/m 3, the average concentration of SO 2 is 58 ug/m 3, and the average concentration of NO 2 is 41 ug/m 3. The number of the days with visible blue sky and white clouds is only 12. Facing the increasingly serious environmental pollution, the environmental protection workers of Jining have a lot of hard work to do. The most arduous one is the real-time, accurate monitoring on air quality and its changing trends and the supervision of pollution sources and their control. As the requirement of environmental monitoring is increasing day by day, environmental monitoring instruments can not only complete the online real-time monitoring and analysis but also adapt to multi task and multi parameter measurement, change monitoring objects at any time, accomplish different monitoring tasks, analyze the data transmission and exchange among monitoring instruments in order to realize the real-time measurement and control. There are 7 automatic monitoring systems for environmental air quality in urban areas of Jining. Based on this, Jining promotes actively the construction of the automatic monitoring stations for the atmospheric environmental quality in its administrative countries or cities. It has been planning or constructing new automatic monitoring systems for environmental air quality in 10 administrative countries or cities. The automatic monitoring for environmental air quality in the whole city will be realized in the near future. In recent years, the computer technology and terminal equipment have been developing rapidly. GIS started a new field of application in environmental protection, effective utilization of resources, natural disaster forecast, and military command etc. The initial GIS managed geographic information data simply in a documentary way. In the 70s of the 20th century, with the development of computer cartography, the using of databases to store and manage geographic data gradually replaced the traditional GIS data management [1]. At the end of 20th century, the Internet developed vigorously. GIS focused on system and function also kept its focus on service applications. At present GIS is heading towards distributed data storage, networked applications and popularization. With the help of cloud computing, GIS applications pay more attention to the improvement of efficiency. It makes full use of network data resources to create the greater developing opportunities. 2015. The authors - Published by Atlantis Press 972

The Introduction of Geographic Information System to Environmental Monitoring Geographic Information System (GIS) combines the functions of computer acquisition, storage, management, analysis, display and application of geographic information together to form a dedicated system [3], which fuses computer graphics with database system. Besides, it has been applied extensively in the field of environmental monitoring. The technology of GIS can be used to analyze and process the data of environmental monitoring. With its unique spatial analysis and visual expression function, GIS can meet the requirements of information processing that environmental monitoring demands, and can provide its users with readily available services of high quality. GIS consists of the following five parts: hardware, software, spatial data, user and application model. It has 5 basic functions: data input, data edit, data storage and management, spatial query and spatial analysis, visual expression and output [4]. Among these functions, the key one is the spatial query and spatial analysis. The spatial analysis of GIS is unique, which makes it different from other computer graphic software. GIS is not only an information system, but also a management system for the analysis and description of geographical space [5]. Comparing with other computer technology, GIS has a relatively late start. But it is successfully applied in many fields [2] with its amazing development speed. According to the different ways of research and application, GIS can be divided into two types. One is to process the user s data by GIS. Another is to provide the user with services by dedicated GIS software which is further developed based on GIS. Cloud storage framework to build MongoDB GIS is an interdiscipline of geographic information technology and computer technology. Its rapid rise has led to the development of geographic spatial data storage. The status of big data in GIS applications is becoming increasingly important. The development of cloud computing drives the development of cloud storage services. With the development of cloud storage, NoSQL achieves rapid popularity. Many large enterprises and network giants select different database products according to their actual needs. MongoDB, as the leader of NoSQL, naturally becomes their preferred database. The author elaborates the method of spatial data storage from the aspects of GIS development, applications and characteristics. According to the development of NoSQL, MongoDB is chosen as the database to store the environment monitoring data. A distributed cloud storage framework is used to build MongoDB. The performance advantages of storing geographic spatial data based on Cloud Storage are completely demonstrated. MongoDB is a document database among the four types of databases of NoSQL. It has all the characteristics of NoSQL, and it plays a leading role in NoSQL with its rich functions. Some of its characteristics are different from other products of NoSQL. It has a high place in the practical applications of NoSQL. The service support of MongoDB can run under both Linux and Windows, no matter 32 bit or 64 bit. When running under the system of 32 bit, MongoDB has a limit of maximum file size, that is 2GB. There is no size limit to its documents while running under the system of 64 bit. MongoDB replaces the concept of "line" in the relational database to "document" model. Such a storage model would be more flexible. Its unique data storage structure makes it possible to nest a document or an array. The complex hierarchy can be represented by one record. What is stored in is the very loose data structure similar to JSON format of BSON. It is easy to adjust data structure and data storage model flexibly. Some complex types of data structure can be easily stored. MongoDB has a powerful language query manner similar to object-oriented query, which can satisfy most of the queries in relational database[6]. In order to simplify the management of the database, MongoDB s autonomous management of the server is designed. In operation, there is only one action to start the server at the very beginning. If the main server is out of work, MongoDB will start a backup server to complete the work of the main server. These are completed automatically under the MongoDB s internal mechanism. Meanwhile the backup server will be upgraded to the current service supporting 973

server. Under the distributed environment, the cluster server will automatically configure a new node in the case of knowing an add node. Deploy Storage Server MongoDB cluster consists of config server,, mongos, clients and arbiter etc. Specific configuration is analyzed and deployed based on the characteristics of different data and the amount of data. Fig. 1 show the structure of MongoDB s distributed cluste. shard 1 Shard 2 Shard 3 Shard 4 Replica Set config servers C 1 C 2 mongos mongos C 3 client Fig. 1 the structure of MongoDB s distributed cluste Config Server. It mainly stores the cluster metadata information, including each shard, shard s server and the basic information of chunk. There is a segment on config server to backup information, which ensures the consistency of the data on config server[7]. Mongod: It is the shard server of MongoDB. It mainly stores fragment data, namely data block. Shards: It is the fragment of MongoDB. Two s form a fragment, that is a group of data. The data in the two s that belong to one fragment is the same. A group of can be designed into a master-slave structure or mutual master-slave, which ensures the consistency of data. In addition, the data on each fragment belongs to certain range. For environmental monitoring data, the data of a city can be regarded as one fragment. According to the size of the data, the fragmentation deals with the load balancing by automatic segmentation or data s autonomous transfer. Mongos: It is the route server of MongoDB. It is mainly responsible for routing and coordination work as a console. So the whole cluster looks more like a database services system. In practical applications, there can be multiple mongos, and it can be placed on any server. After starting the mongos, it gets the metadata information of fragment or block from config servers, and then it responds to the clients requests. The content of the request is routed to the corresponding data storage server via shard server. Finally the searched results are fed back to the clients. In the deployment scheme of distributed cloud storage for the environment monitoring data, we adopts the method that the insertion and update are separated from the query and access and it deploys separately multiple reading and writing server. This separation method has solved the problem of the efficiency of monitoring data storage and the reliability and extension of the database service. Using the deployment architecture model of Replica Set + Sharding ensures the high usability of the storage of monitoring data in MongoDB[9]. The data of environmental monitoring is numerous. To update spatial data regularly in the change of geographic spatial information, this architecture opens up the possibility for the horizontal extension of the database. At the same time, if the fragment needs adding just to change the corresponding replset name. This is very convenient for the expansion of our data 974

storage. At present a lot of large data storages are basically using this architecture to deploy MongoDB storage system. The Designing of Clients Module Environmental monitoring data is stored on a distributed server of Cloud. If GIS application on the clients have a need to visit or access to environment monitoring data, it should be achieved through the interface program. Thus, We need to design a file interface system for this kind of storage architecture on the clients to adapt to the clients access requirements. And a mature and efficient interface program is rather important to promote the performance of the storage system. Our interface program is written in C++. By this program, we can write new environmental monitoring data and a large quantity of data initially, update data, query and delete outdated data according to different requirements. MongoDB Service Configuration Access. The interface program needs to be connected with the MongoDB server first. Distributed deployment requires users to open the MongoDB service, and then access to the Mongo DB service configuration through the program. In order to achieve access to the data, the program needs to handle the information of port, Ip and user ect. first, and then access to Mongo DB server. The program is connected to the Mongo DB database through the Get Connection() function. Specific configuration services are stored in the configuration file gis_conf. By obtaining the configuration information in gis_conf, the default database connection is obtained. Specific core pseudo codes are shown as Get Connection(): Get Connection() if(conf.empty()) { Read Config("gis_conf", CONF); } string host = CONF["host"]; string port = CONF["port"]; string database = CONF["db"]; string user = CONF["user"]; string password = CONF["password"]; DBClient Connection * dbconn = new DBClient Connection(true); conn(dbconn); dbconn ->connect(host + ":" + port); dbconn ->auth(db, user, password, err); return conn; } Write Data Interface Program. The single geographic spatial data or a batch of spatial data is written to the MongoDB storage files. Geographic spatial data and attribute data of geographic information are written to Mongo DB. Write () function and related parameters are as follows: int write (char *fname, unsigned int type,point center, Point * boundary, unsigned int boundary_size, Relation *relations, char *attributes, unsigned int attributes_size, File Object * fo) Among those, fname is file name, center is the center coordinate of spatial object, boundary is the anchor point coordinate of spatial geographical object, relations are the ones of an object and the other one; attributes are data sets of object s attributes, fo is a file object set. Read Data Interface Program. The specific implementation of the reading interface is based on the needs of GIS applications. The reading of environmental monitoring data can be divided into the following categories: query by file name, query by point including this point, query objects related to certain point by relation, and query all the objects included in the range by range. For different requirements of applications, different query interfaces are designed. Delete Data Interface Program. The deleting of interface program is similar to the reading of interface program, which requires to delete according to different fields. It can be divided into 3 975

categories: delete according to file name, delete object data including the point according to point, delete object data included in the range according to range. The client is the operating platform of environmental monitoring system. The sample data obtained from the server are displayed in various forms in the client, which provides data for monitoring analysis. The client is an analysis Center as well. All kinds of analysis functions can help to draw a conclusion. GIS application program is responsible for receiving the client s logic control to the program, proposing transfer requests, controlling of the data analysis methods and results showing forms. The designing and using of the interface program for GIS applications realizes the clients convenient access to the MongoDB s cluster service. At the same time, the access to the data is optimized at the code level. All those lay a solid foundation for the GIS to access the storage system rapidly. Summary The integrated application of distributed cloud storage framework and GIS system for environmental monitoring data has obvious advantages comparing with the traditional storage structure in dealing with large amounts of data. In a certain period of time, the system can meet the requirements of Jining s environmental monitoring system in the storage performance and efficiency. It will greatly promote the healthy development of Jining s environmental protection work. References [1] Quantum GIS. Development Team, 2012. Quantum GIS Geographic Information System. OpenSource Geospatial Foundation Project [J]. URL: [http://qgis.osgeo.org], 2011. [2] Kosters G, Pagel B U, Six H W. GIS-application development with Geo00A [J]. InternationalJournal of Geographical Information Science, 1997, 11(4): 307-335. [3] Zhong-wei Sun, Li-wei Shao, Chun-liang Hou. Information and communication technology on the development of GeographyJournal of Hebei Normal University: Natural Science Edition, 2004, 28(5): 533-537. [4] Jian-shong Li. Geographic Information System Principle [M], Wuhan University press, 2006. [5] De Smith M J, Goodchild M F, Longley P. Geospatial analysis: a comprehensive guide to principles, techniques and software tools [M]. Troubador Publishing Ltd, 2007. [6] Membrey P, Plugge E, Hawkins T. The definitive guide to MongoDB: the nosql database for cloud and desktop computing [M]. Apress, 2010. [7] Shuai Tian. Design and implementation of a large scale remote sensing data storage system based on MongoDB and HDFS [D]. Zhejiang University, 2013. [8] Yang C, Good child M, Huang Q, et a1.spatial cloud computing: how can the geospatial sciences use and help shape cloud computing [J]. International Journal of Digital Earth, 2011, 4(4): 305 一 329. [9] Lin J X,Chen C C,Wu X Z, etal. GeoKSGrid: A geographical knowledge grid with functions of spatial data mining and spatial decision[c].2011 IEEE International Conference on Spatial Dat a Mining and Geographical Knowledge Services(ICSDM),2011,121 一 126. 976