The Establishment of Large Data Mining Platform Based on Cloud Computing. Wei CAI
|
|
- Jocelyn Chandler
- 6 years ago
- Views:
Transcription
1 2017 International Conference on Electronic, Control, Automation and Mechanical Engineering (ECAME 2017) ISBN: The Establishment of Large Data Mining Platform Based on Cloud Computing Wei CAI 101 University East Road, Nanning, Guangxi, China Keywords: Cloud computing, Big data, Mining platform, Establish. Abstract. The data generated in the internet era is massive, its growth mode is exponential growth mode, the traditional data mining method is centralized serial data mining method, it is not suitable for the data era, so we study the data mining methods to improve the information acquisition and analysis processing efficiency, to realize the data mining technology innovation under the big data era. The Concept of Big Data Mining Platform Based on Cloud Computing Definition of Cloud Computing In order to study the establishment of data mining platform in the cloud computing environment, we need to clarify what cloud computing is, cloud computing is generated in the context of the internet technology, a new computing model, the main computing methods of cloud computing include dynamic processing, scalable data processing, cloud computing with other technologies unmatched storage advantages, and data processing advantages, which operate in a virtualized environment, which greatly facilitates users' needs in processing data. Objectively speaking, parallel computing and distributed computing technology are two basic technologies of cloud computing technology development. Data mining is a kind of technology mode of mining effective and valuable information which can be demanded by the users in the big data environment, data mining technology can extract the information we need from the massive data, including knowledge, data and summary information. These information is a kind of guidance for users, and the field of data mining technology is widely used, most of which are used for decision analysis. Prediction task and description task are two tasks of data mining, in detail, the prediction task is the target of the target attribute, and a prediction is made on one attribute, the description task is to summarize the relationship between the data and then describe it. The big data we have mentioned above is a kind of massive data, it is a kind of description, mainly to define and describe the massive data in the era of explosion, to improve the level and structure of the user to deal with the data problem, so as to improve the efficiency and effectiveness and economy. The development of mobile internet has produced big data, but it can generate huge benefits and value for us, the massive data of big data refer to many kinds, high value, fast processing speed and so on. Therefore, cloud computing and big data is a kind of mutual dependence and coexistence, big data is the application of cloud computing. Cloud computing is the foundation of big data, big data is an important application of cloud computing. Characteristics of Cloud Computing In the cloud computing network platform, one of the main features of the cloud computing platform is virtualization, cloud computing can let users at any time, any location, access to the data and services they need, but users need information and data such as the virtual, not physical material, except for the virtual comparison, cloud computing and economic characteristics, so that the cloud computing platform needed equipment and the acquisition of the cost of procurement is very cheap, it is these characteristics, so the cloud computing is widely used and people are familiar with the application. 440
2 In addition to the above three important characteristics, cloud computing technology has high reliability and practical advantages, so the popularity of cloud computing in recent years has also become more and more high. Concept of Data Mining Technology Below we describe the definition of data mining technology, data mining technology is a kind of innovative technology under the environment of big data environment and in the environment of cloud computing technology, it is mainly from the tedious, complex data, through advanced technology to obtain its own needs, valuable, can be used by users, it can discover hidden knowledge behind big data, so its data analysis and data processing ability is very strong, affect the scientific decision-making of the leadership. The meaning of mining in the data mining technology is very extensive, the mining contains the distinction of the number of the data, and the establishment of structural contact, related cluster analysis, evolution analysis, classification analysis, its specific application areas are mainly electronic commerce, telecommunications, medical and military and so on. Concept of Big Data Big data is focused on the huge data, here is huge in two aspects, on the one hand, the large quantity, on the other hand, the mass of the mass, the variety, we live in the data, an image, a gene, the computer is presented in the form of data. Overview of Cloud Computing Data Mining Platform Cloud computing technology in many industries has been applied development, the internet in mobile communication technology has had a great development, intelligent technology also has a huge progress, on the basis of these rapid development of these technologies, data mining technology also has a huge development, it is toward the digital technology trend development, digital network technology research is also deepening. The data mining technology in this paper is a unified standard modeling for cloud computing and data mining system, so that the efficient operation of the data mining platform, and obtain the valuable data we need. Digital programming technology is a foundation of data mining platform, the data mining platform is based on cloud computing technology, data mining platform is based on the technology, there are many intelligent devices, these devices are safe, reliable, low - carbon, network communication platform is the system foundation of data mining platform, mining platform is not only data mining, data sampling, protection, records and other processing, according to different needs of users, the data will be intelligent adjustment, and the realization with other systems interaction, improve the application function. The intelligent programming takes the cloud computing technology as the platform, and the " programming communication network and system" as the series standard of intelligent programming. Intelligent programming defines an information interaction for programming automation, which is a unified and standardized information interaction, which is a new model, it implements a unified modeling on the mining platform, solves the problem of the interaction between different devices, and lays a technical foundation for the integration and sharing of information in intelligent programming. The modeling method under the cloud computing environment is a standard modeling method, the system programming in intelligent programming successfully completes the integration and sharing of various data in big data, which contains state data, monitoring data and detection data. Forming a information platform, this information platform is a platform for the integration of programming, through this system programming can well guarantee the data mining data mining platform is complete, accurate and reliable. In the cloud computing environment data mining platform, we need a comprehensive monitoring, control and management of the programming system to master the operational status of data, with the function of intelligent system development is more and more perfect, the structure of data mining platform is more and more complex, when the large data environment generated more and more information, users have more requirements for the reliability and sharing of data. For the sharing of 441
3 these data, the various functional modules of the mining platform need a database system to maintain, these data can be in many aspects, there are many different, the data structure, data types and the form of data have many different, these problems have a lot of influence on the automation construction of the programming system, so, our data mining platform needs to solve the problem is how to transform the complex data, complete the structured data structure, how to centralize the data management, which can achieve the network form exchange and share between the big data more efficiently. The Importance of Cloud Computing in Big Data Mining As we have said above, the amount of data under the big data environment is increasing, the difficulty of value judgment of the data around us is increasing, in a large number of data, the effective data, we can use the data is only a small part, in the era of big data, data information appears more important, the value of the data we need is reflected in a lot of tedious data, we must dig it out through mining technology. The massive data in big data we call the low value data, we dig out which belong to the potential value of these data, the process of data mining is very tedious, need a lot of statistics and analysis means, data mining technology is to traverse in a lot of data, through various operation, system operation, this process involves solving or optimizing model parameters, get valuable statistics, if the personal operation, without data mining platform, we need to spend a lot of time and energy to access data. The main problem in the big data environment is that the complexity of the data is getting higher and higher, the operation ability of the data processing technology is more and more inefficient, this contradiction is increasingly appearing, the traditional data processing platform and technology has not adapted to the social development. At this time, the emergence of cloud computing technology, cloud computing technology changed the past system speed slow and inefficient, it has a very high dynamic resource allocation ability, and the cloud computing technology has virtualization and high efficiency advantages, it appears very good to meet the requirements of data processing in big data era. Therefore, the big data era and the data mining platform are interdependent, the development of the platform is inseparable from the cloud computing technology. Cloud computing is called "cloud" because it processes complex amounts of data through "clouds", which are composed of several computers that focus on computing power and storage capacity to customers, which also greatly improves the efficiency of data acquisition. Data mining technology is based on data acquisition, a large number of incomplete data processing processing, and then screening optimization, so that we can extract the information we need. A large number of data needs data mining platform has a strong storage capacity, so the combination of cloud computing and data mining platform is inevitable, cloud computing technology platform can solve this problem, on the one hand, we can save the cost of computing and storage, on the other hand, greatly improve the storage efficiency of data, for the traditional data mining platform is an important innovation. There are many types of cloud computing and data mining platform, but all for a purpose, that is, the high speed of data mining and processing capacity, so the concept of " parallel computing" has entered our vision, we can see the cloud computing platform as a virtual resource pool, which can achieve a large number of data computing, cloud computing technology can be distributed through a number of computers, the utilization of resources also increases with this distribution. On the basis of cloud computing technology, we need to build the data mining platform into parallel computing architecture, this process needs to use database fragmentation, data fragmentation is the allocation of data to each node, and finally through a system platform for unified collection and maintenance. The algorithm on each node is not fixed, the data algorithm of each part is different, through the parallel distributed algorithm, this mining technology is more flexible, which is lack of the previous mining platform. This new parallel computing data mining platform can realize our operation on massive data. Establishment of Large Data Mining Platform Based on Cloud Computing Below, we discuss the technology of big data mining in the cloud computing environment. 442
4 Distributed Parallel Technology Cloud computing technology is based on distributed file storage and parallel computing, and the efficiency of data processing relies on distributed file storage. The initial distributed file system was developed by Google Inc. in the United States, called the GFS system, followed by the HDFS, KFS system, their theoretical foundation is the GFS system, and now HDFS, KFS system in business and academic field applications are very extensive. For parallel computing, the REDUCE programming model developed by Google in the United States is the most widely used, parallel computing can encapsulate the distribution of data, task execution, etc., but this process must encode the data, only after the user must make calls to use. However, this method of parallel computing has many non - applicable fields, such as it will have difficulties in calculating the relational data, the main reason is because it does not have a relatively perfect data processing tools, so in view of its problems, we need to further develop the tools needed for parallel operations to broaden its data processing. If it is to low cost processing data, in the face of massive data era massive data, parallel operation and distributed computing is a more applicable method, is a more effective way to deal with data. Cloud computing is a foundation, like parallel computing, distributed computing and grid computing are built on the basis of cloud computing, for the concept of the computer in the implementation of the concept, before we also said, cloud computing is to distribute a variety of computing tasks on a large number of terminals, so various applications will meet the needs of users to obtain the required resources, and storage of a large number of computer resources, and other aggregate service resources. According to experts in the field of cloud computing, the definition of cloud computing is that, in the internet big data environment, cloud computing is to allocate computing tasks to every user, these computing resources refer to computing power, storage capacity and other architectural forms, these capabilities are dynamic, variable, and virtual, these computing capabilities are a form of computer to customer service, in general, it is a distributed data mining framework. Cloud computing system can be divided into several parts, from top to bottom, divided into distributed file system, parallel programming environment, distributed system management, data acquisition layer, these are mainly responsible for collecting data from various data sources, and then using data resources, data cleaning layer is mainly for the following operations, data processing, redundant data processing, and extraction transformation operations, in addition to the parallel analysis layer is mainly responsible for the data dimension definition and association rules definition operation and so on. Hadoop has created a platform to ease the development of programmers, and improve the efficiency of processing massive data, HDFS is a distributed file system, it is a large data storage file system, which has high reliability, strong fault tolerance, MapReduce is a new programming model, it is an efficient programming model, is a parallel program. Based on this programming model, we developed a parallel data mining platform PDMiner, HDFS is used to store large amounts of data, MapReduce mainly for the large amount of data preprocessing and the use of data mining algorithm processing. PDMiner uses parallel algorithm to build data mining platform, in which the parallel computing mode is all kinds of algorithms running together, in addition to the parallel algorithm, the parallel mode also exists in the internal of the algorithm. As a parallel data mining platform, the overall system architecture of PDMiner is divided into four sub - systems, wherein the workflow subsystem provides a variety of interfaces for users to define the tasks of various mining tasks, the user interface subsystem is mainly load subsystem, its main object is parallel extraction of the algorithm, and parallel data mining, workflow subsystem is mainly through the user to define the mining tasks; The user interface can set the parameters of the algorithm, we can use the result display module to analyze the results of our data mining, and guide the decision-making of the leadership; The core part of PDMiner is parallel ETL algorithm subsystem and data mining algorithm subsystem, they can process the data stored in the HDFS system. finally, the results obtained by ETL algorithm can be used as data mining algorithm. Data Mining Algorithm As the key technology in the large data mining platform, data mining algorithm is focused on a number of data areas, including statistics, artificial intelligence, pattern recognition, and so on, we 443
5 deal with the data commonly used in the calculation of statistical analysis, decision tree, neural network, and so on. We analyze separately, the statistical analysis mainly through the following indicators to analyze the statistical law, including the most value, mean, variance, correlation and so on, is a relatively simple analysis method; The decision tree is through the classification of the data, the data is simple, fast description; Neural network through strong self - learning, self - organization, adaptive ability, we can associate and predict the data, and so on. According to the above description, we can find that different algorithms have different characteristics advantages, suitable for different data processing methods, we can choose different algorithms for combination according to the user's needs. Figure 1. Data mining platform architecture based on cloud computing. The Architecture and Implementation of Cloud Computing data Mining Platform The data mining platform is a new generation of transformation, which is due to the distributed storage and distributed computing of cloud computing. The system framework adopts the three-tier architecture design of cloud computing support platform layer, data mining capacity layer, data mining cloud service layer. Cloud computing support platform is mainly for the bottom of the database operation, data mining ability layer is mainly stored in a large number of data processing classes and methods, the ability layer is mainly to data mining service layer interface, easy to call, the three-tier architecture has a lot of advantages, such as convenient maintenance, if the system added new features, we need to add methods and classes, modify the interface will be simple, so the system layered, the security of the system is improved. The resource database can handle the data structure, and the complexity of the data will be reduced, and the independence will be high, can deal with the massive data, and has the expansion data and the role of index data, so we use relational database to model data, mainly through the SCL configuration file, database has data mining, backup, recovery and other technologies, we can also achieve the processing of data. The information of SCL files, after associated processing, can be imported into the database, this is the standard of cloud computing data model, and finally we need to export data, cloud computing system will be very good to achieve the purpose of our file reuse. Table is a basic structure, the table determines the data extraction methods in each module in the database, including 444
6 display module and other functional modules, and so on, when describing the SSD file, the table can implement various operations of the data. Conclusion In the big data age, we must have the technology to deal with big data, cloud computing technology came into being, with this foundation, we developed many data mining platforms, such as parallel distributed data mining platform PDMiner. In addition, there are many association rules algorithm, classification algorithm and clustering algorithm and other parallel data mining algorithm, we through the cloud computing data mining application platform, we have developed a lot of data mining system, we also need through many efforts to extend the data mining platform, but at present our data mining platform cost is high, so we need to have a certain technical operation ability. Cloud computing has further improved the data mining platform, simplifies the data mining system, reduces the cost, the user's extensive participation, suitable for all classes of use, so the data mining platform has made a lot of progress, facing the ever-changing field of data, we need to follow the trend of the times to improve the data mining platform. References [1] Zheng Miaoshi. research on architecture and key technologies of data mining platform based on cloud computing [j]. information communication, 2014 (08). [2] Ding Yan, Yang Qingping, Qian Yuming. research on architecture and key technologies of data mining platform based on cloud computing [j]. ZTE communication technology, 2013 (01). [3] Cheng Lin. research on architecture of data mining system based on cloud computing [j]. electronic world, 2012 (21). [4] Liu Guixia, Cui Yongduo, Gao Ping. research on data mining [j]. industrial technology economy, 2000 (03). [5] Yu Yonghong, Xiang Xiaojun, Gao Yang, etc. research on service - oriented cloud data mining engine [j]. computer science and exploration, 2012 (1): [6] Li Zhilong, Su Shaoying, Tang Pengfei, etc. fast frequency measurement of sinusoidal signals based on digital channelized [j]. radar science and technology, 2011 (5):
An Indian Journal FULL PAPER. Trade Science Inc. Research on data mining clustering algorithm in cloud computing environments ABSTRACT KEYWORDS
[Type text] [Type text] [Type text] ISSN : 0974-7435 Volume 10 Issue 17 BioTechnology 2014 An Indian Journal FULL PAPER BTAIJ, 10(17), 2014 [9562-9566] Research on data mining clustering algorithm in cloud
More informationAn Indian Journal FULL PAPER ABSTRACT KEYWORDS. Trade Science Inc. The study on magnanimous data-storage system based on cloud computing
[Type text] [Type text] [Type text] ISSN : 0974-7435 Volume 10 Issue 11 BioTechnology 2014 An Indian Journal FULL PAPER BTAIJ, 10(11), 2014 [5368-5376] The study on magnanimous data-storage system based
More informationHuge Data Analysis and Processing Platform based on Hadoop Yuanbin LI1, a, Rong CHEN2
2nd International Conference on Materials Science, Machinery and Energy Engineering (MSMEE 2017) Huge Data Analysis and Processing Platform based on Hadoop Yuanbin LI1, a, Rong CHEN2 1 Information Engineering
More informationNew research on Key Technologies of unstructured data cloud storage
2017 International Conference on Computing, Communications and Automation(I3CA 2017) New research on Key Technologies of unstructured data cloud storage Songqi Peng, Rengkui Liua, *, Futian Wang State
More informationDesign of student information system based on association algorithm and data mining technology. CaiYan, ChenHua
5th International Conference on Mechatronics, Materials, Chemistry and Computer Engineering (ICMMCCE 2017) Design of student information system based on association algorithm and data mining technology
More informationOpen Access Apriori Algorithm Research Based on Map-Reduce in Cloud Computing Environments
Send Orders for Reprints to reprints@benthamscience.ae 368 The Open Automation and Control Systems Journal, 2014, 6, 368-373 Open Access Apriori Algorithm Research Based on Map-Reduce in Cloud Computing
More informationCrop Production Management Information System Design and Implementation
2016 International Conference on Computer, Mechatronics and Electronic Engineering (CMEE 2016) ISBN: 978-1-60595-406-6 Crop Production Management Information System Design and Implementation Na ZHANG *,
More informationDecision analysis of the weather log by Hadoop
Advances in Engineering Research (AER), volume 116 International Conference on Communication and Electronic Information Engineering (CEIE 2016) Decision analysis of the weather log by Hadoop Hao Wu Department
More informationResearch and Application of E-Commerce Recommendation System Based on Association Rules Algorithm
Research and Application of E-Commerce Recommendation System Based on Association Rules Algorithm Qingting Zhu 1*, Haifeng Lu 2 and Xinliang Xu 3 1 School of Computer Science and Software Engineering,
More informationMultisource Remote Sensing Data Mining System Construction in Cloud Computing Environment Dong YinDi 1, Liu ChengJun 1
4th International Conference on Computer, Mechatronics, Control and Electronic Engineering (ICCMCEE 2015) Multisource Remote Sensing Data Mining System Construction in Cloud Computing Environment Dong
More informationDesign and Realization of Data Mining System based on Web HE Defu1, a
4th International Conference on Machinery, Materials and Computing Technology (ICMMCT 2016) Design and Realization of Data Mining System based on Web HE Defu1, a 1 Department of Quartermaster, Wuhan Economics
More informationData Mining in the Application of E-Commerce Website
Data Mining in the Application of E-Commerce Website Gu Hongjiu ChongQing Industry Polytechnic College, 401120, China Abstract. With the development of computer technology and Internet technology, the
More informationThe Design and Implementation of Disaster Recovery in Dual-active Cloud Center
International Conference on Information Sciences, Machinery, Materials and Energy (ICISMME 2015) The Design and Implementation of Disaster Recovery in Dual-active Cloud Center Xiao Chen 1, a, Longjun Zhang
More informationDesign and Implementation of Agricultural Information Resources Vertical Search Engine Based on Nutch
619 A publication of CHEMICAL ENGINEERING TRANSACTIONS VOL. 51, 2016 Guest Editors: Tichun Wang, Hongyang Zhang, Lei Tian Copyright 2016, AIDIC Servizi S.r.l., ISBN 978-88-95608-43-3; ISSN 2283-9216 The
More informationYunfeng Zhang 1, Huan Wang 2, Jie Zhu 1 1 Computer Science & Engineering Department, North China Institute of Aerospace
[Type text] [Type text] [Type text] ISSN : 0974-7435 Volume 10 Issue 20 BioTechnology 2014 An Indian Journal FULL PAPER BTAIJ, 10(20), 2014 [12526-12531] Exploration on the data mining system construction
More informationThe power quality intelligent monitoring system based on cloud computing Jie Bai 1a, Changpo Song 2b
International Conference on Intelligent Systems Research and Mechatronics Engineering (ISRME 2015) The power quality intelligent monitoring system based on cloud computing Jie Bai 1a, Changpo Song 2b State
More informationWeb Data Mining based on Cloud Computing
Web Data Mining based on Cloud Computing Liangfei XUE 1 Dongfeng Yuan 2 Mingyan Jiang 3 Abstract With the recent success of cloud computing, data mining is going to be more accessible due to easier access
More informationHierarchy of knowledge BIG DATA 9/7/2017. Architecture
BIG DATA Architecture Hierarchy of knowledge Data: Element (fact, figure, etc.) which is basic information that can be to be based on decisions, reasoning, research and which is treated by the human or
More informationNext-generation IT Platforms Delivering New Value through Accumulation and Utilization of Big Data
Next-generation IT Platforms Delivering New Value through Accumulation and Utilization of Big Data 46 Next-generation IT Platforms Delivering New Value through Accumulation and Utilization of Big Data
More informationResearch on Mass Image Storage Platform Based on Cloud Computing
6th International Conference on Sensor Network and Computer Engineering (ICSNCE 2016) Research on Mass Image Storage Platform Based on Cloud Computing Xiaoqing Zhou1, a *, Jiaxiu Sun2, b and Zhiyong Zhou1,
More informationConstruction and Application of Cloud Data Center in University
International Conference on Logistics Engineering, Management and Computer Science (LEMCS 2014) Construction and Application of Cloud Data Center in University Hong Chai Institute of Railway Technology,
More informationStudy on the Application Analysis and Future Development of Data Mining Technology
Study on the Application Analysis and Future Development of Data Mining Technology Ge ZHU 1, Feng LIN 2,* 1 Department of Information Science and Technology, Heilongjiang University, Harbin 150080, China
More informationDesign and Implementation of High-Speed Real-Time Data Acquisition and Processing System based on FPGA
2nd International Conference on Social Science and Technology Education (ICSSTE 2016) Design and Implementation of High-Speed Real-Time Data Acquisition and Processing System based on FPGA Guojuan Zhou
More informationA Data Classification Algorithm of Internet of Things Based on Neural Network
A Data Classification Algorithm of Internet of Things Based on Neural Network https://doi.org/10.3991/ijoe.v13i09.7587 Zhenjun Li Hunan Radio and TV University, Hunan, China 278060389@qq.com Abstract To
More informationEnergy efficient optimization method for green data center based on cloud computing
4th ational Conference on Electrical, Electronics and Computer Engineering (CEECE 2015) Energy efficient optimization method for green data center based on cloud computing Runze WU1, a, Wenwei CHE1, b,
More informationResearch Article Mobile Storage and Search Engine of Information Oriented to Food Cloud
Advance Journal of Food Science and Technology 5(10): 1331-1336, 2013 DOI:10.19026/ajfst.5.3106 ISSN: 2042-4868; e-issn: 2042-4876 2013 Maxwell Scientific Publication Corp. Submitted: May 29, 2013 Accepted:
More informationResearch on Technologies in Smart Substation
Available online at www.sciencedirect.com Energy Procedia 12 (2011) 113 119 ICSGCE 2011: 27 30 September 2011, Chengdu, China Research on Technologies in Smart Substation Hongwei Li *, Lixin Wang Technology
More informationProcessing Technology of Massive Human Health Data Based on Hadoop
6th International Conference on Machinery, Materials, Environment, Biotechnology and Computer (MMEBC 2016) Processing Technology of Massive Human Health Data Based on Hadoop Miao Liu1, a, Junsheng Yu1,
More informationThe Application Research of Neural Network in Embedded Intelligent Detection
The Application Research of Neural Network in Embedded Intelligent Detection Xiaodong Liu 1, Dongzhou Ning 1, Hubin Deng 2, and Jinhua Wang 1 1 Compute Center of Nanchang University, 330039, Nanchang,
More informationResearch on Approach of Equipment Status and Operation Information Acquisition Based on Equipment Control Bus
Research on Approach of Equipment Status and Operation Information Acquisition Based on Equipment Control Bus Xu Li a, *, Chen Meng, Huixia Jiang, Cheng Wang Army Engineering University, Shijiazhuang 050003,
More informationBioTechnology. An Indian Journal FULL PAPER. Trade Science Inc. Study on secure data storage based on cloud computing ABSTRACT KEYWORDS
[Type text] [Type text] [Type text] ISSN : 0974-7435 Volume 10 Issue 22 BioTechnology 2014 An Indian Journal FULL PAPER BTAIJ, 10(22), 2014 [13778-13783] Study on secure data storage based on cloud computing
More informationFrequent Item Set using Apriori and Map Reduce algorithm: An Application in Inventory Management
Frequent Item Set using Apriori and Map Reduce algorithm: An Application in Inventory Management Kranti Patil 1, Jayashree Fegade 2, Diksha Chiramade 3, Srujan Patil 4, Pradnya A. Vikhar 5 1,2,3,4,5 KCES
More informationFramework Research on Privacy Protection of PHR Owners in Medical Cloud System Based on Aggregation Key Encryption Algorithm
Framework Research on Privacy Protection of PHR Owners in Medical Cloud System Based on Aggregation Key Encryption Algorithm Huiqi Zhao 1,2,3, Yinglong Wang 2,3*, Minglei Shu 2,3 1 Department of Information
More informationResearch on Online Education Interactive Application Based on Cloud Computing and Large Data
2018 International Conference on Computer Science and Biomedical Engineering (CSBIOE 2018) Research on Online Education Interactive Application Based on Cloud Computing and Large Data XU Guo1,a 1 China
More informationAnalysis on the technology improvement of the library network information retrieval efficiency
Available online www.jocpr.com Journal of Chemical and Pharmaceutical Research, 2014, 6(6):2198-2202 Research Article ISSN : 0975-7384 CODEN(USA) : JCPRC5 Analysis on the technology improvement of the
More informationApplication of Redundant Backup Technology in Network Security
2018 2nd International Conference on Systems, Computing, and Applications (SYSTCA 2018) Application of Redundant Backup Technology in Network Security Shuwen Deng1, Siping Hu*, 1, Dianhua Wang1, Limin
More informationThe Analysis and Design of the Object-oriented System Li Xin 1, a
International Conference on Materials Engineering and Information Technology Applications (MEITA 2015) The Analysis and Design of the Object-oriented System Li Xin 1, a 1 Shijiazhuang Vocational Technology
More informationResearch and Improvement of Apriori Algorithm Based on Hadoop
Research and Improvement of Apriori Algorithm Based on Hadoop Gao Pengfei a, Wang Jianguo b and Liu Pengcheng c School of Computer Science and Engineering Xi'an Technological University Xi'an, 710021,
More informationImprovements and Implementation of Hierarchical Clustering based on Hadoop Jun Zhang1, a, Chunxiao Fan1, Yuexin Wu2,b, Ao Xiao1
3rd International Conference on Machinery, Materials and Information Technology Applications (ICMMITA 2015) Improvements and Implementation of Hierarchical Clustering based on Hadoop Jun Zhang1, a, Chunxiao
More informationThe Application of CAD/CAM in the Design of Industrial Products
2018 International Conference on Medicine, Biology, Materials and Manufacturing (ICMBMM 2018) The Application of CAD/CAM in the Design of Industrial Products Hequn Liu Xianning Vocational Technical College,
More informationImplementation of a High-Performance Distributed Web Crawler and Big Data Applications with Husky
Implementation of a High-Performance Distributed Web Crawler and Big Data Applications with Husky The Chinese University of Hong Kong Abstract Husky is a distributed computing system, achieving outstanding
More informationRapid Modeling of Digital City Based on Sketchup
Journal of Mechanical Engineering Research and Developments ISSN: 1024-1752 Website: http://www.jmerd.org Vol. 38, No. 1, 2015, pp. 130-134 J. Y. Li *, H. L. Yuan, & C. Reithmeier Department of Architectural
More informationConstruction of SSI Framework Based on MVC Software Design Model Yongchang Rena, Yongzhe Mab
4th International Conference on Mechatronics, Materials, Chemistry and Computer Engineering (ICMMCCE 2015) Construction of SSI Framework Based on MVC Software Design Model Yongchang Rena, Yongzhe Mab School
More informationA New Model of Search Engine based on Cloud Computing
A New Model of Search Engine based on Cloud Computing DING Jian-li 1,2, YANG Bo 1 1. College of Computer Science and Technology, Civil Aviation University of China, Tianjin 300300, China 2. Tianjin Key
More informationData Mining Technology Based on Bayesian Network Structure Applied in Learning
, pp.67-71 http://dx.doi.org/10.14257/astl.2016.137.12 Data Mining Technology Based on Bayesian Network Structure Applied in Learning Chunhua Wang, Dong Han College of Information Engineering, Huanghuai
More informationConstruction Scheme for Cloud Platform of NSFC Information System
, pp.200-204 http://dx.doi.org/10.14257/astl.2016.138.40 Construction Scheme for Cloud Platform of NSFC Information System Jianjun Li 1, Jin Wang 1, Yuhui Zheng 2 1 Information Center, National Natural
More informationThe Application Research of Semantic Web Technology and Clickstream Data Mart in Tourism Electronic Commerce Website Bo Liu
International Conference on Education Technology, Management and Humanities Science (ETMHS 2015) The Application Research of Semantic Web Technology and Clickstream Data Mart in Tourism Electronic Commerce
More informationChapter 5: Summary and Conclusion CHAPTER 5 SUMMARY AND CONCLUSION. Chapter 1: Introduction
CHAPTER 5 SUMMARY AND CONCLUSION Chapter 1: Introduction Data mining is used to extract the hidden, potential, useful and valuable information from very large amount of data. Data mining tools can handle
More informationIntroduction to Hadoop and MapReduce
Introduction to Hadoop and MapReduce Antonino Virgillito THE CONTRACTOR IS ACTING UNDER A FRAMEWORK CONTRACT CONCLUDED WITH THE COMMISSION Large-scale Computation Traditional solutions for computing large
More informationResearch on Digital Library Platform Based on Cloud Computing
Research on Digital Library Platform Based on Cloud Computing Lingling Han and Lijie Wang Heibei Energy Institute of Vocation and Technology, Tangshan, Hebei, China hanlingling2002@126.com, wanglj509@163.com
More informationDesign of Coal Mine Power Supply Monitoring System
2nd International Conference on Electronics, Network and Computer Engineering (ICENCE 2016) Design of Coal Mine Power Supply Monitoring System Lei Shi 1, Guo Jin 2 and Jun Xu 3 1 2 Department of electronic
More informationAnalysis of Computer Network and Communication System
Journal of Networking and Telecomunications (2018) Original Research Article Analysis of Computer Network and Communication System Jingdong Wang,Sujia Luo,Jie Yuan\ School of Physics and Information Engineering,
More informationZTE Intelligent Wireless Network Solution
ZTE Intelligent Wireless Network Solution We are entering a new intelligent era. Under the development of new technologies like AI, cloud computing, big data, 5G and Internet of Things, we will truly realize
More informationThe Solutions to Some Key Problems of Solar Energy Output in the Belt and Road Yong-ping GAO 1,*, Li-li LIAO 2 and Yue-shun HE 3
2016 International Conference on Artificial Intelligence and Computer Science (AICS 2016) ISBN: 978-1-60595-411-0 The Solutions to Some Key Problems of Solar Energy Output in the Belt and Road Yong-ping
More informationThe Application of CAN Bus in Intelligent Substation Automation System Yuehua HUANG 1, a, Ruiyong LIU 2, b, Peipei YANG 3, C, Dongxu XIANG 4,D
International Power, Electronics and Materials Engineering Conference (IPEMEC 2015) The Application of CAN Bus in Intelligent Substation Automation System Yuehua HUANG 1, a, Ruiyong LIU 2, b, Peipei YANG
More informationDesign and Implementation of Laboratory Information Management. System for Chemical Analysis. LI Qinghua1, a
Advances in Engineering Research (AER), volume 130 5th International Conference on Frontiers of Manufacturing Science and Measuring Technology (FMSMT 2017) Design and Implementation of Laboratory Information
More informationIntelligent management of on-line video learning resources supported by Web-mining technology based on the practical application of VOD
World Transactions on Engineering and Technology Education Vol.13, No.3, 2015 2015 WIETE Intelligent management of on-line video learning resources supported by Web-mining technology based on the practical
More informationResearch on Computer Network Virtual Laboratory based on ASP.NET. JIA Xuebin 1, a
International Conference on Advances in Mechanical Engineering and Industrial Informatics (AMEII 2015) Research on Computer Network Virtual Laboratory based on ASP.NET JIA Xuebin 1, a 1 Department of Computer,
More informationThe Analysis and Implementation of the K - Means Algorithm Based on Hadoop Platform
Computer and Information Science; Vol. 11, No. 1; 2018 ISSN 1913-8989 E-ISSN 1913-8997 Published by Canadian Center of Science and Education The Analysis and Implementation of the K - Means Algorithm Based
More informationThe Design of Distributed File System Based on HDFS Yannan Wang 1, a, Shudong Zhang 2, b, Hui Liu 3, c
Applied Mechanics and Materials Online: 2013-09-27 ISSN: 1662-7482, Vols. 423-426, pp 2733-2736 doi:10.4028/www.scientific.net/amm.423-426.2733 2013 Trans Tech Publications, Switzerland The Design of Distributed
More informationConstruction of the Library Management System Based on Data Warehouse and OLAP Maoli Xu 1, a, Xiuying Li 2,b
Applied Mechanics and Materials Online: 2013-08-30 ISSN: 1662-7482, Vols. 380-384, pp 4796-4799 doi:10.4028/www.scientific.net/amm.380-384.4796 2013 Trans Tech Publications, Switzerland Construction of
More informationDynamic Data Placement Strategy in MapReduce-styled Data Processing Platform Hua-Ci WANG 1,a,*, Cai CHEN 2,b,*, Yi LIANG 3,c
2016 Joint International Conference on Service Science, Management and Engineering (SSME 2016) and International Conference on Information Science and Technology (IST 2016) ISBN: 978-1-60595-379-3 Dynamic
More informationPower Big Data platform Based on Hadoop Technology
6th International Conference on Machinery, Materials, Environment, Biotechnology and Computer (MMEBC 2016) Power Big Data platform Based on Hadoop Technology Jilin Chen1, a, Nana Liu2, b, Yong Chen2, c
More informationObtaining Rough Set Approximation using MapReduce Technique in Data Mining
Obtaining Rough Set Approximation using MapReduce Technique in Data Mining Varda Dhande 1, Dr. B. K. Sarkar 2 1 M.E II yr student, Dept of Computer Engg, P.V.P.I.T Collage of Engineering Pune, Maharashtra,
More informationNowcasting. D B M G Data Base and Data Mining Group of Politecnico di Torino. Big Data: Hype or Hallelujah? Big data hype?
Big data hype? Big Data: Hype or Hallelujah? Data Base and Data Mining Group of 2 Google Flu trends On the Internet February 2010 detected flu outbreak two weeks ahead of CDC data Nowcasting http://www.internetlivestats.com/
More informationBIG DATA TESTING: A UNIFIED VIEW
http://core.ecu.edu/strg BIG DATA TESTING: A UNIFIED VIEW BY NAM THAI ECU, Computer Science Department, March 16, 2016 2/30 PRESENTATION CONTENT 1. Overview of Big Data A. 5 V s of Big Data B. Data generation
More informationImplementation of Parallel CASINO Algorithm Based on MapReduce. Li Zhang a, Yijie Shi b
International Conference on Artificial Intelligence and Engineering Applications (AIEA 2016) Implementation of Parallel CASINO Algorithm Based on MapReduce Li Zhang a, Yijie Shi b State key laboratory
More informationTwitter data Analytics using Distributed Computing
Twitter data Analytics using Distributed Computing Uma Narayanan Athrira Unnikrishnan Dr. Varghese Paul Dr. Shelbi Joseph Research Scholar M.tech Student Professor Assistant Professor Dept. of IT, SOE
More informationGIS Application based on Cloud Storage for Atmospheric Environmental Monitoring
3rd International Conference on Material, Mechanical and Manufacturing Engineering (IC3ME 2015) GIS Application based on Cloud Storage for Atmospheric Environmental Monitoring Mei Han 1, a *,Ke Feng 2,b
More informationDesign and Implementation of Music Recommendation System Based on Hadoop
Design and Implementation of Music Recommendation System Based on Hadoop Zhao Yufeng School of Computer Science and Engineering Xi'an University of Technology Shaanxi, Xi an, China e-mail: zyfzy99@163.com
More informationRemotely Sensed Image Processing Service Automatic Composition
Remotely Sensed Image Processing Service Automatic Composition Xiaoxia Yang Supervised by Qing Zhu State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University
More informationWeb Data mining-a Research area in Web usage mining
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661, p- ISSN: 2278-8727Volume 13, Issue 1 (Jul. - Aug. 2013), PP 22-26 Web Data mining-a Research area in Web usage mining 1 V.S.Thiyagarajan,
More informationEfficient Algorithm for Frequent Itemset Generation in Big Data
Efficient Algorithm for Frequent Itemset Generation in Big Data Anbumalar Smilin V, Siddique Ibrahim S.P, Dr.M.Sivabalakrishnan P.G. Student, Department of Computer Science and Engineering, Kumaraguru
More informationApplication of Nonlinear Later TV Edition in Gigabit Ethernet. Hong Ma
3rd International Conference on Science and Social Research (ICSSR 2014) Application of Nonlinear Later TV Edition in Gigabit Ethernet Hong Ma Education and Training Department, Shaanxi Vocational and
More informationData- and Rule-Based Integrated Mechanism for Job Shop Scheduling
Data- and Rule-Based Integrated Mechanism for Job Shop Scheduling Yanhong Wang*, Dandan Ji Department of Information Science and Engineering, Shenyang University of Technology, Shenyang 187, China. * Corresponding
More informationData Clustering on the Parallel Hadoop MapReduce Model. Dimitrios Verraros
Data Clustering on the Parallel Hadoop MapReduce Model Dimitrios Verraros Overview The purpose of this thesis is to implement and benchmark the performance of a parallel K- means clustering algorithm on
More informationComprehensive analysis and evaluation of big data for main transformer equipment based on PCA and Apriority
IOP Conference Series: Earth and Environmental Science PAPER OPEN ACCESS Comprehensive analysis and evaluation of big data for main transformer equipment based on PCA and Apriority To cite this article:
More informationA Study on Load Balancing Techniques for Task Allocation in Big Data Processing* Jin Xiaohong1,a, Li Hui1, b, Liu Yanjun1, c, Fan Yanfang1, d
International Forum on Mechanical, Control and Automation IFMCA 2016 A Study on Load Balancing Techniques for Task Allocation in Big Data Processing* Jin Xiaohong1,a, Li Hui1, b, Liu Yanjun1, c, Fan Yanfang1,
More informationResearch on Data Mining and Statistical Analysis Xiaoyao Lu1, a
6th International Conference on Machinery, Materials, Environment, Biotechnology and Computer (MMEBC 2016) Research on Data Mining and Statistical Analysis Xiaoyao Lu1, a 1 School of Statistics and Mathematics
More informationSystem For Product Recommendation In E-Commerce Applications
International Journal of Engineering Research and Development e-issn: 2278-067X, p-issn: 2278-800X, www.ijerd.com Volume 11, Issue 05 (May 2015), PP.52-56 System For Product Recommendation In E-Commerce
More informationANN-Based Modeling for Load and Main Steam Pressure Characteristics of a 600MW Supercritical Power Generating Unit
ANN-Based Modeling for Load and Main Steam Pressure Characteristics of a 600MW Supercritical Power Generating Unit Liangyu Ma, Zhiyuan Gao Automation Department, School of Control and Computer Engineering
More informationAN EFFECTIVE DETECTION OF SATELLITE IMAGES VIA K-MEANS CLUSTERING ON HADOOP SYSTEM. Mengzhao Yang, Haibin Mei and Dongmei Huang
International Journal of Innovative Computing, Information and Control ICIC International c 2017 ISSN 1349-4198 Volume 13, Number 3, June 2017 pp. 1037 1046 AN EFFECTIVE DETECTION OF SATELLITE IMAGES VIA
More informationLog Analysis Engine with Integration of Hadoop and Spark
Log Analysis Engine with Integration of Hadoop and Spark Abhiruchi Shinde 1, Neha Vautre 2, Prajakta Yadav 3, Sapna Kumari 4 1Abhiruchi Shinde,, Dept of Computer Engineering, SITS, Maharashtra, India 2Neha
More informationResearch and application on the Data mining technology in medical information systems. Jinhai Zhang
4th International Conference on Machinery, Materials and Computing Technology (ICMMCT 2016) Research and application on the Data mining technology in medical information systems Jinhai Zhang Marine college
More informationResearch on QR Code Image Pre-processing Algorithm under Complex Background
Scientific Journal of Information Engineering May 207, Volume 7, Issue, PP.-7 Research on QR Code Image Pre-processing Algorithm under Complex Background Lei Liu, Lin-li Zhou, Huifang Bao. Institute of
More informationWearable Technology Orientation Using Big Data Analytics for Improving Quality of Human Life
Wearable Technology Orientation Using Big Data Analytics for Improving Quality of Human Life Ch.Srilakshmi Asst Professor,Department of Information Technology R.M.D Engineering College, Kavaraipettai,
More informationA Network-Based Management Information System for Animal Husbandry in Farms
A Network-Based Information System for Animal Husbandry in Farms Jing Han 1 and Xi Wang 2, 1 College of Information Technology, Heilongjiang August First Land Reclamation University, Daqing, Heilongjiang
More informationFAST DATA RETRIEVAL USING MAP REDUCE: A CASE STUDY
, pp-01-05 FAST DATA RETRIEVAL USING MAP REDUCE: A CASE STUDY Ravin Ahuja 1, Anindya Lahiri 2, Nitesh Jain 3, Aditya Gabrani 4 1 Corresponding Author PhD scholar with the Department of Computer Engineering,
More informationATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V
ATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V WHITE PAPER Create the Data Center of the Future Accelerate
More informationCAMPSNA: A Cloud Assisted Mobile Peer to Peer Social Network Architecture
CAMPSNA: A Cloud Assisted Mobile Peer to Peer Social Network Architecture Yuan-ni Liu Hong Tang, Guo-feng Zhao The School of Communication and Information Engineering of ChongQing University of Posts and
More informationCAMPSNA: A Cloud Assisted Mobile Peer to Peer Social Network Architecture
CAMPSNA: A Cloud Assisted Mobile Peer to Peer Social Network Architecture Yuan-ni Liu Hong Tang, Guo-feng Zhao The School of Communication and Information Engineering of ChongQing University of Posts and
More informationDesign and Implementation of Networked CNC Machine DNC System in. Colleges and Universities Based on Internet Plus
5th International Conference on Mechatronics, Materials, Chemistry and Computer Engineering (ICMMCCE 2017) Design and Implementation of Networked CNC Machine DNC System in Colleges and Universities Based
More informationImproved Balanced Parallel FP-Growth with MapReduce Qing YANG 1,a, Fei-Yang DU 2,b, Xi ZHU 1,c, Cheng-Gong JIANG *
2016 Joint International Conference on Artificial Intelligence and Computer Engineering (AICE 2016) and International Conference on Network and Communication Security (NCS 2016) ISBN: 978-1-60595-362-5
More informationIntegration of information security and network data mining technology in the era of big data
Acta Technica 62 No. 1A/2017, 157 166 c 2017 Institute of Thermomechanics CAS, v.v.i. Integration of information security and network data mining technology in the era of big data Lu Li 1 Abstract. The
More informationThe Comparative Study of Machine Learning Algorithms in Text Data Classification*
The Comparative Study of Machine Learning Algorithms in Text Data Classification* Wang Xin School of Science, Beijing Information Science and Technology University Beijing, China Abstract Classification
More informationHousing Estates Information Management System Based on.net. Jianliang Min
3rd International Conference on Management, Education, Information and Control (MEICI 205) Housing Estates Information Management System Based on.et Jianliang Min College of Information Engineering, Jiangxi
More informationResearch on the Establishment and Analysis of Small Business Networks
2018 2nd International Conference on Systems, Computing, and Applications (SYSTCA 2018) Research on the Establishment and Analysis of Small Business Networks Guozhen Sang 1 School of Network Security and
More informationTraffic Flow Prediction Based on the location of Big Data. Xijun Zhang, Zhanting Yuan
5th International Conference on Civil Engineering and Transportation (ICCET 205) Traffic Flow Prediction Based on the location of Big Data Xijun Zhang, Zhanting Yuan Lanzhou Univ Technol, Coll Elect &
More informationExploiting and Gaining New Insights for Big Data Analysis
Exploiting and Gaining New Insights for Big Data Analysis K.Vishnu Vandana Assistant Professor, Dept. of CSE Science, Kurnool, Andhra Pradesh. S. Yunus Basha Assistant Professor, Dept.of CSE Sciences,
More informationChapter 3. Foundations of Business Intelligence: Databases and Information Management
Chapter 3 Foundations of Business Intelligence: Databases and Information Management THE DATA HIERARCHY TRADITIONAL FILE PROCESSING Organizing Data in a Traditional File Environment Problems with the traditional
More information