GPU. Study of Automatic GPU Offloading Technology for Open IoT

Size: px

Start display at page:

Download "GPU. Study of Automatic GPU Offloading Technology for Open IoT"

Rodger Horn
5 years ago
Views:

1 THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. SC ( ) IoT GPU NTT IoT IoT IoT Tacit Computing Tacit Computing GPU Tacit Computing GPU C/C IoT, GPGPU Tacit Computing Study of Automatic GPU Offloading Technology for Open IoT Yoji YAMATO, Tatsuya DEMIZU, Hirofumi NOGUCHI, and Misao KATAOKA Network Service Systems Laboratories, NTT Corporation, , Midori-cho, Musashino-shi, Tokyo yamato.yoji@lab.ntt.co.jp Abstract IoT technologies have been progressed. Now Open IoT concept has attracted attentions which achieve various IoT services by integrating horizontal separated devices and services. For Open IoT era, we have proposed the Tacit Computing technology to discover the devices with necessary data for users on demand and use them dynamically. However, existing Tacit Computing does not care about performance and operation cost. Therefore in this paper, we propose an automatic GPU offloading technology as an elementary technology of Tacit Computing which uses Genetic Algorithm to extract appropriate offload loop statements to improve performances. evaluate a C/C++ matrix manipulation to verify effectiveness of GPU offloading and confirm more than 35 times performances within 1 hour tuning time. Key words Open IoT, GPGPU, Tacit Computing, Genetic Algorithm, Automatic Offloading We 1. IoT Internet of Things [1]- [18] [19]- [39] IoT Open IoT Open IoT CPU IoT CPU GPU Graphics Processing Unit FPGA Field Programmable Gate Array Amazon Web Services (AWS) GPU FPGA Microsoft FPGA Open IoT 1

2 CUDA Compute Unified Device Architecture, OpenCL Open Computing Language GPU FPGA IoT Open IoT Open IoT GPU FPGA Open IoT Tacit Computing [40] Tacit Computing Open IoT GPU FPGA CPU C/C++ GPU 2. GPU GPGPU General Purpose GPU CUDA CUDA GPGPU GPU, FPGA, CPU OpenCL CUDA OpenCL C GPU CPU CUDA OpenCL GPGPU OpenACC [41] PGI [42] OpenACC C/C++/Fortran OpenACC PGI GPU CPU GPU OpenCL, CUDA, OpenACC GPU Intel for GPU CPU-GPU GPU OpenCL CUDA PGI [43] for for GPU for 3. Tacit Computing GPU 3. 1 Tacit Computing Tacit Computing Open IoT 3 1) [40] 3 Tacit Computing IoT Tacit Computing Tacit Computing OpenCV IoT Tacit Computing Semantic Web RDF/OWL Semantic Web Services [44]- [87] 2

Computing 2 1-1 Tacit Computing 1-2 Tacit Computing FFT (Fast Fourier Transformation) 1-3 Tacit

3 1 Tacit Computing 3. 2 Tacit Computing Tacit Computing Open IoT Tacit Computing Tacit Computing Tacit Computing Tacit Computing Tacit Computing 1-2 Tacit Computing FFT (Fast Fourier Transformation) 1-3 Tacit Computing FFT GPU FPGA 1-4 Tacit Computing GPU FPGA DB 2-5 IoT 3. 3 GA GPU Tacit Computing GPU GPU GPU GPU IoT GPU 2 GPU CPU GPU PGI for IoT for for GA for GA 3

4 3 GA GPU GA GA Simple GA Simple GA 1, 0 1 GA for for GPU 1 0 M 1, 0 Pc Pm T GA GPU C/C++ GPU PGI C/C++ OSS proprietary C/C++ OSS GPU PGI PGI OpenACC C/C++/Fortran for OpenACC #pragma acc kernels GPU GPU for for Perl 5 C/C++ C/C++ C/C++ for CPU GPU for for #pragma acc kernels for for break for for #pragma a a 1 0 a C/C++ C/C++ GPU PGI GA GA GA C/C++ GA IoT 4

5 4 5 GA Open IoT GA Simple GA 12 M 12 T 12 ( ) 1/2 Pc 0.9 Pm GPU NVIDIA Quadro K5200 NVIDIA Quadro K5200 CUDA 2304 PGI CUDA Toolkit GA CPU 5 12 GA 0 CPU GA GA 20 CPU 6. Open IoT GPU Open IoT GPU GA C/C++ OSS PGI 12 1 CPU 35 GA [1] Y. Yamato, et al., "Predictive Maintenance Platform with Sound Stream Analysis in Edges," Journal of Information Processing, Vol.25, pp , [2] T. Demizu, et al., "Data Management and Packet Transmission Method based on Receivers Attributes," IEEE WF-IoT 2018, [3] H. Noguchi, et al., "Autonomous Device Identification Architecture for Internet of Things," IEEE WF-IoT 2018, [4] Y. Yamato, et al., "Maintenance of Business Machines with Edge and Cloud Collaboration," IEEJ Transactions on Electrical and Electronic Engineering, Vol.13, No.8, [5] M. Kataoka, et al., "Tacit computing and its application for open IoT era," IEEE CCNC 2018, pp.1-5, [6] Y. Yamato, et al., "A study of coordination logic description and execution for dynamic device coordination services," The 2018 IEICE General Conference, BS-4-1, [7] "hitoe," NTT Vol.29, No.7, pp.13-18, [8] Y. Yamato, "Study of Vital Data Analysis Platform Using Wearable Sensor," IEICE Technical Report, SC , Mar [9] Y. Yamato, et al., "A Study of Optimizing Heterogeneous Resources for Open IoT," IEICE Technical Report, SC , Nov [10] Y. Yamato, et al., "Analyzing machine noise for real time maintenance," ICGIP 2016, [11] Y. Yamato, "Proposal of Vital Data Analysis Platform using Wearable Sensor," ICIAE 2017, pp , [12] " ERP,", Vol.116, No.201, pp.19-22, [13] Y. Yamato, et al., "Proposal of Lambda Architecture Adoption for Real Time Predictive Maintenance," IEEE CANDAR 2016, pp , [14] Y. Yamato, et al., "Proposal of Real Time Predictive Maintenance Platform with 3D Printer for Business Vehicles," International Journal of Information and Electronics Engineering, Vol.6, No.5, pp , [15] Y. Yamato, et al.,"security Camera Movie and ERP Data Matching System to Prevent Theft," IEEE CCNC 2017, pp , [16] Y. Yamato, "A study of posture judgement on vehicles using wearable 5

6 acceleration sensor," The 2017 IEICE General Conference, BS-3-1, [17] Y. Yamato, "Experiments of posture estimation on vehicles using wearable acceleration sensors," IEEE BigDataSecurity 2017, [18] Y. Yamato, et al., "Proposal of Shoplifting Prevention Service Using Image Analysis and ERP Check," IEEJ Transactions on Electrical and Electronic Engineering, Vol.12, pp , [19] Y. Yamato, et al., "Fast and Reliable Restoration Method of Virtual Resources on OpenStack," IEEE Transactions on Cloud Computing, 2015 [20] Y. Yamato, "Server Structure Proposal and Automatic Verification Technology on IaaS Cloud of Plural Type Servers," NETs2015, [21] Y. Yamato, "Proposal of Optimum Application Deployment Technology for Heterogeneous IaaS Cloud," WCSE 2016, pp.34-37, [22] Y. Yamato, "Use case study of HDD-SSD hybrid storage, distributed storage and HDD storage on OpenStack," IDEAS 15, pp , [23] Y. Yamato, et al.,"fast Restoration Method of Virtual Resources on OpenStack," IEEE CCNC 2015, pp , [24] Y. Yamato, et al., "Survey of Public IaaS Cloud Computing API," IEEJ Transactions on Electronics, Information and Systems, Vol.132, pp , [25] Y. Yamato, "Implementation Evaluation of Amazon SQS Compatible Functions," IEEJ Transactions on Electronics, Information and Systems, Vol.135, pp , [26] Y. Yamato, et al.,"development of Template Management Technology for Easy Deployment of Virtual Resources on OpenStack," Journal of Cloud Computing, Springer, 3:7, June [27] Y. Yamato, "Automatic verification for plural virtual machines patches," IEEE ICUFN 2015, pp , [28] Y. Yamato, "Cloud Storage Application Area of HDD-SSD Hybrid Storage, Distributed Storage and HDD Storage," IEEJ Transactions on Electrical and Electronic Engineering, Vol.11, pp , [29] Y. Yamato, et al., "Evaluation of Agile Software Development Method for Carrier Cloud Service Platform Development," IEICE Transactions on Information & Systems, Vol.E97-D, pp , [30] Y. Yamato, et al., "Software Maintenance Evaluation of Agile Software Development Method Based on OpenStack," IEICE Transactions on Information & Systems, Vol.E98-D, pp , [31] Y. Yamato, et al., "Development of Resource Management Server for Production IaaS Services Based on OpenStack," Journal of Information Processing, Vol.23, pp.58-66, [32] Y. Yamato, "Performance-Aware Server Architecture Recommendation and Automatic Performance Verification Technology on IaaS Cloud," Service Oriented Computing and Applications, Springer, [33] Y. Yamato, "Server Selection, Configuration and Reconfiguration Technology for IaaS Cloud with Multiple Server Types," Journal of Network and Systems Management, Springer, [34] ",", Vol.J95-B, pp , [35] ",", Vol.15, No.3, pp.3-8, [36] Y. Yamato, "Automatic verification technology of software patches for user virtual environments on IaaS cloud," Journal of Cloud Computing, Springer, 4:4, Feb [37] Y. Yamato, "Automatic system test technology of user virtual machine software patch on IaaS cloud," IEEJ Transactions on Electrical and Electronic Engineering, Vol.10, pp , [38] Y. Yamato, "Optimum Application Deployment Technology for Heterogeneous IaaS Cloud," Journal of Information Processing, Vol.25, pp.56-58, [39] Y. Yamato, "OpenStack Hypervisor, Container and Baremetal Servers Performance Comparison," IEICE Communication Express, Vol.4, pp , [40] Y. Yamato, et al., "A study to optimize heterogenesous resources for Open IoT," IEEE CANDAR 2017, pp , Nov [41] S. Wienke, et al., "OpenACC-first experiences with real-world applications," Euro-Par 2012 Parallel Processing, pp , [42] M. Wolfe, "Implementing the PGI accelerator model," ACM the 3rd Workshop on General-Purpose Computation on Graphics Processing Units, pp.43-50, [43] Y. Tanaka, et al., "Evaluation of Optimization Method for Fortran Codes with GPU Automatic Parallelization Compiler," IPSJ SIG Technical Report, 2011(9), pp.1-6, [44] H. Sunaga, et al., "Service Delivery Platform Architecture for the Next-Generation Network," ICIN2008, Oct [45] Y. Yokohata, et al., "Service Composition Architecture for Programmability and Flexibility in Ubiquitous Communication Networks," IEEE SAINTW 06, pp , [46] ",", Vol.J91-B, pp , [47] Y. Yamato, et al., "Development of Service Control Server for Web- Telecom Coordination Service," IEEE ICWS 2008, pp , [48] "Web-,", Vol.J91-B, pp , [49] "Web Web,", Vol.49, pp , [50] Y. Yokohata, et al., "Context-Aware Content-Provision Service for Shopping Malls Based on Ubiquitous Service-Oriented Network Framework and Authentication and Access Control Agent Framework," IEEE CCNC 2006, pp , [51], ",", Vol.48, pp , [52] Y. Nakano, et al., "Method of creating web services from web applications," IEEE SOCA 2007, pp.65-71, [53] Y. Yamato, et al., "Context-Aware Service Composition and Component Change-over using Semantic Web Techniques," IEEE ICWS 2007, pp , [54] Y. Yamato, et al., "Context-aware Ubiquitous Service Composition Technology," IFIP Research and Practical Issues of Enterprise Information Systems (CONFENIS 2006), pp.51-61, [55] Y. Miyagi, et al., "Support System for Developing Web Service Composition Scenarios without Need for Web Application Skills," IEEE SAINT 09, pp , [56] ",", Vol.105, No.407, pp.35-40, [57] ",", Vol.105, No.127, pp.5-8, [58] Y. Yamato, et al., "Study and Evaluation of Context-Aware Service Composition and Change-Over Using BPEL Engine and Semantic Web Techniques," IEEE CCNC 2008, pp , [59] H. Sunaga, et al., "Ubiquitous Life Creation through Service Composition Technologies," WTC2006, May [60] Y. Yamato, et al., "Study of Service Processing Agent for Context- Aware Service Coordination," IEEE SCC 2008, pp , [61] M. Takemoto, et al. "Service-composition Method and Its Implementation in Service-provision Architecture for Ubiquitous Computing Environments," IPSJ Journal, Vol.46, pp , [62] "BPEL,", Vol.51, pp , [63] M. Takemoto, et al.,"service Elements and Service Templates for Adaptive Service Composition in a Ubiquitous Computing Environment," IEEE APCC 2003, Vol.1, pp , [64] Y. Nakano, et al.,"effective Web-Service Creation Mechanism for Ubiquitous Service Oriented Architecture," IEEE CEC/EEE 2006, pp.85, [65] Y. Nakano, et al., "Web-Service-Based Avatar Service Modeling in the Next Generation Network," IEEE APSITT 2008, pp.52-57, [66] Y. Yamato, "Method of Service Template Generation on a Service Coordination Framework," UCS2004, Nov [67] ",", Vol.48, pp , [68] "BPEL,", Vol.J91-B, pp , [69] ",,",, [70] Y. Yamato, et al., "Peer-to-Peer Content Searching Method using Evaluation of Semantic Vectors," IEEE CCNC 2006, pp , [71] Y. Yamato, et al., "Peer-to-Peer Non-document Content Searching Method Using User Evaluation of Semantic Vectors," IEICE Transactions on Communications, Vol.E89-B, pp , [72] ",", Vol.104, No.691, pp , [73] ",", Vol.103, No.575, pp.1-6, [74] ",", Vol.116, No.287, pp.49-52, [75] M. Takemoto, et al., "A Service-Coordination Framework for Application to Ubiquitous and Context-aware Computing Environments," The 66th IPSJ General Conference, pp , [76] Y. Yamato, "P2P Content Searching Method Using Evaluation of Semantic Vectors," IEEE SAINT 2005 Workshops, pp , [77] "SOAP-REST,", Vol.51, pp , [78] "BPEL,", Vol.6, pp , [79] ",", Vol.4, pp , [80] "Service Delivery Platform,", Vol.J93-B, pp , 2010 [81] "," 66, pp , [82] "Grid,", Vol.105, No.178, pp.49-54, [83] " SOAP-REST,", Vol.109, No.3, pp.83-86, [84] "R&D Web Web," NTT Vol.20, No.6, pp.40-43, [85] "Web," (DEWS 2007), [86] M. Takemoto, et al., "A Future Network Service based on PC Grid Computing and P2P File Sharing: YOHEIX (Your Own Hybrid Environment for Information Exchange)," IPSJ SIG Technical Report, 2004, No.89 (2004-DPS-119), pp.91-96, [87] ",", Vol.J91-B, pp ,

Server Selection, Configuration and Reconfiguration Technology for IaaS Cloud with Multiple Server Types

J Netw Syst Manage (2018) 26:339 360 https://doi.org/10.1007/s10922-017-9418-z Server Selection, Configuration and Reconfiguration Technology for IaaS Cloud with Multiple Server Types Yoji Yamato 1 Received: