Power Aware Data Driven Distributed Simulation on Micro-Cluster Platforms

Similar documents
Energy Aware Dynamic Data Driven Distributed Traffic Simulations

Micronet SmartHub. Platform Datasheet. The Next Generation Rugged On-Board Computing Platform. Preliminary February, 2019.

Exploring Task Parallelism for Heterogeneous Systems Using Multicore Task Management API

Embedded Computing without Compromise. Evolution of the Rugged GPGPU Computer Session: SIL7127 Dan Mor PLM -Aitech Systems GTC Israel 2017

Ad Hoc Distributed Simulation of Surface Transportation Systems

IoT Based Traffic Signalling System

Transforming Transport Infrastructure with GPU- Accelerated Machine Learning Yang Lu and Shaun Howell

A176 Cyclone. GPGPU Fanless Small FF RediBuilt Supercomputer. IT and Instrumentation for industry. Aitech I/O

Virginia Tech Research Center Arlington, Virginia, USA

Enhancement of CoAP Packet Delivery Performance for Internet of Things. Hang Liu

The Power of the DragonBoard 410c Enabled by IBM Watson IoT. Presented by Robert Schwarz

THE LEADER IN VISUAL COMPUTING

TS1010C 10.1" Tablet PC Surf

ARM and x86 on Qseven & COM Express Mini. Zeljko Loncaric, Marketing Engineer, congatec AG

Fog Computing Design Principles for Industrial IoT Applications. Sudhanshu Gaur Director, Industrial IoT R&D Hitachi America, Ltd.

Mobile Middleware Course. Mobile Platforms and Middleware. Sasu Tarkoma

The TechNexion Difference

HiSilicon STB Push-to-Talk Development Kit for Amazon AVS

Portable Work Zone Data Collection. Blaine Van Dyke ODOT ITS Designer

CLOUDCAST User Guide. Website :

FiPS and M2DC: Novel Architectures for Reconfigurable Hyperscale Servers

Tiny GPU Cluster for Big Spatial Data: A Preliminary Performance Evaluation

Building blocks for 64-bit Systems Development of System IP in ARM

The TechNexion Difference

Introduction to Networked Embedded Systems and Course Description. Song Han Office: ITEB 355

A176 C clone. GPGPU Fanless Small FF RediBuilt Supercomputer. Aitech

An Energy Efficient and Delay Aware Data Collection Protocol in Heterogeneous Wireless Sensor Networks A Review

Energy-Aware CPU Frequency Scaling for Mobile Video Streaming

IOT-GATE-iMX7 Datasheet

F28HS Hardware-Software Interface: Systems Programming

MQTT Protocol Support. Cloud Ready Gateway. Modular Architecture

Benchmarking Real-World In-Vehicle Applications

Vehicular traffic estimation through bluetooth detection the open-source way

Multiple OS Platforms

Temporally Adaptive A* Algorithm on Time Dependent Transportation Network

Enabling opportunistic resources sharing in mobile Operating Systems

Introduction to Distributed Systems

Embedded Linux Conference San Diego 2016

Scalable Selective Traffic Congestion Notification

Experiences Using Tegra K1 and X1 for Highly Energy Efficient Computing

DS504/CS586: Big Data Analytics Data Pre-processing and Cleaning Prof. Yanhua Li

Roadmap of Tablet in Windows OS

Building A Resilient and Secure Power Grid A UCF Microgrid Demo

Data-Driven Vehicle Trajectory Prediction

MicroFuge: A Middleware Approach to Providing Performance Isolation in Cloud Storage Systems

LAUNCH. X-431 PADII Product Introduction

IoT Intro. Fernando Solano Warsaw University of Technology

Deep Neural Network Enhanced VSLAM Landmark Selection

RaceCapture/Pro 3. Next generation of data, telemetry, and control. Live-streaming to Podium

TrajAnalytics: A software system for visual analysis of urban trajectory data

Nvidia Jetson TX2 and its Software Toolset. João Fernandes 2017/2018

Leveraging Mobile GPUs for Flexible High-speed Wireless Communication

AIMS Things in a Fog (TGIF): A Framework to Support Multidomain Research in the Internet of Things

Carmela Comito and Domenico Talia. University of Calabria & ICAR-CNR

Introduction Video Camera Support Option TREQ Platform Datasheet. January Rev. 2 A-317 Video Getting Started Guide 1 / 6

Reduction Deliverable Intelligent V2V and V2I communication

AUTOMATED GARBAGE COLLECTION USING GPS AND GSM. Shobana G 1, Sureshkumar R 2

Experimental Evaluation of YouTube Performance on MPTCP-based LTE-WLAN Integration

QwikBox. Live Video Capture and Upload Device Group 25

A NEW COMPUTING ERA JENSEN HUANG, FOUNDER & CEO GTC CHINA 2017

WHY APPLICATIONS ARE STILL DRAINING OUR BATTERIES and how we can help. Aaron Schulman & Sachin Katti

Tablet MODECOM FreeTAB 1010 IPS IC

Realization of a low energy HPC platform powered by renewables - A case study: Technical, numerical and implementation aspects

Intel Research mote. Ralph Kling Intel Corporation Research Santa Clara, CA

LAUNCH. X-431 PRO3 Product Introduction

Distributed Energy-Aware Routing Protocol

Energy efficient mapping of virtual machines

I-Series Interactive Signage Displays and EloView Platform

Smartphone Evolution and Revolution

CSMA based Medium Access Control for Wireless Sensor Network

Proceedings of the 2014 Winter Simulation Conference A. Tolk, S. Y. Diallo, I. O. Ryzhov, L. Yilmaz, S. Buckley, and J. A. Miller, eds.

IoE Workshop. Marc Khayat, CCIE #41288, Leonard Janer Technical Managers. June 16

CUDA on ARM Update. Developing Accelerated Applications on ARM. Bas Aarts and Donald Becker

TR An Overview of NVIDIA Tegra K1 Architecture. Ang Li, Radu Serban, Dan Negrut

Making Mobile 5G a Commercial Reality. Peter Carson Senior Director Product Marketing Qualcomm Technologies, Inc.

Wireless Ordering System

PRELIMINARY COMPEX SYSTEMS

Distributed Pervasive Systems

Small is the New Big: Data Analytics on the Edge

A+B. Approaches Get Mature. How to evaluate? Self-*, Autonomic Communication. Virtualization f 3. f 4 f 1 f 2 CCN. Loc/ID Split Functional Composition

Mobile & IoT Market Trends and Memory Requirements

ClearStream. Prototyping 40 Gbps Transparent End-to-End Connectivity. Cosmin Dumitru! Ralph Koning! Cees de Laat! and many others (see posters)!

OCP Engineering Workshop - Telco

Quick Start Guide. SABRE Platform for Smart Devices Based on the i.mx 6 Series

6.S062: Mobile and Sensor Computing aka IoT Systems

Embracing the Next Wave

A Lightweight YOLOv2:

CS268: Beyond TCP Congestion Control

Mobile and Sensor Systems

About the Need to Power Instrument the Linux Kernel

Application Layer Switching: A Deployable Technique for Providing Quality of Service

Intelligent Video Analytics for Urban Management

STRAW - An integrated mobility & traffic model for vehicular ad-hoc networks

4G and 5G Cellular Technologies Enable Intelligent Transportation Use Cases

The 5G Business Potential. Terminsstart Telekom 2017 Monika Byléhn, 5G marketing director

ZP-PI335GK-BT User s Manual

USER MANUAL FOR MF0200 GATEWAY BOX VERSION 2.0

Quick Start Guide Multisensory Enablement Kit i.mx 8QuadXPlus MEK CPU Board. Based on i.mx 8QuadXPlus Applications Processor

Raspberry Pi Introduction

Yafit Snir Arindam Guha Cadence Design Systems, Inc. Accelerating System level Verification of SOC Designs with MIPI Interfaces

Transcription:

EAGER Power Aware Data Driven Distributed Simulation on Micro-Cluster Platforms Richard Fujimoto Georgia Institute of Technology

Project Team Faculty Richard Fujimoto (Distributed Simulation) Angshuman Guin (Transportation) Michael Hunter (Transportation) Richard Vuduc (High Performance Computing) Students Aradhya Biswas (Computational Science & Engineering) Sara Karamati (Computational Science & Engineering) Philip Pecher (Industrial & Systems Engineering) SaBra Neal (Computational Science & Engineering)

Motivation: Micro-Clusters and Energy Computation per unit energy doubling every 1.5 years Increasing computational capability of mobile computing devices Explosion in mobile devices Proliferation of low cost battery-powered drones and sensors Suggests trend: push more computing closer to data Micro-cluster: High performance cluster computer constructed from power efficient components operating in energy constrained environments Power and energy consumption are major concerns

A DDDAS Application: Adaptive Sensor Networks Autonomous team of mobile sensors monitoring an evolving physical system Spreading forest fire Dispersing cloud plume Traffic in a city Each mobile device includes Sensors Ability to participate in P2P wireless communication Predictive data driven simulation Processing Loop 1. Sense: construct current system state 2. Predict: distributed simulation to estimate future state 3. Adapt: relocate sensors

Research Goals Data Driven Distributed Simulation Power Aware Computing Micro-Cluster Platforms Transportation system applications (e.g., vehicle tracking, congestion management) Evaluate energy, time tradeoffs in data driven distributed simulation middleware Synchronization algorithms Evaluate accuracy, energy, time tradeoffs for data-driven transportation analytics Simulation, destination prediction, route computation Data tradeoffs Develop approaches for efficient energy use Evaluate approaches experimentally via benchmarks and prototypes

CPU Micro-cluster Testbed System Jetson TK1 ARM A15 (32-bit, 2.3 GHz, 4+1 cores) Jetson TX1 ARM A57 (64-bit, 1.9 GHz, 4 cores) GPU 192 core Kepler, 326 GF/s (peak) 256 core Maxwell, 1 TF/s Memory 2 GB LPDDR3 4 GB LPDDR4 Storage 16 GB emmc 16 GB emmc Networking Ethernet 802.11ac / BT Form Factor Dev board Module with 400 pin connector I/O USB, HDMI, Serial Provided separately Release Date 2014 2015 Micro-Cluster Server Hardware Jetson TK1 Boards (TX1 under evaluation) Tunable core (10x), memory (3x) frequencies PowerMon2 measurement system Client Hardware Quadcore Qualcomm MSM8974 Snapdragon Processor (LG Nexus 5 cellular phone)

Transportation System Monitoring 140 9 10 120 100 80 5 8 60 6 40 7 20 4 0 3 1 2 Portable bluetooth detectors Record ID of passing vehicles Extract vehicle route information

Distributed Simulation Middleware Creating Energy Profiles* Techniques developed to measure energy profiles of distributed simulations CMB synchronization algorithm Queueing network benchmark * submitted for publication ** IEEE Conference on Distributed Simulation & Real-Time Applications Synchronization Algorithms** Chandy/Misra/Bryant null message algorithm YAWNS: algorithm based on barrier synchronizations Different behaviors observed Energy for synchronization algorithms not studied before

Parallel Shortest Path Algorithm Frequency scaling Minimum and maximum parallelism tuning Variation in parallelism leads to inefficient use of energy (Gunrock) Modified Algorithm 45% better performance 20% lower peak power

Route Prediction Analytics GPS trajectories of 10,357 taxis (Beijing) 15M data points: Feb. 2 to Feb. 8, 2008 Average sampling interval 177s (623m on avg) Trips are not marked within taxi tours 1.25 km Source: T-Drive User Guide (Yu Zheng, Microsoft) 1.2 1 1.25 km? 0.8 0.6 0.4 0.2 0 Modified Model with Order-2 Bias Original Destination Prediction Model used for Trajectory Prediction 1 2 3 4 5 6 7 8 Sensor Count Route Prediction Accuracy improved using dynamic data Original approach (Krumm) based on past trajectory of single vehicle Modified approach uses dynamic traffic data

Status & Research Plans Testbeds Micro-cluster, power measurement set up Validated power measurements Bluetooth vehicle detection operational; Next: WiFi Distributed simulation middleware Energy profiling techniques developed, validated Preliminary synchronization algorithm measurement Next: Application benchmark suite; predictive models Transportation analytics Benchmarked graph algorithms, simulations, destination prediction algorithms Next: Data-driven tradeoffs, automated energy tuning

DDDAS Workshops and Meetings Workshop on Research Challenges in Modeling & Simulation for Engineering Complex Systems Arlington, Virginia, January 13-14, 2016 DDDAS Workshop in connection with ACM- SIGSIM PADS conference (Jin, Fujimoto) Banff, Canada, May 16, 2016 Paper deadline: February 1, 2016 Winter Simulation Conference Crystal City, Virginia, December 12-14, 2016 Paper deadline: April 1, 2016