ARM Intelligent Power Allocation

Size: px
Start display at page:

Download "ARM Intelligent Power Allocation"

Transcription

1 ARM Intelligent Power Allocation 1

2 Agenda Background and Motivation What is ARM Intelligent Power Allocation? Results Status and Conclusions 2

3 Power Consumption Scenarios The illustration to the right depicts 3 distinct types of power consumption: 1. Short term responsiveness Power consumption allowed to significantly exceed sustainable max power of the SoC for relatively short periods E.g. Web Browsing, UI update, HDR processing (first few shots) Power/Performance Burst for responsiveness T >= Tjmax, Tskin Sustained performance Max Sustainable Power Power Optimised Low End 2. Sustained workload Goal is to maximise performance at, or within, the sustainable power envelope Adapt performance allocation to major power consuming entities E.g. Gaming, long benchmarks, video processing Energy constrained use cases Operates under the radar of intelligent power allocation E.g. low power audio A successful thermal management scheme will maximize the user experience for types1 and 2. Time 3

4 Power Consumption Scenarios The illustration to the right depicts 3 distinct types of power consumption: 1. Short term responsiveness Power consumption allowed to significantly exceed sustainable max power of the SoC for relatively short periods E.g. Web Browsing, UI update, HDR processing (first few shots) Power/Performance Burst for responsiveness T >= Tjmax, Tskin Sustained performance Max Sustainable Power Power Optimised Low End 2. Sustained workload Goal is to maximise performance at, or within, the sustainable power envelope Adapt performance allocation to major power consuming entities E.g. Gaming, long benchmarks, video processing Energy constrained use cases Operates under the radar of intelligent power allocation E.g. low power audio A successful thermal management scheme will maximize the user experience for types1 and 2. Time 4

5 Power Consumption Scenarios The illustration to the right depicts 3 distinct types of power consumption: 1. Short term responsiveness Power consumption allowed to significantly exceed sustainable max power of the SoC for relatively short periods E.g. Web Browsing, UI update, HDR processing (first few shots) Power/Performance Burst for responsiveness T >= Tjmax, Tskin Sustained performance Max Sustainable Power Power Optimised Low End 2. Sustained workload Goal is to maximise performance at, or within, the sustainable power envelope Adapt performance allocation to major power consuming entities E.g. Gaming, long benchmarks, video processing Energy constrained use cases Operates under the radar of intelligent power allocation E.g. low power audio A successful thermal management scheme will maximize the user experience for types1 and 2. Time 5

6 Power Consumption Scenarios The illustration to the right depicts 3 distinct types of power consumption: 1. Short term responsiveness Power consumption allowed to significantly exceed sustainable max power of the SoC for relatively short periods E.g. Web Browsing, UI update, HDR processing (first few shots) Power/Performance Burst for responsiveness T >= Tjmax, Tskin Sustained performance Max Sustainable Power Power Optimised Low End 2. Sustained workload Goal is to maximise performance at, or within, the sustainable power envelope Adapt performance allocation to major power consuming entities E.g. Gaming, long benchmarks, video processing Energy constrained use cases Operates under the radar of intelligent power allocation E.g. low power audio A successful thermal management scheme will maximize the user experience for types1 and 2. Time 6

7 Power Consumption Scenarios The illustration to the right depicts 3 distinct types of power consumption: 1. Short term responsiveness Power consumption allowed to significantly exceed sustainable max power of the SoC for relatively short periods E.g. Web Browsing, UI update, HDR processing (first few shots) Power/Performance Burst for responsiveness T >= Tjmax, Tskin Sustained performance Max Sustainable Power Power Optimised Low End 2. Sustained workload Goal is to maximise performance at, or within, the sustainable power envelope Adapt performance allocation to major power consuming entities E.g. Gaming, long benchmarks, video processing Energy constrained use cases Operates under the radar of intelligent power allocation E.g. low power audio A successful thermal management scheme will maximize the user experience for types 1 and 2. Time 7

8 Current Linux Thermal Framework Thermal trip mechanism Trip2 Trip1 Designed to turn on fans, and/or reduce operating frequency Activates on temperature overshoot Does not accommodate power partitioning between different parts of SoC: GPU, CPU, DSP, video Reactive and static 8

9 IPA Thermal Management technology Proactive vs. Reactive Thermal Management Fixed response to temperature crossing a threshold (Reactive) vs. continuously adapting response based on power consumption and thermal headroom (Proactive) Static vs. Dynamic Partitioning A fixed policy for each component based on prior knowledge of its power consumption vs. a dynamic policy that takes into account component behaviour over time Maximising performance within thermal limits is critical Closed-loop control to dynamically allocate power budget, taking into account: Thermal headroom Real-time performance requirements 9

10 Agenda Background and Motivation What is ARM Intelligent Power Allocation? Results Status and Conclusions 10

11 ARM Intelligent Power Allocation Power to Heat Tdie Tskin Performance Requests big LITTLE GPU Real-time CPU & GPU Performance requests SoC SoC IPA Elements: Temperature control Power estimation Performance allocation big LITTLE GPU Allocated Performance Dynamic Allocation by: Performance required Thermal headroom 11

12 Thermal Management Approach Granted Performance Power Arbiter performs two tasks: 1. Keeps system within thermal envelope Proactive power budget via closed loop control Exploits thermal headroom CPU GPU Perf request Perf metrics Granted Performance Perf request Power model Power model Delta Power Estimate Power Estimate Delta Power Estimate PID Power Arbiter Policy Config 2. Dynamic power allocation per actor, based on Performance demand Power models Perf metrics Granted Performance Perf request Power model Power Estimate Delta Power Estimate Power arbitration Temp. Monitor 12 Actors Perf metrics Power Estimate

13 PID Controller and Power Allocation K_d control_temp e t x K_i D tdp Performance Requests Actor 1 Actor N S e e x I S S Pmax Power Allocation Actor1..N weight Actor1..N maxpower x P -rest_of_soc_power Actor 1 Actor N Performance Granted -Temp K_po/pu PID Controller Provides a power budget, based on current and control temp 13

14 PID Controller and Power Allocation K_d control_temp S e -Temp x K_po/pu e t K_i Power Allocation based on: Requested performance e x S Power budget I Policy Produces final output power x D P tdp S Pmax -rest_of_soc_power Performance Requests Actor 1 Actor N Power Allocation Actor 1 Actor N Performance Granted Actor1..N weight Actor1..N maxpower 14

15 Agenda Background and Motivation What is ARM Intelligent Power Allocation? Results Status and Conclusions 17

16 Configuration Platform: Odroid XU3: 4xA15, 4xA7 ARM Mali-T628 Aims: Target control temperature 65 o C Critical temperature 69 o C Static policy: GPU: 4 Trip points, one for each frequency At each trip point, max frequency is reduced Trips at 56, 57, 58, 59 o C A15/A7: 3 trip points at 60, 62, 64 o C IPA: Control temp = 65 o C Hotplug actioned if reached 69 o C Higher priority for A7 18

17 Intelligent Power Allocation on Games Unity The Chase: T/ C 31% higher FPS with IPA Time Enlighten Transporter: T/ C 8% higher FPS with IPA Time 19

18 Intelligent Power Allocation SOLID Temperature Control Unity the Chase: T/ C Enlighten Transporter: T/ C 20

19 % Improvement on Consecutive Benchmark Runs IPA vs. Static 21

20 Running Frequency Intelligent Power Allocation in Action IPA dynamically allocates power to CPU clusters or GPU, based on load Temperature determines available power You set Temperature, IPA caps Frequencies Load determines how power is divided between CPUs (big and LITTLE) and GPU GLB Trex HD [3 Runs] Max OPP big Max OPP LITTLE big LITTLE GPU Max OPP GPU Median filtered chart for clarity Time (s) 22

21 Running Frequency Intelligent Power Allocation in Action IPA dynamically allocates power to CPU clusters or GPU, based on load Temperature determines available power You set Temperature, IPA caps Frequencies Load determines how power is divided between CPUs (big and LITTLE) and GPU GLB Trex HD [3 Runs] Max OPP big Max OPP LITTLE big LITTLE GPU Max OPP GPU 23 Median filtered chart for clarity Device is still cool There are no constraints on frequency Every actor runs at max frequency Time (s)

22 Running Frequency Intelligent Power Allocation in Action IPA dynamically allocates power to CPU clusters or GPU, based on load Temperature determines available power You set Temperature, IPA caps Frequencies Load determines how power is divided between CPUs (big and LITTLE) and GPU GLB Trex HD [3 Runs] Max OPP big Max OPP LITTLE big LITTLE GPU Max OPP GPU Median filtered chart for clarity High load on GPU Low load on CPU GPU gets most of the power Time (s) 24

23 Running Frequency Intelligent Power Allocation in Action IPA dynamically allocates power to CPU clusters or GPU, based on load Temperature determines available power You set Temperature, IPA caps Frequencies Load determines how power is divided between CPUs (big and LITTLE) and GPU GLB Trex HD [3 Runs] Max OPP big Max OPP LITTLE big LITTLE GPU Max OPP GPU 25 Median filtered chart for clarity High load on CPU Low load on GPU CPU gets most of the power Time (s)

24 Running Frequency Intelligent Power Allocation in Action IPA dynamically allocates power to CPU clusters or GPU, based on load Temperature determines available power You set Temperature, IPA caps Frequencies Load determines how power is divided between CPUs (big and LITTLE) and GPU GLB Trex HD [3 Runs] Max OPP big big LITTLE GPU Max OPP LITTLE Max OPP GPU 26 Median filtered chart for clarity As the device gets hotter IPA reduces available power to actors to maintain temperature control Time (s)

25 Agenda Background and Motivation What is ARM Intelligent Power Allocation? Results Status and Conclusions 27

26 IPA upstreaming Status ARM has produced a Linux OS implementation of IPA Demonstrates key concepts and shows benefit on real workloads ARM contributing IPA to Linux kernel RFCs have been sent to linux-pm Latest: v5 RFC: Supported by Mali DDK Check out IPA code for ARM Juno release (includes GPU integration) (tag lsk-3.10-armlt-juno ) Upstream timescales are subject to variation, depending on community feedback ARM, Linaro, ARM partners all contributing & reviewing 28

27 Conclusions ARM Intelligent Power Allocation is designed to maximise performance in the thermal envelope: Proactively adjusts available power budget, based on device temperature Allocates power dynamically among actors, based on performance demand ARM Intelligent Power Allocation shows solid thermal control and better performance than existing Linux thermal framework We are providing Intelligent Power Allocation to the community to as baseline for best-in-class thermal management 29

28 Thank You The trademarks featured in this presentation are registered and/or unregistered trademarks of ARM Limited (or its subsidiaries) in the EU and/or elsewhere. All rights reserved. Any other marks featured may be trademarks of their respective owners 30

ARM Vision for Thermal Management and Energy Aware Scheduling on Linux

ARM Vision for Thermal Management and Energy Aware Scheduling on Linux ARM Vision for Management and Energy Aware Scheduling on Linux Charles Garcia-Tobin, Software Power Architect, ARM Thomas Molgaard, Director of Product Management, ARM ARM Tech Symposia China 2015 November

More information

Intelligent Power Allocation for Consumer & Embedded Thermal Control

Intelligent Power Allocation for Consumer & Embedded Thermal Control Intelligent Power Allocation for Consumer & Embedded Thermal Control Ian Rickards ARM Ltd, Cambridge UK ELC San Diego 5-April-2016 Existing Linux Thermal Framework Trip1 Trip0 Thermal trip mechanism using

More information

ARM big.little Technology Unleashed An Improved User Experience Delivered

ARM big.little Technology Unleashed An Improved User Experience Delivered ARM big.little Technology Unleashed An Improved User Experience Delivered Govind Wathan Product Specialist Cortex -A Mobile & Consumer CPU Products 1 Agenda Introduction to big.little Technology Benefits

More information

Dell Dynamic Power Mode: An Introduction to Power Limits

Dell Dynamic Power Mode: An Introduction to Power Limits Dell Dynamic Power Mode: An Introduction to Power Limits By: Alex Shows, Client Performance Engineering Managing system power is critical to balancing performance, battery life, and operating temperatures.

More information

MediaTek CorePilot. Heterogeneous Multi-Processing Technology. Delivering extreme compute performance with maximum power efficiency

MediaTek CorePilot. Heterogeneous Multi-Processing Technology. Delivering extreme compute performance with maximum power efficiency MediaTek CorePilot Heterogeneous Multi-Processing Technology Delivering extreme compute performance with maximum power efficiency In July 2013, MediaTek delivered the industry s first mobile system on

More information

The Challenges of System Design. Raising Performance and Reducing Power Consumption

The Challenges of System Design. Raising Performance and Reducing Power Consumption The Challenges of System Design Raising Performance and Reducing Power Consumption 1 Agenda The key challenges Visibility for software optimisation Efficiency for improved PPA 2 Product Challenge - Software

More information

Artificial Intelligence Enriched User Experience with ARM Technologies

Artificial Intelligence Enriched User Experience with ARM Technologies Artificial Intelligence Enriched User Experience with ARM Technologies Daniel Heo Senior Segment Manager Mobile, BSG, ARM ARM Tech Forum Singapore July 12 th 2017 Global AI survey: the world is ready 71

More information

Building Ultra-Low Power Wearable SoCs

Building Ultra-Low Power Wearable SoCs Building Ultra-Low Power Wearable SoCs 1 Wearable noun An item that can be worn adjective Easy to wear, suitable for wearing 2 Wearable Opportunity: Fastest Growing Market Segment Projected Growth from

More information

Cut Power Consumption by 5x Without Losing Performance

Cut Power Consumption by 5x Without Losing Performance Cut Power Consumption by 5x Without Losing Performance A big.little Software Strategy Klaas van Gend FAE, Trainer & Consultant The mandatory Klaas-in-a-Plane picture 2 October 10, 2014 LINUXCON EUROPE

More information

Energy efficient mapping of virtual machines

Energy efficient mapping of virtual machines GreenDays@Lille Energy efficient mapping of virtual machines Violaine Villebonnet Thursday 28th November 2013 Supervisor : Georges DA COSTA 2 Current approaches for energy savings in cloud Several actions

More information

A Study on C-group controlled big.little Architecture

A Study on C-group controlled big.little Architecture A Study on C-group controlled big.little Architecture Renesas Electronics Corporation New Solutions Platform Business Division Renesas Solutions Corporation Advanced Software Platform Development Department

More information

Profiling and Debugging OpenCL Applications with ARM Development Tools. October 2014

Profiling and Debugging OpenCL Applications with ARM Development Tools. October 2014 Profiling and Debugging OpenCL Applications with ARM Development Tools October 2014 1 Agenda 1. Introduction to GPU Compute 2. ARM Development Solutions 3. Mali GPU Architecture 4. Using ARM DS-5 Streamline

More information

ARM instruction sets and CPUs for wide-ranging applications

ARM instruction sets and CPUs for wide-ranging applications ARM instruction sets and CPUs for wide-ranging applications Chris Turner Director, CPU technology marketing ARM Tech Forum Taipei July 4 th 2017 ARM computing is everywhere #1 shipping GPU in the world

More information

ARM Multimedia IP: working together to drive down system power and bandwidth

ARM Multimedia IP: working together to drive down system power and bandwidth ARM Multimedia IP: working together to drive down system power and bandwidth Speaker: Robert Kong ARM China FAE Author: Sean Ellis ARM Architect 1 Agenda System power overview Bandwidth, bandwidth, bandwidth!

More information

Building blocks for 64-bit Systems Development of System IP in ARM

Building blocks for 64-bit Systems Development of System IP in ARM Building blocks for 64-bit Systems Development of System IP in ARM Research seminar @ University of York January 2015 Stuart Kenny stuart.kenny@arm.com 1 2 64-bit Mobile Devices The Mobile Consumer Expects

More information

MediaTek CorePilot 2.0. Delivering extreme compute performance with maximum power efficiency

MediaTek CorePilot 2.0. Delivering extreme compute performance with maximum power efficiency MediaTek CorePilot 2.0 Heterogeneous Computing Technology Delivering extreme compute performance with maximum power efficiency In July 2013, MediaTek delivered the industry s first mobile system on a chip

More information

Achieving Console Quality Games on Mobile

Achieving Console Quality Games on Mobile Achieving Console Quality Games on Mobile Peter Harris, Senior Principal Engineer, ARM Unai Landa, CTO, Digital Legends Jon Kirkham, Staff Engineer, ARM GDC 2017 Agenda Premium smartphone in 2017 ARM Cortex

More information

TZMP-1 Software Reference Implementation. Ken Liu 2018-Mar-12

TZMP-1 Software Reference Implementation. Ken Liu 2018-Mar-12 TZMP-1 Software Reference Implementation Ken Liu 2018-Mar-12 2018 Arm Limited Content DRM Applications and Secure Video Path Regular Secure Video Path Design with Trustzone TZMP1 Design Concepts Reference

More information

Enabling a Richer Multimedia Experience with GPU Compute. Roberto Mijat Visual Computing Marketing Manager

Enabling a Richer Multimedia Experience with GPU Compute. Roberto Mijat Visual Computing Marketing Manager Enabling a Richer Multimedia Experience with GPU Compute Roberto Mijat Visual Computing Marketing Manager 1 What is GPU Compute Operating System and most application processing continue to reside on the

More information

Exploring System Coherency and Maximizing Performance of Mobile Memory Systems

Exploring System Coherency and Maximizing Performance of Mobile Memory Systems Exploring System Coherency and Maximizing Performance of Mobile Memory Systems Shanghai: William Orme, Strategic Marketing Manager of SSG Beijing & Shenzhen: Mayank Sharma, Product Manager of SSG ARM Tech

More information

Optimizing Cache Coherent Subsystem Architecture for Heterogeneous Multicore SoCs

Optimizing Cache Coherent Subsystem Architecture for Heterogeneous Multicore SoCs Optimizing Cache Coherent Subsystem Architecture for Heterogeneous Multicore SoCs Niu Feng Technical Specialist, ARM Tech Symposia 2016 Agenda Introduction Challenges: Optimizing cache coherent subsystem

More information

for Power Energy and

for Power Energy and Engineered for Power Management: Dell PowerEdge Servers Are Designed to Help Save Energy and Reduce Costs ABSTRACT Keeping up with the rising cost of energy is one of the greatest challenges facing IT

More information

8205-E6C ENERGY STAR Power and Performance Data Sheet

8205-E6C ENERGY STAR Power and Performance Data Sheet 8205-E6C ENERGY STAR Power and Performance Data Sheet ii 8205-E6C ENERGY STAR Power and Performance Data Sheet Contents 8205-E6C ENERGY STAR Power and Performance Data Sheet........ 1 iii iv 8205-E6C ENERGY

More information

Dynamic Power Optimization for Higher Server Density Racks A Baidu Case Study with Intel Dynamic Power Technology

Dynamic Power Optimization for Higher Server Density Racks A Baidu Case Study with Intel Dynamic Power Technology Dynamic Power Optimization for Higher Server Density Racks A Baidu Case Study with Intel Dynamic Power Technology Executive Summary Intel s Digital Enterprise Group partnered with Baidu.com conducted a

More information

8233-E8B 3x6-core ENERGY STAR Power and Performance Data Sheet

8233-E8B 3x6-core ENERGY STAR Power and Performance Data Sheet 8233-E8B 3x6-core ENERGY STAR Power and Performance Data Sheet ii 8233-E8B 3x6-core ENERGY STAR Power and Performance Data Sheet Contents 8233-E8B 3x6-core ENERGY STAR Power and Performance Data Sheet...

More information

Mali-G72 Enabling tomorrow s technology today

Mali-G72 Enabling tomorrow s technology today Mali-G72 Enabling tomorrow s technology today Alan Tsai Senior Regional Marketing Manager Media Processing Group, ARM ARM Tech Forum Taipei July 4 th 2017 Mali High Performance GPU success 2 Mali-G71 in

More information

POWER-AWARE SOFTWARE ON ARM. Paul Fox

POWER-AWARE SOFTWARE ON ARM. Paul Fox POWER-AWARE SOFTWARE ON ARM Paul Fox OUTLINE MOTIVATION LINUX POWER MANAGEMENT INTERFACES A UNIFIED POWER MANAGEMENT SYSTEM EXPERIMENTAL RESULTS AND FUTURE WORK 2 MOTIVATION MOTIVATION» ARM SoCs designed

More information

Tizen Power Management Service with PASS (Power-Aware System Service)

Tizen Power Management Service with PASS (Power-Aware System Service) Tizen Power Management Service with PASS (Power-Aware System Service) 1 Chanwoo Choi cw00.choi@samsung.com S/W R&D Center, Samsung Electronics Copyright 2017 Samsung. All Rights Reserved. Contents Power-Management

More information

i.mx 7 Dual/Solo Product Lifetime Usage

i.mx 7 Dual/Solo Product Lifetime Usage NXP Semiconductors Document Number: AN5334 Application Note Rev. 1, 05/2017 i.mx 7 Dual/Solo Product Lifetime Usage 1. Introduction This document describes the estimated product lifetimes for the i.mx

More information

ARMv8-A CPU Architecture Overview

ARMv8-A CPU Architecture Overview ARMv8-A CPU Architecture Overview Chris Shore Training Manager, ARM ARM Game Developer Day, London 03/12/2015 Chris Shore ARM Training Manager With ARM for 16 years Managing customer training for 15 years

More information

PowerExecutive. Tom Brey IBM Agenda. Why PowerExecutive. Fundamentals of PowerExecutive. - The Data Center Power/Cooling Crisis.

PowerExecutive. Tom Brey IBM Agenda. Why PowerExecutive. Fundamentals of PowerExecutive. - The Data Center Power/Cooling Crisis. PowerExecutive IBM Agenda Why PowerExecutive - The Data Center Power/Cooling Crisis Fundamentals of PowerExecutive 1 The Data Center Power/Cooling Crisis Customers want more IT processing cycles to run

More information

The Benefits of GPU Compute on ARM Mali GPUs

The Benefits of GPU Compute on ARM Mali GPUs The Benefits of GPU Compute on ARM Mali GPUs Tim Hartley 1 SEMICON Europa 2014 ARM Introduction World leading semiconductor IP Founded in 1990 1060 processor licenses sold to more than 350 companies >

More information

Power Capping Linux. Len Brown, Jacob Pan, Srinivas Pandruvada

Power Capping Linux. Len Brown, Jacob Pan, Srinivas Pandruvada Power Capping Linux Len Brown, Jacob Pan, Srinivas Pandruvada Agenda Context System Power Management Issues Power Capping Overview Power capping participants Recommendation Linux Power Capping Framework

More information

PM-QoS? Naah..It is PnP QoS

PM-QoS? Naah..It is PnP QoS PM-QoS? Naah..It is PnP QoS Sundar Iyer, Mark Gross, Premanand Sakarda, Ajaya Durg, Muthukumar Kalyan, Anand Bodas, Manoj Dawarwadikar Mobile & Comms. Group, Intel Special Thanks to: Ticky Thakkar, Jasmin

More information

Towards Power Management for FreeBSD

Towards Power Management for FreeBSD Towards Power Management for FreeBSD Robin Randhawa robin.randhawa@arm.com FreeBSD Developer Summit Computer Laboratory University of Cambridge August 2015 Agenda An overview of Energy Aware Scheduling

More information

Srinivas Chennupaty. Intel Corporation, 2018

Srinivas Chennupaty. Intel Corporation, 2018 Srinivas Chennupaty Intel Corporation, 2018 Notices and Disclaimers This document contains information on products, services and/or processes in development. All information provided here is subject to

More information

Efficient Evaluation and Management of Temperature and Reliability for Multiprocessor Systems

Efficient Evaluation and Management of Temperature and Reliability for Multiprocessor Systems Efficient Evaluation and Management of Temperature and Reliability for Multiprocessor Systems Ayse K. Coskun Electrical and Computer Engineering Department Boston University http://people.bu.edu/acoskun

More information

Developing the Bifrost GPU architecture for mainstream graphics

Developing the Bifrost GPU architecture for mainstream graphics Developing the Bifrost GPU architecture for mainstream graphics Anand Patel Senior Product Manager, Media Processing Group ARM Tech Symposia India December 7 th 2016 Graphics processing drivers Virtual

More information

Advanced IP solutions enabling the autonomous driving revolution

Advanced IP solutions enabling the autonomous driving revolution Advanced IP solutions enabling the autonomous driving revolution Chris Turner Director, Emerging Technology & Strategy, Embedded & Automotive Arm Shanghai, Beijing, Shenzhen Arm Tech Symposia 2017 Agenda

More information

An introduction to Machine Learning silicon

An introduction to Machine Learning silicon An introduction to Machine Learning silicon November 28 2017 Insight for Technology Investors AI/ML terminology Artificial Intelligence Machine Learning Deep Learning Algorithms: CNNs, RNNs, etc. Additional

More information

Designing, developing, debugging ARM Cortex-A and Cortex-M heterogeneous multi-processor systems

Designing, developing, debugging ARM Cortex-A and Cortex-M heterogeneous multi-processor systems Designing, developing, debugging ARM and heterogeneous multi-processor systems Kinjal Dave Senior Product Manager, ARM ARM Tech Symposia India December 7 th 2016 Topics Introduction System design Software

More information

DVFS Space Exploration in Power-Constrained Processing-in-Memory Systems

DVFS Space Exploration in Power-Constrained Processing-in-Memory Systems DVFS Space Exploration in Power-Constrained Processing-in-Memory Systems Marko Scrbak and Krishna M. Kavi Computer Systems Research Laboratory Department of Computer Science & Engineering University of

More information

Moorestown Platform: Based on Lincroft SoC Designed for Next Generation Smartphones

Moorestown Platform: Based on Lincroft SoC Designed for Next Generation Smartphones Moorestown Platform: Based on Lincroft SoC Designed for Next Generation Smartphones HOT CHIPS 2009 August 24 2009 Rajesh Patel Lead Architect, Lincroft SoC Intel Corporation Legal Disclaimer INFORMATION

More information

Square Pegs in Round holes. Paweł Moll

Square Pegs in Round holes. Paweł Moll Square Pegs in Round holes or or System System Level Level Performance Performance Data Data and and perf perf Paweł Moll 1 The plan Problem definition s Systems perf and non-s Examples

More information

Jae Wook Lee. SIC R&D Lab. LG Electronics

Jae Wook Lee. SIC R&D Lab. LG Electronics Jae Wook Lee SIC R&D Lab. LG Electronics Contents Introduction Why power validation on mobile application processor? Then, what to validate? Who is in charge of validation? Power Validation Components

More information

Next Generation Visual Computing

Next Generation Visual Computing Next Generation Visual Computing (Making GPU Computing a Reality with Mali ) Taipei, 18 June 2013 Roberto Mijat ARM Addressing Computational Challenges Trends Growing display sizes and resolutions Increasing

More information

The mobile computing evolution. The Griffin architecture. Memory enhancements. Power management. Thermal management

The mobile computing evolution. The Griffin architecture. Memory enhancements. Power management. Thermal management Next-Generation Mobile Computing: Balancing Performance and Power Efficiency HOT CHIPS 19 Jonathan Owen, AMD Agenda The mobile computing evolution The Griffin architecture Memory enhancements Power management

More information

Designing Security & Trust into Connected Devices

Designing Security & Trust into Connected Devices Designing Security & Trust into Connected Devices Eric Wang Senior Technical Marketing Manager Shenzhen / ARM Tech Forum / The Ritz-Carlton June 14, 2016 Agenda Introduction Security Foundations on Cortex-A

More information

F28HS Hardware-Software Interface: Systems Programming

F28HS Hardware-Software Interface: Systems Programming F28HS Hardware-Software Interface: Systems Programming Hans-Wolfgang Loidl School of Mathematical and Computer Sciences, Heriot-Watt University, Edinburgh Semester 2 2017/18 0 No proprietary software has

More information

QoS Handling with DVFS (CPUfreq & Devfreq)

QoS Handling with DVFS (CPUfreq & Devfreq) QoS Handling with DVFS (CPUfreq & Devfreq) MyungJoo Ham SW Center, 1 Performance Issues of DVFS Performance Sucks w/ DVFS! Battery-life Still Matters More Devices (components) w/ DVFS More Performance

More information

How to Build Optimized ML Applications with Arm Software

How to Build Optimized ML Applications with Arm Software How to Build Optimized ML Applications with Arm Software Arm Technical Symposia 2018 Arm K.K. Senior FAE Ryuji Tanaka Overview Today we will talk about applied machine learning (ML) on Arm. My aim for

More information

R goes Mobile: Efficient Scheduling for Parallel R Programs on Heterogeneous Embedded Systems

R goes Mobile: Efficient Scheduling for Parallel R Programs on Heterogeneous Embedded Systems R goes Mobile: Efficient Scheduling for Parallel R Programs on Heterogeneous Embedded Systems, Andreas Lang Olaf Neugebauer, Peter Marwedel 03/07/2017 SFB 876 Parallel Machine Learning Algorithms Challenge:

More information

Power Management in Intel Architecture Servers

Power Management in Intel Architecture Servers Power Management in Intel Architecture Servers White Paper Intel Architecture Servers During the last decade, Intel has added several new technologies that enable users to improve the power efficiency

More information

HPC projects. Grischa Bolls

HPC projects. Grischa Bolls HPC projects Grischa Bolls Outline Why projects? 7th Framework Programme Infrastructure stack IDataCool, CoolMuc Mont-Blanc Poject Deep Project Exa2Green Project 2 Why projects? Pave the way for exascale

More information

Unleashing the benefits of GPU Computing with ARM Mali TM Practical applications and use-cases. Steve Steele, ARM

Unleashing the benefits of GPU Computing with ARM Mali TM Practical applications and use-cases. Steve Steele, ARM Unleashing the benefits of GPU Computing with ARM Mali TM Practical applications and use-cases Steve Steele, ARM 1 Today s Computational Challenges Trends Growing display sizes and resolutions, richer

More information

How to Build Optimized ML Applications with Arm Software

How to Build Optimized ML Applications with Arm Software How to Build Optimized ML Applications with Arm Software Arm Technical Symposia 2018 ML Group Overview Today we will talk about applied machine learning (ML) on Arm. My aim for today is to show you just

More information

Enabling Arm DynamIQ support. Dan Handley (Arm) Ionela Voinescu (Arm) Vincent Guittot (Linaro)

Enabling Arm DynamIQ support. Dan Handley (Arm) Ionela Voinescu (Arm) Vincent Guittot (Linaro) Enabling Arm DynamIQ support Dan Handley (Arm) Ionela Voinescu (Arm) Vincent Guittot (Linaro) Agenda DynamIQ introduction DynamIQ and Arm Trusted Firmware OS Power Management with DynamIQ L3 partial power-down

More information

UTILIZING A BIG.LITTLE TM SOLUTION IN AUTOMOTIVE

UTILIZING A BIG.LITTLE TM SOLUTION IN AUTOMOTIVE UTILIZING A BIG.LITTLE TM SOLUTION IN AUTOMOTIVE JUN. 20, 2018 YOSHIYUKI ITO AUTOMOTIVE INFORMATION SOLUTION BUSINESS DIVISION RENESAS ELECTRONICS CORPORATION Today s Topics & Goal Requirement for big.little

More information

Optimizing and Profiling Unity Games for Mobile Platforms. Angelo Theodorou Senior Software Engineer, MPG Gamelab 2014, 25 th -27 th June

Optimizing and Profiling Unity Games for Mobile Platforms. Angelo Theodorou Senior Software Engineer, MPG Gamelab 2014, 25 th -27 th June Optimizing and Profiling Unity Games for Mobile Platforms Angelo Theodorou Senior Software Engineer, MPG Gamelab 2014, 25 th -27 th June 1 Agenda Introduction ARM and the presenter Preliminary knowledge

More information

Power management for in-vehicle infotainment systems

Power management for in-vehicle infotainment systems Automotive Linux Summit 2017 Power management for in-vehicle infotainment systems 2017/05/31 Takahiko Gomi Automotive Information Solution Business Division Renesas Electronics Corporation 1 Who am I?

More information

Comprehensive Arm Solutions for Innovative Machine Learning (ML) and Computer Vision (CV) Applications

Comprehensive Arm Solutions for Innovative Machine Learning (ML) and Computer Vision (CV) Applications Comprehensive Arm Solutions for Innovative Machine Learning (ML) and Computer Vision (CV) Applications Helena Zheng ML Group, Arm Arm Technical Symposia 2017, Taipei Machine Learning is a Subset of Artificial

More information

Integrating CPU and GPU, The ARM Methodology. Edvard Sørgård, Senior Principal Graphics Architect, ARM Ian Rickards, Senior Product Manager, ARM

Integrating CPU and GPU, The ARM Methodology. Edvard Sørgård, Senior Principal Graphics Architect, ARM Ian Rickards, Senior Product Manager, ARM Integrating CPU and GPU, The ARM Methodology Edvard Sørgård, Senior Principal Graphics Architect, ARM Ian Rickards, Senior Product Manager, ARM The ARM Business Model Global leader in the development of

More information

SANDPIPER: BLACK-BOX AND GRAY-BOX STRATEGIES FOR VIRTUAL MACHINE MIGRATION

SANDPIPER: BLACK-BOX AND GRAY-BOX STRATEGIES FOR VIRTUAL MACHINE MIGRATION SANDPIPER: BLACK-BOX AND GRAY-BOX STRATEGIES FOR VIRTUAL MACHINE MIGRATION Timothy Wood, Prashant Shenoy, Arun Venkataramani, and Mazin Yousif * University of Massachusetts Amherst * Intel, Portland Data

More information

Programming for Multicore & ARM big.little Technology. Ed Plowman Director of Solutions Architecture Media Processing Group, ARM

Programming for Multicore & ARM big.little Technology. Ed Plowman Director of Solutions Architecture Media Processing Group, ARM Programming for Multicore & ARM big.little Technology Ed Plowman Director of Solutions Architecture Media Processing Group, ARM 1 Multicore & ARM big.little Technology The case for multiprocessing Platform

More information

Energy Efficient Big Data Processing at the Software Level

Energy Efficient Big Data Processing at the Software Level 2014/9/19 Energy Efficient Big Data Processing at the Software Level Da-Qi Ren, Zane Wei Huawei US R&D Center Santa Clara, CA 95050 Power Measurement on Big Data Systems 1. If the System Under Test (SUT)

More information

Helio X20: The First Tri-Gear Mobile SoC with CorePilot 3.0 Technology

Helio X20: The First Tri-Gear Mobile SoC with CorePilot 3.0 Technology Helio X20: The First Tri-Gear Mobile SoC with CorePilot 3.0 Technology Tsung-Yao Lin, g-hsien Lee, Loda Chou, Clavin Peng, Jih-g Hsu, Jia-g Chen, John-CC Chen, Alex Chiou, Artis Chiu, David Lee, Carrie

More information

LCA14-104: GTS- A solution to support ARM s big.little technology. Mon-3-Mar, 11:15am, Mathieu Poirier

LCA14-104: GTS- A solution to support ARM s big.little technology. Mon-3-Mar, 11:15am, Mathieu Poirier LCA14-104: GTS- A solution to support ARM s big.little technology Mon-3-Mar, 11:15am, Mathieu Poirier Today s Presentation: Things to know about Global Task Scheduling (GTS). MP patchset description and

More information

IT Level Power Provisioning Business Continuity and Efficiency at NTT

IT Level Power Provisioning Business Continuity and Efficiency at NTT IT Level Power Provisioning Business Continuity and Efficiency at NTT Henry M.L. Wong Intel Eco-Technology Program Office Environment Global CO 2 Emissions ICT 2% 98% Source: The Climate Group Economic

More information

MULTITHERMAN: Out-of-band High-Resolution HPC Power and Performance Monitoring Support for Big-Data Analysis

MULTITHERMAN: Out-of-band High-Resolution HPC Power and Performance Monitoring Support for Big-Data Analysis MULTITHERMAN: Out-of-band High-Resolution HPC Power and Performance Monitoring Support for Big-Data Analysis EU H2020 FETHPC project ANTAREX (g.a. 671623) EU FP7 ERC Project MULTITHERMAN (g.a.291125) HPC

More information

Accurate and Stable Empirical CPU Power Modelling for Multi- and Many-Core Systems

Accurate and Stable Empirical CPU Power Modelling for Multi- and Many-Core Systems Accurate and Stable Empirical CPU Power Modelling for Multi- and Many-Core Systems Matthew J. Walker*, Stephan Diestelhorst, Geoff V. Merrett* and Bashir M. Al-Hashimi* *University of Southampton Arm Ltd.

More information

Reliable Power and Thermal Management in The Data Center

Reliable Power and Thermal Management in The Data Center Reliable Power and Thermal Management in The Data Center Deva Bodas Corporation April 19, 2004 Deva.Bodas@.com 1 Agenda 2 Data center manageability challenges & trends Current state of power & thermal

More information

SAPPHIRE TOXIC R9 280X 3GB GDDR5

SAPPHIRE TOXIC R9 280X 3GB GDDR5 SAPPHIRE TOXIC R9 280X 3GB GDDR5 Specification Display Support Output GPU Video Memory Dimension Software Accessory 5 x Maximum Display Monitor(s) support 1 x HDMI (with 3D) 2 x Mini-DisplayPort 1 x Single-Link

More information

INSIDE THE PROTOTYPE BODENTYPE DATA CENTER: OPENSOURCE MONITORING OF OPENSOURCE HARDWARE

INSIDE THE PROTOTYPE BODENTYPE DATA CENTER: OPENSOURCE MONITORING OF OPENSOURCE HARDWARE INSIDE THE PROTOTYPE BODENTYPE DATA CENTER: OPENSOURCE MONITORING OF OPENSOURCE HARDWARE Dr Jon Summers Scientific Leader in Data Centers, RISE Adjunct Professor of Fluid Mechanics, Lulea University of

More information

Power C a p p i n g. Ali Larijani Microsoft F/W Engineering Manager. David Locklear Intel Platform Architect

Power C a p p i n g. Ali Larijani Microsoft F/W Engineering Manager. David Locklear Intel Platform Architect OPEN CLOUD SERVER PROJECT OLYMPUS Power C a p p i n g Ali Larijani Microsoft F/W Engineering Manager David Locklear Intel Platform Architect Power Capping Agenda: Power capping benefits at Data center

More information

Mali-G72: Enabling tomorrow s technology today

Mali-G72: Enabling tomorrow s technology today Mali-G72: Enabling tomorrow s technology today Ploutarchos Galatsopoulos Senior Product Manager Media Processing Group, ARM ARM Tech Forum Korea June 28 th 2017 ARM Mali: The world s #1 shipping GPU ~50%

More information

Dominick Lovicott Enterprise Thermal Engineering. One Dell Way One Dell Way Round Rock, Texas

Dominick Lovicott Enterprise Thermal Engineering. One Dell Way One Dell Way Round Rock, Texas One Dell Way One Dell Way Round Rock, Texas 78682 www.dell.com DELL ENTERPRISE WHITEPAPER THERMAL DESIGN OF THE DELL POWEREDGE T610, R610, AND R710 SERVERS Dominick Lovicott Enterprise Thermal Engineering

More information

Benchmark of XU4 with Active Cooler and XU4Q with Passive Cooler

Benchmark of XU4 with Active Cooler and XU4Q with Passive Cooler 2017/12/07 20:24 1/8 Benchmark of XU4 with Active Cooler and XU4Q with Passive Cooler Benchmark of XU4 with Active Cooler and XU4Q with Passive Cooler This wiki page shows benchmark test results to analysis

More information

Quantifying the Energy Cost of Data Movement for Emerging Smartphone Workloads on Mobile Platforms

Quantifying the Energy Cost of Data Movement for Emerging Smartphone Workloads on Mobile Platforms Quantifying the Energy Cost of Data Movement for Emerging Smartphone Workloads on Mobile Platforms Arizona State University Dhinakaran Pandiyan(dpandiya@asu.edu) and Carole-Jean Wu(carole-jean.wu@asu.edu

More information

System Design of Kepler Based HPC Solutions. Saeed Iqbal, Shawn Gao and Kevin Tubbs HPC Global Solutions Engineering.

System Design of Kepler Based HPC Solutions. Saeed Iqbal, Shawn Gao and Kevin Tubbs HPC Global Solutions Engineering. System Design of Kepler Based HPC Solutions Saeed Iqbal, Shawn Gao and Kevin Tubbs HPC Global Solutions Engineering. Introduction The System Level View K20 GPU is a powerful parallel processor! K20 has

More information

Enable AI on Mobile Devices

Enable AI on Mobile Devices Enable AI on Mobile Devices Scott Wang 王舒翀 Senior Segment Manager Mobile, BSG ARM Tech Forum 2017 14 th June 2017, Shenzhen AI is moving from core to edge Ubiquitous AI Safe and autonomous Mixed reality

More information

ARM processors driving automotive innovation

ARM processors driving automotive innovation ARM processors driving automotive innovation Chris Turner Director of advanced technology marketing, CPU group ARM tech forums, Seoul and Taipei June/July 2016 The ultimate intelligent connected device

More information

Core 2 vs I-series. How Far Have We Really Come?

Core 2 vs I-series. How Far Have We Really Come? Core 2 vs I-series How Far Have We Really Come? Appendix 1. Introduction 2. Road map 3. General specifications 4. Hardware subtleties 5. Technology difference 6. Advantages of the new architecture 7. Conclusion

More information

Broadcast-Quality, High-Density HEVC Encoding with AMD EPYC Processors

Broadcast-Quality, High-Density HEVC Encoding with AMD EPYC Processors Solution Brief December, 2018 2018 Broadcast-Quality, High-Density HEVC Encoding with AMD EPYC Processors HIGHLIGHTS o The AMD EPYC SoC brings a new balance to the datacenter. Utilizing an x86-architecture,

More information

ARM Performance Libraries Current and future interests

ARM Performance Libraries Current and future interests ARM Performance Libraries Current and future interests Chris Goodyer Senior Engineering Manager, HPC Software Workshop on Batched, Reproducible, and Reduced Precision BLAS 25 th February 2017 ARM Performance

More information

Dell OpenManage Power Center s Power Policies for 12 th -Generation Servers

Dell OpenManage Power Center s Power Policies for 12 th -Generation Servers Dell OpenManage Power Center s Power Policies for 12 th -Generation Servers This Dell white paper describes the advantages of using the Dell OpenManage Power Center to set power policies in a data center.

More information

Oracle Database 10g Resource Manager. An Oracle White Paper October 2005

Oracle Database 10g Resource Manager. An Oracle White Paper October 2005 Oracle Database 10g Resource Manager An Oracle White Paper October 2005 Oracle Database 10g Resource Manager INTRODUCTION... 3 SYSTEM AND RESOURCE MANAGEMENT... 3 ESTABLISHING RESOURCE PLANS AND POLICIES...

More information

OpenManage Power Center Demo Guide for https://demos.dell.com

OpenManage Power Center Demo Guide for https://demos.dell.com OpenManage Power Center Demo Guide for https://demos.dell.com Contents Introduction... 3 Lab 1 Demo Environment... 6 Lab 2 Change the default settings... 7 Lab 3 Discover the devices... 8 Lab 4 Group Creation

More information

MULTITHERMAN: Out-of-band High-Resolution HPC Power and Performance Monitoring Support for Big-Data Analysis

MULTITHERMAN: Out-of-band High-Resolution HPC Power and Performance Monitoring Support for Big-Data Analysis MULTITHERMAN: Out-of-band High-Resolution HPC Power and Performance Monitoring Support for Big-Data Analysis EU H2020 FETHPC project ANTAREX (g.a. 671623) EU FP7 ERC Project MULTITHERMAN (g.a.291125) EETHPC,

More information

HPE Datacenter Care for SAP and SAP HANA Datacenter Care Addendum

HPE Datacenter Care for SAP and SAP HANA Datacenter Care Addendum HPE Datacenter Care for SAP and SAP HANA Datacenter Care Addendum This addendum to the HPE Datacenter Care Service data sheet describes HPE Datacenter Care SAP and SAP HANA service features, which are

More information

Intel Speed Select Technology Base Frequency - Enhancing Performance

Intel Speed Select Technology Base Frequency - Enhancing Performance Intel Speed Select Technology Base Frequency - Enhancing Performance Application Note April 2019 Document Number: 338928-001 You may not use or facilitate the use of this document in connection with any

More information

Data Center Energy Efficiency Using Intel Intelligent Power Node Manager and Intel Data Center Manager

Data Center Energy Efficiency Using Intel Intelligent Power Node Manager and Intel Data Center Manager Data Center Energy Efficiency Using Intel Intelligent Power Node Manager and Intel Data Center Manager Deploying Intel Intelligent Power Node Manager and Intel Data Center Manager with a proper power policy

More information

LCA14-412: GPGPU on ARM SoC. Thu 6 March, 2.00pm, T.Gall, G.Pitney

LCA14-412: GPGPU on ARM SoC. Thu 6 March, 2.00pm, T.Gall, G.Pitney LCA14-412: GPGPU on ARM SoC Thu 6 March, 2.00pm, T.Gall, G.Pitney Agenda Shamrock - Gil Pitney sqlite accelerated with OpenCL - Tom Gall GPGPU Goals Recognizing that: GPUs are much more energy efficient

More information

CHALLENGES OF TODAY'S COMPLEX SOC: PERFORMANCE VERIFICATION PANKAJ SINGH, MALATHI CHIKKANNA

CHALLENGES OF TODAY'S COMPLEX SOC: PERFORMANCE VERIFICATION PANKAJ SINGH, MALATHI CHIKKANNA CHALLENGES OF TODAY'S COMPLEX SOC: PERFORMANCE VERIFICATION PANKAJ SINGH, MALATHI CHIKKANNA INTRODUCTION Rapid progress in Semiconductor Technology Numerous circuits soldered ona printed circuit board

More information

Efficient Programming for Multicore Processor Heterogeneity: OpenMP Versus OmpSs

Efficient Programming for Multicore Processor Heterogeneity: OpenMP Versus OmpSs Efficient Programming for Multicore Processor Heterogeneity: OpenMP Versus OmpSs Anastasiia Butko, Lawrence Berkeley National Laboratory F. Bruguier, A. Gamatié, G Sassatelli, LIRMM/CNRS/UM 2 Heterogeneity:

More information

Each Milliwatt Matters

Each Milliwatt Matters Each Milliwatt Matters Ultra High Efficiency Application Processors Govind Wathan Product Manager, CPG ARM Tech Symposia China 2015 November 2015 Ultra High Efficiency Processors Used in Diverse Markets

More information

fitlet-rm specifications

fitlet-rm specifications fitlet-rm specifications fitlet-rm Rugged and Robust fitlet-rm is a ruggedized miniature fanless PC housed in an all-metal housing. It provides excellent durability at extreme temperatures and under conditions

More information

Intel Atom Processor D2000 Series and N2000 Series Embedded Application Power Guideline Addendum January 2012

Intel Atom Processor D2000 Series and N2000 Series Embedded Application Power Guideline Addendum January 2012 Intel Atom Processor D2000 Series and N2000 Series Embedded Application Power Guideline Addendum January 2012 Document Number: 326673-001 Background INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION

More information

LPGPU2 Font Renderer App

LPGPU2 Font Renderer App LPGPU2 Font Renderer App Performance Analysis Introduction As part of LPGPU2 Work Package 3, a font rendering app was developed to research the profiling characteristics of different font rendering algorithms.

More information

Tips and Tricks: Designing low power Native and WebApps. Harita Chilukuri and Abhishek Dhanotia

Tips and Tricks: Designing low power Native and WebApps. Harita Chilukuri and Abhishek Dhanotia Tips and Tricks: Designing low power Native and WebApps Harita Chilukuri and Abhishek Dhanotia Acknowledgements William Baughman for his help with the browser analysis Ross Burton & Thomas Wood for information

More information

Inside VR on Mobile. Sam Martin Graphics Architect GDC 2016

Inside VR on Mobile. Sam Martin Graphics Architect GDC 2016 Inside VR on Mobile Sam Martin Graphics Architect GDC 2016 VR Today Emerging technology Main mobile VR ecosystems Google Cardboard Samsung GearVR In this talk: Latency Multiple views Performance tuning

More information