The Evolution of the ARM Architecture Towards Big Data and the Data-Centre
|
|
- Theodore Lynch
- 5 years ago
- Views:
Transcription
1 The Evolution of the ARM Architecture Towards Big Data and the Data-Centre 8th Workshop on Virtualization in High-Performance Cloud Computing (VHPC'13) held in conjunction with SC 13, Denver, Colorado John Goodacre Director, Technology and Systems ARM Processor Division ARM Cambridge 1
2 Abstract In recent years the rapid evolution of the ARM Architecture has lead to a potential inflection point where the architectural and compute capabilities the ARM partners deliver into the low cost, low power, consumer focused products has direct applicability to the requirement of compute in the data centre and the processing of big data. This talk will review these capabilities and the discuss the potential impact to the data centre. Finally, we'll take a glance into a project that aims to use a holistic approach to compute, ranging from 3D fabrication of an application specific, scalable compute element with virtualized access to system level resources so to balance to cost and requirement between compute, IO and memory. 2
3 ARM: Technology Supplier Advanced digital products are incorporating more and more ARM technology from processor and multimedia IP to software Processor IP Design of the brain of the chip Physical IP Software development tools Design of the building blocks of the chip 3
4 ARM IP Business Model Our customers are primarily the Semiconductor partners We sell mostly to their OEM We work with our customer to support the ecosystem 4
5 Various IP Licensing Models Arch. Subscription Implementation Term Single Use Design Foundry Program Higher license value Higher IP value Greater commercial / technical interaction Increasing internal investment by partner Increasing ARM / Partner business knowledge Greater flexibility Greater commercial / technical interaction Platform licenses End User license Free Library Program DesignStart Academic / Research Processors Physical IP 5
6 ARM Architecture Evolution 6
7 Performance, Functionality ARM Applications Processors Cortex-A Series Low-Power Leadership Cortex-A15 >2GHz+ in 28HPM Virtualization 1TB physical addressing big.little with Cortex-A7 Cortex-A57 ARMv8 64-bit Same power, 3x the performance of todays superphones 40LP Cortex-A8 Cortex-A5 Cortex-A9 Shipping in mobile since nd generation 1-4X SMP 4x1750DMIPS@700MHz+ in Cortex-A53 ARMv8 64-bit Same performance, ¼ the power of todays superphones Cortex-A7 1/5 the power of Cortex-A15 Architectural alignment with Cortex-A Future 7
8 ARM VIRTUALIZATION EXTENSIONS 8
9 Capabilities introduced in Cortex-A15 Full backwards compatibility with the Cortex-A9 Supporting the ARMv7 Architecture Addition of Virtualization Extension (VE) Run multiple OS binary instances simultaneously Isolates multiple work environments and data Supporting Large Physical Addressing Extensions (LPAE) Ability to use up to 1TB of physical memory With AMBA 4 System Coherency (AMBA-ACE) Other cached devices can be coherent with processor Many core multiprocessor scalability Basis of concurrent big.little Processing 9
10 Large Physical Addressing 40-bit physical addressing Virtual memory (apps and OS) still has 32bit address space However support more memory resident applications Offering up to 1 TB of physical address space Supporting multiple 32bit OS images through virtualization What does this mean for ARM based systems? More applications resident at the same time Multiple resident virtualized operating systems Common global physical address in many-core 10
11 Virtualization Extensions: The Basics New Non-secure level of privilege to hold Hypervisor Hyp mode New mechanisms avoid the need Hypervisor intervention for: Guest OS Interrupt masking bits Guest OS page table management Guest OS Device Drivers due to Hypervisor memory relocation Guest OS communication with the interrupt controller (GIC) New traps into Hyp mode for: ID register accesses and idling (WFI/WFE) Miscellaneous difficult System Control Register cases New mechanisms to improve: Guest OS Load/Store emulation by the Hypervisor Emulation of trapped instructions through syndromes 11
12 How does ARM do Virtualization Extensions to the v7-a Architecture, available on the Cortex -A15, Cortex-A7 CPUs and future processors Second stage of address translation (separate page tables) Functionality for virtualizing interrupts inside the Interrupt Controller Functionality for virtualizing all CPU features, including CP15 Option of a MMU within the system to help virtualize IO Hypervisor runs in new Hyp exception mode / privilege HVC (Hypervisor Call) instruction to enter Hyp mode Uses previously unused entry (0X14 offset) in vector table for hypervisor traps Hyp mode exception link register, SPSR, stack pointer Hypervisor Control Register (HCR) marks virtualized resources Hypervisor Syndrome Register (HSR) for Hyp mode entry reason 12
13 Exceptions Exception Returns Virtualization: Third Privilege Guest OS same kernel/user privilege structure HYP mode higher privilege than OS kernel level VMM controls wide range of OS accesses Hardware maintains TZ security (4 th privilege) Non-secure State User Mode (Non-privileged) 1 Secure State App1 App2 App1 App2 Secure Apps Guest Operating System1 Guest Operating System2 Supervisor Mode (Privileged) 2 Secure Operating System Virtual Machine Monitor / Hypervisor Hyp Mode (More Privileged) 3 TrustZone Secure Monitor (Highest Privilege) 13
14 Virtual address map of each application Physical Address Map Memory the Classic Resource Before virtualisation the OS owns the memory Allocates areas of memory to the different applications Virtual Memory commonly used in rich operating systems Translation s from translation table (owned by the OS) 14
15 Virtual Memory in Two Stages Stage 1 translation owned by each Guest OS Stage 2 translation owned by the VMM Hardware has 2-stage memory translation Tables from Guest OS translate VA to IPA Second set of tables from VMM translate IPA to PA Allows aborts to be routed to appropriate software layer Virtual address (VA) map of each App on each Guest OS 15 Intermediate Physical address map of each Guest OS (IPA) Physical Address (PA) Map
16 Classic Issue: Interrupts An Interrupt might need to be routed to one of: Current or different GuestOS Hypervisor OS/RTOS running in the secure TrustZone environment Basic model of the ARM virtualisation extensions: Physical interrupts are taken initially in the Hypervisor If the Interrupt should go to a GuestOS : Hypervisor maps a virtual interrupt for that GuestOS App1 App2 Virtual Interrupt App1 Guest OS 1 App2 App1 Guest OS 2 App2 Operating System Physical Interrupt Physical Interrupt VMM System without virtualisation System with virtualisation 16
17 Interrupt Virtualization Virtualisation Extensions provides : Registers to hold the Virtual Interrupt CPSR.{I,A,F} bits in the GuestOS only appling to that OS Physical Interrupts are not masked by the CPSR.{I.A.F} bits GuestOS changes to I,A,F no longer need to be trapped Mechanism to route all physical interrupts to Monitor Mode Already utilized in TrustZone technology based devices Virtual Interrupts are routed to the Non-secure IRQ/FIQ/Abort Guest OS manipulates a virtualized interrupt controller Actually available in the Cortex-A9 to aid paravirtualization support for interrupts 17
18 Virtual Interrupt Controller New Virtual GIC Interface has been Architected ISR of GuestOS interacts with the virtual controller Pending and Active interrupt lists for each GuestOS Interacts with the physical GIC in hardware Creates Virtual Interrupts only when priority indicates it is necessary GuestOS ISRs therefore do not need calls for: Determining interrupt to take [Read of the Interrupt Acknowledge] Marking the end of an interrupt [Sending EOI] Changing CPU Interrupt Priority Mask [Current Priority] 18
19 Virtual GIC (Global Interrupt Controller) GIC now has separate sets of internal registers: Physical registers and virtual registers Non-virtualized system and hypervisor access the physical registers Virtual machines access the virtual registers Guest OS functionality does not change when accessing the vgic Virtual registers are remapped by hypervisor so that the Guest OS thinks it is accessing the physical registers GIC registers and functionality are identical Hypervisor can set IRQs as virtual in the HCR Interrupts are configured to generate a Hypervisor trap Hypervisor can deliver an interrupt to a CPU running a virtual process using register lists of interrupts 19
20 Resource Ownership Software-only approaches Access to resources by GuestOS intercepted by the VMM VMM interprets the GuestOS s intent Provides its own mechanism to meet that intent Mechanism of interception varies Paravirtualisation adds a hypercall to the source code Binary translation adds a hypercall to the binary Exceptions in Hardware provide an trapping of operations Hypercalls can be more efficient More of the intent to be expressed in a single VMM entry Hardware assisted approaches: Provide further indirection to resources Accelerating trapped operations by syndrome information 20
21 Helping with Virtual Devices ARM I/O handling uses memory mapped devices Reads and Writes to the Device registers have specific side-effects Creating Virtual Devices requires emulation: Typically reads/writes to devices have to trap to the VMM VMM interprets the operation and performs emulation Perfect virtualization means all possible devices loads/stores emulated Fetching and interpreting emulated load/store is performance intensive Syndrome information on aborts available for some loads/stores Syndrome unpacks key information about the instruction Source/Destination register, Size of data transfer, Size of the instruction, SignExtension etc If syndrome not available, then fetching of the instruction for emulation still required 21
22 Devices and Memory Providing address translation for devices is important Allows unmodified device drivers in the GuestOS If the device can access memory, GuestOS will program it in IPA ARM virtualisation adds option for a System MMU Enables second stage memory translations in the system A System MMU could also provide stage 1 translations Allows devices to be programmed into guest s VA space System MMU natural fit for the processor ACP port ARM defining a common programming model Intent is for the system MMU to be hardware managed using Distributed Virtual Messages found in AMBA 4 ACE 22
23 System MMU 23
24 Virtualization and TrustZone TrustZone coexists alongside a VMM Normal World App1 (EPG) App2 App1 (Flash) App2 Guest Operating System1 Guest Operating System2 Virtual Machine Monitor (VMM) or Hypervisor Secure Monitor Secure Apps Secure RTOS Secure World HARDWARE (Memory, ARM CPU, I/O Devices) TrustZone offers a specialised type of Virtualisation Only 2 Worlds not extendable (except through paravirtualization) Although VMM can also span both worlds Fourth privilege level is provided by CPU s secure monitor mode Non-symmetrical - The two Worlds are not equal Secure world can access both worlds (33bit addressing) 24
25 Spanning Hypervisor Framework ARM Non-Secure Execution Environment User Exception Level (Application Layer) A P P API A P P A P P A P P API ARM Secure Execution Environment Secured Services Domain SoC SoC Management Domains (ST/TEI) Resource API Platform Resources NoC smmu Coherence Kernel/Supervisor Exception Level Open OS & Drivers Driver (OpenMP RT) Open OS (Linux) Resource Driver RTOS/uKernel Para-virtualization with platform service API s Virtualized TZ Call Secured hw timer (tick) HYP(ervisor) Exception Level Accelerators (eg DSP / GPU) Virtualized TZ Call Open Platform Hypervisor Full Virtualization (KVM) Heterogeneous TZ Monitor Exception Level TrustZone Monitor Enabling secure and mix-criticality systems through virtualization This project in ARM is in part funded by ICT-Virtical, a European project supported under the Seventh Framework Programme (7FP) for research and technological development Project website 25
26 Flexibility Through Design Same core technology thousands of application specific solutions Built by hundreds of ARM technology licensee partners 26
27 Bridge IO Options Targeted IO Advantage of a Targeted Solution Discrete Processor System On Chip Specific Memory Types ARM business model enables efficiency of system design as In addition to processor design Targeted Memory Types One-size can not fit all applications Compromise across all important metric, compute vs memory vs IO Energy overhead from abstractions (eg PCIe) between IO and CPU ARM model supports hundreds of companies building hundreds of application optimized System on Chip solutions The challenge is ROI cost of development vs. market opportunity 27
28 COTS-on-Silicon 40+ billion cores to date 100+ billion by Approx 10B additional core in
29 Scalability Concept: Compute Unit Coherent view into memory hierarchy of this compute unit Compute Unit Local Coherent Interconnect Path to local memory addressable by this unit Path to address locations not accessible by this unit The Compute Unit unifies a number of localized compute resources Potentially both general purpose and other local heterogeneous accelerators Provides coherent and symmetric access across local resources Enabling a SMP capable operating system and resource sharing Each Compute Unit is registered at a partition within a system s global address space (GAS), including units with heterogeneous capability Any unit can access any remote location in the GAS (including cached) DMA can transfer between (virtually address cached) memory partitions 29
30 Example: Heterogeneous Compute Unit Access to remote partitions AMBA Extensions Interface (Master) Cortex-A15 Cluster CPU 0 L2 Cache CPU 2 CPU 1 CPU 3 Cortex-A7 Cluster CPU 0 L2 Cache CPU 2 CPU 1 CPU 3 Mali T600 series GPU Shader Core 0 Shader Core 1 L2 Cache SMMU Shader Core 2 Shader Core 3 Debug & Trace NIC 400 CoreSight Resources Mgt System Power JTAG & Trace PMIC/ APB Bus Cache Coherent Interconnect (CCI-400) AMBA Extensions Interface (Master) Unified coherent virtual memory DMC-400 LPDDR2/DDR3 Controller OpenCL TBA - H.S.A. NIC 400 TrustZone enabled 2012 Compute Subsystem Shared pointers Shared data Shared dispatch On-Chip Memories (RAM, ROM) Base Peripheral DDR PHY or DDR Memory Local memory partition AMBA Extensions Interface (Slave) Requests from remote partitions 30
31 EUROSERVER Vision: Started Sept 13 Local I/O and Remote Routing Cortex-A15 Cluster Cortex-A7 Cluster CPU 0 CPU 2 CPU 0 CPU 2 L2 Cache L2 Cache CPU 1 CPU 3 CPU 1 CPU 3 Cache Coherent Interconnect (CCI-400) AMBA Extensions Interface (Master) Mali T600 series GPU Shader Core 0 Shader Core 1 L2 Cache Shader Core 2 Shader Core 3 SMMU AMBA Extensions Interface (Master) Debug & Trace CoreSight Resources Mgt System Power NIC 400 JTAG & Trace PMIC/ APB Bus 1) Start with a standard scalable compute unit die Compute Node Multi-Chip-Module Compute Chiplet Active Interposer (2D SoC + TSV) Bumps TrustZone enabled Compute Chiplet Package Substrate DMC Compute Subsystem LPDDR2/DDR3 Controller DDR PHY or DDR Memory AMBA Extensions Interface (Slave) On-Chip Memories (RAM, ROM) NIC 400 Base Peripheral 2) Connect a few together along with its local memory partition Memories 3) Put a few on a board With shared virtualized IO s Consumer supply 256xCPU 4) In future, integrate into a single MCM Balls Non-Volatile Memory Compute Node (64xCPU) Compute Node Compute Node Configurable Multi- Protocol I/O Interfaces Delivering a result 100x better than today Compute Node 31
32 EUROSERVER: System Hierarchy Component Hierarchy Per hierarchy Software view Design Choice Rack Shelves 8-16 System shmem MPI Base board Chassis shelf 2-4 Clustering (supporting virtually mapped shmem) Power / cooling density Thermal density and local storage requirement Mezzanine board IO Specialized node 2-8 processor boards Virtually mapped regions, hostos Balance of IO vs communication Processor Board Daughter board 2 nodes < 100ns latency Virtual mapped regions, guestos Physical sizing Packaged 2D TSV flipchip Compute Node 2-8 Units Virtual shared memory (ccnuma) Balance compute vs communication Micro-bumped Silicon chiplet Compute Unit 4-16 CPU + GPGPU Virtual memory Coherent SMP Balance compute again memory MCM in Package Local Memory 8-32GB per unit Unified memory Primary Memory per processor 32
33 Prototype Example Configuration Processor Board (local router + remote bridge) Remote routing Up to 4x 100Gb/s optical or 8x 40Gb/s Ethernet Processor Board (local router + remote bridge) Technology Dependent 4 16 Nodes / board 40Gb/s between nodes 2Tb/s switching Node Node Node Node CPU / node Unit Unit Unit Unit Unit Unit Unit Unit 4-16GB per unit 2 48 P0 P1 Flat Global Address Space 8x CPU per unit, 4x unit per node, 8xG3 PCIe/40Gb/s per node, 9x nodes per local router Latency from unit to unit via processor board ~ 0.5 us, ability to read/write L2 cache of remote unit 33
34 Use of Virtualization IO Multiple nodes share the physical cost of utility interfaces Multiple Access Mac used to share high-speed interfaces NIC masters directly into each guests local memory partition Processing IO Node is the VMM host operating as a ccnuma machine over multiple physical compute nodes Capable of 1:1 allocation between hardware and guest Memory Any guest can be allocated memory up to the entire system image Numa cost rules help with shared memory access latencies Communication Direct processor read/write semantic between system memory Shared memory MPI with DMA for long haul transfers 34
35 Summary ARM Architecture Has evolved into a full 64bit, secure, high-performance solution capable of hardware accelerated virtualization ARM Licensee Design both the processor core and system to target the requirements of specific applications and markets The flexibility opportunity of ARM based design for HPC hasn t really started We see integration of the high-speed IO around the ARM core We see scalable connectivity being integrated on the device The full opportunities of integrated system design has yet to be realized for enterprise and HPC requirement 35
ARM the Company ARM the Research Collaborator
UMIC Day 13 ARM the Company ARM the Research Collaborator John Goodacre Director Technology and Systems Aachen 15 th October 2013 1 The ARM Vision A world where all electronic products and services are
More informationBuilding High Performance, Power Efficient Cortex and Mali systems with ARM CoreLink. Robert Kaye
Building High Performance, Power Efficient Cortex and Mali systems with ARM CoreLink Robert Kaye 1 Agenda Once upon a time ARM designed systems Compute trends Bringing it all together with CoreLink 400
More informationBuilding blocks for 64-bit Systems Development of System IP in ARM
Building blocks for 64-bit Systems Development of System IP in ARM Research seminar @ University of York January 2015 Stuart Kenny stuart.kenny@arm.com 1 2 64-bit Mobile Devices The Mobile Consumer Expects
More informationExploring System Coherency and Maximizing Performance of Mobile Memory Systems
Exploring System Coherency and Maximizing Performance of Mobile Memory Systems Shanghai: William Orme, Strategic Marketing Manager of SSG Beijing & Shenzhen: Mayank Sharma, Product Manager of SSG ARM Tech
More informationNext Generation Enterprise Solutions from ARM
Next Generation Enterprise Solutions from ARM Ian Forsyth Director Product Marketing Enterprise and Infrastructure Applications Processor Product Line Ian.forsyth@arm.com 1 Enterprise Trends IT is the
More informationDesigning, developing, debugging ARM Cortex-A and Cortex-M heterogeneous multi-processor systems
Designing, developing, debugging ARM and heterogeneous multi-processor systems Kinjal Dave Senior Product Manager, ARM ARM Tech Symposia India December 7 th 2016 Topics Introduction System design Software
More informationARMv8-A Software Development
ARMv8-A Software Development Course Description ARMv8-A software development is a 4 days ARM official course. The course goes into great depth and provides all necessary know-how to develop software for
More informationCortex-A15 MPCore Software Development
Cortex-A15 MPCore Software Development Course Description Cortex-A15 MPCore software development is a 4 days ARM official course. The course goes into great depth and provides all necessary know-how to
More informationLecture 5. KVM for ARM. Christoffer Dall and Jason Nieh. 5 November, Operating Systems Practical. OSP Lecture 5, KVM for ARM 1/42
Lecture 5 KVM for ARM Christoffer Dall and Jason Nieh Operating Systems Practical 5 November, 2014 OSP Lecture 5, KVM for ARM 1/42 Contents Virtualization KVM Virtualization on ARM KVM/ARM: System architecture
More informationThe Next Steps in the Evolution of Embedded Processors
The Next Steps in the Evolution of Embedded Processors Terry Kim Staff FAE, ARM Korea ARM Tech Forum Singapore July 12 th 2017 Cortex-M Processors Serving Connected Applications Energy grid Automotive
More informationMaximizing heterogeneous system performance with ARM interconnect and CCIX
Maximizing heterogeneous system performance with ARM interconnect and CCIX Neil Parris, Director of product marketing Systems and software group, ARM Teratec June 2017 Intelligent flexible cloud to enable
More informationPorting bhyve on ARM. Mihai Carabas, Peter Grehan BSDCan 2016 University of Ottawa Ottawa, Canada June 10 11, 2016
Porting bhyve on ARM Mihai Carabas, Peter Grehan {mihai,grehan}@freebsd.org BSDCan 2016 University of Ottawa Ottawa, Canada June 10 11, 2016 About me University POLITEHNICA of Bucharest PhD Student: virtualization
More informationDesigning Security & Trust into Connected Devices
Designing Security & Trust into Connected Devices Eric Wang Sr. Technical Marketing Manager Tech Symposia China 2015 November 2015 Agenda Introduction Security Foundations on ARM Cortex -M Security Foundations
More informationBig.LITTLE Processing with ARM Cortex -A15 & Cortex-A7
Big.LITTLE Processing with ARM Cortex -A15 & Cortex-A7 Improving Energy Efficiency in High-Performance Mobile Platforms Peter Greenhalgh, ARM September 2011 This paper presents the rationale and design
More informationARM big.little Technology Unleashed An Improved User Experience Delivered
ARM big.little Technology Unleashed An Improved User Experience Delivered Govind Wathan Product Specialist Cortex -A Mobile & Consumer CPU Products 1 Agenda Introduction to big.little Technology Benefits
More informationDesigning Security & Trust into Connected Devices
Designing Security & Trust into Connected Devices Rob Coombs Security Marketing Director TechCon 11/10/15 Agenda Introduction Security Foundations on Cortex-M Security Foundations on Cortex-A Use cases
More informationDesigning Security & Trust into Connected Devices
Designing Security & Trust into Connected Devices Eric Wang Senior Technical Marketing Manager Shenzhen / ARM Tech Forum / The Ritz-Carlton June 14, 2016 Agenda Introduction Security Foundations on Cortex-A
More informationUEFI ARM Update. UEFI PlugFest March 18-22, 2013 Andrew N. Sloss (ARM, Inc.) presented by
presented by UEFI ARM Update UEFI PlugFest March 18-22, 2013 Andrew N. Sloss (ARM, Inc.) Updated 2011-06-01 UEFI Spring PlugFest March 2013 www.uefi.org 1 AGENDA economics technology status summary questions
More informationCopyright 2016 Xilinx
Zynq Architecture Zynq Vivado 2015.4 Version This material exempt per Department of Commerce license exception TSU Objectives After completing this module, you will be able to: Identify the basic building
More informationSystem Virtual Machines
System Virtual Machines Outline Need and genesis of system Virtual Machines Basic concepts User Interface and Appearance State Management Resource Control Bare Metal and Hosted Virtual Machines Co-designed
More informationSystem Virtual Machines
System Virtual Machines Outline Need and genesis of system Virtual Machines Basic concepts User Interface and Appearance State Management Resource Control Bare Metal and Hosted Virtual Machines Co-designed
More information3D Graphics in Future Mobile Devices. Steve Steele, ARM
3D Graphics in Future Mobile Devices Steve Steele, ARM Market Trends Mobile Computing Market Growth Volume in millions Mobile Computing Market Trends 1600 Smart Mobile Device Shipments (Smartphones and
More informationPerformance Optimization for an ARM Cortex-A53 System Using Software Workloads and Cycle Accurate Models. Jason Andrews
Performance Optimization for an ARM Cortex-A53 System Using Software Workloads and Cycle Accurate Models Jason Andrews Agenda System Performance Analysis IP Configuration System Creation Methodology: Create,
More informationKeyStone II. CorePac Overview
KeyStone II ARM Cortex A15 CorePac Overview ARM A15 CorePac in KeyStone II Standard ARM Cortex A15 MPCore processor Cortex A15 MPCore version r2p2 Quad core, dual core, and single core variants 4096kB
More informationSilver Bullet of Virtualization. Challenges and Concerns. May 27, 2013 v1.0
Silver Bullet of Virtualization. Challenges and Concerns May 27, 2013 v1.0 Agenda Introduction / Motivation Background Use Cases / Scenarios Open Questions / Problems Q & A COGENT EMBEDDED 2 Introduction
More informationARM Processors for Embedded Applications
ARM Processors for Embedded Applications Roadmap for ARM Processors ARM Architecture Basics ARM Families AMBA Architecture 1 Current ARM Core Families ARM7: Hard cores and Soft cores Cache with MPU or
More informationThe Challenges of System Design. Raising Performance and Reducing Power Consumption
The Challenges of System Design Raising Performance and Reducing Power Consumption 1 Agenda The key challenges Visibility for software optimisation Efficiency for improved PPA 2 Product Challenge - Software
More informationRA3 - Cortex-A15 implementation
Formation Cortex-A15 implementation: This course covers Cortex-A15 high-end ARM CPU - Processeurs ARM: ARM Cores RA3 - Cortex-A15 implementation This course covers Cortex-A15 high-end ARM CPU OBJECTIVES
More informationARM CORTEX-R52. Target Audience: Engineers and technicians who develop SoCs and systems based on the ARM Cortex-R52 architecture.
ARM CORTEX-R52 Course Family: ARMv8-R Cortex-R CPU Target Audience: Engineers and technicians who develop SoCs and systems based on the ARM Cortex-R52 architecture. Duration: 4 days Prerequisites and related
More informationCortex-A75 and Cortex-A55 DynamIQ processors Powering applications from mobile to autonomous driving
Cortex-A75 and Cortex- DynamIQ processors Powering applications from mobile to autonomous driving Lionel Belnet Sr. Product Manager Arm Arm Tech Symposia 2017 Agenda Market growth and trends DynamIQ technology
More informationGetting the Most out of Advanced ARM IP. ARM Technology Symposia November 2013
Getting the Most out of Advanced ARM IP ARM Technology Symposia November 2013 Evolving System Requirements Processor Advances big.little Multicore Unicore DSP Cortex -R7 Block are now Sub-Systems Cortex
More informationARM instruction sets and CPUs for wide-ranging applications
ARM instruction sets and CPUs for wide-ranging applications Chris Turner Director, CPU technology marketing ARM Tech Forum Taipei July 4 th 2017 ARM computing is everywhere #1 shipping GPU in the world
More informationMobile & IoT Market Trends and Memory Requirements
Mobile & IoT Market Trends and Memory Requirements JEDEC Mobile & IOT Forum Ivan H. P. Lin ARM Segment Marketing Copyright ARM 2016 Outline Wearable & IoT Market Opportunities Challenges in Wearables &
More informationMobile & IoT Market Trends and Memory Requirements
Mobile & IoT Market Trends and Memory Requirements JEDEC Mobile & IOT Forum Copyright 2016 [ARM Inc.] Outline Wearable & IoT Market Opportunity Challenges in Wearables & IoT Market ARM technology tackles
More informationBuilding supercomputers from embedded technologies
http://www.montblanc-project.eu Building supercomputers from embedded technologies Alex Ramirez Barcelona Supercomputing Center Technical Coordinator This project and the research leading to these results
More informationOptimizing ARM SoC s with Carbon Performance Analysis Kits. ARM Technical Symposia, Fall 2014 Andy Ladd
Optimizing ARM SoC s with Carbon Performance Analysis Kits ARM Technical Symposia, Fall 2014 Andy Ladd Evolving System Requirements Processor Advances big.little Multicore Unicore DSP Cortex -R7 Block
More informationIntegrating CPU and GPU, The ARM Methodology. Edvard Sørgård, Senior Principal Graphics Architect, ARM Ian Rickards, Senior Product Manager, ARM
Integrating CPU and GPU, The ARM Methodology Edvard Sørgård, Senior Principal Graphics Architect, ARM Ian Rickards, Senior Product Manager, ARM The ARM Business Model Global leader in the development of
More informationModeling Performance Use Cases with Traffic Profiles Over ARM AMBA Interfaces
Modeling Performance Use Cases with Traffic Profiles Over ARM AMBA Interfaces Li Chen, Staff AE Cadence China Agenda Performance Challenges Current Approaches Traffic Profiles Intro Traffic Profiles Implementation
More informationProfiling and Debugging OpenCL Applications with ARM Development Tools. October 2014
Profiling and Debugging OpenCL Applications with ARM Development Tools October 2014 1 Agenda 1. Introduction to GPU Compute 2. ARM Development Solutions 3. Mali GPU Architecture 4. Using ARM DS-5 Streamline
More informationEC H2020 dredbox: Seminar School at INSA Rennes
EC H2020 dredbox: Seminar School at INSA Rennes contact@virtualopensystems.com www.virtualopensystems.com Pierre LUCAS 2017-11-22 Open Part 1: Open Company Overview 2 OpenOpen Confidential & Proprietary
More informationBeyond Hardware IP An overview of Arm development solutions
Beyond Hardware IP An overview of Arm development solutions 2018 Arm Limited Arm Technical Symposia 2018 Advanced first design cost (US$ million) IC design complexity and cost aren t slowing down 542.2
More informationThe ARM Cortex-A9 Processors
The ARM Cortex-A9 Processors This whitepaper describes the details of the latest high performance processor design within the common ARM Cortex applications profile ARM Cortex-A9 MPCore processor: A multicore
More informationMobile & IoT Market Trends and Memory Requirements
Mobile & IoT Market Trends and Memory Requirements JEDEC Mobile & IOT Forum Daniel Heo ARM Segment Marketing Copyright ARM 2016 Outline Wearable & IoT Market Opportunities Challenges in Wearables & IoT
More informationEach Milliwatt Matters
Each Milliwatt Matters Ultra High Efficiency Application Processors Govind Wathan Product Manager, CPG ARM Tech Symposia China 2015 November 2015 Ultra High Efficiency Processors Used in Diverse Markets
More informationNested Virtualization and Server Consolidation
Nested Virtualization and Server Consolidation Vara Varavithya Department of Electrical Engineering, KMUTNB varavithya@gmail.com 1 Outline Virtualization & Background Nested Virtualization Hybrid-Nested
More informationHETEROGENEOUS SYSTEM ARCHITECTURE: PLATFORM FOR THE FUTURE
HETEROGENEOUS SYSTEM ARCHITECTURE: PLATFORM FOR THE FUTURE Haibo Xie, Ph.D. Chief HSA Evangelist AMD China OUTLINE: The Challenges with Computing Today Introducing Heterogeneous System Architecture (HSA)
More informationOn-chip Networks Enable the Dark Silicon Advantage. Drew Wingard CTO & Co-founder Sonics, Inc.
On-chip Networks Enable the Dark Silicon Advantage Drew Wingard CTO & Co-founder Sonics, Inc. Agenda Sonics history and corporate summary Power challenges in advanced SoCs General power management techniques
More informationBUD17-301: KVM/ARM Nested Virtualization. Christoffer Dall
BUD17-301: KVM/ARM Nested Virtualization Christoffer Dall Nested Virtualization VM VM VM App App App App App VM App Hypervisor Hypervisor Hardware Terminology Nested VM VM Nested VM L2 App App App App
More informationKVM/ARM. Marc Zyngier LPC 12
KVM/ARM Marc Zyngier LPC 12 For example: if a processor is in Supervisor mode and Secure state, it is in Secure Supervisor mode ARM Architecture if a processor is Virtualization
More informationVirtualization, Xen and Denali
Virtualization, Xen and Denali Susmit Shannigrahi November 9, 2011 Susmit Shannigrahi () Virtualization, Xen and Denali November 9, 2011 1 / 70 Introduction Virtualization is the technology to allow two
More informationG Disco. Robert Grimm New York University
G22.3250-001 Disco Robert Grimm New York University The Three Questions! What is the problem?! What is new or different?! What are the contributions and limitations? Background: ccnuma! Cache-coherent
More informationEffective System Design with ARM System IP
Effective System Design with ARM System IP Mentor Technical Forum 2009 Serge Poublan Product Marketing Manager ARM 1 Higher level of integration WiFi Platform OS Graphic 13 days standby Bluetooth MP3 Camera
More informationNegotiating the Maze Getting the most out of memory systems today and tomorrow. Robert Kaye
Negotiating the Maze Getting the most out of memory systems today and tomorrow Robert Kaye 1 System on Chip Memory Systems Systems use external memory Large address space Low cost-per-bit Large interface
More informationA Secure and Connected Intelligent Future. Ian Smythe Senior Director Marketing, Client Business Arm Tech Symposia 2017
A Secure and Connected Intelligent Future 1 2017 Arm Copyright Limited Arm 2017 Ian Smythe Senior Director Marketing, Client Business Arm Tech Symposia 2017 Arm: The Industry s Architecture of Choice 50
More informationVirtualization. Pradipta De
Virtualization Pradipta De pradipta.de@sunykorea.ac.kr Today s Topic Virtualization Basics System Virtualization Techniques CSE506: Ext Filesystem 2 Virtualization? A virtual machine (VM) is an emulation
More informationToward a Memory-centric Architecture
Toward a Memory-centric Architecture Martin Fink EVP & Chief Technology Officer Western Digital Corporation August 8, 2017 1 SAFE HARBOR DISCLAIMERS Forward-Looking Statements This presentation contains
More informationModule 1: Virtualization. Types of Interfaces
Module 1: Virtualization Virtualization: extend or replace an existing interface to mimic the behavior of another system. Introduced in 1970s: run legacy software on newer mainframe hardware Handle platform
More informationHow Might Recently Formed System Interconnect Consortia Affect PM? Doug Voigt, SNIA TC
How Might Recently Formed System Interconnect Consortia Affect PM? Doug Voigt, SNIA TC Three Consortia Formed in Oct 2016 Gen-Z Open CAPI CCIX complex to rack scale memory fabric Cache coherent accelerator
More informationMulti-core microcontroller design with Cortex-M processors and CoreSight SoC
Multi-core microcontroller design with Cortex-M processors and CoreSight SoC Joseph Yiu, ARM Ian Johnson, ARM January 2013 Abstract: While the majority of Cortex -M processor-based microcontrollers are
More informationCMP Conference 20 th January Director of Business Development EMEA
CMP Conference 20 th January 2011 eric.lalardie@arm.com Director of Business Development EMEA +33 6 07 83 09 60 1 1 Unparalleled Applicability ARM Cortex Advanced Processors Architectural innovation, compatibility
More informationRapidIO.org Update. Mar RapidIO.org 1
RapidIO.org Update rickoco@rapidio.org Mar 2015 2015 RapidIO.org 1 Outline RapidIO Overview & Markets Data Center & HPC Communications Infrastructure Industrial Automation Military & Aerospace RapidIO.org
More informationThe Architecture of Virtual Machines Lecture for the Embedded Systems Course CSD, University of Crete (April 29, 2014)
The Architecture of Virtual Machines Lecture for the Embedded Systems Course CSD, University of Crete (April 29, 2014) ManolisMarazakis (maraz@ics.forth.gr) Institute of Computer Science (ICS) Foundation
More informationCortex-A75 and Cortex-A55 DynamIQ processors Powering applications from mobile to autonomous driving
Cortex-A75 and Cortex-A55 DynamIQ processors Powering applications from mobile to autonomous driving Stefan Rosinger Director, Product Management Arm Arm TechCon 2017 Agenda Market growth and trends DynamIQ
More informationCortex-A9 MPCore Software Development
Cortex-A9 MPCore Software Development Course Description Cortex-A9 MPCore software development is a 4 days ARM official course. The course goes into great depth and provides all necessary know-how to develop
More informationARM processors driving automotive innovation
ARM processors driving automotive innovation Chris Turner Director of advanced technology marketing, CPU group ARM tech forums, Seoul and Taipei June/July 2016 The ultimate intelligent connected device
More informationNested Virtualization on ARM
Nested Virtualization on ARM NEVE: Nested Virtualization Extensions Jin Tack Lim Christoffer Dall Shih-Wei Li Jason Nieh Marc Zyngier LEADING COLLABORATION IN THE ARM ECOSYSTEM jitack@cs.columbia.edu christoffer.dall@linaro.org
More informationThe ARM Cortex-M0 Processor Architecture Part-1
The ARM Cortex-M0 Processor Architecture Part-1 1 Module Syllabus ARM Architectures and Processors What is ARM Architecture ARM Processors Families ARM Cortex-M Series Family Cortex-M0 Processor ARM Processor
More informationComputer-System Organization (cont.)
Computer-System Organization (cont.) Interrupt time line for a single process doing output. Interrupts are an important part of a computer architecture. Each computer design has its own interrupt mechanism,
More informationI/O and virtualization
I/O and virtualization CSE-C3200 Operating systems Autumn 2015 (I), Lecture 8 Vesa Hirvisalo Today I/O management Control of I/O Data transfers, DMA (Direct Memory Access) Buffering Single buffering Double
More informationRapidIO.org Update.
RapidIO.org Update rickoco@rapidio.org June 2015 2015 RapidIO.org 1 Outline RapidIO Overview Benefits Interconnect Comparison Ecosystem System Challenges RapidIO Markets Data Center & HPC Communications
More informationThe Cortex-A15 Verification Story
The Cortex-A15 Verification Story Bill Greene Micah McDaniel December 7, 2011 1 2 WHAT IS CORTEX-A15? Cortex-A15: Next Generation Leadership Cortex-A class multi-processor 40bit physical addressing (1TB)
More informationThe Challenges of X86 Hardware Virtualization. GCC- Virtualization: Rajeev Wankar 36
The Challenges of X86 Hardware Virtualization GCC- Virtualization: Rajeev Wankar 36 The Challenges of X86 Hardware Virtualization X86 operating systems are designed to run directly on the bare-metal hardware,
More informationChapter 5. Introduction ARM Cortex series
Chapter 5 Introduction ARM Cortex series 5.1 ARM Cortex series variants 5.2 ARM Cortex A series 5.3 ARM Cortex R series 5.4 ARM Cortex M series 5.5 Comparison of Cortex M series with 8/16 bit MCUs 51 5.1
More informationHSA foundation! Advanced Topics on Heterogeneous System Architectures. Politecnico di Milano! Seminar Room A. Alario! 23 November, 2015!
Advanced Topics on Heterogeneous System Architectures HSA foundation! Politecnico di Milano! Seminar Room A. Alario! 23 November, 2015! Antonio R. Miele! Marco D. Santambrogio! Politecnico di Milano! 2
More informationThe Next Steps in the Evolution of ARM Cortex-M
The Next Steps in the Evolution of ARM Cortex-M Joseph Yiu Senior Embedded Technology Manager CPU Group ARM Tech Symposia China 2015 November 2015 Trust & Device Integrity from Sensor to Server 2 ARM 2015
More informationAnalyzing and Debugging Performance Issues with Advanced ARM CoreLink System IP Components
Analyzing and Debugging Performance Issues with Advanced ARM CoreLink System IP Components By William Orme, Strategic Marketing Manager, ARM Ltd. and Nick Heaton, Senior Solutions Architect, Cadence Finding
More informationThe Bifrost GPU architecture and the ARM Mali-G71 GPU
The Bifrost GPU architecture and the ARM Mali-G71 GPU Jem Davies ARM Fellow and VP of Technology Hot Chips 28 Aug 2016 Introduction to ARM Soft IP ARM licenses Soft IP cores (amongst other things) to our
More informationHETEROGENOUS COMPUTE IN A QUAD CORE CPU
HETEROGENOUS COMPUTE IN A QUAD CORE CPU Cyril Jean Director Embedded Systems Solutions Microsemi, a Microchip Company https://tmt.knect365.com/risc-v-summit @risc_v, a wholly owned subsidiary of Microchip
More informationA unified multicore programming model
A unified multicore programming model Simplifying multicore migration By Sven Brehmer Abstract There are a number of different multicore architectures and programming models available, making it challenging
More informationChapter 1 Computer System Overview
Operating Systems: Internals and Design Principles Chapter 1 Computer System Overview Ninth Edition By William Stallings Operating System Exploits the hardware resources of one or more processors Provides
More informationCortex-A15 MPCore Software Development
Cortex-A15 MPCore Software Development תיאור הקורס קורסDevelopment Cortex-A15 MPCore Software הינו הקורסהרשמי שלחברת ARM בן 4 ימים, מעמיקמאודומכסהאתכלהנושאיםהקשוריםבפיתוחתוכנה לפלטפורמותמבוססותליבתMPCore.Cortex-A15
More informationCSC 5930/9010 Cloud S & P: Virtualization
CSC 5930/9010 Cloud S & P: Virtualization Professor Henry Carter Fall 2016 Recap Network traffic can be encrypted at different layers depending on application needs TLS: transport layer IPsec: network
More informationFour Components of a Computer System
Four Components of a Computer System Operating System Concepts Essentials 2nd Edition 1.1 Silberschatz, Galvin and Gagne 2013 Operating System Definition OS is a resource allocator Manages all resources
More informationBringing Intelligence to Enterprise Storage Drives
Bringing Intelligence to Enterprise Storage Drives Neil Werdmuller Director Storage Solutions Arm Santa Clara, CA 1 Who am I? 28 years experience in embedded Lead the storage solutions team Work closely
More informationCSE 120 Principles of Operating Systems
CSE 120 Principles of Operating Systems Spring 2018 Lecture 16: Virtual Machine Monitors Geoffrey M. Voelker Virtual Machine Monitors 2 Virtual Machine Monitors Virtual Machine Monitors (VMMs) are a hot
More informationChapter 5 C. Virtual machines
Chapter 5 C Virtual machines Virtual Machines Host computer emulates guest operating system and machine resources Improved isolation of multiple guests Avoids security and reliability problems Aids sharing
More informationHardware Software Bring-Up Solutions for ARM v7/v8-based Designs. August 2015
Hardware Software Bring-Up Solutions for ARM v7/v8-based Designs August 2015 SPMI USB 2.0 SLIMbus RFFE LPDDR 2 LPDDR 3 emmc 4.5 UFS SD 3.0 SD 4.0 UFS Bare Metal Software DSP Software Bare Metal Software
More informationOptimizing Cache Coherent Subsystem Architecture for Heterogeneous Multicore SoCs
Optimizing Cache Coherent Subsystem Architecture for Heterogeneous Multicore SoCs Niu Feng Technical Specialist, ARM Tech Symposia 2016 Agenda Introduction Challenges: Optimizing cache coherent subsystem
More informationARMv8: The Next Generation. Minlin Fan & Zenon Xiu December 8, 2015
ARMv8: The Next Generation Minlin Fan & Zenon Xiu December 8, 2015 1 Introducing Ourselves Minlin Fan Application Engineering Manager Zenon Xiu Application Engineering Software Team Lead 2 ARM Partner
More informationCS 152 Computer Architecture and Engineering
CS 152 Computer Architecture and Engineering Lecture 12 -- Virtual Memory 2014-2-27 John Lazzaro (not a prof - John is always OK) TA: Eric Love www-inst.eecs.berkeley.edu/~cs152/ Play: CS 152 L12: Virtual
More informationSoC Platforms and CPU Cores
SoC Platforms and CPU Cores COE838: Systems on Chip Design http://www.ee.ryerson.ca/~courses/coe838/ Dr. Gul N. Khan http://www.ee.ryerson.ca/~gnkhan Electrical and Computer Engineering Ryerson University
More informationVirtualization. Operating Systems, 2016, Meni Adler, Danny Hendler & Amnon Meisels
Virtualization Operating Systems, 2016, Meni Adler, Danny Hendler & Amnon Meisels 1 What is virtualization? Creating a virtual version of something o Hardware, operating system, application, network, memory,
More informationBringing Intelligence to Enterprise Storage Drives
Bringing Intelligence to Enterprise Storage Drives Neil Werdmuller Director Storage Solutions Arm Santa Clara, CA 1 Who am I? 28 years experience in embedded Lead the storage solutions team Work closely
More informationHSA Foundation! Advanced Topics on Heterogeneous System Architectures. Politecnico di Milano! Seminar Room (Bld 20)! 15 December, 2017!
Advanced Topics on Heterogeneous System Architectures HSA Foundation! Politecnico di Milano! Seminar Room (Bld 20)! 15 December, 2017! Antonio R. Miele! Marco D. Santambrogio! Politecnico di Milano! 2
More information6.9. Communicating to the Outside World: Cluster Networking
6.9 Communicating to the Outside World: Cluster Networking This online section describes the networking hardware and software used to connect the nodes of cluster together. As there are whole books and
More information«Real Time Embedded systems» Multi Masters Systems
«Real Time Embedded systems» Multi Masters Systems rene.beuchat@epfl.ch LAP/ISIM/IC/EPFL Chargé de cours rene.beuchat@hesge.ch LSN/hepia Prof. HES 1 Multi Master on Chip On a System On Chip, Master can
More informationVeloce2 the Enterprise Verification Platform. Simon Chen Emulation Business Development Director Mentor Graphics
Veloce2 the Enterprise Verification Platform Simon Chen Emulation Business Development Director Mentor Graphics Agenda Emulation Use Modes Veloce Overview ARM case study Conclusion 2 Veloce Emulation Use
More informationGen-Z Memory-Driven Computing
Gen-Z Memory-Driven Computing Our vision for the future of computing Patrick Demichel Distinguished Technologist Explosive growth of data More Data Need answers FAST! Value of Analyzed Data 2005 0.1ZB
More informationCortex-A5 MPCore Software Development
Cortex-A5 MPCore Software Development תיאורהקורס קורסDevelopment Cortex-A5 MPCore Software הינו הקורס הרשמי שלחברת ARM בן 4 ימים, מעמיקמאודומכסהאתכלהנושאיםהקשוריםבפיתוחתוכנה לפלטפורמותמבוססותליבת.Cortex-A5
More informationNested Virtualizationon ARM
Nested Virtualizationon ARM NEVE: Nested Virtualization Extensions Jin Tack Lim Christoffer Dall Shih-Wei Li Jason Nieh Marc Zyngier LEADING COLLABORATION IN THE ARM ECOSYSTEM jitack@cs.columbia.edu christoffer.dall@linaro.org
More information