Jeff Stuecheli, PhD IBM Power Systems IBM Systems & Technology Group Development International Business Machines Corporation 1

Similar documents
Open Source on IBM I Announce Materials

Power Technology For a Smarter Future

Open Innovation with Power8

No i Access for Windows 10? No Worries Access Client Solutions is ready to meet the challenge!

IBM i Mobile Access: Access and Manage your IBM i from the comfort of your couch

An Overview of the Next Generation of IBM i Access Solution

Accessing your IBM i with Style

IBM Entry Cloud for Power Systems

IBM Systems Director ~ IBM Systems Director Navigator for i ~ System i Navigator

Database Cloning with Tivoli Storage FlashCopy Manager

Working with null-capable fields

IBM Power Systems October PowerVM Editions. Overview IBM Corporation

IBM Systems Director and Power Toronto Users Group--March 2012

Hands-on - DMA Transfer Using Control Block

Web Enable your IBM i Applications with PHP

POWER9. Jeff Stuecheli POWER Systems, IBM Systems IBM Corporation

POWER8 for DB2 and SAP

The Who, What and Why of Application Modernization

Investigate Database Performance the Navigator Way

Everything you need to know about IBM i Access Client Solutions

Using Guardium V10 to Track Database Activity on DB2 for i

IBM i Quarterly Update. IBM i Team Announcement Feb 5

POWER9 Announcement. Martin Bušek IBM Server Solution Sales Specialist

Power Systems Hardware: Today and Tomorrow

IBM 3995 Migration to POWER7

An Administrators Guide to IBM i Access Client Solutions

An Administrator s Guide to IBM i Access Client Solutions

Eric Schwarz. IBM Accelerators. July 11, IBM Corporation

Set Review 3 SQL Working with Sets Question: number of customers in China? Count what s here Locations Customers Question: who are the customers in Ch

IBM: IBM Corporation

IBM i NetServer: Easy Access to IBM i Data

What s New in IBM i 7.1, 7.2 & 7.3 Security Jeffrey Uehling IBM i Security Development

IBM PowerKVM available with the Linux only scale-out servers IBM Redbooks Solution Guide

Power 7. Dan Christiani Kyle Wieschowski

Université IBM i 2018

Web Services from the World of REST Tim Rowe - Business Architect - Application Development Systems Management

IBM Power Systems Performance Report. POWER9, POWER8 and POWER7 Results

Indexing 3 Index Types Two types of indexing technologies are supported Radix Index The default unless you explicitly say otherwise Encoded Vector Ind

POWER7: IBM's Next Generation Server Processor

Learn to Fly with RDi

Conquer the IBM i World with OpenSSH!!

Université IBM i 2018

Hands-on - DMA Transfer Using get and put Buffer

Hands-on - DMA Transfer Using get Buffer

Time travel with DB2 for i - Temporal tables on IBM i 7.3

POWER7: IBM's Next Generation Server Processor

IBM. Avoiding Inventory Synchronization Issues With UBA Technical Note

IBM i Power Systems Update & Roadmap

IBM Power 755 server. High performance compute node for scalable clusters using InfiniBand architecture interconnect products.

What s New in IBM i 7.1, 7.2 & 7.3 Security

outthink limits Spectrum Scale Enhancements for CORAL Sarp Oral, Oak Ridge National Laboratory Gautam Shah, IBM

IBM Power AC922 Server

Release Notes. IBM Tivoli Identity Manager Rational ClearQuest Adapter for TDI 7.0. Version First Edition (January 15, 2011)

IBM Deep Computing SC08. Petascale Delivered What s Past is Prologue. IBM s pnext; The Next Era of Computing Innovation

z/vm 6.3 A Quick Introduction

IBM System Storage - DS8870 Disk Storage Microcode Bundle Release Note Information v1

LinuxCon Japan 2014 OpenPOWER Technical Overview. Jeff Scheel Chief Engineer Linux on Power May 21, IBM Corporation

IBM Endpoint Manager Version 9.1. Patch Management for Ubuntu User's Guide

Any application that can be written in JavaScript, will eventually be written in JavaScript.

z/vm 6.3 Installation or Migration or Upgrade Hands-on Lab Sessions

HMC V8R810 & V8R820 for POWER8 adds new Management capabilities, Performance Capacity Monitor and much more.

Infor Lawson on IBM i 7.1 and IBM POWER7+

Time travel with Db2 for i - Temporal tables on IBM i 7.3

IBM Systems Director Active Energy Manager 4.3

IBM Power 740 Express server

SAS Enterprise Miner Performance on IBM System p 570. Jan, Hsian-Fen Tsao Brian Porter Harry Seifert. IBM Corporation

OpenCAPI and its Roadmap

IBM Blockchain IBM Blockchain Developing Applications Workshop - Node-Red Integration

z/osmf 2.1 User experience Session: 15122

p5 520 server Robust entry system designed for the on demand world Highlights

IBM System p5 185 Express Server

IBM Power Systems HPC Cluster

Netcool/Impact Version Release Notes GI

Requirements Supplement

Oracle PeopleSoft Applications for IBM z Systems

IBM Power Systems. IBM Systems and Technology Group. Power your planet. Smarter systems for a smarter planet

IBM Power Advanced Compute (AC) AC922 Server

IBM System Storage - DS8870 Disk Storage Microcode Bundle Release Note Information v1

A Preliminary evalua.on of OpenPOWER through op.mizing stencil based algorithms

Db2 for i Row & Column Access Control

Return to Basics II : Understanding POWER7 Capacity Entitlement and Virtual Processors VN212 Rosa Davidson

IBM Power Systems solution for SugarCRM

IBM Mainframe Life Cycle History

IBM Power Systems Performance Capabilities Reference

IBM i Licensing: Navigating the World of IBM i Software

IBM Data Center Networking in Support of Dynamic Infrastructure

Power your planet. Optimizing the Enterprise Data Center POWER7 Powers a Smarter Infrastructure

DB2 for IBM i. Smorgasbord of topics. Scott Forstie DB2 for i Business Architect

Running Docker applications on Linux on the Mainframe

Release Notes. IBM Tivoli Identity Manager Universal Provisioning Adapter. Version First Edition (June 14, 2010)

M7: Next Generation SPARC. Hotchips 26 August 12, Stephen Phillips Senior Director, SPARC Architecture Oracle

IBM i Version 7.2. Systems management Logical partitions IBM

Behind the Glitz - Is Life Better on Another Database Platform?

IBM System Storage - DS8870 Disk Storage Microcode Bundle Release Note Information

IBM. Business Process Troubleshooting. IBM Sterling B2B Integrator. Release 5.2

Your Roadmap to POWER9: Migration Scenarios

Agenda. System Performance Scaling of IBM POWER6 TM Based Servers

IBM _` p5 570 servers

IBM Cloud Orchestrator. Content Pack for IBM Endpoint Manager for Software Distribution IBM

Computing as a Service

Transcription:

Jeff Stuecheli, PhD IBM Power Systems IBM Systems & Technology Group Development 2013 International Business Machines Corporation 1

POWER5 2004 POWER6 2007 POWER7 2010 POWER7+ 2012 Technology 130nm SOI 65nm SOI 45nm SOI edram 32nm SOI edram Compute s Threads 2 SMT2 2 SMT2 8 SMT4 8 SMT4 Caching On-chip Off-chip 1.9MB 36MB 8MB 32MB 2 + 32MB None 2 + 80MB None Die Bandwidth Sust. Mem. Peak I/O 15GB/s 6GB/s 30GB/s 20GB/s 100GB/s 40GB/s 100GB/s 40GB/s 2013 International Business Machines Corporation 2

POWER5 2004 POWER6 2007 POWER7 2010 POWER7+ 2012 POWER8 Technology 130nm SOI 65nm SOI 45nm SOI edram 32nm SOI edram Compute s Threads 2 SMT2 2 SMT2 8 SMT4 8 SMT4 Today s Topic Caching On-chip Off-chip 1.9MB 36MB 8MB 32MB 2 + 32MB None 2 + 80MB None Die Bandwidth Sust. Mem. Peak I/O 15GB/s 6GB/s 30GB/s 20GB/s 100GB/s 40GB/s 100GB/s 40GB/s 2013 International Business Machines Corporation 3

Leadership Performance Increase core throughput at single thread, SMT2, SMT4, and SMT8 level Large step in per socket performance Enable more robust multisocket scaling System Innovation Higher capacity cache hierarchy and highly threaded processor Enhanced memory bandwidth, capacity, and expansion Dynamic code optimization Hardware-accelerated virtual memory management Open System Innovation Coherent Accelerator Processor Interface (CAPI) Agnostic Memory interface Open system software 2013 International Business Machines Corporation 4

2013 International Business Machines Corporation 5

s 12 cores (SMT8) 8 dispatch, 10 issue, 16 exec pipe 2X internal data flows/queues Enhanced prefetching 64K data cache, 32K instruction cache Accelerators Crypto & memory expansion Transactional Memory VMM assist Data Move / VM Mobility Technology 22nm SOI, edram, 15 ML 650mm2 Energy Management On-chip Power Management Micro-controller Integrated Per-core VRM Critical Path Monitors Caches 512 KB SRAM / core 96 MB edram shared L3 Up to 128 MB edram L4 (off-chip) Memory per Die Up to 230 GB/s sustained bandwidth Die Bus Interfaces Durable open memory attach interface Integrated PCIe Gen3 SMP Interconnect CAPI (Coherent Accelerator Processor Interface) 2013 International Business Machines Corporation 6

s 12 cores (SMT8) 8 dispatch, 10 issue, 16 exec pipe 2X internal data flows/queues Enhanced prefetching 64K data cache, 32K instruction cache Accelerators Crypto & memory expansion Transactional Memory VMM assist Data Move / VM Mobility Mem. Ctrl. Technology 22nm SOI, edram, 15 ML 650mm2 8M L3 Region SMP Links Accelerators Energy Management On-chip Power Management Micro-controller Integrated Per-core VRM Critical Path Monitors Caches 512 KB SRAM / core 96 MB edram shared L3 Up to 128 MB edram L4 (off-chip) Memory per Die Up to 230 GB/s sustained bandwidth Die Bus Interfaces Durable open memory attach interface Integrated PCIe Gen3 SMP Interconnect CAPI (Coherent Accelerator Processor Interface) 2013 International Business Machines Corporation 7 L3 Cache & Chip Interconnect SMP Links PCIe Mem. Ctrl.

2013 International Business Machines Corporation 8

Execution Improvement vs. POWER7 SMT4 SMT8 8 dispatch 10 issue 16 execution pipes: 2 FXU, 2 LSU, 2 LU, 4 FPU, 2 VMX, 1 Crypto, 1 DFU, 1 CR, 1 BR Larger Issue queues (4 x 16-entry) Larger global completion, Load/Store reorder Improved branch prediction Improved unaligned storage access Larger Caching Structures vs. POWER7 2x L1 data cache (64 KB) 2x outstanding data cache misses 4x translation Cache Wider Load/Store 32B 64B to L1 data bus 2x data cache to execution dataflow Enhanced Prefetch Instruction speculation awareness Data prefetch depth awareness Adaptive bandwidth awareness Topology awareness Performance vs. POWER7 ~1.6x Single Thread ~2x Max SMT 2013 International Business Machines Corporation 9

Execution Improvement vs. POWER7 SMT4 SMT8 8 dispatch 10 issue 16 execution pipes: 2 FXU, 2 LSU, 2 LU, 4 FPU, 2 VMX, 1 Crypto, 1 DFU, 1 CR, 1 BR Larger Issue queues (4 x 16-entry) Larger global completion, Load/Store reorder Improved branch prediction Improved unaligned storage access IFU ISU FXU LSU DFU VSU Larger Caching Structures vs. POWER7 2x L1 data cache (64 KB) 2x outstanding data cache misses 4x translation Cache Wider Load/Store 32B 64B to L1 data bus 2x data cache to execution dataflow Enhanced Prefetch Instruction speculation awareness Data prefetch depth awareness Adaptive bandwidth awareness Topology awareness Performance vs. POWER7 ~1.6x Single Thread ~2x Max SMT 2013 International Business Machines Corporation 10

: 512 KB 8 way per core L3: 96 MB (12 x 8 MB 8 way Bank) NUCA Cache policy (Non-Uniform Cache Architecture) Scalable bandwidth and latency Migrate hot lines to local, then local L3 (replicate contained footprint) Chip Interconnect: 150 GB/sec x 12 segments per direction = 3.6 TB/sec L3 SMP L3 L3 Acc L3 L3 L3 Bank Bank Bank Bank Bank Bank Memory Chip Interconnect Memory L3 L3 L3 L3 L3 L3 Bank Bank Bank SMP PCIe Bank Bank Bank 2013 International Business Machines Corporation 11

256 64 128 128 GB/sec shown assuming 4 GHz Product frequency will vary based on model type Across 12 core chip 4 TB/sec BW 3 TB/sec L3 BW L3 128 64 2013 International Business Machines Corporation 12

POWER8 Processor Memory DIMM Form factors Up to 4 high speed channels, each running up to 9.6 Gb/s Up to 16 total DDR ports 2013 International Business Machines Corporation 13

with 16MB of Cache DRAM Chips Memory Buffer DDR Interfaces Intelligence Moved into Memory Scheduling logic, caching structures Energy Mgmt, RAS decision point Formerly on Processor Moved to Memory Buffer Processor Interface 9.6 GB/s high speed interface More robust RAS On-the-fly lane isolation/repair Extensible for innovation build-out Performance Value End-to-end fastpath and data retry (latency) Cache latency/bandwidth, partial updates Cache write scheduling, prefetch, energy 22nm SOI for optimal performance / energy 15 metal levels (latency, bandwidth) Scheduler & Management 16MB Memory Cache POWER8 Link 2013 International Business Machines Corporation 14

POWER8 POWER8 POWER7 Native PCIe Gen 3 Support Direct processor integration Replaces proprietary GX/Bridge Low latency Gen3 x16 bandwidth (16 Gb/s) I/O Bridge GX Bus Transport Layer for CAPI Protocol Coherently Attach Devices connect to processor via PCIe Protocol encapsulated in PCIe PCIe G3 PCIe G2 PCI Device PCI Device 2013 International Business Machines Corporation 15

Virtual Addressing Accelerator can work with same memory addresses that the processors use Pointers de-referenced same as the host application Removes OS & device driver overhead Hardware Managed Cache Coherence Enables the accelerator to participate in Locks as a normal thread Lowers Latency over IO communication model POWER8 POWER8 Coherence Bus CAPP Custom Hardware Application PSL FPGA or ASIC Customizable Hardware Application Accelerator Specific system SW, middleware, or user application Written to durable interface provided by PSL PCIe Gen 3 Transport for encapsulated messages Processor Service Layer (PSL) Present robust, durable interfaces to applications Offload complexity / content from CAPP 2013 International Business Machines Corporation 16

POWER5 2004 POWER6 2007 POWER7 2010 POWER7+ 2012 POWER8 Technology Compute s Threads 130nm SOI 2 SMT2 65nm SOI 2 SMT2 45nm SOI edram 8 SMT4 32nm SOI edram 8 SMT4 22nm SOI edram 12 SMT8 Caching On-chip Off-chip 1.9MB 36MB 8MB 32MB 2 + 32MB None 2 + 80MB None 6 + 96MB 128MB Die Bandwidth Sust. Mem. Peak I/O 15GB/s 6GB/s 30GB/s 20GB/s 100GB/s 40GB/s 100GB/s 40GB/s Up to 230GB/s Up to 96GB/s 2013 International Business Machines Corporation 17

Systems Design Solutions a New Conversation Industry Solutions Business & Predictive Analytics Cognitive Computing IBM Watson Stream Computing Real-time Analytics Natural Language Learning Continuous data load Massive IO bandwidth Parallel processing 1,000+ Concurrent Queries Large-scale memory processing???????????????????????? Open & flexible infrastructure Available on premise or through the Cloud 2013 International Business Machines Corporation 18

POWER8 Differentiation for Analytics Massive capacity and bandwidth to memory and IO Large caches with massive bandwidth Strong Single thread SMT8, Many threads to hide memory latency Graph traversals Transactional memory enables efficient thread scaling CAPI Accelerators Enables heterogeneous compute (GPU, FPGA, etc.) Big Data, Analytics, Cognitive Computing Synergy with IBM Software, Driving Optimization Across the Stack 2013 International Business Machines Corporation 19

giving ecosystem partners a license to innovate Mellanox TYAN IBM OpenPower Open Innovation Google NVIDIA OpenPOWER will enable hyper-scale cloud data centers to rethink their approach to technology. Member companies will use POWER for custom open servers and components for Linux based cloud data centers. For the first time, OpenPOWER ecosystem partners can optimize the interactions of server building blocks microprocessors, networking, I/O & other components to tune performance. 20 2013 International Business Machines Corporation 20

Significant Performance at Thread,, and System Optimization for VM Density & Efficiency Strong Enablement of Autonomic System Optimization Excellent Big Data Analytics Capability 2013 International Business Machines Corporation 21

Thank You! 2013 International Business Machines Corporation 22

Special notices This document was developed for IBM offerings in the United States as of the date of publication. IBM may not make these offerings available in other countries, and the information is subject to change without notice. Consult your local IBM business contact for information on the IBM offerings available in your area. Information in this document concerning non-ibm products was obtained from the suppliers of these products or other public sources. Questions on the capabilities of non-ibm products should be addressed to the suppliers of those products. IBM may have patents or pending patent applications covering subject matter in this document. The furnishing of this document does not give you any license to these patents. Send license inquires, in writing, to IBM Director of Licensing, IBM Corporation, New Castle Drive, Armonk, NY 10504-1785 USA. All statements regarding IBM future direction and intent are subject to change or withdrawal without notice, and represent goals and objectives only. The information contained in this document has not been submitted to any formal IBM test and is provided "AS IS" with no warranties or guarantees either expressed or implied. All examples cited or described in this document are presented as illustrations of the manner in which some IBM products can be used and the results that may be achieved. Actual environmental costs and performance characteristics will vary depending on individual client configurations and conditions. IBM Global Financing offerings are provided through IBM Credit Corporation in the United States and other IBM subsidiaries and divisions worldwide to qualified commercial and government clients. Rates are based on a client's credit rating, financing terms, offering type, equipment type and options, and may vary by country. Other restrictions may apply. Rates and offerings are subject to change, extension or withdrawal without notice. IBM is not responsible for printing errors in this document that result in pricing or information inaccuracies. IBM hardware products are manufactured from new parts, or new and serviceable used parts. Regardless, our warranty terms apply. Any performance data contained in this document was determined in a controlled environment. Actual results may vary significantly and are dependent on many factors including system hardware configuration and software design and configuration. Some measurements quoted in this document may have been made on development-level systems. There is no guarantee these measurements will be the same on generallyavailable systems. Some measurements quoted in this document may have been estimated through extrapolation. Users of this document should verify the applicable data for their specific environment. 2013 International Business Machines Corporation 23

Special notices (cont.) IBM, the IBM logo, ibm.com AIX, AIX (logo), AIX 5L, AIX 6 (logo), AS/400, BladeCenter, Blue Gene, ClusterProven, DB2, ESCON, i5/os, i5/os (logo), IBM Business Partner (logo), IntelliStation, LoadLeveler, Lotus, Lotus Notes, Notes, Operating System/400, OS/400, PartnerLink, PartnerWorld, PowerPC, pseries, Rational, RISC System/6000, RS/6000, THINK, Tivoli, Tivoli (logo), Tivoli Management Environment, WebSphere, xseries, z/os, zseries, Active Memory, Balanced Warehouse, CacheFlow, Cool Blue, IBM Watson, IBM Systems Director VMControl, purescale, Turbo, Chiphopper, Cloudscape, DB2 Universal Database, DS4000, DS6000, DS8000, EnergyScale, Enterprise Workload Manager, General Parallel File System,, GPFS, HACMP, HACMP/6000, HASM, IBM Systems Director Active Energy Manager, iseries, Micro-Partitioning, POWER, PowerLinux, PowerExecutive, PowerVM, PowerVM (logo), PowerHA, Power Architecture, Power Everywhere, Power Family, POWER Hypervisor, Power Systems, Power Systems (logo), Power Systems Software, Power Systems Software (logo), POWER2, POWER3, POWER4, POWER4+, POWER5, POWER5+, POWER6, POWER6+, POWER7, POWER7+, Systems, System i, System p, System p5, System Storage, System z, TME 10, Workload Partitions Manager and X-Architecture are trademarks or registered trademarks of International Business Machines Corporation in the United States, other countries, or both. If these and other IBM trademarked terms are marked on their first occurrence in this information with a trademark symbol ( or ), these symbols indicate U.S. registered or common law trademarks owned by IBM at the time this information was published. Such trademarks may also be registered or common law trademarks in other countries. A full list of U.S. trademarks owned by IBM may be found at: http://www.ibm.com/legal/copytrade.shtml. Adobe, the Adobe logo, PostScript, and the PostScript logo are either registered trademarks or trademarks of Adobe Systems Incorporated in the United States, and/or other countries. AltiVec is a trademark of Freescale Semiconductor, Inc. AMD Opteron is a trademark of Advanced Micro Devices, Inc. InfiniBand, InfiniBand Trade Association and the InfiniBand design marks are trademarks and/or service marks of the InfiniBand Trade Association. Intel, Intel logo, Intel Inside, Intel Inside logo, Intel Centrino, Intel Centrino logo, Celeron, Intel Xeon, Intel SpeedStep, Itanium, and Pentium are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. IT Infrastructure Library is a registered trademark of the Central Computer and Telecommunications Agency which is now part of the Office of Government Commerce. Java and all Java-based trademarks and logos are trademarks or registered trademarks of Oracle and/or its affiliates. Linear Tape-Open, LTO, the LTO Logo, Ultrium, and the Ultrium logo are trademarks of HP, IBM Corp. and Quantum in the U.S. and other countries. Linux is a registered trademark of Linus Torvalds in the United States, other countries or both. PowerLinux uses the registered trademark Linux pursuant to a sublicense from LMI, the exclusive licensee of Linus Torvalds, owner of the Linux mark on a worldwide basis. Microsoft, Windows and the Windows logo are registered trademarks of Microsoft Corporation in the United States, other countries or both. NetBench is a registered trademark of Ziff Davis Media in the United States, other countries or both. SPECint, SPECfp, SPECjbb, SPECweb, SPECjAppServer, SPEC OMP, SPECviewperf, SPECapc, SPEChpc, SPECjvm, SPECmail, SPECimap and SPECsfs are trademarks of the Standard Performance Evaluation Corp (SPEC). The Power Architecture and Power.org wordmarks and the Power and Power.org logos and related marks are trademarks and service marks licensed by Power.org. TPC-C and TPC-H are trademarks of the Transaction Performance Processing Council (TPPC). UNIX is a registered trademark of The Open Group in the United States, other countries or both. Other company, product and service names may be trademarks or service marks of others. 2013 International Business Machines Corporation 24