Online Webinar. June 16, Intel SDK for opencl applications. Intel Media SDK

Size: px
Start display at page:

Download "Online Webinar. June 16, Intel SDK for opencl applications. Intel Media SDK"

Transcription

1 Online Webinar June 16, 2016 Intel Media SDK Intel SDK for opencl applications

2 Welcome! Meet our Experts Jeff McAllister Media Software Senior Technical Consulting Engineer Developer Products Division Robert Ioffe OpenCL Software Technical Consulting Engineer Developer Products Division Intel Software & Services Group 2 2

3 What We ll Cover Today Introduction Intel Processor Graphics -the Other Side of the Chip Programming Model Examples Intel OpenCL & Media SDKs Better Together Q&A 3 3

4 Take advantage of advanced media accelerators, graphics processing & programmable graphics

5 Intel CPUs + Intel Processor Graphics (GPUs): A Winning Combination Intel HD Graphics 6 th Gen Intel Core Processor If you re only targeting CPUs, you re not fully utilizing the chip See Technical Specifications for hardware requirements & other details. Not shown Intel Xeon Processor Graphics 5

6 Hardware Overview Graphics Technology Highlights Execution Units (EUs) = general purpose cores EUs, samplers, caches, etc. in slices Fixed function is in unslice edram adds cache, increases bandwidth Naming Convention Intel HD Graphics Intel Iris Graphics Intel Iris Pro Graphics 24 EUs 1x FF 48 EUs 2x FF edram 72 EUs 2x FF edram Other Names GT2 4+2 GT3 2+3e GT3e,GT4e 4+4e Summary Good Better Best Fixed Function (VDBox, VEBox) GT2 GT4 GT3 6 Look for Intel Processor Graphics at ark.intel.com 6

7 Intel SDK for OpenCL Applications & Intel Media SDK: Better together for video apps Codecs AVC/H264, HEVC/H265, MPEG2, MJPEG Filters Resize/crop, color conversion, deinterlace, composition/blend, FRC, telecine/interlace reversal Increasing GPGPU + fixed function = high performance custom pipelines 7 7

8 Takeaways Intel Media SDK & Intel SDK for OpenCL Applications used together allows programmers to exploit the full computational capabilities of Intel processors by utilizing CPU, GPU & fixed function hardware. Intel offers drivers, runtimes/compilers & a suite of tools allowing video application developers to create, build, debug, analyze & optimize. Intel tools & hardware help you create exciting new applications & usages for image/video processing, virtual & augmented reality, artificial intelligence, robotics, & machine learning (including deep learning). 8 8

9 Get the most out of Intel platforms with Intel Software Development Tools

10 Intel Software Development Tools Technical Computing Embedded Systems & IoT (CPU) Heterogeneous Tools (GPU) New Interface (future) Intel RealSense SDK Intel System Studio Intel IoT Developer Kit Intel Media SDK Intel SDK for OpenCL Applications 10 10

11 Developer Hardware Access GPU Components EUs Media Fixed Function VDbox/MFX VEbox/VQE Media Accelerator Model (GPU) Intel Media Server Studio Intel SDK for OpenCL Applications Intel Media SDK Core Core Ring System Agent LLC (edram) Core CPU Components LLC Core LLC Traditional CPU Model Intel Parallel Studio Intel System Studio CPU Compilers, performance libraries, etc

12 Media Accelerator vs. Traditional Model Comparison Media Application Decode Frame Processing Encode App MSDK Decode DXVA/ VAAPI MSDK VPP OpenCL GPU MSDK Encode VAAPI, OpenCL (+ VME extension) Library CPU or App CPU Driver GPU Driver Driver Driver CPU (Traditional Model) VDBOX EUs, VEBOX EUs, VDBOX GPU (Accelerator Model, includes CPU) 12 12

13 Main loop Initialization OpenCL & Media SDK Programming Models Application (written C, C++, ) Kernels to Accelerate (OpenCL C) Parameters (from header) Parameters (in & out) Parameters kernel void dot ( ) { int tid = get_global_id(0); c[tid] = dot(a[tid], b[tid]); } init init init OpenCL API Khronos ICD stream Decode frame VPP frame Encode stream Compiler Front End (Clang) SPIR Back End (LLVM) Runtime Intel Media SDK GPU Device CPU Device Coprocessor Device Intel SDK for OpenCL Applications 13

14 Intel Media SDK Value Propositions Acceleration: Utilize full capabilities of Intel processors. Easy optimizations: Simple high level asynchronous API no need for specialized developers to code to the metal. Multi-platform/OS support: Utilize Intel Xeon and Core in Windows* and Linux* Future proofed: Align with Intel hardware strategy for the longterm. Optimize for new hardware without code changes BenefitsHardware (TTM): Automatically utilize latest hardware capabilities on Linux & Windows without multiple implementations. Write in Windows, deploy on Linux or vice versa. Fully utilize Intel hardware capabilities for more powerful, cost efficient devices/platforms. Client: Better experience, longer battery life Server: Higher density = more streams per server & lower space/power costs ImpactsTime-to-Market Disruptive technology: more streams per server with lower space/power costs changes the rules of the game and makes new things possible. 14 One unified API for codecs & frame processing across hardware generations 14

15 3D FF Codecs + Frame Processing use Fixed Function + EUs Video Encoding ENC= EU+VDBox VME (MB type, motion vectors, bit budget/brc) PAK = VDBox (residue packing & entropy coding) VDENC = low power encode (6 th Generation Core & forward) VPP Video Decoding BSD=VDBox decode Media Fixed Function VDBOX VEBOX EU EU EU EU EU EU EU EU EU EU EU EU EU EU EU EU EU EU EU EU EU EU EU EU Sampler Sampler Sampler Caches VPHal Video Processing Hardware Acceleration Layer VEBox Deinterlacing Denoise (Luma/Chroma) Frame Rate Conversion Color space conversions Composition/alpha blending Scaling 15 15

16 Application Basic Structure of a Media SDK-optimized Application Initialize Session, set parameters Query + Allocate Main loop Find free surface Q stages: decode, VPP Encode Sync Retrieve output Drain loop Same as above Clean up, exit 16 16

17 SDKs More Efficient Together Problem Solution GPU CPU Decode X Process Encode X GPU CPU Decode Process Encode Expensive GPU<->CPU copy/sync Entire pipeline on GPU for greatest efficiency Key Intel Media SDK Component Intel SDK for OpenCL Applications Component 17 17

18 Hardware View (OpenCL perspective) Extensions: Built-in fixed function access Load function in app init clcreateprogramwithbuiltinkernels set parameters (src/ref images, outputs, ) enqueuendrangekernel as usual GPGPU for EUs Low level C subset/spir provides instructions 2x 128-bit SIMD FPUs (8 32 bit float or int operations per cycle) Images (I/O for MSDK) read through sampler Buffers read through data port On many processors EU capabilities higher than CPUs 18 18

19 Motion Estimation Extensions Two extensions for hardware video motion estimation Host APIs & built-in kernels & associated constant definitions cl_intel_motion_estimation (1.0) Defines an accelerator for cl_intel_accelerator extension Provides a built-in kernel for Intel Graphics Hardware motion estimation cl_intel_advanced_motion_estimation (2.0) New built-in kernel for shape decisions Intra/Inter, bi/uni-directional prediction, partitioning, partition size Available Today: Khronos Spec & Sample software.intel.com/en-us/articles/intro-to-advanced-motion-estimation-extension-for-opencl 19

20 Application Media SDK+OpenCL Application Initialize Session, set parameters Compile/load OpenCL kernels Find free surface Query + Allocate (add clcreatefromdx9mediasu rfacekhr) Q stages: decode, VPP, Main loop Sync clenqueueacquiredx9me diasurfaceskhr Launch kernel set params, clenqueuendrangekernel clenqueuereleasedx9me diasurfaceskhr 20 20

21

22 How-To 1. A minimal example (code walk through) 2. Use case: Color conversion filter 3. Use case: BRC Advantages: A few simple steps to keep your pipeline unified Constant workload boosts GPU to peak frequency Free your CPUs for other tasks Create new capabilities not possible otherwise in small space/power/cost constraints Fixed Function Performance Your innovation via GPGPU New possibilities! 22 22

23 A Simple Kernel kernel void Flip( read_only image2d_t src, write_only image2d_t dst, int h) { int2 coord_src = (int2)(get_global_id(0), get_global_id(1)); int2 coord_dst; coord_dst.x = coord_src.x; coord_dst.y = h - coord_src.y - 1; Init Decode Process } uint4 pixel = read_imageui(src, CLK_FILTER_NEAREST, coord_src); write_imageui(dst, coord_dst, pixel.xyzw); Inner loop, only 4 pixels 23 23

24 Kernel Components (parameters) kernel void Flip( read_only image2d_t src, write_only image2d_t dst, int h) { int2 coord_src = (int2)(get_global_id(0), get_global_id(1)); int2 coord_dst; kernel name + parameters coord_dst.x = coord_src.x; coord_dst.y = h - coord_src.y - 1; } uint4 pixel = read_imageui(src, CLK_FILTER_NEAREST, coord_src); write_imageui(dst, coord_dst, pixel.xyzw); 24 24

25 Kernel Components (coordinate calculations) kernel void Flip( read_only image2d_t src, write_only image2d_t dst, int h) { int2 coord_src = (int2)(get_global_id(0), get_global_id(1)); int2 coord_dst; coord_dst.x = coord_src.x; coord_dst.y = h - coord_src.y - 1; where to get data, where to put it } uint4 pixel = read_imageui(src, CLK_FILTER_NEAREST, coord_src); write_imageui(dst, coord_dst, pixel.xyzw); 25 25

26 Kernel Components (read data in) kernel void Flip( read_only image2d_t src, write_only image2d_t dst, int h) { int2 coord_src = (int2)(get_global_id(0), get_global_id(1)); int2 coord_dst; } coord_dst.x = coord_src.x; coord_dst.y = h - coord_src.y - 1; uint4 pixel = read_imageui(src, CLK_FILTER_NEAREST, coord_src); write_imageui(dst, coord_dst, pixel.xyzw); fn to read from sampler, not automatic like traditional CPU model 26 26

27 Kernel Components (operation + output) kernel void Flip( read_only image2d_t src, write_only image2d_t dst, int h) { int2 coord_src = (int2)(get_global_id(0), get_global_id(1)); int2 coord_dst; coord_dst.x = coord_src.x; coord_dst.y = h - coord_src.y - 1; Combine steps fn to write back } uint4 pixel = read_imageui(src, CLK_FILTER_NEAREST, coord_src); //do something more interesting here write_imageui(dst, coord_dst, pixel.xyzw); 27

28 Decode Steps for (;;) { sts = mfxdec.decodeframeasync(&mfxbsin, &pmfxsurfaces[nindex], &pmfxframesurfout, &syncp); Init Decode Process if (MFX_WRN_DEVICE_BUSY == sts) {MSDK_SLEEP(1); continue;} if (MFX_ERR_MORE_DATA == sts) {sts = ReadBitStreamData(&mfxBSin, fsource); continue;} if (MFX_ERR_MORE_SURFACE == sts) {nindex = getfreesurfaceindex(pmfxsurfaces, numsurfaces); continue;} if (MFX_WRN_VIDEO_PARAM_CHANGED == sts) continue; //todo: handle resolution changes, etc. if (MFX_ERR_NONE == sts) break; //new frame is ready after sync } ready sts = session.syncoperation(syncp, 60000); // Wait until decoded frame is Media SDK decode may loop through several states before completing 28 28

29 More input Input finished Simplified Decode Flow Initialize Main loop DecodeFrameAsync Drain loop DecodeFrameAsync Expected Return Codes for DecodeFrameAsync MFX_ERR_MORE_SURFACE A new surface is required to proceed this is where decode will write its output MFX_ERR_MORE_DATA (bitstream in) (null in) More input bitstream data is required to proceed MFX_WRN_DEVICE_BUSY MFX_ERR_MORE _SURFACE MFX_ERR_MORE _SURFACE Hardware device is unable to respond. This is an expected output for normal operation & should clear shortly. If this state persists more than a few milliseconds, this may indicate a problem. MFX_WRN_VIDEO_PARAM_CHANGED MFX_ERR_MORE _DATA The SDK decoder parsed a new sequence header. Decoding can continue with existing frame buffers. The app can optionally retrieve new video parameters by calling MFXVideoDECODE_GetVideoParam. Other Finish (MFX_ERR_MORE_DATA indicates all surfaces drained) Other error codes may be bugs. Contact Intel Support for more info

30 Launching the OpenCL Kernel Init Decode Process clenqueueacquiredx9mediasurfaceskhr(m_clqueue, 4, m_oclsurfaces, 0, NULL, NULL); // Y plane clsetkernelarg(m_clkernel, 0, sizeof(cl_mem), &m_poclbuffers[ocl_in][mididxin].ocl_y); // In clsetkernelarg(m_clkernel, 1, sizeof(cl_mem), &m_poclbuffers[ocl_out][mididxout].ocl_y); // Out clsetkernelarg(m_clkernel, 2, sizeof(cl_int), &m_framesizey[1]); // Frame height clenqueuendrangekernel(m_clqueue, m_clkernel, 2, NULL, m_globalworksizey, m_localworksizey, 0, NULL, NULL); // UV plane clsetkernelarg(m_clkernel, 0, sizeof(cl_mem), &m_poclbuffers[ocl_in][mididxin].ocl_uv); // In clsetkernelarg(m_clkernel, 1, sizeof(cl_mem), &m_poclbuffers[ocl_out][mididxout].ocl_uv); // Out clsetkernelarg(m_clkernel, 2, sizeof(cl_int), &m_framesizeuv[1]); // Frame height clsts = clenqueuendrangekernel(m_clqueue, m_clkernel, 2, NULL, m_globalworksizeuv, m_localworksizeuv, 0, NULL, NULL); clsts = clenqueuereleasedx9mediasurfaceskhr(m_clqueue, 4, m_oclsurfaces, 0, NULL, NULL); 30 30

31 Use Case: Color Format Conversion Filter R G B R G B R Y0 Y1 Y2 Y3 other filters? Encode U0 V0 U1 V1 RGB3 (packed format) NV12 (planar format) kernel void RGB3_to_NV12( read_only image2d_t src, write_only image2d_t Ydst, write_only image2d_t UVdst, ) { coord calculations read_imageui(src) (calculate Y,U,V from src RGB) For additional efficiency chain more operations here write_imageui(ydst) write_imageui(uvdst) } 31 31

32 Use Case: Bitrate Control (BRC) Two paths to improve codec quality 1. Core algorithms 2. Bitrate control VME data for QP decisions Encode Motion estimation kernel output and custom quality metrics can be combined in novel ways to build custom BRC with your own secret sauce. pre-check frames Custom quality metric filter Decode bitstream 32 32

33 Visualize FF+GPGPU+CPUs working together with Intel VTune Amplifier 33

34

35 Why OpenCL + Intel Media SDK? Decode What Media SDK Covers (high level) Process OpenCL Covers Fuller range/lower level Encode Media SDK provides optimized implementations for Codecs Frame Processing Operations For video processing tasks not in Media SDK s scope, extend with OpenCL Make use of growing GPU capabilities Keep pipelines on GPU Example uses: color conversions, custom bitrate control Fixed Function Performance Add your innovation via GPGPU Build something awesome! 35

36 Unleash the Full Range of Gen Graphics Capabilities for Video Processing Applications CALL TO ACTION Develop fast, efficient, competitive media apps - access media accelerators in Intel GPUs Servers & Embedded: Intel Media Server Studio free Community Edition For clients: Intel Media SDK & Intel SDK for OpenCL Applications free standalone tools Get Started - use Media Samples & Tutorials 36 36

37 For more information, visit: Intel Media Server Studio & how-to articles Intel Media SDK & Intel SDK for OpenCL Applications The webinar materials & recording will be ed to you soon. Watch for more webinars in Q3 & Q4.

38 Legal Notices, Disclaimers & Intel technologies features and benefits depend on system configuration and may require enabled hardware, software or service activation. Performance varies depending on system configuration. No computer system can be absolutely secure. Check with your system manufacturer or retailer or learn more at intel.com. Tests document performance of components on a particular test, in specific systems. Differences in hardware, software, or configuration will affect actual performance. Consult other sources of information to evaluate performance as you consider your purchase. For more complete information about performance and benchmark results, visit All information provided here is subject to change without notice. Contact your Intel representative, sales office or distributor to obtain the latest Intel product specifications and roadmaps. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. The cost reduction scenarios described in this document are intended to enable you to get a better understanding of how the purchase of a given Intel product, combined with a number of situation-specific variables, might affect your future cost and savings. Nothing in this document should be interpreted as either a promise of or contract for a given level of costs. INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. EXCEPT AS PROVIDED IN INTEL'S TERMS AND CONDITIONS OF SALE FOR SUCH PRODUCTS, INTEL ASSUMES NO LIABILITY WHATSOEVER, AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO SALE AND/OR USE OF INTEL PRODUCTS INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT. Intel, the Intel logo, Xeon, Core, Iris Pro, and VTune are trademarks of Intel Corporation in the U.S. and other countries. Intel s compilers may or may not optimize to the same degree for non-intel microprocessors for optimizations that are not unique to Intel microprocessors. These optimizations include SSE2, SSE3, and SSSE3 instruction sets and other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors. Certain optimizations not specific to Intel microarchitecture are reserved for Intel microprocessors. Please refer to the applicable product User and Reference Guides for more information regarding the specific instruction sets covered by this notice. Notice revision #

39

40 How to Get the Intel Media SDK Intel Media Server Studio 3 Editions (includes Free Community) Platform / Device Targets Select SKUs of Intel Xeon & Core processor-based platforms Applications for media, communications infrastructure, video processing/conferencing, digital surveillance, video cloud & data center Format Support - HEVC, AVC, MPEG-2, MPEG-Audio See Technical Specifications for System Requirements Intel Media SDK - FREE Platform / Device Targets Intel Core or Core M processors Select SKUs of Intel Celeron, Pentium & Atom processors with Intel HD Graphics supporting Intel Quick Sync Video Client devices Desktop/mobile applications OS - Windows only* See Technical Specifications for System Requirements software.intel.com/intel-media-server-studio software.intel.com/media-sdk 40

41 Intel Media Server Studio Editions At a Glance Premium Feature/Components Community Edition Essentials Edition Professional Edition Media SDK Graphics Drivers Code Samples OpenCL Code Builder and Runtime Metrics Monitor (Linux* only) Intel Premier Support HEVC Decoder & Encoder, GPU Assist APIs Audio Decoder & Encoder Video Quality Caliper Intel VTune Amplifier Premium Telecine Interlace Reverser 41

42

More performance options

More performance options More performance options OpenCL, streaming media, and native coding options with INDE April 8, 2014 2014, Intel Corporation. All rights reserved. Intel, the Intel logo, Intel Inside, Intel Xeon, and Intel

More information

Sample for OpenCL* and DirectX* Video Acceleration Surface Sharing

Sample for OpenCL* and DirectX* Video Acceleration Surface Sharing Sample for OpenCL* and DirectX* Video Acceleration Surface Sharing User s Guide Intel SDK for OpenCL* Applications Sample Documentation Copyright 2010 2013 Intel Corporation All Rights Reserved Document

More information

Optimizing Film, Media with OpenCL & Intel Quick Sync Video

Optimizing Film, Media with OpenCL & Intel Quick Sync Video Optimizing Film, Media with OpenCL & Intel Quick Sync Video Petter Larsson, Senior Software Engineer Ryan Tabrah, Product Manager The Intel Vision Enriching the lives of every person on earth through technology

More information

OpenCL* and Microsoft DirectX* Video Acceleration Surface Sharing

OpenCL* and Microsoft DirectX* Video Acceleration Surface Sharing OpenCL* and Microsoft DirectX* Video Acceleration Surface Sharing Intel SDK for OpenCL* Applications Sample Documentation Copyright 2010 2012 Intel Corporation All Rights Reserved Document Number: 327281-001US

More information

Getting Started with Intel SDK for OpenCL Applications

Getting Started with Intel SDK for OpenCL Applications Getting Started with Intel SDK for OpenCL Applications Webinar #1 in the Three-part OpenCL Webinar Series July 11, 2012 Register Now for All Webinars in the Series Welcome to Getting Started with Intel

More information

Intel Media Server Studio Professional Edition for Linux*

Intel Media Server Studio Professional Edition for Linux* Intel Media Server Studio 2015 R4 Professional Edition for Linux* Release Notes Overview What's New System Requirements Package Contents Installation Installation Folders Known Limitations Legal Information

More information

Intel Media Server Studio 2018 R1 Essentials Edition for Linux* Release Notes

Intel Media Server Studio 2018 R1 Essentials Edition for Linux* Release Notes Overview What's New Intel Media Server Studio 2018 R1 Essentials Edition for Linux* Release Notes System Requirements Package Contents Installation Installation Folders Known Limitations Legal Information

More information

OpenCL Driver for Intel HD, Iris, and Iris Pro Graphics for Linux -- Release Notes 1

OpenCL Driver for Intel HD, Iris, and Iris Pro Graphics for Linux -- Release Notes 1 OpenCL Driver for Intel HD, Iris, and Iris Pro Graphics for Linux -- Release Notes 1 Version Information This document covers the Intel OpenCL Linux graphics device driver version r4.1-build_id, hereafter

More information

Michael Kinsner, Dirk Seynhaeve IWOCL 2018

Michael Kinsner, Dirk Seynhaeve IWOCL 2018 Michael Kinsner, Dirk Seynhaeve IWOCL 2018 Topics 1. FPGA overview 2. Motivating application classes 3. Host pipes 4. Some data 2 FPGA: Fine-grained Massive Parallelism Intel Stratix 10 FPGA: Over 5 Million

More information

Intel SDK for OpenCL* - Sample for OpenCL* and Intel Media SDK Interoperability

Intel SDK for OpenCL* - Sample for OpenCL* and Intel Media SDK Interoperability Intel SDK for OpenCL* - Sample for OpenCL* and Intel Media SDK Interoperability User s Guide Copyright 2010 2012 Intel Corporation All Rights Reserved Document Number: 327283-001US Revision: 1.0 World

More information

Alexei Katranov. IWOCL '16, April 21, 2016, Vienna, Austria

Alexei Katranov. IWOCL '16, April 21, 2016, Vienna, Austria Alexei Katranov IWOCL '16, April 21, 2016, Vienna, Austria Hardware: customization, integration, heterogeneity Intel Processor Graphics CPU CPU CPU CPU Multicore CPU + integrated units for graphics, media

More information

Intel Xeon Phi Coprocessor. Technical Resources. Intel Xeon Phi Coprocessor Workshop Pawsey Centre & CSIRO, Aug Intel Xeon Phi Coprocessor

Intel Xeon Phi Coprocessor. Technical Resources. Intel Xeon Phi Coprocessor Workshop Pawsey Centre & CSIRO, Aug Intel Xeon Phi Coprocessor Technical Resources Legal Disclaimer INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPETY RIGHTS

More information

OpenMP * 4 Support in Clang * / LLVM * Andrey Bokhanko, Intel

OpenMP * 4 Support in Clang * / LLVM * Andrey Bokhanko, Intel OpenMP * 4 Support in Clang * / LLVM * Andrey Bokhanko, Intel Clang * : An Excellent C++ Compiler LLVM * : Collection of modular and reusable compiler and toolchain technologies Created by Chris Lattner

More information

Intel and Badaboom Video File Transcoding

Intel and Badaboom Video File Transcoding Solutions Intel and Badaboom Video File Transcoding Introduction Intel Quick Sync Video, built right into 2 nd generation Intel Core processors, is breakthrough hardware acceleration that lets the user

More information

Debugging and Analyzing Programs using the Intercept Layer for OpenCL Applications

Debugging and Analyzing Programs using the Intercept Layer for OpenCL Applications Debugging and Analyzing Programs using the Intercept Layer for OpenCL Applications Ben Ashbaugh IWOCL 2018 https://github.com/intel/opencl-intercept-layer Why am I here? Intercept Layer for OpenCL Applications

More information

Intel Media Server Studio 2017 R3 Essentials Edition for Linux* Release Notes

Intel Media Server Studio 2017 R3 Essentials Edition for Linux* Release Notes Overview What's New Intel Media Server Studio 2017 R3 Essentials Edition for Linux* Release Notes System Requirements Package Contents Installation Installation Folders Known Limitations Legal Information

More information

Jomar Silva Technical Evangelist

Jomar Silva Technical Evangelist Jomar Silva Technical Evangelist Agenda Introduction Intel Graphics Performance Analyzers: what is it, where do I get it, and how do I use it? Intel GPA with VR What devices can I use Intel GPA with and

More information

MICHAL MROZEK ZBIGNIEW ZDANOWICZ

MICHAL MROZEK ZBIGNIEW ZDANOWICZ MICHAL MROZEK ZBIGNIEW ZDANOWICZ Legal Notices and Disclaimers INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY

More information

Arnon Peleg Visual Computing Products Management

Arnon Peleg Visual Computing Products Management Optimizing Visual Computing Applications on 3 rd Generation Intel Core Processor Family Intel Visual Computing Tools Helping Make Your Game & Media Applications Faster, Faster Arnon Peleg Visual Computing

More information

Intel Iris Graphics Quick Sync Video Innovation Behind Quality and Performance Leadership

Intel Iris Graphics Quick Sync Video Innovation Behind Quality and Performance Leadership Intel Iris Graphics Quick Sync Video Innovation Behind Quality and Performance Leadership Dr. Wen-Fu Kao and Dr. Ryan Lei Intel Visual & Parallel Group Media Architecture Legal INFORMATION IN THIS DOCUMENT

More information

Bitonic Sorting Intel OpenCL SDK Sample Documentation

Bitonic Sorting Intel OpenCL SDK Sample Documentation Intel OpenCL SDK Sample Documentation Document Number: 325262-002US Legal Information INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL

More information

Installation Guide and Release Notes

Installation Guide and Release Notes Intel C++ Studio XE 2013 for Windows* Installation Guide and Release Notes Document number: 323805-003US 26 June 2013 Table of Contents 1 Introduction... 1 1.1 What s New... 2 1.1.1 Changes since Intel

More information

Bitonic Sorting. Intel SDK for OpenCL* Applications Sample Documentation. Copyright Intel Corporation. All Rights Reserved

Bitonic Sorting. Intel SDK for OpenCL* Applications Sample Documentation. Copyright Intel Corporation. All Rights Reserved Intel SDK for OpenCL* Applications Sample Documentation Copyright 2010 2012 Intel Corporation All Rights Reserved Document Number: 325262-002US Revision: 1.3 World Wide Web: http://www.intel.com Document

More information

Crosstalk between VMs. Alexander Komarov, Application Engineer Software and Services Group Developer Relations Division EMEA

Crosstalk between VMs. Alexander Komarov, Application Engineer Software and Services Group Developer Relations Division EMEA Crosstalk between VMs Alexander Komarov, Application Engineer Software and Services Group Developer Relations Division EMEA 2 September 2015 Legal Disclaimer & Optimization Notice INFORMATION IN THIS DOCUMENT

More information

Graphics Performance Analyzer for Android

Graphics Performance Analyzer for Android Graphics Performance Analyzer for Android 1 What you will learn from this slide deck Detailed optimization workflow of Graphics Performance Analyzer Android* System Analysis Only Please see subsequent

More information

Intel Media Server Studio 2018 R1 - HEVC Decoder and Encoder Release Notes (Version )

Intel Media Server Studio 2018 R1 - HEVC Decoder and Encoder Release Notes (Version ) Intel Media Server Studio 2018 R1 - HEVC Decoder and Encoder Release Notes (Version 1.0.10) Overview New Features System Requirements Installation Installation Folders How To Use Supported Formats Known

More information

Agenda. Optimization Notice Copyright 2017, Intel Corporation. All rights reserved. *Other names and brands may be claimed as the property of others.

Agenda. Optimization Notice Copyright 2017, Intel Corporation. All rights reserved. *Other names and brands may be claimed as the property of others. Agenda VTune Amplifier XE OpenMP* Analysis: answering on customers questions about performance in the same language a program was written in Concepts, metrics and technology inside VTune Amplifier XE OpenMP

More information

Real World Development examples of systems / iot

Real World Development examples of systems / iot Real World Development examples of systems / iot Intel Software Developer Conference Seoul 2017 Jon Kim Software Consulting Engineer Contents IOT end-to-end Scalability with Intel x86 Architect Real World

More information

Collecting OpenCL*-related Metrics with Intel Graphics Performance Analyzers

Collecting OpenCL*-related Metrics with Intel Graphics Performance Analyzers Collecting OpenCL*-related Metrics with Intel Graphics Performance Analyzers Collecting Important OpenCL*-related Metrics with Intel GPA System Analyzer Introduction Intel SDK for OpenCL* Applications

More information

NVMe Over Fabrics: Scaling Up With The Storage Performance Development Kit

NVMe Over Fabrics: Scaling Up With The Storage Performance Development Kit NVMe Over Fabrics: Scaling Up With The Storage Performance Development Kit Ben Walker Data Center Group Intel Corporation 2018 Storage Developer Conference. Intel Corporation. All Rights Reserved. 1 Notices

More information

Munara Tolubaeva Technical Consulting Engineer. 3D XPoint is a trademark of Intel Corporation in the U.S. and/or other countries.

Munara Tolubaeva Technical Consulting Engineer. 3D XPoint is a trademark of Intel Corporation in the U.S. and/or other countries. Munara Tolubaeva Technical Consulting Engineer 3D XPoint is a trademark of Intel Corporation in the U.S. and/or other countries. notices and disclaimers Intel technologies features and benefits depend

More information

Intel s Architecture for NFV

Intel s Architecture for NFV Intel s Architecture for NFV Evolution from specialized technology to mainstream programming Net Futures 2015 Network applications Legal Disclaimer INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION

More information

Intel Parallel Studio XE 2011 for Windows* Installation Guide and Release Notes

Intel Parallel Studio XE 2011 for Windows* Installation Guide and Release Notes Intel Parallel Studio XE 2011 for Windows* Installation Guide and Release Notes Document number: 323803-001US 4 May 2011 Table of Contents 1 Introduction... 1 1.1 What s New... 2 1.2 Product Contents...

More information

Vectorization Advisor: getting started

Vectorization Advisor: getting started Vectorization Advisor: getting started Before you analyze Run GUI or Command Line Set-up environment Linux: source /advixe-vars.sh Windows: \advixe-vars.bat Run GUI or Command

More information

H.J. Lu, Sunil K Pandey. Intel. November, 2018

H.J. Lu, Sunil K Pandey. Intel. November, 2018 H.J. Lu, Sunil K Pandey Intel November, 2018 Issues with Run-time Library on IA Memory, string and math functions in today s glibc are optimized for today s Intel processors: AVX/AVX2/AVX512 FMA It takes

More information

Becca Paren Cluster Systems Engineer Software and Services Group. May 2017

Becca Paren Cluster Systems Engineer Software and Services Group. May 2017 Becca Paren Cluster Systems Engineer Software and Services Group May 2017 Clusters are complex systems! Challenge is to reduce this complexity barrier for: Cluster architects System administrators Application

More information

Expressing and Analyzing Dependencies in your C++ Application

Expressing and Analyzing Dependencies in your C++ Application Expressing and Analyzing Dependencies in your C++ Application Pablo Reble, Software Engineer Developer Products Division Software and Services Group, Intel Agenda TBB and Flow Graph extensions Composable

More information

Guy Blank Intel Corporation, Israel March 27-28, 2017 European LLVM Developers Meeting Saarland Informatics Campus, Saarbrücken, Germany

Guy Blank Intel Corporation, Israel March 27-28, 2017 European LLVM Developers Meeting Saarland Informatics Campus, Saarbrücken, Germany Guy Blank Intel Corporation, Israel March 27-28, 2017 European LLVM Developers Meeting Saarland Informatics Campus, Saarbrücken, Germany Motivation C AVX2 AVX512 New instructions utilized! Scalar performance

More information

Intel Advisor XE Future Release Threading Design & Prototyping Vectorization Assistant

Intel Advisor XE Future Release Threading Design & Prototyping Vectorization Assistant Intel Advisor XE Future Release Threading Design & Prototyping Vectorization Assistant Parallel is the Path Forward Intel Xeon and Intel Xeon Phi Product Families are both going parallel Intel Xeon processor

More information

Intel Open Source HD Graphics, Intel Iris Graphics, and Intel Iris Pro Graphics

Intel Open Source HD Graphics, Intel Iris Graphics, and Intel Iris Pro Graphics Intel Open Source HD Graphics, Intel Iris Graphics, and Intel Iris Pro Graphics Programmer's Reference Manual For the 2015-2016 Intel Core Processors, Celeron Processors, and Pentium Processors based on

More information

Also a new Metro style sample application which showcases media files transcoding with parameter control using Metro design language is introduced.

Also a new Metro style sample application which showcases media files transcoding with parameter control using Metro design language is introduced. Intel Media Software Development Kit 2012 R3 Release Notes (Version 3.5.915.45249) Overview New Features System Requirements Package Contents Installation Known Limitations Other Limitations Legal Information

More information

IXPUG 16. Dmitry Durnov, Intel MPI team

IXPUG 16. Dmitry Durnov, Intel MPI team IXPUG 16 Dmitry Durnov, Intel MPI team Agenda - Intel MPI 2017 Beta U1 product availability - New features overview - Competitive results - Useful links - Q/A 2 Intel MPI 2017 Beta U1 is available! Key

More information

LIBXSMM Library for small matrix multiplications. Intel High Performance and Throughput Computing (EMEA) Hans Pabst, March 12 th 2015

LIBXSMM Library for small matrix multiplications. Intel High Performance and Throughput Computing (EMEA) Hans Pabst, March 12 th 2015 LIBXSMM Library for small matrix multiplications. Intel High Performance and Throughput Computing (EMEA) Hans Pabst, March 12 th 2015 Abstract Library for small matrix-matrix multiplications targeting

More information

Krzysztof Laskowski, Intel Pavan K Lanka, Intel

Krzysztof Laskowski, Intel Pavan K Lanka, Intel Krzysztof Laskowski, Intel Pavan K Lanka, Intel Legal Notices and Disclaimers INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR

More information

Intel Cluster Checker 3.0 webinar

Intel Cluster Checker 3.0 webinar Intel Cluster Checker 3.0 webinar June 3, 2015 Christopher Heller Technical Consulting Engineer Q2, 2015 1 Introduction Intel Cluster Checker 3.0 is a systems tool for Linux high performance compute clusters

More information

Intel tools for High Performance Python 데이터분석및기타기능을위한고성능 Python

Intel tools for High Performance Python 데이터분석및기타기능을위한고성능 Python Intel tools for High Performance Python 데이터분석및기타기능을위한고성능 Python Python Landscape Adoption of Python continues to grow among domain specialists and developers for its productivity benefits Challenge#1:

More information

Installation Guide and Release Notes

Installation Guide and Release Notes Intel Parallel Studio XE 2013 for Linux* Installation Guide and Release Notes Document number: 323804-003US 10 March 2013 Table of Contents 1 Introduction... 1 1.1 What s New... 1 1.1.1 Changes since Intel

More information

Daniel Verkamp, Software Engineer

Daniel Verkamp, Software Engineer Daniel Verkamp, Software Engineer Notices and Disclaimers Intel technologies features and benefits depend on system configuration and may require enabled hardware, software or service activation. Learn

More information

Повышение энергоэффективности мобильных приложений путем их распараллеливания. Примеры. Владимир Полин

Повышение энергоэффективности мобильных приложений путем их распараллеливания. Примеры. Владимир Полин Повышение энергоэффективности мобильных приложений путем их распараллеливания. Примеры. Владимир Полин Legal Notices This presentation is for informational purposes only. INTEL MAKES NO WARRANTIES, EXPRESS

More information

Intel Stereo 3D SDK Developer s Guide. Alpha Release

Intel Stereo 3D SDK Developer s Guide. Alpha Release Intel Stereo 3D SDK Developer s Guide Alpha Release Contents Why Intel Stereo 3D SDK?... 3 HW and SW requirements... 3 Intel Stereo 3D SDK samples... 3 Developing Intel Stereo 3D SDK Applications... 4

More information

Using Intel VTune Amplifier XE and Inspector XE in.net environment

Using Intel VTune Amplifier XE and Inspector XE in.net environment Using Intel VTune Amplifier XE and Inspector XE in.net environment Levent Akyil Technical Computing, Analyzers and Runtime Software and Services group 1 Refresher - Intel VTune Amplifier XE Intel Inspector

More information

What s New August 2015

What s New August 2015 What s New August 2015 Significant New Features New Directory Structure OpenMP* 4.1 Extensions C11 Standard Support More C++14 Standard Support Fortran 2008 Submodules and IMPURE ELEMENTAL Further C Interoperability

More information

SDK API Reference Manual for VP8. API Version 1.12

SDK API Reference Manual for VP8. API Version 1.12 SDK API Reference Manual for VP8 API Version 1.12 LEGAL DISCLAIMER INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE,

More information

Intel Parallel Studio XE 2011 SP1 for Linux* Installation Guide and Release Notes

Intel Parallel Studio XE 2011 SP1 for Linux* Installation Guide and Release Notes Intel Parallel Studio XE 2011 SP1 for Linux* Installation Guide and Release Notes Document number: 323804-002US 21 June 2012 Table of Contents 1 Introduction... 1 1.1 What s New... 1 1.2 Product Contents...

More information

Visualizing and Finding Optimization Opportunities with Intel Advisor Roofline feature. Intel Software Developer Conference London, 2017

Visualizing and Finding Optimization Opportunities with Intel Advisor Roofline feature. Intel Software Developer Conference London, 2017 Visualizing and Finding Optimization Opportunities with Intel Advisor Roofline feature Intel Software Developer Conference London, 2017 Agenda Vectorization is becoming more and more important What is

More information

Tuning Python Applications Can Dramatically Increase Performance

Tuning Python Applications Can Dramatically Increase Performance Tuning Python Applications Can Dramatically Increase Performance Vasilij Litvinov Software Engineer, Intel Legal Disclaimer & 2 INFORMATION IN THIS DOCUMENT IS PROVIDED AS IS. NO LICENSE, EXPRESS OR IMPLIED,

More information

Memory & Thread Debugger

Memory & Thread Debugger Memory & Thread Debugger Here is What Will Be Covered Overview Memory/Thread analysis New Features Deep dive into debugger integrations Demo Call to action Intel Confidential 2 Analysis Tools for Diagnosis

More information

Non-Volatile Memory Cache Enhancements: Turbo-Charging Client Platform Performance

Non-Volatile Memory Cache Enhancements: Turbo-Charging Client Platform Performance Non-Volatile Memory Cache Enhancements: Turbo-Charging Client Platform Performance By Robert E Larsen NVM Cache Product Line Manager Intel Corporation August 2008 1 Legal Disclaimer INFORMATION IN THIS

More information

Achieving High Performance. Jim Cownie Principal Engineer SSG/DPD/TCAR Multicore Challenge 2013

Achieving High Performance. Jim Cownie Principal Engineer SSG/DPD/TCAR Multicore Challenge 2013 Achieving High Performance Jim Cownie Principal Engineer SSG/DPD/TCAR Multicore Challenge 2013 Does Instruction Set Matter? We find that ARM and x86 processors are simply engineering design points optimized

More information

Välkommen. Intel Anders Huge

Välkommen. Intel Anders Huge Välkommen Intel Anders Huge Transformative Technology from Intel A n d e r s H u g e I n t e l Why intel INTEL CORPORATION 5 TRANSFORMING BUSINESS MODERN BUSINESS DEMANDS Intel VISION Accelerate workplace

More information

Mikhail Dvorskiy, Jim Cownie, Alexey Kukanov

Mikhail Dvorskiy, Jim Cownie, Alexey Kukanov Mikhail Dvorskiy, Jim Cownie, Alexey Kukanov What is the Parallel STL? C++17 C++ Next An extension of the C++ Standard Template Library algorithms with the execution policy argument Support for parallel

More information

Evolving Small Cells. Udayan Mukherjee Senior Principal Engineer and Director (Wireless Infrastructure)

Evolving Small Cells. Udayan Mukherjee Senior Principal Engineer and Director (Wireless Infrastructure) Evolving Small Cells Udayan Mukherjee Senior Principal Engineer and Director (Wireless Infrastructure) Intelligent Heterogeneous Network Optimum User Experience Fibre-optic Connected Macro Base stations

More information

Advances of Media Technology in Modern Computing. Dr. Hong Jiang, Intel Fellow

Advances of Media Technology in Modern Computing. Dr. Hong Jiang, Intel Fellow Advances of Media Technology in Modern Computing Dr. Hong Jiang, Intel Fellow Legal INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL

More information

Intel Graphics Virtualization Technology. Kevin Tian Graphics Virtualization Architect

Intel Graphics Virtualization Technology. Kevin Tian Graphics Virtualization Architect Intel Graphics Virtualization Technology Kevin Tian Graphics Virtualization Architect Legal Disclaimer INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS OR

More information

Overview of Data Fitting Component in Intel Math Kernel Library (Intel MKL) Intel Corporation

Overview of Data Fitting Component in Intel Math Kernel Library (Intel MKL) Intel Corporation Overview of Data Fitting Component in Intel Math Kernel Library (Intel MKL) Intel Corporation Agenda 1D interpolation problem statement Computation flow Application areas Data fitting in Intel MKL Data

More information

INTEL PERCEPTUAL COMPUTING SDK. How To Use the Privacy Notification Tool

INTEL PERCEPTUAL COMPUTING SDK. How To Use the Privacy Notification Tool INTEL PERCEPTUAL COMPUTING SDK How To Use the Privacy Notification Tool LEGAL DISCLAIMER THIS DOCUMENT CONTAINS INFORMATION ON PRODUCTS IN THE DESIGN PHASE OF DEVELOPMENT. INFORMATION IN THIS DOCUMENT

More information

Desktop 4th Generation Intel Core, Intel Pentium, and Intel Celeron Processor Families and Intel Xeon Processor E3-1268L v3

Desktop 4th Generation Intel Core, Intel Pentium, and Intel Celeron Processor Families and Intel Xeon Processor E3-1268L v3 Desktop 4th Generation Intel Core, Intel Pentium, and Intel Celeron Processor Families and Intel Xeon Processor E3-1268L v3 Addendum May 2014 Document Number: 329174-004US Introduction INFORMATION IN THIS

More information

Intel Parallel Studio XE 2015

Intel Parallel Studio XE 2015 2015 Create faster code faster with this comprehensive parallel software development suite. Faster code: Boost applications performance that scales on today s and next-gen processors Create code faster:

More information

Intel Software Development Products Licensing & Programs Channel EMEA

Intel Software Development Products Licensing & Programs Channel EMEA Intel Software Development Products Licensing & Programs Channel EMEA Intel Software Development Products Advanced Performance Distributed Performance Intel Software Development Products Foundation of

More information

Kevin O Leary, Intel Technical Consulting Engineer

Kevin O Leary, Intel Technical Consulting Engineer Kevin O Leary, Intel Technical Consulting Engineer Moore s Law Is Going Strong Hardware performance continues to grow exponentially We think we can continue Moore's Law for at least another 10 years."

More information

Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Intel Xeon Processor E7 v2 Family-Based Platforms

Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Intel Xeon Processor E7 v2 Family-Based Platforms Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Family-Based Platforms Executive Summary Complex simulations of structural and systems performance, such as car crash simulations,

More information

ISA-L Performance Report Release Test Date: Sept 29 th 2017

ISA-L Performance Report Release Test Date: Sept 29 th 2017 Test Date: Sept 29 th 2017 Revision History Date Revision Comment Sept 29 th, 2017 1.0 Initial document for release 2 Contents Audience and Purpose... 4 Test setup:... 4 Intel Xeon Platinum 8180 Processor

More information

Bei Wang, Dmitry Prohorov and Carlos Rosales

Bei Wang, Dmitry Prohorov and Carlos Rosales Bei Wang, Dmitry Prohorov and Carlos Rosales Aspects of Application Performance What are the Aspects of Performance Intel Hardware Features Omni-Path Architecture MCDRAM 3D XPoint Many-core Xeon Phi AVX-512

More information

Intel Math Kernel Library (Intel MKL) BLAS. Victor Kostin Intel MKL Dense Solvers team manager

Intel Math Kernel Library (Intel MKL) BLAS. Victor Kostin Intel MKL Dense Solvers team manager Intel Math Kernel Library (Intel MKL) BLAS Victor Kostin Intel MKL Dense Solvers team manager Intel MKL BLAS/Sparse BLAS Original ( dense ) BLAS available from www.netlib.org Additionally Intel MKL provides

More information

High Dynamic Range Tone Mapping Post Processing Effect Multi-Device Version

High Dynamic Range Tone Mapping Post Processing Effect Multi-Device Version High Dynamic Range Tone Mapping Post Processing Effect Multi-Device Version Intel SDK for OpenCL* Application Sample Documentation Copyright 2010 2012 Intel Corporation All Rights Reserved Document Number:

More information

High Performance Computing The Essential Tool for a Knowledge Economy

High Performance Computing The Essential Tool for a Knowledge Economy High Performance Computing The Essential Tool for a Knowledge Economy Rajeeb Hazra Vice President & General Manager Technical Computing Group Datacenter & Connected Systems Group July 22 nd 2013 1 What

More information

Case Study. Optimizing an Illegal Image Filter System. Software. Intel Integrated Performance Primitives. High-Performance Computing

Case Study. Optimizing an Illegal Image Filter System. Software. Intel Integrated Performance Primitives. High-Performance Computing Case Study Software Optimizing an Illegal Image Filter System Intel Integrated Performance Primitives High-Performance Computing Tencent Doubles the Speed of its Illegal Image Filter System using SIMD

More information

Accelerate Machine Learning on macos with Intel Integrated Graphics. Hisham Chowdhury May 23, 2018

Accelerate Machine Learning on macos with Intel Integrated Graphics. Hisham Chowdhury May 23, 2018 Accelerate Machine Learning on macos with Intel Integrated Graphics Hisham Chowdhury May 23, 2018 Apple Machine Learning Stack Machine Learning Application 1 Machine Learning Application 2 Vision Natural

More information

Installation Guide and Release Notes

Installation Guide and Release Notes Installation Guide and Release Notes Document number: 321604-001US 19 October 2009 Table of Contents 1 Introduction... 1 1.1 Product Contents... 1 1.2 System Requirements... 2 1.3 Documentation... 3 1.4

More information

INTEL PENTIUM Gold AND CELERON PROCESSORS

INTEL PENTIUM Gold AND CELERON PROCESSORS INTEL PENTIUM Gold AND CELERON PROCESSORS Reliable performance and affordable priced PCs for value-oriented buyers Impressive performance for work and play. The new Pentium Gold processor provides great

More information

Jackson Marusarz Software Technical Consulting Engineer

Jackson Marusarz Software Technical Consulting Engineer Jackson Marusarz Software Technical Consulting Engineer What Will Be Covered Overview Memory/Thread analysis New Features Deep dive into debugger integrations Demo Call to action 2 Analysis Tools for Diagnosis

More information

Stanislav Bratanov; Roman Belenov; Ludmila Pakhomova 4/27/2015

Stanislav Bratanov; Roman Belenov; Ludmila Pakhomova 4/27/2015 Stanislav Bratanov; Roman Belenov; Ludmila Pakhomova 4/27/2015 What is Intel Processor Trace? Intel Processor Trace (Intel PT) provides hardware a means to trace branching, transaction, and timing information

More information

A Simple Path to Parallelism with Intel Cilk Plus

A Simple Path to Parallelism with Intel Cilk Plus Introduction This introductory tutorial describes how to use Intel Cilk Plus to simplify making taking advantage of vectorization and threading parallelism in your code. It provides a brief description

More information

Automatic Intra-Application Load Balancing for Heterogeneous Systems

Automatic Intra-Application Load Balancing for Heterogeneous Systems Automatic Intra-Application Load Balancing for Heterogeneous Systems Michael Boyer, Shuai Che, and Kevin Skadron Department of Computer Science University of Virginia Jayanth Gummaraju and Nuwan Jayasena

More information

Intel Stress Bitstreams and Encoder (Intel SBE) 2017 AVS2 Release Notes (Version 2.3)

Intel Stress Bitstreams and Encoder (Intel SBE) 2017 AVS2 Release Notes (Version 2.3) Intel Stress Bitstreams and Encoder (Intel SBE) 2017 AVS2 Release Notes (Version 2.3) Overview Changes History Installation Package Contents Known Limitations Attributions Legal Information Overview The

More information

April 2 nd, Bob Burroughs Director, HPC Solution Sales

April 2 nd, Bob Burroughs Director, HPC Solution Sales April 2 nd, 2019 Bob Burroughs Director, HPC Solution Sales Today - Introducing 2 nd Generation Intel Xeon Scalable Processors how Intel Speeds HPC performance Work Time System Peak Efficiency Software

More information

Kirill Rogozhin. Intel

Kirill Rogozhin. Intel Kirill Rogozhin Intel From Old HPC principle to modern performance model Old HPC principles: 1. Balance principle (e.g. Kung 1986) hw and software parameters altogether 2. Compute Density, intensity, machine

More information

Solid State Graphics (SSG) SDK Setup and Raw Video Player Guide

Solid State Graphics (SSG) SDK Setup and Raw Video Player Guide Solid State Graphics (SSG) SDK Setup and Raw Video Player Guide PAGE 1 Radeon Pro SSG SDK Setup To enable you to access the capabilities of the Radeon Pro SSG card, it comes with extensions for Microsoft

More information

mfxextavcreflistctrl structure was added to allow configuration of reference frame options for H.264 encoder and decoder. Used as extended buffer.

mfxextavcreflistctrl structure was added to allow configuration of reference frame options for H.264 encoder and decoder. Used as extended buffer. Intel Media Software Development Kit 2012 Release Notes (Version 3.0.774.38156) Overview New Features System Requirements Package Contents Installation Known Limitations Other Limitations Legal Information

More information

Andreas Schneider. Markus Leberecht. Senior Cloud Solution Architect, Intel Deutschland. Distribution Sales Manager, Intel Deutschland

Andreas Schneider. Markus Leberecht. Senior Cloud Solution Architect, Intel Deutschland. Distribution Sales Manager, Intel Deutschland Markus Leberecht Senior Cloud Solution Architect, Intel Deutschland Andreas Schneider Distribution Sales Manager, Intel Deutschland Legal Disclaimers 2016 Intel Corporation. Intel, the Intel logo, Xeon

More information

Efficiently Introduce Threading using Intel TBB

Efficiently Introduce Threading using Intel TBB Introduction This guide will illustrate how to efficiently introduce threading using Intel Threading Building Blocks (Intel TBB), part of Intel Parallel Studio XE. It is a widely used, award-winning C++

More information

Intel Parallel Studio XE 2011 for Linux* Installation Guide and Release Notes

Intel Parallel Studio XE 2011 for Linux* Installation Guide and Release Notes Intel Parallel Studio XE 2011 for Linux* Installation Guide and Release Notes Document number: 323804-001US 8 October 2010 Table of Contents 1 Introduction... 1 1.1 Product Contents... 1 1.2 What s New...

More information

Achieving Peak Performance on Intel Hardware. Intel Software Developer Conference London, 2017

Achieving Peak Performance on Intel Hardware. Intel Software Developer Conference London, 2017 Achieving Peak Performance on Intel Hardware Intel Software Developer Conference London, 2017 Welcome Aims for the day You understand some of the critical features of Intel processors and other hardware

More information

Are You Insured Against Your Noisy Neighbor Sunku Ranganath, Intel Corporation Sridhar Rao, Spirent Communications

Are You Insured Against Your Noisy Neighbor Sunku Ranganath, Intel Corporation Sridhar Rao, Spirent Communications Are You Insured Against Your Noisy Neighbor Sunku Ranganath, Intel Corporation Sridhar Rao, Spirent Communications @SunkuRanganath, @ngignir Legal Disclaimer 2018 Intel Corporation. Intel, the Intel logo,

More information

Intel Parallel Studio XE 2015 Composer Edition for Linux* Installation Guide and Release Notes

Intel Parallel Studio XE 2015 Composer Edition for Linux* Installation Guide and Release Notes Intel Parallel Studio XE 2015 Composer Edition for Linux* Installation Guide and Release Notes 23 October 2014 Table of Contents 1 Introduction... 1 1.1 Product Contents... 2 1.2 Intel Debugger (IDB) is

More information

Intel Quick Sync Video Tec h nology on Intel Iris Graphics and Intel HD Gra p hics family Flexible Transcode Performance and Quality

Intel Quick Sync Video Tec h nology on Intel Iris Graphics and Intel HD Gra p hics family Flexible Transcode Performance and Quality WHITE PAPER Intel Quick Sync Video Intel Quick Sync Video Tec h nology on Intel Iris Graphics and Intel HD Gra p hics family Flexible Transcode Performance and Quality The 4th generation Intel Core Processor

More information

Intel Core TM i7-4702ec Processor for Communications Infrastructure

Intel Core TM i7-4702ec Processor for Communications Infrastructure Intel Core TM i7-4702ec Processor for Communications Infrastructure Application Power Guidelines Addendum May 2014 Document Number: 330009-001US Introduction INFORMATION IN THIS DOCUMENT IS PROVIDED IN

More information

Achieving 2.5X 1 Higher Performance for the Taboola TensorFlow* Serving Application through Targeted Software Optimization

Achieving 2.5X 1 Higher Performance for the Taboola TensorFlow* Serving Application through Targeted Software Optimization white paper Internet Discovery Artificial Intelligence (AI) Achieving.X Higher Performance for the Taboola TensorFlow* Serving Application through Targeted Software Optimization As one of the world s preeminent

More information

Software Occlusion Culling

Software Occlusion Culling Software Occlusion Culling Abstract This article details an algorithm and associated sample code for software occlusion culling which is available for download. The technique divides scene objects into

More information

Case Study: Optimizing King of Soldier* with Intel Graphics Performance Analyzers on Intel HD Graphics 4000

Case Study: Optimizing King of Soldier* with Intel Graphics Performance Analyzers on Intel HD Graphics 4000 Case Study: Optimizing King of Soldier* with Intel Graphics Performance Analyzers on Intel HD Graphics 4000 Intel Corporation: Cage Lu, Kiefer Kuah Giant Interactive Group, Inc.: Yu Nana Abstract The performance

More information