Intel VTune Performance Analyzer 9.1 for Windows* In-Depth

Similar documents
Intel VTune Amplifier XE

Using Intel VTune Amplifier XE and Inspector XE in.net environment

Tutorial: Analyzing MPI Applications. Intel Trace Analyzer and Collector Intel VTune Amplifier XE

Intel Threading Building Blocks (Intel TBB) 2.1. In-Depth

Intel Parallel Amplifier 2011

Intel Parallel Amplifier

Intel Parallel Studio 2011

Munara Tolubaeva Technical Consulting Engineer. 3D XPoint is a trademark of Intel Corporation in the U.S. and/or other countries.

Oracle Developer Studio Performance Analyzer

Locate a Hotspot and Optimize It

Intel Parallel Studio XE 2015

Intel Thread Checker 3.1 for Windows* Release Notes

Intel Xeon Phi Coprocessor Performance Analysis

Tutorial: Finding Hotspots with Intel VTune Amplifier - Linux* Intel VTune Amplifier Legal Information

ClearSpeed Visual Profiler

Intel Visual Fortran Compiler Professional Edition 11.0 for Windows* In-Depth

Efficiently Introduce Threading using Intel TBB

Optimize an Existing Program by Introducing Parallelism

Oracle Developer Studio 12.6

Eliminate Threading Errors to Improve Program Stability

Revealing the performance aspects in your code

Get an Easy Performance Boost Even with Unthreaded Apps. with Intel Parallel Studio XE for Windows*

Intel Parallel Amplifier Sample Code Guide

Microarchitectural Analysis with Intel VTune Amplifier XE

Profiling: Understand Your Application

Optimising Multicore JVMs. Khaled Alnowaiser

Intel Math Kernel Library 10.3

Eliminate Threading Errors to Improve Program Stability

Performance Profiler. Klaus-Dieter Oertel Intel-SSG-DPD IT4I HPC Workshop, Ostrava,

Intel System Studio 2014 Overview

Graphics Performance Analyzer for Android

Quickly Pinpoint and Resolve Problems in Windows /.NET Applications TECHNICAL WHITE PAPER

Performance Tools for Technical Computing

Agenda. Optimization Notice Copyright 2017, Intel Corporation. All rights reserved. *Other names and brands may be claimed as the property of others.

Using Intel Inspector XE 2011 with Fortran Applications

HPC Tools on Windows. Christian Terboven Center for Computing and Communication RWTH Aachen University.

Thread Profiler 2.0 Release Notes

Performance Analysis using Intel VTune Amplifier XE

Using Intel VTune Amplifier XE for High Performance Computing

Stanislav Bratanov; Roman Belenov; Ludmila Pakhomova 4/27/2015

Simplified and Effective Serial and Parallel Performance Optimization

Performance Profiling

KNL tools. Dr. Fabio Baruffa

Tools and Methodology for Ensuring HPC Programs Correctness and Performance. Beau Paisley

Allinea Unified Environment

Collecting OpenCL*-related Metrics with Intel Graphics Performance Analyzers

CERN IT Technical Forum

Performance analysis basics

Eliminate Memory Errors and Improve Program Stability

Memory & Thread Debugger

Method-Level Phase Behavior in Java Workloads

Performance Tuning VTune Performance Analyzer

Intel profiling tools and roofline model. Dr. Luigi Iapichino

Multi-Core Programming

MPI Performance Snapshot. User's Guide

Effective Performance Measurement and Analysis of Multithreaded Applications

Profiling and Workflow

Analyze and Optimize Windows* Game Applications Using Intel INDE Graphics Performance Analyzers (GPA)

IBM Tivoli OMEGAMON XE on z/os

Borland Optimizeit Enterprise Suite 6

Intel VTune Amplifier XE for Tuning of HPC Applications Intel Software Developer Conference Frankfurt, 2017 Klaus-Dieter Oertel, Intel

Introduction to Performance Tuning & Optimization Tools

More performance options

Jackson Marusarz Intel Corporation

Performance analysis tools: Intel VTuneTM Amplifier and Advisor. Dr. Luigi Iapichino

RAD Studio XE Datasheet

Bei Wang, Dmitry Prohorov and Carlos Rosales

Performance Analysis of Parallel Scientific Applications In Eclipse

Getting Started Tutorial: Finding Hotspots

Evolution of Virtual Machine Technologies for Portability and Application Capture. Bob Vandette Java Hotspot VM Engineering Sept 2004

Tutorial: Finding Hotspots on an Android* Platform

Copyright Notice SmartBear Software. All rights reserved.

Getting Started Tutorial: Finding Hotspots

IBM High Performance Computing Toolkit

Bitonic Sorting Intel OpenCL SDK Sample Documentation

Non-Volatile Memory Cache Enhancements: Turbo-Charging Client Platform Performance

Intel C++ Compiler Professional Edition 11.0 for Windows* In-Depth

Intel VTune Amplifier XE Overview

What's new in VTune Amplifier XE

Grand Central Dispatch

MPI Performance Snapshot

New IDE Application Profiler Enhancements

Enhancing Analysis-Based Design with Quad-Core Intel Xeon Processor-Based Workstations

Bitonic Sorting. Intel SDK for OpenCL* Applications Sample Documentation. Copyright Intel Corporation. All Rights Reserved

Eliminate Memory Errors to Improve Program Stability

Eliminate Memory Errors to Improve Program Stability

Intel Parallel Studio XE 2011 for Windows* Installation Guide and Release Notes

Jackson Marusarz Software Technical Consulting Engineer

Optimize Data Structures and Memory Access Patterns to Improve Data Locality

David R. Mackay, Ph.D. Libraries play an important role in threading software to run faster on Intel multi-core platforms.

Optimizing Cache Coherent Subsystem Architecture for Heterogeneous Multicore SoCs

Addressing the Increasing Challenges of Debugging on Accelerated HPC Systems. Ed Hinkel Senior Sales Engineer

Intel VTune Amplifier XE. Dr. Michael Klemm Software and Services Group Developer Relations Division

Introduction to Energy-Efficient Software 2 nd life talk

Achieving Peak Performance on Intel Hardware. Intel Software Developer Conference London, 2017

Enabling Performance & Stress Test throughout the Application Lifecycle

Getting Started Tutorial: Finding Hotspots

Create Natural User Interfaces with the Intel RealSense SDK Beta 2014

Intel Advisor XE Future Release Threading Design & Prototyping Vectorization Assistant

Understanding the performance of an X user environment

Transcription:

Intel VTune Performance Analyzer 9.1 for Windows* In-Depth

Contents Deliver Faster Code...................................... 3 Optimize Multicore Performance...3 Highlights............................................... 3 Features...4 Performance: Advanced Optimization Features...4 Compatibility...5 New...5 Support...7 2

Deliver Faster Code Deliver fast software on the latest 64-bit multicore systems running Microsoft Windows Vista*, Windows* XP or Windows Server*. Locate performance bottlenecks without recompilation and with very low overhead (under 5%). Analyze the results using a graphical interface with strong Visual Studio* and.net integration. Quickly drill down to the source to identify problematic lines of code. Optimize Multicore Performance Intel Thread Profiler included! Not getting the performance you expected after threading your application? Intel Thread Profiler helps you tune multi-threaded code, for optimized performance on today s multicore processors. NEW! Now supports the Intel Core i7 processor NEW! Performance Profiling for Dynamically Generated Code, JavaScript, and Flash. Access to VTune Analyzer s Open Data Model. Microsoft Visual Studio* 2005 integration asmo-na/eng/vtune/336206.htm Highlights Tune for quad-core with Intel Thread Profiler asmo-na/eng/vtune/336208.htm Microsoft Windows Vista* compatible asmo-na/eng/vtune/336200.htm 32- and 64-bit support for Microsoft Windows Vista, Windows XP and Windows Server. Integrated with Microsoft Visual Studio 2005 or use standalone with Visual Studio 2008. Supports the latest Intel quad-core processors. Highlights thread overhead and synchronization impact with Intel Thread Profiler. Multi-threading support for load balancing and idle time identification. Works with all compiled languages & compilers (C, C++, Fortran, etc.) Optimize Microsoft.NET applications including C# and Visual Basic. 3 Tune remote Linux* systems from Your Windows* Desktop.

Features Speed up your applications by using Intel VTune Performance Analyzer throughout the development process to produce faster, more efficient code. See the Answers on Your Source Source and disassembly views show you exactly which lines of code are taking the most time. Performance: Advanced Optimization Features Low Overhead Sampling Profiling Event-based sampling finds your bottleneck with very low overhead (typically less than 5 percent). Identify problems such as cache misses, branch mis-predictions, and bus bandwidth. Because it is system-wide, event-based sampling can be used to tune libraries and drivers as well as application programs. No special builds or instrumentation are required. Performance data displayed on the source and assembly code asmo-na/eng/336445.htm Time- and Event-based sampling capability asmo-na/eng/336443.htm Find the Critical Path Using Call Graph Profiling Call Graph determines calling sequences and graphically displays the critical path. It also shows you the context of the bottleneck. To be effective, you often need to know not only where the application is spending its time, but how it got there. See which functions took the most time to process or were blocked the longest. Call Graph Profiling: Critical path shown in red. Selecting in the table (light blue row) selects in the graph (light blue box) and scrolls the view to make the function visible. asmo-na/eng/336446.htm 4

Identify System Issues with Counter Monitor Quickly identify system level performance issues using the Counter Monitor to track system activity and resource consumption during runtime. For instance, it indicates whether reduced available memory or performance issues associated with file I/O slow down the application. Compatibility Programming Language and Compiler Independent VTune analyzer supports all compilers that follow industry standards including Microsoft and Intel compilers for C, C++ and Fortran. It also supports Microsoft.NET including C# and Visual Basic Processor Support Supports the latest Intel processors, including Intel quad-core processors. Windows* Command Line Capability Save time by automating sampling with the ability to collect sampling data through the command line, independent of the Intel VTune Performance Analyzer GUI. Tune Remote Linux* Systems from Your Windows Desktop Collect performance data on a remote target Linux system and analyze the results on your controlling Windows system. Or, try the native Linux version of VTune Analyzer. Counter Monitor: Allows real-time performance viewing asmo-na/eng/336450.htm Tap Into an Expert Knowledge Base with Intel Tuning Assistant Increase productivity using Intel Tuning Assistant to automatically provide advice based on an extensive knowledge base. New Note: Features listed as New are new since the last major release 8.0. Some have been previewed in minor updates and beta releases. Profile JavaScript* and Flash* Code New profiling support in emerging internet browsers and other scriptoriented products allow developers working with new JavaScript* or Flash* JIT technologies to analyze their code. Use the VTune analyzer to optimize for scalable performance of these codes on Windows* and Linux* to ensure the best end user experience with your application. VTune analyzer supports profiling JIT d code when browser vendors add the required support. This enables deep performance analysis of these additional languages: JavaScript / AJAX Flash (Action Script) Check with your browser supplier for details on when their browser will enable support. Intel Tuning Assistant: Provides tuning advice asmo-na/eng/336482.htm Profile Dynamically Generated Code Many applications today emit their own runtime-generated or justin-time (JIT) code. New profiling APIs in the VTune analyzer enable performance analysis of dynamic code and allow you to view annotated source code directly from the analysis results. 5

See the profiling results on your JavaScript source See larger image at: http://cache-www.intel.com/ cd/00/00/40/10/401044_401044.gif Visual Studio 2005 Integration asmo-na/eng/336193.htm Access to VTune Analyzer s Open Data Model VTune analyzer can now support many different software platforms with performance sampling analysis. Use the new open data model APIs to combine the VTune analyzer s powerful GUI on Windows* or Linux* with data from your custom collector to analyze any application on a wide range of platforms. Collect data on operating systems not directly supported by the VTune analyzer. Supported Windows* Operating Systems Supported Linux* Distributions Collect data on embedded Intel hardware based platforms. Microsoft Windows Vista* asmo-na/eng/336191.htm Access to the latest Experimental Technologies VTune analyzer users have access to the latest experimental performance tuning technologies Intel has to offer. Visit whatif.intel. com and look for Intel Performance Tuning Utility and Intel Platform Modeling with Machine Learning. These tools include a number of exciting capabilities including: Statistical Call Tree: profiles with low overhead to detect where time is spent in your application Basic Block Analysis: displays hotspots with basic block granularity and generates a control flow graph for advanced analysis of application, even without the source code Data Access Profiling: identifies memory hotspots and relates them to code hotspots Dependency Plots: visualize the relationships between metrics 6 Event Rank:view the list of best predictors of performance using machine learning

Support Intel Premier Support: Every purchase of an Intel Software Development Product includes a year of support services, which provides access to Intel Premier Support and all product updates during that time. Intel Premier Support gives you online access to technical notes, application notes, and documentation. For support issues specific to VTune analyzer click on the support resources tab at https://premier.intel.com/premier VTune analyzer is programming language and compiler independent. It supports all compilers that follow industry standards including Microsoft and Intel compilers for C, C++ and Fortran. It also supports Microsoft.NET (including C# and Visual Basic). 2009, Intel Corporation. All rights reserved. Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and other countries. *Other names and brands may be claimed as the property of others. 0209/BLA/CMD/PDF 321523-001