Intel C++ Compiler Professional Edition 11.0 for Windows* In-Depth

Similar documents
Intel C++ Compiler Professional Edition 11.0 for Linux* In-Depth

Intel C++ Compiler Professional Edition 11.1 for Linux* In-Depth

Intel C++ Compiler Professional Edition 11.1 for Mac OS* X. In-Depth

Intel Visual Fortran Compiler Professional Edition 11.0 for Windows* In-Depth

Intel Threading Building Blocks (Intel TBB) 2.1. In-Depth

Intel Math Kernel Library 10.3

Installation Guide and Release Notes

Intel Parallel Studio XE 2011 for Windows* Installation Guide and Release Notes

Installation Guide and Release Notes

Efficiently Introduce Threading using Intel TBB

Intel Parallel Studio 2011

Intel Parallel Studio XE 2011 for Linux* Installation Guide and Release Notes

Intel Parallel Studio XE 2011 SP1 for Linux* Installation Guide and Release Notes

Oracle Developer Studio 12.6

More performance options

Intel Math Kernel Library

Graphics Performance Analyzer for Android

Intel Parallel Studio XE 2015

Installation Guide and Release Notes

Intel Performance Libraries

Intel PerfMon Performance Monitoring Hardware

David R. Mackay, Ph.D. Libraries play an important role in threading software to run faster on Intel multi-core platforms.

Installation Guide and Release Notes

A Simple Path to Parallelism with Intel Cilk Plus

Intel Array Building Blocks

Optimize an Existing Program by Introducing Parallelism

Intel Software Development Products Licensing & Programs Channel EMEA

Thread Profiler 2.0 Release Notes

Intel Software Development Products for High Performance Computing and Parallel Programming

Intel Parallel Studio XE 2015 Composer Edition for Linux* Installation Guide and Release Notes

Overview of Intel Parallel Studio XE

Sergey Maidanov. Software Engineering Manager for Intel Distribution for Python*

Installation Guide and Release Notes

Maximizing performance and scalability using Intel performance libraries

Intel C++ Compiler User's Guide With Support For The Streaming Simd Extensions 2

Intel Parallel Studio

Using the Intel Math Kernel Library (Intel MKL) and Intel Compilers to Obtain Run-to-Run Numerical Reproducible Results

Get an Easy Performance Boost Even with Unthreaded Apps. with Intel Parallel Studio XE for Windows*

Getting Started with Intel SDK for OpenCL Applications

Case Study. Optimizing an Illegal Image Filter System. Software. Intel Integrated Performance Primitives. High-Performance Computing

Getting Reproducible Results with Intel MKL

Using Intel Inspector XE 2011 with Fortran Applications

Intel Thread Checker 3.1 for Windows* Release Notes

Chao Yu, Technical Consulting Engineer, Intel IPP and MKL Team

Intel Atom Processor Based Platform Technologies. Intelligent Systems Group Intel Corporation

Debugging on Intel Platforms

Intel Parallel Studio

What s New August 2015

What s P. Thierry

Expressing and Analyzing Dependencies in your C++ Application

Using Intel Math Kernel Library with MathWorks* MATLAB* on Intel Xeon Phi Coprocessor System

Intel VTune Performance Analyzer 9.1 for Windows* In-Depth

From Serial to Parallel Intel Software Products for HPC

Using Intel VTune Amplifier XE for High Performance Computing

RTX64 Features by Release IZ-DOC-X R3

Intel System Studio 2014 Overview

Lecture 11 - Portability and Optimizations

WIND RIVER DIAB COMPILER

High Performance Computing Software Development Kit For Mac OS X In Depth Product Information

Intel Parallel Amplifier 2011

Intel MKL Sparse Solvers. Software Solutions Group - Developer Products Division

Alexei Katranov. IWOCL '16, April 21, 2016, Vienna, Austria

applications with SIMD and Hyper-Threading Technology by Chew Yean Yam Intel Corporation

Intel tools for High Performance Python 데이터분석및기타기능을위한고성능 Python

Parallel Programming Features in the Fortran Standard. Steve Lionel 12/4/2012

SDACCEL DEVELOPMENT ENVIRONMENT. The Xilinx SDAccel Development Environment. Bringing The Best Performance/Watt to the Data Center

Intel Math Kernel Library (Intel MKL) BLAS. Victor Kostin Intel MKL Dense Solvers team manager

Intel Thread Building Blocks

Installation Guide and Release Notes

Operating Systems (2INC0) 2018/19. Introduction (01) Dr. Tanir Ozcelebi. Courtesy of Prof. Dr. Johan Lukkien. System Architecture and Networking Group

Intel Parallel Amplifier

Expand Your HPC Market Reach and Grow Your Sales with Intel Cluster Ready

Brief notes on setting up semi-high performance computing environments. July 25, 2014

Software Tools for Software Developers and Programming Models

Parallel Programming on Larrabee. Tim Foley Intel Corp

Exploiting the Power of the Intel Compiler Suite. Dr. Mario Deilmann Intel Compiler and Languages Lab Software Solutions Group

This guide will show you how to use Intel Inspector XE to identify and fix resource leak errors in your programs before they start causing problems.

NVMe Over Fabrics: Scaling Up With The Storage Performance Development Kit

Using Intel VTune Amplifier XE and Inspector XE in.net environment

Performance Tools for Technical Computing

Quick-Reference Guide to Optimization with Intel Compilers

Performance Analysis of BLAS Libraries in SuperLU_DIST for SuperLU_MCDT (Multi Core Distributed) Development

CSC573: TSHA Introduction to Accelerators

Understanding Dual-processors, Hyper-Threading Technology, and Multicore Systems

Eliminate Threading Errors to Improve Program Stability

Mikhail Dvorskiy, Jim Cownie, Alexey Kukanov

LENOVO BUSINESS SOLUTIONS AND INTEL OPTANE MEMORY

Computer Architecture and Structured Parallel Programming James Reinders, Intel

Intel Thread Building Blocks

RAD Studio XE Datasheet

EI 338: Computer Systems Engineering (Operating Systems & Computer Architecture)

Intel INDE Integrated Native Developer Experience

RTX64 Features by Release

Finding Non-trivial Opportunities for Parallelism in Existing Serial Code using OpenMP

Intel Advisor XE Future Release Threading Design & Prototyping Vectorization Assistant

Workshop on High Performance Computing (HPC08) School of Physics, IPM February 16-21, 2008 HPC tools: an overview

Also a new Metro style sample application which showcases media files transcoding with parameter control using Metro design language is introduced.

Intel Media Server Studio 2018 R1 - HEVC Decoder and Encoder Release Notes (Version )

Working with Metal Overview

Eliminate Memory Errors to Improve Program Stability

Transcription:

Intel C++ Compiler Professional Edition 11.0 for Windows* In-Depth

Contents Intel C++ Compiler Professional Edition for Windows*..... 3 Intel C++ Compiler Professional Edition At A Glance...3 Intel C++ Compiler Professional Edition Components:...3 Features...3 New in This Release..................................... 4 Technical Support...5 Compatibility...5 System Requirements at a Glance...5 2

Intel C++ Compiler Professional Edition for Windows* The features you need to create high-performance multi-threaded apps for multicore systems. The product includes: Intel C++ Compiler for Windows for IA-32, Intel 64, and IA-64 architectures Intel Threading Building Blocks Intel Integrated Performance Primitives Intel Math Kernel Library Compatible with Microsoft* Visual Studio* to protect your investment in how you develop. The Professional Edition offers a great price compared to the individual components. Attention Fortran developers Intel Compiler Suite Professional Edition for Windows includes everything listed above plus the Intel Visual Fortran Compiler for Windows. Take advantage of significant price savings over individual components. Intel C++ Compiler Professional Edition At A Glance Features Integration into Microsoft Visual Studio* 2005 and 2008 protects your investment in the way you develop software on and for Windows* based systems. Use Intel C++, Microsoft Visual C++* or both! Multi-Threaded Application Support including new in 11.0, OpenMP 3.0 (data- and now task-parallelism), and autoparallelization for simple and efficient software threading. Auto-vectorization parallelizes code to utilize the Streaming SIMD Extensions (SSE) instruction set architectures (SSE, SSE2, SSE3, SSSE3, and SSE4) of our latest processors. High-Performance Parallel Optimizer (HPO) restructures and optimizes loops to ensure that auto-vectorization, OpenMP, or auto-parallelization make best use of cache and memory accesses, SIMD instruction sets, and multiple cores. Compiles in a single pass, improving compile-time and producing more reliable code. Interprocedural Optimization (IPO) dramatically improves performance of small- to mid-sized functions, especially in programs containing calls within loops. IPO analysis gives feedback on vulnerabilities and coding errors, such as uninitialized variables or OpenMP API issues, which cannot be detected as well by other compilers. Profile-guided Optimization (PGO) improves application performance by reducing instruction-cache thrashing, reorganizing code layout, shrinking code size, and reducing branch mispredictions. Intel Threading Building Blocks is an award winning C++ template library that abstracts threads to tasks to create reliable, portable and scalable parallel applications. Intel TBB is the most efficient way to implement parallel applications and unleash multicore platform Intel Math Kernel Library includes optimized and scalable math routines for maximizing performance and seamlessly provides forward scaling from current to future many-core platforms. Intel C++ Compiler Professional Edition Components: The multi-threading and optimization technologies in Intel compilers and libraries give the Professional Edition its edge in helping you deliver high-performance applications for the latest multicore processors. 3 Intel Integrated Performance Primitives is an extensive library of multicore-ready, highly optimized software functions for multimedia data processing, and communications applications. New integrated, simplified installation gets you going with all capabilities quickly and easily. Simplified custom install makes it easy to identify just the components you want. Ongoing Premier Support now includes online community support forums to speed information flow, in addition to private, password-protected accounts. Technical support, interactive issues management, access to technical and application notes, product updates and more with every commercial and academic license.

New in This Release Feature New Processor Support Compiler and Debugger New exception handling C++ lambda functions OpenMP* 3.0 Improved valarray option SSE2 enabled by default Decimal floating point Thread-related front-end diagnostics Parallel compilation Intel Threading Building Blocks (Intel TBB) Loops, containers, mutexes, atomic operations, and more Task-oriented threading Threaded Performance Scalable Performance through data-parallelism Debug and release libraries Intel ATOM Processor - Create high-performance and battery saving applications for new Mobile Internet Devices! Same user-settable options produce betteroptimized code leading to improved application Already part of the next C++ standard, Intel C++ Lambda functions available now to simplify template libraries use with loop constructs like STL and Intel TBB. OpenMP raises the parallelism abstraction away from the API, simplifying threading and making code more portable. Previously limited to loop-based data-parallelism, the new 3.0 standard simplifies both data and task parallelism. Templates of array operations that enable low level hardware features to enhance application No source code change required to use! Take advantage of new Intel Streaming SIMD Extensions automatically through the compiler. No messy low-level coding to get the most from Intel processors. Resetable for other hosts/targets. IEEE 754R Standard implementation overcomes otherwise unavoidable precision issues implied by binary FP formats. Great for banking, accounting, billing, and e-commerce. Provides useful warnings about references and assignment to statically allocated variables and address references of statically allocated variables. Supports your build by appropriately allocating files to available processors to take advantage of multicore processors and speed you through your edit/compile/debug cycle. Intel TBB provides developers with high-level, STL-like library functions to take advantage of parallelism in existing or planned code. Covering loops, containers, mutex controls, atomic operations, complex task scheduling and more, Intel TBB simplifies threading, saves time and produces applications that scale as processors are added. Eliminate the tedium and inefficiencies of threading low-level, heavy constructs close to the hardware. Thread the way you think about your application and let the Intel TBB run-time library worry about the threads. Intel TBB focuses on the particular goal of parallelizing computationally intensive work, delivering higher-level, simpler solutions. Intel TBB emphasizes programming which scales well to larger numbers of processors by dividing the collection into smaller pieces. With data-parallel programming, program performance increases as you add processors. Intel TBB functions come in debug and release forms to support extensive internal checking before building the production version of your software with the release form of the Intel TBB function. This simplifies development and delivers great Feature Intel Integrated Performance Primitives (Intel IPP) Intel Core i7 and Atom processor support Deferred Mode Image Processing (DMIP) Layer Unified Image Codec (UIC) framework sample library Threaded Static libraries for all functional domains High level LZO data compression library + improved Zlib, gzip and bzip2 algorithms Data Integrity Functional Domain Variety of new functions and enhancements to Intel IPP samples Intel Threading Building Blocks (Intel TBB) Loops, containers, mutexes, atomic operations, and more Task-oriented threading Threaded Performance Scalable Performance through data-parallelism Debug and release libraries Continued support for new processors continues to future-proof your investment with assurance of support for each successive generation of processors. Introduced as a sample on top of Intel IPP libraries, this provides solutions for pipelined image operations on larger images, utilizes in-memory optimization and improves performance in a multithreading environment. Standardizes plug- and-play interfaces for various image codecs (JPEG, JPEG2000, etc.) to improve ease of implementation and maintenance. Provides higher performance threading to static library users. High performance implementations of most popular data compression algorithms using standard interfaces for ease of use. Reed-Solomon error correcting codes to preserve integrity of data in transmission, storage and encoding. Continued added value in response to user requests for added functionality including new signal and image processing transforms and codec enhancements. Intel TBB provides developers with high-level, STL-like library functions to take advantage of parallelism in existing or planned code. Covering loops, containers, mutex controls, atomic operations, complex task scheduling and more, Intel TBB simplifies threading, saves time and produces applications that scale as processors are added. Eliminate the tedium and inefficiencies of threading low-level, heavy constructs close to the hardware. Thread the way you think about your application and let the Intel TBB run-time library worry about the threads. Intel TBB focuses on the particular goal of parallelizing computationally intensive work, delivering higher-level, simpler solutions. Intel TBB emphasizes programming which scales well to larger numbers of processors by dividing the collection into smaller pieces. With data-parallel programming, program performance increases as you add processors. Intel TBB functions come in debug and release forms to support extensive internal checking before building the production version of your software with the release form of the Intel TBB function. This simplifies development and delivers great 4

Feature Intel Math Kernel Library (Intel MKL) New layered architecture New threading layer Discrete Fourier Transform Interface LAPACK enhancement VML extensions Sparse BLAS extensions Simplified installation New Online Support Community Processor Support The new architecture provides maximum support for different development environment configurations and processors in a single package. Link to the version of this layer that matches your development environment and rest assured that Intel MKL will not have threading incompatibilities with the threading in your application. The DftiCopyDescriptor function has been added for convenience when using the FFTs. The size of statically linked executables calling DFTI has been reduced significantly and complex storage is now available for real-to-real transforms. The capability to track and/or interrupt the progress of lengthy LAPACK computations has been added. A function called mkl_progress can be defined in a user application, which will be called regularly from a subset of the MKL LAPACK routines. With performance in mind, all VML functions are now threaded. And a new Enhance Performance mode is offered for applications where mathfunction inaccuracies don t dominate parameter inaccuracies (e.g., Monte Carlo simulations and media applications). Improvements include threaded level-3 sparse BLAS triangular solvers and support for all data types (single precision, complex and double complex). Streamlined, simplified complete installation for a seamless one-step installation of all components. Our enhanced online community support forums and knowledge base search capabilities help you find answers more quickly. This is in addition to private, password-protected accounts available with Premier Support. Go to the support section of the Web site for more information. The addition of support for Intel Atom processors continues to future-proof your investment with assurance of support for each successive generation of processors. That s a key advantage in a world where new hardware platforms come to market with awesome speed. For more details, see the Release Notes. Technical Support With the purchase of the product, you receive one year of technical support and product updates from Intel Premier Support, our interactive and password-protected issue management and communication web site. This premium support service allows you to submit questions, download product updates, and access technical notes, application notes, and other documentation. In addition, we have enhanced our user forums to provide a quick and easy first resource to help with most issues. Compatibility The Intel C++ Compiler for Windows is substantially standards compliant, including support for parallelization by means of OpenMP. Binaries are also substantially compatible with Microsoft Visual C++ binaries to support a mix and match interoperability capability. It also integrates with Microsoft Visual Studio 2005 and 2008 and provides expanded 32-bit and 64-bit multicore Intel processor support. The Intel C++ Compiler provides the following language conformances: ANSI/ISO standard for C language compilation (ISO/IEC 9899:1990) ANSI/ISO standard (ISO/IEC 14882:1998) for the C++ language OpenMP specification version 3.0 System Requirements at a Glance More specific information on installation requirements is available in the Release Notes but, at a glance, Intel C++ Compiler Professional Edition for Windows can be used on, and develop code for, Intel processors developed since the Intel Pentium 4 processor. Supported operating systems include Microsoft Windows* XP, Windows Vista* or Windows Server* 2003. Installation of all components in the product requires 2 GB free disk space. 2009, Intel Corporation. All rights reserved. Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and other countries. *Other names and brands may be claimed as the property of others. 0209/BLA/CMD/PDF 321503-001