IBM. Optimization and Programming Guide. IBM XL Fortran for AIX, V Version SC

Size: px
Start display at page:

Download "IBM. Optimization and Programming Guide. IBM XL Fortran for AIX, V Version SC"

Transcription

1 IBM XL Fortran for AIX, V IBM Optimization and Programming Guide Version SC

2

3 IBM XL Fortran for AIX, V IBM Optimization and Programming Guide Version SC

4 Note Before using this information and the product it supports, read the information in Notices on page 339. First edition This edition applies to IBM XL Fortran for AIX, V (Program 5765-J09; 5725-C74) and to all subsequent releases and modifications until otherwise indicated in new editions. Make sure you are using the correct edition for the level of the product. Copyright IBM Corporation 1990, US Government Users Restricted Rights Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp.

5 Contents About this document vii Who should read this document vii How to use this document vii How this document is organized vii Conventions viii Related information xii IBM XL Fortran information xii Standards and specifications xiii Other IBM information xiv Technical support xiv How to send your comments xiv Chapter 1. Optimizing your applications 1 Distinguishing between optimization and tuning.. 1 Steps in the optimization process Basic optimization Optimizing at level Optimizing at level Advanced optimization Optimizing at level An intermediate step: adding -qhot suboptions at level Optimizing at level Optimizing at level Specialized optimization techniques High-order transformation (HOT) Interprocedural analysis (IPA) Profile-directed feedback Handling table of contents (TOC) overflow Vector technology Using compiler reports to diagnose optimization opportunities Tracing procedures in your code Getting more performance Beyond performance: effective programming techniques Chapter 2. Tuning XL compiler applications Tuning for your target architecture Using -qarch Using -qtune Using -qcache Before you finish tuning Further option driven tuning Options for providing application characteristics 43 Options to control optimization transformations 46 Options to assist with performance analysis.. 47 Options that can inhibit performance Chapter 3. Advanced optimization concepts Aliasing Inlining Finding the right level of inlining Chapter 4. Managing code size Steps for reducing code size Compiler option influences on code size The -qipa compiler option The -qinline inlining option The -qhot compiler option The -qcompact compiler option Other influences on code size High activity areas Computed GOTOs and CASE constructs Code size with dynamic or static linking Chapter 5. Debugging optimized code 59 Understanding different results in optimized programs Debugging in the presence of optimization Using -qoptdebug to help debug optimized programs Chapter 6. Compiler-friendly programming techniques General practices Variables and pointers Arrays Choosing appropriate variable sizes Submodules (Fortran 2008) Chapter 7. High performance libraries 71 Using the Mathematical Acceleration Subsystem (MASS) libraries Using the scalar library Using the vector libraries Using the SIMD libraries Compiling and linking a program with MASS.. 84 Using the Basic Linear Algebra Subprograms BLAS BLAS function syntax Linking the libxlopt library Chapter 8. Parallel programming with XL Fortran Compiling your parallelized code The _OPENMP C preprocessor macro and conditional compilation Setting runtime options XLSMPOPTS Environment variables for OpenMP Optimizing your SMP code Developing and running SMP applications An introduction to parallelization directives Parallel region construct Work-sharing constructs Copyright IBM Corp. 1990, 2015 iii

6 Combined parallel work-sharing constructs Synchronization constructs Other OpenMP directives Non-OpenMP SMP directives Deprecated directive Detailed descriptions of parallelization directives 107 ATOMIC BARRIER CRITICAL / END CRITICAL DO / END DO DO SERIAL FLUSH MASTER / END MASTER ORDERED / END ORDERED PARALLEL / END PARALLEL PARALLEL DO / END PARALLEL DO PARALLEL SECTIONS / END PARALLEL SECTIONS PARALLEL WORKSHARE / END PARALLEL WORKSHARE SCHEDULE SECTIONS / END SECTIONS SINGLE / END SINGLE TASK / END TASK TASKWAIT TASKYIELD THREADLOCAL THREADPRIVATE WORKSHARE / END WORKSHARE Data sharing attribute rules Directive clauses COLLAPSE COPYIN COPYPRIVATE DEFAULT FINAL FIRSTPRIVATE IF LASTPRIVATE MERGEABLE NUM_THREADS ORDERED PRIVATE REDUCTION SCHEDULE SHARED UNTIED Routines for OpenMP omp_destroy_lock(svar) omp_destroy_nest_lock(nvar) omp_get_active_level() omp_get_ancestor_thread_num(level) omp_get_dynamic() omp_get_level() omp_get_max_active_levels() omp_get_max_threads() omp_get_nested() omp_get_num_procs() omp_get_num_threads() omp_get_schedule(kind, modifier) omp_get_team_size(level) omp_get_thread_limit() omp_get_thread_num() omp_get_wtick() omp_get_wtime() omp_in_final() omp_in_parallel() omp_init_lock(svar) omp_init_nest_lock(nvar) omp_set_dynamic(enable_expr) omp_set_lock(svar) omp_set_max_active_levels(max_levels) omp_set_nested(enable_expr) omp_set_nest_lock(nvar) omp_set_num_threads(number_of_threads_expr) 202 omp_set_schedule(kind, modifier) omp_test_lock(svar) omp_test_nest_lock(nvar) omp_unset_lock(svar) omp_unset_nest_lock(nvar) Pthreads Library Module Pthreads data structures, functions, and subroutines f_maketime(delay) f_pthread_attr_destroy(attr) f_pthread_attr_getdetachstate(attr, detach) f_pthread_attr_getguardsize(attr, guardsize) f_pthread_attr_getinheritsched(attr, inherit) f_pthread_attr_getschedparam(attr, param) f_pthread_attr_getschedpolicy(attr, policy) f_pthread_attr_getscope(attr, scope) f_pthread_attr_getstackaddr(attr, stackaddr) f_pthread_attr_getstacksize(attr, ssize) f_pthread_attr_init(attr) f_pthread_attr_setdetachstate(attr, detach) f_pthread_attr_setguardsize(attr, guardsize) f_pthread_attr_setinheritsched(attr, inherit) f_pthread_attr_setschedparam(attr, param) f_pthread_attr_setschedpolicy(attr, policy) f_pthread_attr_setscope(attr, scope) f_pthread_attr_setstackaddr(attr, stackaddr) f_pthread_attr_setstacksize(attr, ssize) f_pthread_attr_t f_pthread_cancel(thread) f_pthread_cleanup_pop(exec) f_pthread_cleanup_push(cleanup, flag, arg) f_pthread_cond_broadcast(cond) f_pthread_cond_destroy(cond) f_pthread_cond_init(cond, cattr) f_pthread_cond_signal(cond) f_pthread_cond_t f_pthread_cond_timedwait(cond, mutex, timeout) f_pthread_cond_wait(cond, mutex) f_pthread_condattr_destroy(cattr) f_pthread_condattr_getpshared(cattr, pshared) 229 f_pthread_condattr_init(cattr) f_pthread_condattr_setpshared(cattr, pshared) 230 f_pthread_condattr_t f_pthread_create(thread, attr, flag, ent, arg) f_pthread_detach(thread) f_pthread_equal(thread1, thread2) iv XL Fortran: Optimization and Programming Guide

7 f_pthread_exit(ret) f_pthread_getconcurrency() f_pthread_getschedparam(thread, policy, param) 235 f_pthread_getspecific(key, arg) f_pthread_join(thread, ret) f_pthread_key_create(key, dtr) f_pthread_key_delete(key) f_pthread_key_t f_pthread_kill(thread, sig) f_pthread_mutex_destroy(mutex) f_pthread_mutex_getprioceiling(mutex, old) f_pthread_mutex_init(mutex, mattr) f_pthread_mutex_lock(mutex) f_pthread_mutex_setprioceiling(mutex, new, old) 241 f_pthread_mutex_t f_pthread_mutex_trylock(mutex) f_pthread_mutex_unlock(mutex) f_pthread_mutexattr_destroy(mattr) f_pthread_mutexattr_getprioceiling(mattr, ceiling) f_pthread_mutexattr_getprotocol(mattr, proto) 244 f_pthread_mutexattr_getpshared(mattr, pshared) 245 f_pthread_mutexattr_gettype(mattr, type) f_pthread_mutexattr_init(mattr) f_pthread_mutexattr_setprioceiling(mattr, ceiling) f_pthread_mutexattr_setprotocol(mattr, proto) 247 f_pthread_mutexattr_setpshared(mattr, pshared) 248 f_pthread_mutexattr_settype(mattr, type) f_pthread_mutexattr_t f_pthread_once(once, initr) f_pthread_once_t f_pthread_rwlock_destroy(rwlock) f_pthread_rwlock_init(rwlock, rwattr) f_pthread_rwlock_rdlock(rwlock) f_pthread_rwlock_t f_pthread_rwlock_tryrdlock(rwlock) f_pthread_rwlock_trywrlock(rwlock) f_pthread_rwlock_unlock(rwlock) f_pthread_rwlock_wrlock(rwlock) f_pthread_rwlockattr_destroy(rwattr) f_pthread_rwlockattr_getpshared(rwattr, pshared) f_pthread_rwlockattr_init(rwattr) f_pthread_rwlockattr_setpshared(rwattr, pshared) f_pthread_rwlockattr_t f_pthread_self() f_pthread_setcancelstate(state, oldstate) f_pthread_setcanceltype(type, oldtype) f_pthread_setconcurrency(new_level) f_pthread_setschedparam(thread, policy, param) 261 f_pthread_setspecific(key, arg) f_pthread_t f_pthread_testcancel() f_sched_param f_sched_yield() f_timespec Chapter 9. Interlanguage calls Conventions for XL Fortran external names Mixed-language input and output Mixing Fortran and C Making calls to C functions work Passing data from one language to another Passing arguments between languages Passing global variables between languages Passing character types between languages Passing arrays between languages Passing pointers between languages Passing arguments by reference or by value Returning values from Fortran functions Arguments with the OPTIONAL attribute Type encoding and checking Assembler-level subroutine linkage conventions 276 The stack The Linkage Area The input parameter area The register save area The local stack area The output parameter area Linkage convention for argument passing Argument passing rules (by value) Order of arguments in argument list Linkage convention for function calls Pointers to functions Function values The stack floor Stack overflow Prolog and epilog Traceback Chapter 10. Implementation details of XL Fortran Input/Output (I/O) Implementation details of file formats File names Preconnected and Implicitly Connected Files File positioning Preserving the XL Fortran Version 2.3 file positioning I/O redirection How XL Fortran I/O interacts with pipes, special files, and links Default record lengths File permissions Selecting error messages and recovery actions Flushing I/O buffers Choosing locations and names for Input/Output files Naming files that are connected with no explicit name Naming scratch files Increasing throughput with logical volume I/O and data striping Logical volume I/O Data striping Asynchronous I/O Execution of an asychronous data transfer operation Usage Performance Compiler-generated temporary I/O items Contents v

8 System setup Linking Error handling XL Fortran thread-safe I/O library Synchronization of I/O operations Parallel I/O issues Use of I/O statements in signal handlers Asynchronous thread cancellation Chapter 11. Implementation details of XL Fortran floating-point processing. 311 IEEE floating-point overview Compiling for strict IEEE conformance IEEE single-precision and double-precision values IEEE extended-precision values Infinities and NaNs Exception-handling model Hardware-specific floating-point overview Single-precision and double-precision values 314 Extended-precision values How XL Fortran rounds floating-point calculations 316 Selecting the rounding mode Minimizing rounding errors Minimizing overall rounding Delaying rounding until run time Ensuring that the rounding mode is consistent 319 Duplicating the floating-point results of other systems Maximizing floating-point performance Detecting and trapping floating-point exceptions 321 Compiler features for trapping floating-point exceptions Operating system features for trapping floating-point exceptions Installing an exception handler Producing a core file Controlling the floating-point status and control register xlf_fp_util procedures fpgets and fpsets subroutines Sample programs for exception handling Causing exceptions for particular variables Minimizing the performance impact of floating-point exception trapping Chapter 12. Porting programs to XL Fortran Outline of the porting process Maintaining FORTRAN 77 source and object code 329 Portability of directives Common industry extensions that XL Fortran supports Mixing data types in statements Date and time routines Other libc routines Changing the default sizes of data types Name conflicts between your procedures and XL Fortran intrinsic procedures Reproducing results from other systems Chapter 13. Sample Fortran programs 333 Example 1 - XL Fortran source file Example 2 - valid C routine source file Example 3 - valid Fortran SMP source file Example 4 - invalid Fortran SMP source file Programming examples using the Pthreads library module Notices Trademarks Index vi XL Fortran: Optimization and Programming Guide

9 About this document This document is part of the IBM XL Fortran for AIX, V information suite. It provides both reference information and practical tips for using the optimization and tuning capabilities of XL Fortran to maximize application performance, as well as expanding on programming concepts such as I/O and interlanguage calls. Who should read this document How to use this document This document is for anyone who wants to exploit the capabilities of XL Fortran for optimizing and tuning Fortran programs. Readers should be familiar with their AIX operating system and have extensive Fortran programming experience with complex applications. However, users new to XL Fortran can still use this information to help them understand how the compiler's features can be used for effective program optimization. This guide focuses on specific programming and compilation techniques that can maximize XL Fortran application performance. It covers optimization and tuning strategies, recommended programming practices, and compilation procedures, debugging, and information about using XL Fortran advanced language features. This guide also contains cross-references to relevant topics of other reference guides in the XL Fortran information suite. Topics not described in this information are available as follows: v Installation, system requirements, last-minute updates: see the XL Fortran Installation Guide and product README file. v Overview of XL Fortran features: see the Getting Started with XL Fortran book. v Syntax, semantics, and implementation of the XL Fortran programming language: see the XL Fortran Language Reference book. v Compiler setup, compiling and running programs, compiler options, diagnostics: see the XL Fortran Compiler Reference book. v Operating system commands related to the use of the compiler: AIX Commands Reference, Volumes 1-6 and the information center for AIX operating systems. How this document is organized This guide includes the following topics: v Chapter 1, Optimizing your applications, on page 1 provides an overview of the optimization process. v Chapter 2, Tuning XL compiler applications, on page 39 discusses the compiler options available for optimizing and tuning code. v Chapter 3, Advanced optimization concepts, on page 49, Chapter 4, Managing code size, on page 53, and Chapter 5, Debugging optimized code, on page 59 discuss advanced techniques, such as optimizing loops and inlining code, and debug considerations for optimized code. Copyright IBM Corp. 1990, 2015 vii

10 v The following sections contain information about how to write optimization friendly and portable XL Fortran code that is interoperable with other languages. Also included is a description of XL Fortran support for OpenMP and SMP with guidelines for writing parallel code. Chapter 6, Compiler-friendly programming techniques, on page 65 Chapter 7, High performance libraries, on page 71 Chapter 8, Parallel programming with XL Fortran, on page 89 Chapter 9, Interlanguage calls, on page 265 v The following sections contain information about XL Fortran and its implementation that can be useful for new and experienced users alike, as well as those who want to move their existing Fortran applications to the XL Fortran compiler: Chapter 10, Implementation details of XL Fortran Input/Output (I/O), on page 291 Chapter 11, Implementation details of XL Fortran floating-point processing, on page 311 Chapter 12, Porting programs to XL Fortran, on page 329 v Chapter 13, Sample Fortran programs, on page 333 provides coding examples for XL Fortran. Conventions Typographical conventions The following table shows the typographical conventions used in the IBM XL Fortran for AIX, V information. Table 1. Typographical conventions Typeface Indicates Example lowercase bold Invocation commands, executable names, and compiler options. The compiler provides basic invocation commands, xlf, along with several other compiler invocation commands to support various Fortran language levels and compilation environments. The default file name for the executable program is a.out. italics underlining monospace UPPERCASE bold Parameters or variables whose actual names or values are to be supplied by the user. Italics are also used to introduce new terms. The default setting of a parameter of a compiler option or directive. Examples of program code, reference to program code, file names, path names, command strings, or user-defined names. Fortran programming keywords, statements, directives, and intrinsic procedures. Uppercase letters may also be used to indicate the minimum number of characters required to invoke a compiler option/suboption. Make sure that you update the size parameter if you return more than the size requested. nomaf maf To compile and optimize myprogram.f, enter: xlf myprogram.f -O3. The ASSERT directive applies only to the DO loop immediately following the directive, and not to any nested DO loops. viii XL Fortran: Optimization and Programming Guide

11 Qualifying elements (icons and bracket separators) In descriptions of language elements, this information uses icons and marked bracket separators to delineate the Fortran language standard text as follows: Table 2. Qualifying elements Icon F2008 F2008 Bracket separator text Fortran 2008 begins / Fortran 2008 ends Fortran 2003 begins / Fortran 2003 ends IBM extension begins / IBM extension ends Meaning The text describes an IBM XL Fortran implementation of the Fortran 2008 standard. The text describes an IBM XL Fortran implementation of the Fortran 2003 standard, and it applies to all later standards. The text describes a feature that is an IBM XL Fortran extension to the standard language specifications. TS TS TS begins / TS ends The text describes an IBM XL Fortran implementation of Technical Specification 29113, referred to as TS Note: If the information is marked with a Fortran language standard icon or bracket separators, it applies to this specific Fortran language standard and all later ones. If it is not marked, it applies to all Fortran language standards. Syntax diagrams Throughout this information, diagrams illustrate XL Fortran syntax. This section helps you to interpret and use those diagrams. v Read the syntax diagrams from left to right, from top to bottom, following the path of the line. The symbol indicates the beginning of a command, directive, or statement. The symbol indicates that the command, directive, or statement syntax is continued on the next line. The symbol indicates that a command, directive, or statement is continued from the previous line. The symbol indicates the end of a command, directive, or statement. Fragments, which are diagrams of syntactical units other than complete commands, directives, or statements, start with the symbol and end with the symbol. IBM XL Fortran extensions are marked by a number in the syntax diagram with an explanatory note immediately following the diagram. Program units, procedures, constructs, interface blocks and derived-type definitions consist of several individual statements. For such items, a box encloses the syntax representation, and individual syntax diagrams show the required order for the equivalent Fortran statements. v Required items are shown on the horizontal line (the main path): About this document ix

12 keyword required_argument v Optional items are shown below the main path: keyword optional_argument Note: Optional items (not in syntax diagrams) are enclosed by square brackets ([ and ]). For example, [UNIT=]u v If you can choose from two or more items, they are shown vertically, in a stack. If you must choose one of the items, one item of the stack is shown on the main path. keyword required_argument1 required_argument2 If choosing one of the items is optional, the entire stack is shown below the main path. keyword optional_argument1 optional_argument2 v An arrow returning to the left above the main line (a repeat arrow) indicates that you can make more than one choice from the stacked items or repeat an item. The separator character, if it is other than a blank, is also indicated:, keyword repeatable_argument v The item that is the default is shown above the main path. keyword default_argument alternate_argument v Keywords are shown in nonitalic letters and should be entered exactly as shown. v Variables are shown in italicized lowercase letters. They represent user-supplied names or values. If a variable or user-specified name ends in _list, you can provide a list of these terms separated by commas. v If punctuation marks, parentheses, arithmetic operators, or other such symbols are shown, you must enter them as part of the syntax. Sample syntax diagram The following is an example of a syntax diagram with an interpretation: x XL Fortran: Optimization and Programming Guide

13 (1) EXAMPLE char_constant a b c d, e name_list Notes: 1 IBM extension Interpret the diagram as follows: v Enter the keyword EXAMPLE. v EXAMPLE is an IBM extension. v Enter a value for char_constant. v Enter a value for a or b, but not for both. v Optionally, enter a value for c or d. v Enter at least one value for e. If you enter more than one value, you must put a comma between each. v Enter the value of at least one name for name_list. If you enter more than one value, you must put a comma between each. (The _list syntax is equivalent to the previous syntax for e.) How to read syntax statements Syntax statements are read from left to right: v Individual required arguments are shown with no special notation. v When you must make a choice between a set of alternatives, they are enclosed by { and } symbols. v Optional arguments are enclosed by [ and ] symbols. v When you can select from a group of choices, they are separated by characters. v Arguments that you can repeat are followed by ellipses (...). Example of a syntax statement EXAMPLE char_constant {a b}[c d]e[,e]... name_list{name_list}... The following list explains the syntax statement: v Enter the keyword EXAMPLE. v Enter a value for char_constant. v Enter a value for a or b, but not for both. v Optionally, enter a value for c or d. v Enter at least one value for e. If you enter more than one value, you must put a comma between each. v Optionally, enter the value of at least one name for name_list. If you enter more than one value, you must put a comma between each name. Note: The same example is used in both the syntax-statement and syntax-diagram representations. About this document xi

14 Related information Examples in this information The examples in this information, except where otherwise noted, are coded in a simple style that does not try to conserve storage, check for errors, achieve fast performance, or demonstrate all possible methods to achieve a specific result. The examples for installation information are labelled as either Example or Basic example. Basic examples are intended to document a procedure as it would be performed during a basic, or default, installation; these need little or no modification. Notes on the terminology used Some of the terminology in this information is shortened as follows: v The term free source form format often appears as free source form. v The term fixed source form format often appears as fixed source form. v The term XL Fortran often appears as XLF. The following sections provide related information for XL Fortran: IBM XL Fortran information XL Fortran provides product information in the following formats: v Quick Start Guide The Quick Start Guide (quickstart.pdf) is intended to get you started with IBM XL Fortran for AIX, V It is located by default in the XL Fortran directory and in the \quickstart directory of the installation DVD. v README files README files contain late-breaking information, including changes and corrections to the product information. README files are located by default in the XL Fortran directory and in the root directory of the installation DVD. v Installable man pages Man pages are provided for the compiler invocations and all command-line utilities provided with the product. Instructions for installing and accessing the man pages are provided in the IBM XL Fortran for AIX, V Installation Guide. v Online product documentation The fully searchable HTML-based documentation is viewable in IBM Knowledge Center at com.ibm.compilers.aix.doc/welcome.html. v PDF documents PDF documents are available on the web at docview.wss?uid=swg The following files comprise the full set of XL Fortran product information: Table 3. XL Fortran PDF files Document title IBM XL Fortran for AIX, V Installation Guide, SC PDF file name install.pdf Description Contains information for installing XL Fortran and configuring your environment for basic compilation and program execution. xii XL Fortran: Optimization and Programming Guide

15 Table 3. XL Fortran PDF files (continued) Document title Getting Started with IBM XL Fortran for AIX, V15.1.3, SC IBM XL Fortran for AIX, V Compiler Reference, SC IBM XL Fortran for AIX, V Language Reference, SC IBM XL Fortran for AIX, V Optimization and Programming Guide, SC PDF file name Description getstart.pdf Contains an introduction to the XL Fortran product, with information about setting up and configuring your environment, compiling and linking programs, and troubleshooting compilation errors. compiler.pdf Contains information about the various compiler options and environment variables. langref.pdf Contains information about the Fortran programming language as supported by IBM, including language extensions for portability and conformance to nonproprietary standards, compiler directives and intrinsic procedures. proguide.pdf Contains information on advanced programming topics, such as application porting, interlanguage calls, floating-point operations, input/output, application optimization and parallelization, and the XL Fortran high-performance libraries. To read a PDF file, use Adobe Reader. If you do not have Adobe Reader, you can download it (subject to license terms) from the Adobe website at More information related to XL Fortran, including IBM Redbooks publications, white papers, and other articles, is available on the web at support/docview.wss?uid=swg For more information about Fortran, see the Fortran café at communityview?communityuuid=b10932b4-0edd-4e61-89f2-6e478ccba9aa. Standards and specifications XL Fortran is designed to support the following standards and specifications. You can refer to these standards and specifications for precise definitions of some of the features found in this information. v American National Standard Programming Language FORTRAN, ANSI X v American National Standard Programming Language Fortran 90, ANSI X v ANSI/IEEE Standard for Binary Floating-Point Arithmetic, ANSI/IEEE Std v Federal (USA) Information Processing Standards Publication Fortran, FIPS PUB v Information technology - Programming languages - Fortran, ISO/IEC :1991. (This information uses its informal name, Fortran 90.) v Information technology - Programming languages - Fortran - Part 1: Base language, ISO/IEC :1997. (This information uses its informal name, Fortran 95.) v Information technology - Programming languages - Fortran - Part 1: Base language, ISO/IEC :2004. (This information uses its informal name, Fortran 2003.) v Information technology - Programming languages - Fortran - Part 1: Base language, ISO/IEC :2010. (This information uses its informal name, Fortran We currently provide partial support to this standard.) About this document xiii

16 v Information technology - Further interoperability of Fortran with C, ISO/IEC TS 29113:2012. (This information uses its informal name, Technical specification 29113, referred to as TS We currently provide partial support to this specification.) v Military Standard Fortran DOD Supplement to ANSI X , MIL-STD-1753 (United States of America, Department of Defense standard). Note that XL Fortran supports only those extensions documented in this standard that have also been subsequently incorporated into the Fortran 90 standard. v OpenMP Application Program Interface Version 3.1 (full support), and OpenMP Application Program Interface Version 4.0 (partial support), available at Other IBM information Technical support v Parallel Environment for AIX: Operation and Use v The IBM Systems Information Center, at infocenter/systems/index.jsp?topic=/com.ibm.aix.doc/doc/base/aixparent.htm, is a resource for AIX information. You can find the following books for your specific AIX system: AIX Commands Reference, Volumes 1-6 Technical Reference: Base Operating System and Extensions, Volumes 1 & 2 AIX National Language Support Guide and Reference AIX General Programming Concepts: Writing and Debugging Programs AIX Assembler Language Reference How to send your comments Additional technical support is available from the XL Fortran Support page at This page provides a portal with search capabilities to a large selection of Technotes and other support information. If you cannot find what you need, you can send an to compinfo@ca.ibm.com. For the latest information about XL Fortran, visit the product information site at Your feedback is important in helping us to provide accurate and high-quality information. If you have any comments about this information or any other XL Fortran information, send your comments to compinfo@ca.ibm.com. Be sure to include the name of the manual, the part number of the manual, the version of XL Fortran, and, if applicable, the specific location of the text you are commenting on (for example, a page number or table number). xiv XL Fortran: Optimization and Programming Guide

17 Chapter 1. Optimizing your applications The XL compilers enable development of high performance 32-bit and 64-bit applications by offering a comprehensive set of performance enhancing techniques that exploit the multilayered PowerPC architecture. These performance advantages depend on good programming techniques, thorough testing and debugging, followed by optimization, and tuning. Distinguishing between optimization and tuning You can use optimization and tuning separately or in combination to increase the performance of your application. Understanding the difference between them is the first step in understanding how the different levels, settings, and techniques can increase performance. Optimization Optimization is a compiler-driven process that searches for opportunities to restructure your source code and give your application better overall performance at run time, without significantly impacting development time. The XL compiler optimization suite, which you control using compiler options and directives, performs best on well-written source code that has already been through a thorough debugging and testing process. These optimization transformations can bring the following benefits: v Reduce the number of instructions that your application executes to perform critical operations. v Restructure your object code to make optimal use of the PowerPC architecture. v Improve memory subsystem usage. v Exploit the ability of the architecture to handle large amounts of shared memory parallelization. Each basic optimization technique can result in a performance benefit, although not all optimizations can benefit all applications. Consult the Steps in the optimization process on page 2 for an overview of the common sequence of steps that you can use to increase the performance of your application. Tuning Tuning is a user-driven process where you experiment with changes, for example to source code or compiler options, to make the compiler better optimize your program. While optimization applies general transformations designed to improve the performance of any application in any supported environment, tuning offers you opportunities to adjust specific characteristics or target execution environments of your application to improve its performance. Even at low optimization levels, tuning for your application and target architecture can have a positive impact on performance. With proper tuning, the compiler can make the following improvements: v Select more efficient machine instructions. v Generate instruction sequences that are more relevant to your application. v Select from more focussed optimizations to improve your code. Copyright IBM Corp. 1990,

18 For instructions, see Tuning XL compiler applications. Steps in the optimization process Basic optimization When you begin the optimization process, consider that not all optimization techniques suit all applications. Trade-offs sometimes occur between an increase in compile time, a reduction in debugging capability, and the improvements that optimization can provide. Learning about and experimenting with different optimization techniques can help you strike the right balance for your XL compiler applications while achieving the best possible performance. Also, though it is unnecessary to hand-optimize your code, compiler-friendly programming can be extremely beneficial to the optimization process. Unusual constructs can obscure the characteristics of your application and make performance optimization difficult. Use the steps in this section as a guide for optimizing your application. 1. The Basic optimization step begins your optimization processes at levels 0 and The Advanced optimization step exposes your application to more intense optimizations at levels 3, 4, and The High-order transformation (HOT) step can help you reduce loop execution time. 4. The Interprocedural analysis (IPA) step can optimize your entire application at once. 5. The Profile-directed feedback (PDF) step focuses optimizations on specific characteristics of your application. 6. The Debugging optimized code step can help you identify issues and problems that can occur with optimized code. 7. The Getting more performance section offers other strategies and tuning alternatives to compiler-driven optimization. The section Compiler-friendly programming techniques contains tips for writing more easily optimized source code. The XL compiler supports several levels of optimization, with each option level building on the levels below through increasingly aggressive transformations and consequently using more machine resources. Ensure that your application compiles and executes properly at low optimization levels before you try more aggressive optimizations. This topic discusses two optimizations levels, listed with complementary options in Table 4. The table also includes a column for compiler options that can have a performance benefit at that optimization level for some applications. Table 4. Basic optimizations Optimization level Additional options implied by default Complementary options -O0 None -qarch None -O2 -qmaxmem=8192 -qarch -qtune Other options with possible benefits -qmaxmem=-1 -qhot=level=0 2 XL Fortran: Optimization and Programming Guide

19 Note: Specifying -O without including a level implies -O2. Optimizing at level 0 Benefits at level 0 v Provides minimal performance improvement with minimal impact on machine resources v Exposes some source code problems that can be helpful in the debugging process Begin your optimization process at -O0, which the compiler already specifies by default. This level performs basic analytical optimization by removing obviously redundant code, and it can result in better compile time. It also ensures your code is algorithmically correct so you can move forward to more complex optimizations. -O0 also includes some redundant instruction elimination and constant folding. The -qfloat=nofold option can be used to suppress folding floating-point operations. Optimizing at this level accurately preserves all debugging information and can expose problems in existing code, such as uninitialized variables. Additionally, specifying -qarch at this level targets your application for a particular machine and can significantly improve performance by ensuring that your application takes advantage of all applicable architectural benefits. Note: For SMP programs, you need to add an additional option -qsmp=noopt. For more information about tuning, consult Tuning for Your Target Architecture. See "-O" in the XL Fortran Compiler Reference for information about the -O level syntax. Related information in the XL Fortran Compiler Reference -qarch Optimizing at level 2 Benefits at level 2 v Eliminates redundant code v Performs basic loop optimization v Structures code to take advantage of -qarch and -qtune settings After you successfully compile, execute, and debug your application using -O0, recompiling at -O2 opens your application to a set of comprehensive low-level transformations that apply to subprogram or compilation unit scopes and can include some inlining. Optimizations at -O2 attain a relative balance between increasing performance while limiting the impact on compilation time and system resources. You can increase the memory available to some of the optimizations in the -O2 portfolio by providing a larger value for the -qmaxmem option. Specifying -qmaxmem=-1 allows the optimizer to use memory as needed without checking for limits but does not change the transformations the optimizer applies to your application at -O2. Starting to tune at level 2 Choosing the right hardware architecture target or family of targets becomes even more important at -O2 and higher. By targeting the proper hardware, the optimizer can make the best use of the available hardware facilities. If you choose a family of Chapter 1. Optimizing your applications 3

20 Advanced optimization hardware targets, the -qtune option can direct the compiler to emit code that is consistent with the architecture choice and that can execute optimally on the chosen tuning hardware target. With this option, you can compile for a general set of targets and have the code run best on a particular target. For details on the -qarch and -qtune options, see Chapter 2, Tuning XL compiler applications, on page 39. The -O2 option can perform a number of additional optimizations as follows: v Common subexpression elimination: Eliminates redundant instructions v Constant propagation: Evaluates constant expressions at compile time v Dead code elimination: Eliminates instructions that a particular control flow does not reach or that generate an unused result v Dead store elimination: Eliminates unnecessary variable assignments v Global register allocation: Globally assigns user variables to registers v Value numbering: Simplifies algebraic expressions by eliminating redundant computations v Instruction scheduling for the target machine v Loop unrolling and software pipelining v Moving loop-invariant code out of loops v Simplifying control flow v Strength reduction and effective use of addressing modes v Widening: Merges adjacent load/stores and other operations v Pointer aliasing improvements to enhance other optimizations Even with -O2 optimizations, some useful information about your source code is made available to the debugger if you specify -g. Using a higher -g level increases the information provided to the debugger but reduces the optimization that can be done. Conversely, higher optimization levels can transform code to an extent to which debugging information is no longer accurate. The section on Chapter 5, Debugging optimized code, on page 59 discusses other debugging strategies in detail. See "-O" in the XL Fortran Compiler Reference for information on the -O level syntax. Higher optimization levels can have a tremendous impact on performance, but some trade-offs can occur in terms of code size, compile time, resource requirements, and numeric or algorithmic precision. After applying Basic optimization on page 2 and successfully compiling and executing your application, you can apply more powerful optimization tools. The XL compiler optimization portfolio includes many options for directing advanced optimization, and the transformations that your application undergoes are largely under your control. The discussion of each optimization level in Table 5 on page 5 includes information on the performance benefits and the possible trade-offs and information on how you can help guide the optimizer to find the best solutions for your application. 4 XL Fortran: Optimization and Programming Guide

21 Table 5. Advanced optimizations Optimization Level Additional options implied Complementary options Options with possible benefits -O3 -qnostrict -qmaxmem=-1 -qhot=level=0 -O4 -qnostrict -qmaxmem=-1 -qhot -qipa -qarch=auto -qtune=auto -qcache=auto -O5 All of -O4 -qipa=level=2 -qarch -qtune -qarch -qtune -qcache -qarch -qtune -qcache -qpdf -qpdf -qsmp=auto -qpdf -qsmp=auto When you compile programs with any of the following sets of options: v -qhot -qnostrict v -O3 -qhot v -O4 v -O5 the compiler automatically attempts to vectorize calls to system math functions by calling the equivalent vector functions in the Mathematical Acceleration Subsystem libraries (MASS), with the exceptions of functions vatan2, vsatan2, vdnint, vdint, vcosisin, vscosisin, vqdrt, vsqdrt, vrqdrt, vsrqdrt, vpopcnt4, and vpopcnt8. If the compiler cannot vectorize, it automatically tries to call the equivalent MASS scalar functions. For automatic vectorization or scalarization, the compiler uses versions of the MASS functions contained in the system library libxlopt.a. In addition to any of the preceding sets of options, when the -qipa option is in effect, if the compiler cannot vectorize, it tries to inline the MASS scalar functions before deciding to call them. Optimizing at level 3 Benefits at level 3 v In-depth Aliasing on page 49 analysis v Better loop scheduling v High-order loop analysis and transformations (-qhot=level=0) v Inlining of small procedures within a compilation unit by default v Eliminating implicit compile-time memory usage limits Specifying -O3 initiates more intense low-level transformations that remove many of the limitations present at -O2. For instance, the optimizer no longer checks for memory limits, by setting the default to -qmaxmem=-1. Additionally, optimizations encompass larger program regions and attempt more in-depth analysis. Although not all applications contain opportunities for the optimizer to provide a measurable increase in performance, most applications can benefit from this type of analysis. Chapter 1. Optimizing your applications 5

22 Potential trade-offs at level 3 With the in-depth analysis of -O3 comes a trade-off in terms of compilation time and memory resources. Also, because -O3 implies -qnostrict, the optimizer can alter certain floating-point semantics in your application to gain execution speed. This typically involves precision trade-offs as follows: v Reordering of floating-point computations v Reordering or elimination of possible exceptions, such as division by zero or overflow v Using alternative calculations that might give slightly less precise results or not handle infinities or NaNs in the same way You can still gain most of the -O3 benefits while preserving precise floating-point semantics by specifying -qstrict. Compiling with -qstrict is necessary if you require the same absolute precision in floating-point computational accuracy as you get with -O0, -O2, or -qnoopt results. The option -qstrict=ieeefp also ensures adherence to all IEEE semantics for floating-point operations. If your application is sensitive to floating-point exceptions or the order of evaluation for floating-point arithmetic, compiling with -qstrict, -qstrict=exceptions, or -qstrict=order helps to ensure accurate results. You should also consider the impact of the -qstrict=precision suboption group on floating-point computational accuracy. The precision suboption group includes the individual suboptions: subnormals, operationprecision, association, reductionorder, and library (described in the -qstrict option in the XL Fortran Compiler Reference). Without -qstrict, the difference in computation for any one source-level operation is very small in comparison to Basic optimization on page 2. Although a small difference can be compounded if the operation is in a loop structure where the difference becomes additive, most applications are not sensitive to the changes that can occur in floating-point semantics. For information about the -O level syntax, see "-O" in the XL Fortran Compiler Reference. An intermediate step: adding -qhot suboptions at level 3 At -O3, the optimization includes minimal -qhot loop transformations at level=0 to increase performance. To further increase your performance benefit from -qhot, increase the optimization aggressiveness by increasing the optimization level of -qhot. Try specifying -qhot without any suboptions or -qhot=level=1. The following -qhot suboptions can also provide additional performance benefits, depending on the characteristics of your application: v -qhot=vector to enable long vectorization v -qhot=arraypad to enable array padding v -qhot=fastmath to enable the replacement of math routines with those from the XLOPT library For more information about -qhot, see High-order transformation (HOT) on page 9. Conversely, if the application does not use loops processing arrays, which -qhot improves, you can improve compile speed significantly, usually with minimal performance loss by using -qnohot after -O3. 6 XL Fortran: Optimization and Programming Guide

IBM. Optimization and Programming Guide for Little Endian Distributions. IBM XL Fortran for Linux, V Version 15.1.

IBM. Optimization and Programming Guide for Little Endian Distributions. IBM XL Fortran for Linux, V Version 15.1. IBM XL Fortran for Linux, V15.1.1 IBM Optimization and Programming Guide for Little Endian Distributions Version 15.1.1 SC27-6600-00 IBM XL Fortran for Linux, V15.1.1 IBM Optimization and Programming

More information

IBM. Optimization and Programming Guide for Little Endian Distributions. IBM XL Fortran for Linux, V Version 15.1.

IBM. Optimization and Programming Guide for Little Endian Distributions. IBM XL Fortran for Linux, V Version 15.1. IBM XL Fortran for Linux, V15.1.3 IBM Optimization and Programming Guide for Little Endian Distributions Version 15.1.3 SC27-6600-02 IBM XL Fortran for Linux, V15.1.3 IBM Optimization and Programming

More information

Optimization and Programming Guide

Optimization and Programming Guide IBM XL Fortran for AIX, V12.1 Optimization and Programming Guide Version 12.1 SC23-8894-00 IBM XL Fortran for AIX, V12.1 Optimization and Programming Guide Version 12.1 SC23-8894-00 Note Before using

More information

Optimization and Programming Guide

Optimization and Programming Guide IBM XL Fortran Enterprise Edition for AIX, V11.1 Optimization and Programming Guide SC23-5836-00 IBM XL Fortran Enterprise Edition for AIX, V11.1 Optimization and Programming Guide SC23-5836-00 Note!

More information

IBM. Optimization and Programming Guide for Little Endian Distributions. IBM XL Fortran for Linux, V16.1. Version 16.

IBM. Optimization and Programming Guide for Little Endian Distributions. IBM XL Fortran for Linux, V16.1. Version 16. IBM XL Fortran for Linux, V16.1 IBM Optimization and Programming Guide for Little Endian Distributions Version 16.1 SC27-8049-00 IBM XL Fortran for Linux, V16.1 IBM Optimization and Programming Guide

More information

Optimization and Programming Guide

Optimization and Programming Guide IBM XL Fortran Enterprise Edition V10.1 for AIX Optimization and Programming Guide SC09-8010-00 IBM XL Fortran Enterprise Edition V10.1 for AIX Optimization and Programming Guide SC09-8010-00 Note! Before

More information

Optimization and Programming Guide

Optimization and Programming Guide IBM XL Fortran for Linux, V15.1 Optimization and Programming Guide Version 15.1 SC27-4256-00 IBM XL Fortran for Linux, V15.1 Optimization and Programming Guide Version 15.1 SC27-4256-00 Note Before using

More information

IBM. Getting Started with XL Fortran. IBM XL Fortran for AIX, V16.1. Version 16.1 SC

IBM. Getting Started with XL Fortran. IBM XL Fortran for AIX, V16.1. Version 16.1 SC IBM XL Fortran for AIX, V16.1 IBM Getting Started with XL Fortran Version 16.1 SC27-8056-00 IBM XL Fortran for AIX, V16.1 IBM Getting Started with XL Fortran Version 16.1 SC27-8056-00 Note Before using

More information

IBM. Getting Started with XL Fortran for Little Endian Distributions. IBM XL Fortran for Linux, V Version 15.1.

IBM. Getting Started with XL Fortran for Little Endian Distributions. IBM XL Fortran for Linux, V Version 15.1. IBM XL Fortran for Linux, V15.1.6 IBM Getting Started with XL Fortran for Little Endian Distributions Version 15.1.6 SC27-6620-05 IBM XL Fortran for Linux, V15.1.6 IBM Getting Started with XL Fortran

More information

IBM. Getting Started with XL Fortran for Little Endian Distributions. IBM XL Fortran for Linux, V Version 15.1.

IBM. Getting Started with XL Fortran for Little Endian Distributions. IBM XL Fortran for Linux, V Version 15.1. IBM XL Fortran for Linux, V15.1.1 IBM Getting Started with XL Fortran for Little Endian Distributions Version 15.1.1 SC27-6620-00 IBM XL Fortran for Linux, V15.1.1 IBM Getting Started with XL Fortran

More information

Getting Started with XL C

Getting Started with XL C IBM XL C for AIX, V12.1 Getting Started with XL C Version 12.1 SC14-7323-00 IBM XL C for AIX, V12.1 Getting Started with XL C Version 12.1 SC14-7323-00 Note Before using this information and the product

More information

IBM. Migration Guide for Little Endian Distributions. IBM XL Fortran for Linux, V Version GC

IBM. Migration Guide for Little Endian Distributions. IBM XL Fortran for Linux, V Version GC IBM XL Fortran for Linux, V15.1.6 IBM Migration Guide for Little Endian Distributions Version 15.1.6 GC27-8038-00 IBM XL Fortran for Linux, V15.1.6 IBM Migration Guide for Little Endian Distributions

More information

IBM. Installation Guide. IBM XL Fortran for AIX, V Version SC

IBM. Installation Guide. IBM XL Fortran for AIX, V Version SC IBM XL Fortran for AIX, V15.1.3 IBM Installation Guide Version 15.1.3 SC27-4243-02 IBM XL Fortran for AIX, V15.1.3 IBM Installation Guide Version 15.1.3 SC27-4243-02 Note Before using this information

More information

Code optimization with the IBM XL compilers on Power architectures IBM

Code optimization with the IBM XL compilers on Power architectures IBM Code optimization with the IBM XL compilers on Power architectures IBM December 2017 References in this document to IBM products, programs, or services do not imply that IBM intends to make these available

More information

IBM. Language Reference for Little Endian Distributions. IBM XL C/C++ for Linux, V16.1. Version 16.1 SC

IBM. Language Reference for Little Endian Distributions. IBM XL C/C++ for Linux, V16.1. Version 16.1 SC IBM XL C/C++ for Linux, V16.1 IBM Language Reference for Little Endian Distributions Version 16.1 SC27-8045-00 IBM XL C/C++ for Linux, V16.1 IBM Language Reference for Little Endian Distributions Version

More information

IBM. Optimization and Programming Guide for Little Endian Distributions. IBM XL C/C++ for Linux, V Version 13.1.

IBM. Optimization and Programming Guide for Little Endian Distributions. IBM XL C/C++ for Linux, V Version 13.1. IBM XL C/C++ for Linux, V13.1.6 IBM Optimization and Programming Guide for Little Endian Distributions Version 13.1.6 SC27-6560-05 IBM XL C/C++ for Linux, V13.1.6 IBM Optimization and Programming Guide

More information

IBM. Installation Guide. IBM XL C/C++ for Linux on z Systems, V1.2. Version 1.2 GC

IBM. Installation Guide. IBM XL C/C++ for Linux on z Systems, V1.2. Version 1.2 GC IBM XL C/C++ for Linux on z Systems, V1.2 IBM Installation Guide Version 1.2 GC27-5995-01 IBM XL C/C++ for Linux on z Systems, V1.2 IBM Installation Guide Version 1.2 GC27-5995-01 Note Before using this

More information

Optimising with the IBM compilers

Optimising with the IBM compilers Optimising with the IBM Overview Introduction Optimisation techniques compiler flags compiler hints code modifications Optimisation topics locals and globals conditionals data types CSE divides and square

More information

IBM. Migration Guide. IBM XL Fortran for AIX, V16.1. Version 16.1 GC

IBM. Migration Guide. IBM XL Fortran for AIX, V16.1. Version 16.1 GC IBM XL Fortran for AIX, V16.1 IBM Migration Guide Version 16.1 GC27-8052-00 IBM XL Fortran for AIX, V16.1 IBM Migration Guide Version 16.1 GC27-8052-00 Note Before using this information and the product

More information

IBM. Language Reference for Little Endian Distributions. IBM XL C/C++ for Linux, V Version SC

IBM. Language Reference for Little Endian Distributions. IBM XL C/C++ for Linux, V Version SC IBM XL C/C++ for Linux, V13.1.3 IBM Language Reference for Little Endian Distributions Version 13.1.3 SC27-6550-02 IBM XL C/C++ for Linux, V13.1.3 IBM Language Reference for Little Endian Distributions

More information

IBM. Language Reference. IBM XL C/C++ for Linux on z Systems, V1.2. Version 1.2 SC

IBM. Language Reference. IBM XL C/C++ for Linux on z Systems, V1.2. Version 1.2 SC IBM XL C/C++ for Linux on z Systems, V1.2 IBM Language Reference Version 1.2 SC27-5996-01 IBM XL C/C++ for Linux on z Systems, V1.2 IBM Language Reference Version 1.2 SC27-5996-01 Note Before using this

More information

IBM. Optimization and Programming Guide for Little Endian Distributions. IBM XL C/C++ for Linux, V Version 13.1.

IBM. Optimization and Programming Guide for Little Endian Distributions. IBM XL C/C++ for Linux, V Version 13.1. IBM XL C/C++ for Linux, V13.1.3 IBM Optimization and Programming Guide for Little Endian Distributions Version 13.1.3 SC27-6560-02 IBM XL C/C++ for Linux, V13.1.3 IBM Optimization and Programming Guide

More information

Getting Started with XL C/C++

Getting Started with XL C/C++ IBM XL C/C++ Enterprise Edition V8.0 for AIX Getting Started with XL C/C++ SC09-7997-00 IBM XL C/C++ Enterprise Edition V8.0 for AIX Getting Started with XL C/C++ SC09-7997-00 Note! Before using this

More information

Upgrading XL Fortran Compilers

Upgrading XL Fortran Compilers Upgrading XL Fortran Compilers Oeriew Upgrading to the latest IBM XL Fortran compilers makes good business sense. Upgrading puts new capabilities into the hands of your programmers making them and your

More information

IBM XL C/C++ Alpha Edition for Multicore Acceleration. Using the single-source compiler

IBM XL C/C++ Alpha Edition for Multicore Acceleration. Using the single-source compiler IBM XL C/C++ Alpha Edition for Multicore Acceleration for Linux, V0.9 Using the single-source compiler IBM XL C/C++ Alpha Edition for Multicore Acceleration for Linux, V0.9 Using the single-source compiler

More information

IBM. Language Reference. IBM XL C/C++ for Linux on z Systems, V1.1. Version 1.1 SC

IBM. Language Reference. IBM XL C/C++ for Linux on z Systems, V1.1. Version 1.1 SC IBM XL C/C++ for Linux on z Systems, V1.1 IBM Language Reference Version 1.1 SC27-5996-00 IBM XL C/C++ for Linux on z Systems, V1.1 IBM Language Reference Version 1.1 SC27-5996-00 Note Before using this

More information

IBM. Optimization and Programming Guide. IBM XL C/C++ for Linux on z Systems, V1.2. Version 1.2 SC

IBM. Optimization and Programming Guide. IBM XL C/C++ for Linux on z Systems, V1.2. Version 1.2 SC IBM XL C/C++ for Linux on z Systems, V1.2 IBM Optimization and Programming Guide Version 1.2 SC27-5997-01 IBM XL C/C++ for Linux on z Systems, V1.2 IBM Optimization and Programming Guide Version 1.2 SC27-5997-01

More information

IBM. Language Reference. IBM XL C for AIX, V Version SC

IBM. Language Reference. IBM XL C for AIX, V Version SC IBM XL C for AIX, V13.1.2 IBM Language Reference Version 13.1.2 SC27-4240-01 IBM XL C for AIX, V13.1.2 IBM Language Reference Version 13.1.2 SC27-4240-01 Note Before using this information and the product

More information

IBM. Migration Guide for Little Endian Distributions. IBM XL C/C++ for Linux, V Version GC

IBM. Migration Guide for Little Endian Distributions. IBM XL C/C++ for Linux, V Version GC IBM XL C/C++ for Linux, V13.1.6 IBM Migration Guide for Little Endian Distributions Version 13.1.6 GC27-8036-00 IBM XL C/C++ for Linux, V13.1.6 IBM Migration Guide for Little Endian Distributions Version

More information

IBM. Getting Started with CUDA Fortran programming using XL Fortran for Little Endian Distributions. IBM XL Fortran for Linux, V16.1. Version 16.

IBM. Getting Started with CUDA Fortran programming using XL Fortran for Little Endian Distributions. IBM XL Fortran for Linux, V16.1. Version 16. IBM XL Fortran for Linux, V16.1 IBM Getting Started with CUDA Fortran programming using XL Fortran for Little Endian Distributions Version 16.1 GI13-3565-00 IBM XL Fortran for Linux, V16.1 IBM Getting

More information

IBM. Getting Started with CUDA Fortran programming using XL Fortran for Little Endian Distributions. IBM XL Fortran for Linux, V Version 15.1.

IBM. Getting Started with CUDA Fortran programming using XL Fortran for Little Endian Distributions. IBM XL Fortran for Linux, V Version 15.1. IBM XL Fortran for Linux, V15.1.4 IBM Getting Started with CUDA Fortran programming using XL Fortran for Little Endian Distributions Version 15.1.4 GI13-3562-00 IBM XL Fortran for Linux, V15.1.4 IBM Getting

More information

OpenMP Application Program Interface

OpenMP Application Program Interface OpenMP Application Program Interface DRAFT Version.1.0-00a THIS IS A DRAFT AND NOT FOR PUBLICATION Copyright 1-0 OpenMP Architecture Review Board. Permission to copy without fee all or part of this material

More information

IBM. Code optimization with the IBM z/os XL C/C++ compiler. Introduction. z/os XL C/C++ compiler

IBM. Code optimization with the IBM z/os XL C/C++ compiler. Introduction. z/os XL C/C++ compiler IBM Code optimization with the IBM z/os XL C/C++ compiler Introduction The IBM z/os XL C/C++ compiler is built on an industry-wide reputation for robustness, versatility, and high level of standards compliance.

More information

IBM XL C for AIX, V12.1. Language Reference. Version 12.1 SC

IBM XL C for AIX, V12.1. Language Reference. Version 12.1 SC IBM XL C for AIX, V12.1 Language Reference Version 12.1 SC14-7326-00 IBM XL C for AIX, V12.1 Language Reference Version 12.1 SC14-7326-00 Note Before using this information and the product it supports,

More information

OpenMP Application Program Interface

OpenMP Application Program Interface OpenMP Application Program Interface Version.0 - RC - March 01 Public Review Release Candidate Copyright 1-01 OpenMP Architecture Review Board. Permission to copy without fee all or part of this material

More information

Getting Started with XL Fortran

Getting Started with XL Fortran IBM XL Fortran for Linux, V15.1 Getting Started with XL Fortran Version 15.1 SC27-4252-00 IBM XL Fortran for Linux, V15.1 Getting Started with XL Fortran Version 15.1 SC27-4252-00 Note Before using this

More information

IBM XL C for AIX, V11.1. Language Reference. Version 11.1 SC

IBM XL C for AIX, V11.1. Language Reference. Version 11.1 SC IBM XL C for AIX, V11.1 Language Reference Version 11.1 SC27-2477-00 IBM XL C for AIX, V11.1 Language Reference Version 11.1 SC27-2477-00 Note Before using this information and the product it supports,

More information

IBM XL C/C++ for Multicore Acceleration for Linux, V9.0

IBM XL C/C++ for Multicore Acceleration for Linux, V9.0 IBM XL C/C++ for Multicore Acceleration for Linux, V9.0 Language Reference SC23-8519-00 IBM XL C/C++ for Multicore Acceleration for Linux, V9.0 Language Reference SC23-8519-00 Note! Before using this

More information

IBM. Enterprise Systems Architecture/ Extended Configuration Principles of Operation. z/vm. Version 6 Release 4 SC

IBM. Enterprise Systems Architecture/ Extended Configuration Principles of Operation. z/vm. Version 6 Release 4 SC z/vm IBM Enterprise Systems Architecture/ Extended Configuration Principles of Operation Version 6 Release 4 SC24-6192-01 Note: Before you use this information and the product it supports, read the information

More information

IBM Tivoli Federated Identity Manager Version Installation Guide GC

IBM Tivoli Federated Identity Manager Version Installation Guide GC IBM Tivoli Federated Identity Manager Version 6.2.2 Installation Guide GC27-2718-01 IBM Tivoli Federated Identity Manager Version 6.2.2 Installation Guide GC27-2718-01 Note Before using this information

More information

Version Monitoring Agent User s Guide SC

Version Monitoring Agent User s Guide SC Tivoli IBM Tivoli Advanced Catalog Management for z/os Version 02.01.00 Monitoring Agent User s Guide SC23-7974-00 Tivoli IBM Tivoli Advanced Catalog Management for z/os Version 02.01.00 Monitoring Agent

More information

IBM XL Fortran Advanced Edition V8.1 for Mac OS X A new platform supported in the IBM XL Fortran family

IBM XL Fortran Advanced Edition V8.1 for Mac OS X A new platform supported in the IBM XL Fortran family Software Announcement January 13, 2004 IBM XL Fortran Advanced Edition V8.1 for Mac OS X A new platform supported in the IBM XL Fortran family Overview IBM extends the XL Fortran family to the Apple Mac

More information

About the Authors... iii Introduction... xvii. Chapter 1: System Software... 1

About the Authors... iii Introduction... xvii. Chapter 1: System Software... 1 Table of Contents About the Authors... iii Introduction... xvii Chapter 1: System Software... 1 1.1 Concept of System Software... 2 Types of Software Programs... 2 Software Programs and the Computing Machine...

More information

OpenMP Technical Report 3 on OpenMP 4.0 enhancements

OpenMP Technical Report 3 on OpenMP 4.0 enhancements OPENMP ARB OpenMP Technical Report on OpenMP.0 enhancements This Technical Report specifies OpenMP.0 enhancements that are candidates for a future OpenMP.1: (e.g. for asynchronous execution on and data

More information

Db2 Query Management Facility Version 12 Release 2. Installing and Managing Db2 QMF for TSO and CICS IBM GC

Db2 Query Management Facility Version 12 Release 2. Installing and Managing Db2 QMF for TSO and CICS IBM GC Db2 Query Management Facility Version 12 Release 2 Installing and Managing Db2 QMF for TSO and CICS IBM GC27-8877-02 Db2 Query Management Facility Version 12 Release 2 Installing and Managing Db2 QMF

More information

IBM. Installation Guide. IBM XL C/C++ for Linux, V Version GC

IBM. Installation Guide. IBM XL C/C++ for Linux, V Version GC IBM XL C/C++ for Linux, V13.1.5 IBM Installation Guide Version 13.1.5 GC27-6540-04 IBM XL C/C++ for Linux, V13.1.5 IBM Installation Guide Version 13.1.5 GC27-6540-04 Note Before using this information

More information

CROSSREF Manual. Tools and Utilities Library

CROSSREF Manual. Tools and Utilities Library Tools and Utilities Library CROSSREF Manual Abstract This manual describes the CROSSREF cross-referencing utility, including how to use it with C, COBOL 74, COBOL85, EXTENDED BASIC, FORTRAN, Pascal, SCREEN

More information

A Fast Review of C Essentials Part I

A Fast Review of C Essentials Part I A Fast Review of C Essentials Part I Structural Programming by Z. Cihan TAYSI Outline Program development C Essentials Functions Variables & constants Names Formatting Comments Preprocessor Data types

More information

Progress on OpenMP Specifications

Progress on OpenMP Specifications Progress on OpenMP Specifications Wednesday, November 13, 2012 Bronis R. de Supinski Chair, OpenMP Language Committee This work has been authored by Lawrence Livermore National Security, LLC under contract

More information

Implementation of Parallelization

Implementation of Parallelization Implementation of Parallelization OpenMP, PThreads and MPI Jascha Schewtschenko Institute of Cosmology and Gravitation, University of Portsmouth May 9, 2018 JAS (ICG, Portsmouth) Implementation of Parallelization

More information

OpenMP C and C++ Application Program Interface Version 1.0 October Document Number

OpenMP C and C++ Application Program Interface Version 1.0 October Document Number OpenMP C and C++ Application Program Interface Version 1.0 October 1998 Document Number 004 2229 001 Contents Page v Introduction [1] 1 Scope............................. 1 Definition of Terms.........................

More information

Application Programming

Application Programming Multicore Application Programming For Windows, Linux, and Oracle Solaris Darryl Gove AAddison-Wesley Upper Saddle River, NJ Boston Indianapolis San Francisco New York Toronto Montreal London Munich Paris

More information

IBM Tivoli Decision Support for z/os Version Messages and Problem Determination IBM SH

IBM Tivoli Decision Support for z/os Version Messages and Problem Determination IBM SH IBM Tivoli Decision Support for z/os Version 1.8.2 Messages and Problem Determination IBM SH19-6902-15 IBM Tivoli Decision Support for z/os Version 1.8.2 Messages and Problem Determination IBM SH19-6902-15

More information

Shared Memory Parallelism - OpenMP

Shared Memory Parallelism - OpenMP Shared Memory Parallelism - OpenMP Sathish Vadhiyar Credits/Sources: OpenMP C/C++ standard (openmp.org) OpenMP tutorial (http://www.llnl.gov/computing/tutorials/openmp/#introduction) OpenMP sc99 tutorial

More information

Error Message Reference

Error Message Reference Security Policy Manager Version 7.1 Error Message Reference GC23-9477-01 Security Policy Manager Version 7.1 Error Message Reference GC23-9477-01 Note Before using this information and the product it

More information

Introduction. No Optimization. Basic Optimizations. Normal Optimizations. Advanced Optimizations. Inter-Procedural Optimizations

Introduction. No Optimization. Basic Optimizations. Normal Optimizations. Advanced Optimizations. Inter-Procedural Optimizations Introduction Optimization options control compile time optimizations to generate an application with code that executes more quickly. Absoft Fortran 90/95 is an advanced optimizing compiler. Various optimizers

More information

IBM. Installation Guide. IBM XL Fortran for Linux, V Version GC

IBM. Installation Guide. IBM XL Fortran for Linux, V Version GC IBM XL Fortran for Linux, V15.1.6 IBM Installation Guide Version 15.1.6 GC27-6580-05 IBM XL Fortran for Linux, V15.1.6 IBM Installation Guide Version 15.1.6 GC27-6580-05 Note Before using this information

More information

Accelerated Library Framework for Hybrid-x86

Accelerated Library Framework for Hybrid-x86 Software Development Kit for Multicore Acceleration Version 3.0 Accelerated Library Framework for Hybrid-x86 Programmer s Guide and API Reference Version 1.0 DRAFT SC33-8406-00 Software Development Kit

More information

Introduction [1] 1. Directives [2] 7

Introduction [1] 1. Directives [2] 7 OpenMP Fortran Application Program Interface Version 2.0, November 2000 Contents Introduction [1] 1 Scope............................. 1 Glossary............................ 1 Execution Model.........................

More information

IBM XL C/C++ Enterprise Edition for AIX, V9.0. Programming SC

IBM XL C/C++ Enterprise Edition for AIX, V9.0. Programming SC IBM XL C/C++ Enterprise Edition for AIX, V9.0 Programming Guide SC23-5827-00 IBM XL C/C++ Enterprise Edition for AIX, V9.0 Programming Guide SC23-5827-00 Note! Before using this information and the product

More information

XL C/C++ Advanced Edition V6.0 for Mac OS X A new platform for the IBM family of C/C++ compilers

XL C/C++ Advanced Edition V6.0 for Mac OS X A new platform for the IBM family of C/C++ compilers Software Announcement January 13, 2004 XL C/C++ Advanced Edition V6.0 for Mac OS X A new platform for the IBM family of C/C++ compilers Overview XL C/C++ Advanced Edition for Mac OS X is an optimizing,

More information

MPI: A Message-Passing Interface Standard

MPI: A Message-Passing Interface Standard MPI: A Message-Passing Interface Standard Version 2.1 Message Passing Interface Forum June 23, 2008 Contents Acknowledgments xvl1 1 Introduction to MPI 1 1.1 Overview and Goals 1 1.2 Background of MPI-1.0

More information

More Advanced OpenMP. Saturday, January 30, 16

More Advanced OpenMP. Saturday, January 30, 16 More Advanced OpenMP This is an abbreviated form of Tim Mattson s and Larry Meadow s (both at Intel) SC 08 tutorial located at http:// openmp.org/mp-documents/omp-hands-on-sc08.pdf All errors are my responsibility

More information

Data Handling in OpenMP

Data Handling in OpenMP Data Handling in OpenMP Manipulate data by threads By private: a thread initializes and uses a variable alone Keep local copies, such as loop indices By firstprivate: a thread repeatedly reads a variable

More information

PGI Accelerator Programming Model for Fortran & C

PGI Accelerator Programming Model for Fortran & C PGI Accelerator Programming Model for Fortran & C The Portland Group Published: v1.3 November 2010 Contents 1. Introduction... 5 1.1 Scope... 5 1.2 Glossary... 5 1.3 Execution Model... 7 1.4 Memory Model...

More information

EPL372 Lab Exercise 5: Introduction to OpenMP

EPL372 Lab Exercise 5: Introduction to OpenMP EPL372 Lab Exercise 5: Introduction to OpenMP References: https://computing.llnl.gov/tutorials/openmp/ http://openmp.org/wp/openmp-specifications/ http://openmp.org/mp-documents/openmp-4.0-c.pdf http://openmp.org/mp-documents/openmp4.0.0.examples.pdf

More information

Getting Started with XL Fortran

Getting Started with XL Fortran IBM XL Fortran for Blue Gene/Q, V14.1 Getting Started with XL Fortran Version 14.1 GC14-7366-00 IBM XL Fortran for Blue Gene/Q, V14.1 Getting Started with XL Fortran Version 14.1 GC14-7366-00 Note Before

More information

Enhancements in OpenMP 2.0

Enhancements in OpenMP 2.0 Enhancements in mueller@hlrs.de University of Stuttgart High-Performance Computing-Center Stuttgart (HLRS) www.hlrs.de Slide 1 Outline Timeline Clarifications/Modifications New Features Slide 2 19. Enhancements

More information

IBM XL C/C++ Enterprise Edition supports POWER5 architecture

IBM XL C/C++ Enterprise Edition supports POWER5 architecture Software Announcement August 31, 2004 IBM XL C/C++ Enterprise Edition supports POWER5 architecture Overview IBM XL C/C++ Enterprise Edition V7.0 for AIX is an optimizing, standards-based compiler for the

More information

November IBM XL C/C++ Compilers Insights on Improving Your Application

November IBM XL C/C++ Compilers Insights on Improving Your Application November 2010 IBM XL C/C++ Compilers Insights on Improving Your Application Page 1 Table of Contents Purpose of this document...2 Overview...2 Performance...2 Figure 1:...3 Figure 2:...4 Exploiting the

More information

!OMP #pragma opm _OPENMP

!OMP #pragma opm _OPENMP Advanced OpenMP Lecture 12: Tips, tricks and gotchas Directives Mistyping the sentinel (e.g.!omp or #pragma opm ) typically raises no error message. Be careful! The macro _OPENMP is defined if code is

More information

Fundamentals of the Java Programming Language

Fundamentals of the Java Programming Language Fundamentals of the Java Programming Language Student Guide SL-110 REV E D61798GC10 Edition 1.0 2009 D62399 Copyright 2006, 2009, Oracle and/or its affiliates. All rights reserved. Disclaimer This document

More information

IBM. IBM XL C/C++ and XL Fortran compilers on Power architectures overview

IBM. IBM XL C/C++ and XL Fortran compilers on Power architectures overview IBM IBM XL C/C++ and XL Fortran compilers on Power architectures overview December 2017 References in this document to IBM products, programs, or services do not imply that IBM intends to make these available

More information

CSE 501: Compiler Construction. Course outline. Goals for language implementation. Why study compilers? Models of compilation

CSE 501: Compiler Construction. Course outline. Goals for language implementation. Why study compilers? Models of compilation CSE 501: Compiler Construction Course outline Main focus: program analysis and transformation how to represent programs? how to analyze programs? what to analyze? how to transform programs? what transformations

More information

IBM Tivoli Monitoring for Web Infrastructure: WebSphere Application Server. User s Guide. Version SC

IBM Tivoli Monitoring for Web Infrastructure: WebSphere Application Server. User s Guide. Version SC IBM Tivoli Monitoring for Web Infrastructure: WebSphere Application Server User s Guide Version 5.1.1 SC23-4705-01 IBM Tivoli Monitoring for Web Infrastructure: WebSphere Application Server User s Guide

More information

Barbara Chapman, Gabriele Jost, Ruud van der Pas

Barbara Chapman, Gabriele Jost, Ruud van der Pas Using OpenMP Portable Shared Memory Parallel Programming Barbara Chapman, Gabriele Jost, Ruud van der Pas The MIT Press Cambridge, Massachusetts London, England c 2008 Massachusetts Institute of Technology

More information

Tivoli Tivoli Decision Support for z/os

Tivoli Tivoli Decision Support for z/os Tivoli Tivoli Decision Support for z/os Version 1.8.1 Messages and Problem Determination SH19-6902-13 Tivoli Tivoli Decision Support for z/os Version 1.8.1 Messages and Problem Determination SH19-6902-13

More information

OpenMP 2. CSCI 4850/5850 High-Performance Computing Spring 2018

OpenMP 2. CSCI 4850/5850 High-Performance Computing Spring 2018 OpenMP 2 CSCI 4850/5850 High-Performance Computing Spring 2018 Tae-Hyuk (Ted) Ahn Department of Computer Science Program of Bioinformatics and Computational Biology Saint Louis University Learning Objectives

More information

Overview: The OpenMP Programming Model

Overview: The OpenMP Programming Model Overview: The OpenMP Programming Model motivation and overview the parallel directive: clauses, equivalent pthread code, examples the for directive and scheduling of loop iterations Pi example in OpenMP

More information

Introduction to OpenMP. Tasks. N.M. Maclaren September 2017

Introduction to OpenMP. Tasks. N.M. Maclaren September 2017 2 OpenMP Tasks 2.1 Introduction Introduction to OpenMP Tasks N.M. Maclaren nmm1@cam.ac.uk September 2017 These were introduced by OpenMP 3.0 and use a slightly different parallelism model from the previous

More information

IBM. XL C/C++ Messages. z/os. Version 2 Release 3 GC

IBM. XL C/C++ Messages. z/os. Version 2 Release 3 GC z/os IBM XL C/C++ Messages Version 2 Release 3 GC14-7305-30 Note Before using this information and the product it supports, read the information in Notices on page 257. This edition applies to Version

More information

OPENMP TIPS, TRICKS AND GOTCHAS

OPENMP TIPS, TRICKS AND GOTCHAS OPENMP TIPS, TRICKS AND GOTCHAS Mark Bull EPCC, University of Edinburgh (and OpenMP ARB) markb@epcc.ed.ac.uk OpenMPCon 2015 OpenMPCon 2015 2 A bit of background I ve been teaching OpenMP for over 15 years

More information

COMP4300/8300: The OpenMP Programming Model. Alistair Rendell. Specifications maintained by OpenMP Architecture Review Board (ARB)

COMP4300/8300: The OpenMP Programming Model. Alistair Rendell. Specifications maintained by OpenMP Architecture Review Board (ARB) COMP4300/8300: The OpenMP Programming Model Alistair Rendell See: www.openmp.org Introduction to High Performance Computing for Scientists and Engineers, Hager and Wellein, Chapter 6 & 7 High Performance

More information

COMP4300/8300: The OpenMP Programming Model. Alistair Rendell

COMP4300/8300: The OpenMP Programming Model. Alistair Rendell COMP4300/8300: The OpenMP Programming Model Alistair Rendell See: www.openmp.org Introduction to High Performance Computing for Scientists and Engineers, Hager and Wellein, Chapter 6 & 7 High Performance

More information

Parallel and Distributed Programming. OpenMP

Parallel and Distributed Programming. OpenMP Parallel and Distributed Programming OpenMP OpenMP Portability of software SPMD model Detailed versions (bindings) for different programming languages Components: directives for compiler library functions

More information

Building a Runnable Program and Code Improvement. Dario Marasco, Greg Klepic, Tess DiStefano

Building a Runnable Program and Code Improvement. Dario Marasco, Greg Klepic, Tess DiStefano Building a Runnable Program and Code Improvement Dario Marasco, Greg Klepic, Tess DiStefano Building a Runnable Program Review Front end code Source code analysis Syntax tree Back end code Target code

More information

Introduction to Compilers HPC Workshop University of Kentucky May 9, 2007 May 10, 2007

Introduction to Compilers HPC Workshop University of Kentucky May 9, 2007 May 10, 2007 Introduction to Compilers HPC Workshop University of Kentucky May 9, 2007 May 10, 2007 Andrew Komornicki, Ph. D. Balaji Veeraraghavan, Ph. D. Agenda Introduction Availability of compilers, GNU, Intel and

More information

Shared Memory Parallelism using OpenMP

Shared Memory Parallelism using OpenMP Indian Institute of Science Bangalore, India भ रत य व ज ञ न स स थ न ब गल र, भ रत SE 292: High Performance Computing [3:0][Aug:2014] Shared Memory Parallelism using OpenMP Yogesh Simmhan Adapted from: o

More information

OPENMP TIPS, TRICKS AND GOTCHAS

OPENMP TIPS, TRICKS AND GOTCHAS OPENMP TIPS, TRICKS AND GOTCHAS OpenMPCon 2015 2 Directives Mistyping the sentinel (e.g.!omp or #pragma opm ) typically raises no error message. Be careful! Extra nasty if it is e.g. #pragma opm atomic

More information

Introduction to. Slides prepared by : Farzana Rahman 1

Introduction to. Slides prepared by : Farzana Rahman 1 Introduction to OpenMP Slides prepared by : Farzana Rahman 1 Definition of OpenMP Application Program Interface (API) for Shared Memory Parallel Programming Directive based approach with library support

More information

Preface... (vii) CHAPTER 1 INTRODUCTION TO COMPUTERS

Preface... (vii) CHAPTER 1 INTRODUCTION TO COMPUTERS Contents Preface... (vii) CHAPTER 1 INTRODUCTION TO COMPUTERS 1.1. INTRODUCTION TO COMPUTERS... 1 1.2. HISTORY OF C & C++... 3 1.3. DESIGN, DEVELOPMENT AND EXECUTION OF A PROGRAM... 3 1.4 TESTING OF PROGRAMS...

More information

Parallelising Scientific Codes Using OpenMP. Wadud Miah Research Computing Group

Parallelising Scientific Codes Using OpenMP. Wadud Miah Research Computing Group Parallelising Scientific Codes Using OpenMP Wadud Miah Research Computing Group Software Performance Lifecycle Scientific Programming Early scientific codes were mainly sequential and were executed on

More information

OpenACC 2.6 Proposed Features

OpenACC 2.6 Proposed Features OpenACC 2.6 Proposed Features OpenACC.org June, 2017 1 Introduction This document summarizes features and changes being proposed for the next version of the OpenACC Application Programming Interface, tentatively

More information

OpenMP API 4.5 Fortran Page 1

OpenMP API 4.5 Fortran Page 1 OpenMP API 4.5 Fortran Page 1 Fortran Directives and Constructs for Fortran OpenMP 4.5 API Fortran Syntax Reference Guide OpenMP Application Program Interface (API) is a portable, scalable model that gives

More information

Federated Identity Manager Business Gateway Version Configuration Guide GC

Federated Identity Manager Business Gateway Version Configuration Guide GC Tivoli Federated Identity Manager Business Gateway Version 6.2.1 Configuration Guide GC23-8614-00 Tivoli Federated Identity Manager Business Gateway Version 6.2.1 Configuration Guide GC23-8614-00 Note

More information

PGI Fortran & C Accelerator Programming Model. The Portland Group

PGI Fortran & C Accelerator Programming Model. The Portland Group PGI Fortran & C Accelerator Programming Model The Portland Group Published: v0.72 December 2008 Contents 1. Introduction...3 1.1 Scope...3 1.2 Glossary...3 1.3 Execution Model...4 1.4 Memory Model...5

More information

Absolute C++ Walter Savitch

Absolute C++ Walter Savitch Absolute C++ sixth edition Walter Savitch Global edition This page intentionally left blank Absolute C++, Global Edition Cover Title Page Copyright Page Preface Acknowledgments Brief Contents Contents

More information

INSTALLING INSTALLING INSTALLING

INSTALLING INSTALLING INSTALLING Concurrent F77 Version 7.3 Release Notes (PowerMAX) November 2006 0890536-7.3 READ READ READ ME ME ME BEFORE BEFORE BEFORE INSTALLING INSTALLING INSTALLING THIS THIS THIS PRODUCT PRODUCT PRODUCT Copyright

More information

Practical C++ Programming

Practical C++ Programming SECOND EDITION Practical C++ Programming Steve Oualline O'REILLY' Beijing Cambridge Farnham Koln Paris Sebastopol Taipei Tokyo Preface xv Part I. The Basics 1. What Is C++? 3 A Brief History of C++ 3 C++

More information

An Introduction to OpenMP

An Introduction to OpenMP An Introduction to OpenMP U N C L A S S I F I E D Slide 1 What Is OpenMP? OpenMP Is: An Application Program Interface (API) that may be used to explicitly direct multi-threaded, shared memory parallelism

More information