ACCELEWARE FDTD PERFORMANCE GUIDE
|
|
- Sydney McBride
- 5 years ago
- Views:
Transcription
1 Eight easy ways to speed up your simulation - January 21 Logan Maxwell, Mike Weldon
2 Copyright Notice All material herein is Acceleware copyright and shall not be reproduced, copied, forwarded, published or shared in any matter without prior written authorization from Acceleware. All rights reserved. Acceleware, the Acceleware logo and wordmark are registered trademarks and /or trademarks of Acceleware Corp. in the United States, Canada and other countries. All other trademarks are the property of their respective owners. Page 2 of 13
3 Table of Contents Overview 4 The Fundamentals of FDTD Performance on GPUs 5 1) Perfectly Matched Layers (PML) 6 2) Reads and Read Regions (Observations, DFT, Convergence, etc.) 7 3) Screen Savers 8 4) Simulation Orientation 9 5) Number of Materials 1 6) Dispersive Materials 11 7) Mesh Density 12 8) Windows Remote Log In 13 Page 3 of 13
4 Overview Introduction Fastest-possible FDTD simulation performance on GPU and multi-core hardware is a key objective for partners and end users alike. The hardware and Acceleware software libraries that make it run are obviously key determinants of the ultimate performance, but partners and end users can still have a large impact on the performance in ways that are not always obvious. This document outlines several key simulation parameters that impact simulation performance. Each case includes a brief description of the parameter, a plot illustrating the performance impact and tips on how to minimize any speed reduction. Improper use of these parameters whether intended or not, can reduce simulation speed by 5% or more. Understanding the suggestions in this document will help you avoid unnecessary reductions in performance and get the most out of all your simulations. Intended Audience Acceleware partners, end users and all those that are interested in running FDTD simulations optimally on GPUs and multi-core hardware. This document should be considered essential reading for engineers and scientists running FDTD simulations and will help make sure that they are always getting the most out of their simulation tools. Page 4 of 13
5 Performance (Mcell/s) The Fundamentals of FDTD Performance on GPUs 8 (Driver ) Knee Simulation: Cubic, 16 Lossy Dielectrics, 4 Layer PML Memory Limit GPU CPU Ramp Up Optimal Range Soft Memory Simulation Size (Mcell) Ramp Up: Knee: Optimal Range: GPU Memory Limit: Soft Memory: In this range the GPU is not using all of its compute resources and memory bandwidth efficiently. Secondly, PML takes up a large portion of the total simulation size and acts to slow the total simulation throughput. The knee is the point at which the performance levels off and the GPU is running optimally. This is the optimal range because the GPU has found a good balance between computation and communication. The goal of any GPU FDTD code is to maximize this region s breadth and magnitude. This is the point at which the GPU runs out of memory and CPU begins to solve the remaining calculations. In this area the CPU is solving the remaining calculations that the GPU does not have memory for. As simulation size goes further into soft memory, the closer the performance will get to 1% CPU performance. How to calculate throughput performance: Note: Simulation Size is not including PML cells Page 5 of 13
6 Performance (Mcells/s) Perfectly Matched Layers (PML) Adding PML (absorbing) boundary layers can reduce simulation performance by as much as 5% which would double run the time. The maximum simulation size the GPU is capable of running will also be partially reduced. PML cells require more memory than non-pml cells. That reduces simulation size. They also are more expensive to compute, which reduces performance. More significantly, we don't include PML cells when calculating capacity or speed. Small simulations are impacted more than larger simulations because PML cells represent the majority of the computational load. 8 (Driver ) PML Performance Simulation: Cubic, 16 Lossy Dielectrics 6 PML Layers Simulation Size (Mcells) - Minimize the number of layers of PML. - Understand how much PML your simulation requires and use no more than that. - Use maximum PML layers only when absolutely necessary. Page 6 of 13
7 Performance (MCells/s) Reads and Read Regions (Observations, DFT, Convergence, etc.) Reading field data during a simulation can dramatically impact performance. Field data is read when observing simulation output, convergence, for DFTs etc. How much of the volume is read and how frequently the volume is read both impact simulation performance. The chart below shows performance for different volumes of reads based on a percentage of the total volume. All six fields are read for each cell. We are sweeping the number of time steps between each read. Read Performance 5 (Driver ) Observations: Ex, Ey, Ez, Hx, Hy, Hz Simulation: 3 Mcells,Cubic, 16 Lossy Dielectrics, 4 Layer PML 4 % of Volume Read 3 2 % 25% 5% 75% 1% All Fields Read Every X Time Steps - Keep the read volume to a minimum; only observe the region (volume) that is of direct interest. - Read only as frequently as is necessary to achieve accurate power, DFT, SAR, optical generation, etc. results. - For optical generation, far field etc. only start to read after a simulation has converged. Page 7 of 13
8 Performance (Mcells/s) Screen Savers Screen savers, especially graphics intensive 3D types can decrease the performance of the simulation. Performance difference between no screen saver and basic screen saver is negligible. Smaller simulations experienced a greater percent decrease in performance. Occasionally, significantly worse performance is observed, and is abnormal. 8 Screen Saver Performance (Driver ) Simulation: Cubic, 16 Lossy Dielectrics, 4 Layer PML 6 Screen Saver 4 None Blank 3D Pipes Simulation Size (Mcells) - Use low detail screen savers, blank, or no screen saver. - Use the management settings to turn off the monitor instead of using a screen saver. - If you must use a screen saver, confirm your performance is not degraded by more than 1-2%. If it is worse, please contact Acceleware Page 8 of 13
9 Performance (Mcells/s) Simulation Orientation Single-GPU simulations where Z is the smallest dimension by a significant margin will experience a decrease in simulation performance and maximum simulation size. This is due to the way in which memory is allocated, this problem is not unique to GPU FDTD solutions - it is also present in CPU-only FDTD solvers. The example below shows an extreme case of smallest dimension, for less extreme cases the decrease in performance and max simulation size is smaller. Partitioning across multiple GPUs will change the effective simulation dimensions on each GPU, and hence the performance. Smallest dimension in the graph is 1% of the other dimensions. x y z a b b x smallest (a, b, b) y smallest (b, a, b) z smallest (b, b, a) 8 (Driver ) Smallest Performance Simulation: 16 Lossy Dielectrics, 4 Layer PML 6 4 Orientation: Cubic X Smallest Y Smallest Z Smallest Simulation Size (Mcells) - Rotate the simulation so that the Z is not the smallest dimension - Avoid extreme differences in dimension, cubic shows the best performance Page 9 of 13
10 Performance (Mcells/s) Number of Materials The number of materials can have a large impact on performance, up to a 2% decrease. The type of material can also have an effect on performance. For simulations with a variety of both E and H materials, the performance drop is more severe. 6 Number of Materials Performance (Driver ) Simulation: 3 Mcells. Cubic, 4 Layer PML E Materials H Materials E and H Materials Unique Materials (#) - If possible keep number of materials below Make sure that all the materials are necessary; some applications add arbitrary complexity by continually varying the number of materials. Page 1 of 13
11 Performance (Mcells/s) Dispersive Materials Dispersive materials have a large impact on simulation performance and maximum simulation size. Both the order (number of poles) of the dispersive materials and the total number of materials present will decrease performance. Higher order dispersive materials show worse performance, and higher numbers of dispersive materials will decrease performance. This applies to all dispersive materials types, Drude, Debye, Lorentz, Drude-Lorentz, etc. 7 Dispersive Performance (Driver ) Simulation: Cubic,, 4 Layer PML, Case 1 Case 2 Case 3 Case 4 Case Simulation Size (Mcells) Case 1 16 non-dispersive materials distributed evenly thought the entire simulation space Case 2 1 single-pole dispersive material occupies 4% of the total volume contiguously. Case 3 1 single-pole dispersive distributed evenly throughout the entire volume, 4% of the total volume is made up of dispersive materials. Case 4 16 Multi-pole dispersive materials distributed contiguously throughout 4% of the total volume. Case 5 16 Multi-pole dispersive materials distributed evenly throughout the entire volume, 4% of the total volume is made up of dispersive materials. - Restrict the total volume of dispersive materials in any simulation. - Use the minimum number of dispersive materials and volume to achieve desired result. Page 11 of 13
12 Simulation Time (h:mm:ss) Mesh Density Increasing the mesh density does not always yield more accurate results, however increasing the mesh density will always increase run time. This is for two reasons, one because there are more cells to compute, and two, because t in the simulation must also decrease to maintain simulation stability which increases the number of time steps required for a given number of periods. The chart below demonstrates the naive linear and actual effect of increasing mesh density on run time with a 1 Mcell simulation. Time to complete 6 periods :2: (Driver ) Simulation: Cubic, 16 Lossy Dielectrics, 4 Layer PML :16: :12: :8: Actual Naïve Linear :4: :: Simulation Size (Mcells) - Only increase mesh density if your simulation accuracy requires it. Page 12 of 13
13 Performance (Mcells/s) Windows Remote Log In Remote desktop software can have a large impact on simulation performance; it can exceed a 5% reduction in speed, give an error or not run at all. This happens because the desktop is virtualized and in some cases access to the GPU is limited or nonexistent. The desktop uses GPU resources which are needed for computation. 8 (Driver ) Ultra VNC Performance Simulation: Cubic, 16 Lossy Dielectrics, 4 Layer PML 6 VNC OFF 4 VNC ON Simulation Size (Mcells/s) - Use a KVM as they have no impact on performance. - Do not use remote desktop tools in general - If absolutely necessary use Ultra VNC, which still has some performance decrease as shown above. Page 13 of 13
Lenovo RAID Introduction Reference Information
Lenovo RAID Introduction Reference Information Using a Redundant Array of Independent Disks (RAID) to store data remains one of the most common and cost-efficient methods to increase server's storage performance,
More informationDisk to Disk Data File Backup and Restore.
Disk to Disk Data File Backup and Restore. Implementation Variations and Advantages with Tivoli Storage Manager and Tivoli SANergy software Dimitri Chernyshov September 26th, 2001 Data backup procedure.
More informationIncreasing Performance for PowerCenter Sessions that Use Partitions
Increasing Performance for PowerCenter Sessions that Use Partitions 1993-2015 Informatica LLC. No part of this document may be reproduced or transmitted in any form, by any means (electronic, photocopying,
More informationPowerVault MD3 SSD Cache Overview
PowerVault MD3 SSD Cache Overview A Dell Technical White Paper Dell Storage Engineering October 2015 A Dell Technical White Paper TECHNICAL INACCURACIES. THE CONTENT IS PROVIDED AS IS, WITHOUT EXPRESS
More informationAn Introduction to the Finite Difference Time Domain (FDTD) Method & EMPIRE XCcel
An Introduction to the Finite Difference Time Domain (FDTD) Method & EMPIRE XCcel Simulation Model definition for FDTD DUT Port Simulation Box Graded Mesh six Boundary Conditions 1 FDTD Basics: Field components
More informationSimulation of Photo-Sensitive Devices with FDTD Method. Copyright 2008 Crosslight Software Inc.
Simulation of Photo-Sensitive Devices with FDTD Method Copyright 2008 Crosslight Software Inc. www.crosslight.com What is FDTD method? FDTD=Finite Difference Time Domain FDTD method solves Maxwell s equations
More informationKey Features for OptiFDTD 14
14.0 New Features Created to address the needs of research scientists, photonic engineers, professors and students; OptiFDTD satisfies the demand of users who are searching for a powerful yet easy to use
More informationAdobe Photoshop CS5: 64-bit Performance and Efficiency Measures
Pfeiffer Report Benchmark Analysis Adobe : 64-bit Performance and Efficiency Measures How support for larger memory configurations improves performance of imaging workflows. Executive Summary This report
More informationDefinition of RAID Levels
RAID The basic idea of RAID (Redundant Array of Independent Disks) is to combine multiple inexpensive disk drives into an array of disk drives to obtain performance, capacity and reliability that exceeds
More informationDirect Rendering. Direct Rendering Goals
May 2, 2005 Goals General Goals Small memory footprint Fast rendering High-quality results identical to those of Saffron V1 using distance-based anti-aliasing and alignment zones Goals Specific Goals Avoid
More informationAlcatel 1671 Service Connect
Alcatel 1671 Service Connect Service providers are looking for a solution that allows them to realize advanced capabilities today, while charting a clear migration strategy from traditional equipment to
More informationOver provisioning in solid state hard drives: benefits, design considerations, and trade-offs in its use
Over provisioning in solid state hard drives: benefits, design considerations, and trade-offs in its use Conditions of use: Intended to provide the reader with some background on over provisioning, this
More informationChapter 6 Memory 11/3/2015. Chapter 6 Objectives. 6.2 Types of Memory. 6.1 Introduction
Chapter 6 Objectives Chapter 6 Memory Master the concepts of hierarchical memory organization. Understand how each level of memory contributes to system performance, and how the performance is measured.
More informationThe term "physical drive" refers to a single hard disk module. Figure 1. Physical Drive
HP NetRAID Tutorial RAID Overview HP NetRAID Series adapters let you link multiple hard disk drives together and write data across them as if they were one large drive. With the HP NetRAID Series adapter,
More informationPerformance Consistency
White Paper Performance Consistency SanDIsk Corporation Corporate Headquarters 951 SanDisk Drive, Milpitas, CA 95035, U.S.A. Phone +1.408.801.1000 Fax +1.408.801.8657 www.sandisk.com Performance Consistency
More informationMemory Management. Memory Management Requirements
Memory Management Subdividing memory to accommodate multiple processes Memory needs to be allocated to ensure a reasonable supply of ready processes to consume available processor time 1 Memory Management
More informationWHITE PAPER ULTRA LOW LATENCY MICROWAVE THE NEED FOR SPEED: BEST PRACTICES FOR BUILDING ULTRA-LOW LATENCY MICROWAVE NETWORKS
THE NEED FOR SPEED: BEST PRACTICES FOR BUILDING ULTRA-LOW LATENCY MICROWAVE NETWORKS JULY 2012 TABLE OF CONTENTS INTRODUCTION... 3 MICROWAVE: THE NEW DEFAULT TRANSPORT FOR LOW LATENCY APPLICATIONS... 3
More informationFluid Mechanics Simulation Essentials R2014X
Fluid Mechanics Simulation Essentials R2014X About this Course Course objectives Upon completion of this course you will be able to: Set up and create CFD, CHT and FSI models in the 3DEXPERIENCE Platform
More informationNVMe SSDs A New Benchmark for OLTP Performance
A MICRON TECHNICAL BRIEF NVMe SSDs A New Benchmark for OLTP Performance OLTP Platforms Thrive on Fast, Consistent Results Delivered by NVMe SSDs Overview A recent DB-Engines ranking shows that Microsoft
More informationReal-Time Performance During CUDA A Demonstration and Analysis of RedHawk CUDA RT Optimizations
A Concurrent Real-Time White Paper 2881 Gateway Drive Pompano Beach, FL 33069 (954) 974-1700 www.concurrent-rt.com Real-Time Performance During CUDA A Demonstration and Analysis of RedHawk CUDA RT Optimizations
More informationSpeed Performance Passion for Innovation
Speed Performance Passion for Innovation MANUFACTURE CIARA s OEM and Appliance division provides technology solutions for ISV and Fortune 1000 companies. We have the ability to manage any OEM project throughout
More informationGeneral Objective:To understand the basic memory management of operating system. Specific Objectives: At the end of the unit you should be able to:
F2007/Unit6/1 UNIT 6 OBJECTIVES General Objective:To understand the basic memory management of operating system Specific Objectives: At the end of the unit you should be able to: define the memory management
More informationIntroduction to Abaqus/CFD for Multiphysics Applications
Introduction to Abaqus/CFD for Multiphysics Applications R 6.12 About this Course Course objectives Upon completion of this course you will be able to: Set up and create CFD and FSI models with Abaqus
More informationAdd to the ArcMap layout the Census dataset which are located in your Census folder.
Building Your Map To begin building your map, open ArcMap. Add to the ArcMap layout the Census dataset which are located in your Census folder. Right Click on the Labour_Occupation_Education shapefile
More informationHardware Acceleration for CST MICROWAVE STUDIO. Amy Dewis Channel Manager
Hardware Acceleration for CST MICROWAVE STUDIO Amy Dewis Channel Manager Agenda 1. Acceleware Overview 2. Why use Hardware Acceleration? 3. Current Performance, Features and Hardware 4. Upcoming Features
More informationTechnical Note P/N REV A01 March 29, 2007
EMC Symmetrix DMX-3 Best Practices Technical Note P/N 300-004-800 REV A01 March 29, 2007 This technical note contains information on these topics: Executive summary... 2 Introduction... 2 Tiered storage...
More informationIntel Xeon Scalable Family Balanced Memory Configurations
Front cover Intel Xeon Scalable Family Balanced Memory Configurations Last Update: 20 November 2017 Demonstrates three balanced memory guidelines for Intel Xeon Scalable processors Compares the performance
More informationEngineers can be significantly more productive when ANSYS Mechanical runs on CPUs with a high core count. Executive Summary
white paper Computer-Aided Engineering ANSYS Mechanical on Intel Xeon Processors Engineer Productivity Boosted by Higher-Core CPUs Engineers can be significantly more productive when ANSYS Mechanical runs
More informationSIMULATION OF AN IMPLANTED PIFA FOR A CARDIAC PACEMAKER WITH EFIELD FDTD AND HYBRID FDTD-FEM
1 SIMULATION OF AN IMPLANTED PIFA FOR A CARDIAC PACEMAKER WITH EFIELD FDTD AND HYBRID FDTD- Introduction Medical Implanted Communication Service (MICS) has received a lot of attention recently. The MICS
More informationWHITE PAPER: ENTERPRISE AVAILABILITY. Introduction to Adaptive Instrumentation with Symantec Indepth for J2EE Application Performance Management
WHITE PAPER: ENTERPRISE AVAILABILITY Introduction to Adaptive Instrumentation with Symantec Indepth for J2EE Application Performance Management White Paper: Enterprise Availability Introduction to Adaptive
More informationMultigrid Pattern. I. Problem. II. Driving Forces. III. Solution
Multigrid Pattern I. Problem Problem domain is decomposed into a set of geometric grids, where each element participates in a local computation followed by data exchanges with adjacent neighbors. The grids
More informationDELL EMC DATA DOMAIN SISL SCALING ARCHITECTURE
WHITEPAPER DELL EMC DATA DOMAIN SISL SCALING ARCHITECTURE A Detailed Review ABSTRACT While tape has been the dominant storage medium for data protection for decades because of its low cost, it is steadily
More informationQlik Sense Performance Benchmark
Technical Brief Qlik Sense Performance Benchmark This technical brief outlines performance benchmarks for Qlik Sense and is based on a testing methodology called the Qlik Capacity Benchmark. This series
More informationAbaqus/CAE: Geometry Import and Meshing. About this Course
Abaqus/CAE: Geometry Import and Meshing R 6.12 About this Course Course objectives Upon completion of this course you will be able to: Import, edit, and repair CAD geometry. Import and edit orphan meshes.
More informationFind the right platform for your server needs
Product family guide Find the right platform for your server needs HP ProLiant Gen8 model transition guide The next generation There are mounting business pressures to support more applications and users,
More informationvsan 6.6 Performance Improvements First Published On: Last Updated On:
vsan 6.6 Performance Improvements First Published On: 07-24-2017 Last Updated On: 07-28-2017 1 Table of Contents 1. Overview 1.1.Executive Summary 1.2.Introduction 2. vsan Testing Configuration and Conditions
More informationFlash File Systems Overview
Table of contents 1.0 Overview 3 1.1 Flash architecture 3 1.1.1 Partitions 3 1.1.2 Blocks 3 1.2 Programming data 3 1.3 Data integrity 4 2.0 Flash file system functions 4 2.1 Wear leveling 4 2.2 Reclaim
More informationA Graphical User Interface (GUI) for Two-Dimensional Electromagnetic Scattering Problems
A Graphical User Interface (GUI) for Two-Dimensional Electromagnetic Scattering Problems Veysel Demir vdemir@olemiss.edu Mohamed Al Sharkawy malshark@olemiss.edu Atef Z. Elsherbeni atef@olemiss.edu Abstract
More informationFront Novell Guide to NLSP Migration
Front Novell Guide to NLSP Migration Front : 1 2 Novell Guide to NLSP Migration About This Guide.......................................... 7 Introduction..................................... 7 NetWare
More informationTable of Contents. HP A7173A PCI-X Dual Channel Ultra320 SCSI Host Bus Adapter. Performance Paper for HP PA-RISC Servers
HP A7173A PCI-X Dual Channel Ultra32 SCSI Host Bus Adapter Performance Paper for HP PA-RISC Servers Table of Contents Introduction...2 Executive Summary...2 Test Results...3 I/Ops...3 Service Demand...4
More informationEMC Celerra Replicator V2 with Silver Peak WAN Optimization
EMC Celerra Replicator V2 with Silver Peak WAN Optimization Applied Technology Abstract This white paper discusses the interoperability and performance of EMC Celerra Replicator V2 with Silver Peak s WAN
More informationVirtual EM Inc. Ann Arbor, Michigan, USA
Functional Description of the Architecture of a Special Purpose Processor for Orders of Magnitude Reduction in Run Time in Computational Electromagnetics Tayfun Özdemir Virtual EM Inc. Ann Arbor, Michigan,
More informationWHITE PAPER. Optimizing Virtual Platform Disk Performance
WHITE PAPER Optimizing Virtual Platform Disk Performance Optimizing Virtual Platform Disk Performance 1 The intensified demand for IT network efficiency and lower operating costs has been driving the phenomenal
More informationFlexCache Caching Architecture
NetApp Technical Report FlexCache Caching Architecture Marty Turner, NetApp July 2009 TR-3669 STORAGE CACHING ON NETAPP SYSTEMS This technical report provides an in-depth analysis of the FlexCache caching
More informationStellar performance for a virtualized world
IBM Systems and Technology IBM System Storage Stellar performance for a virtualized world IBM storage systems leverage VMware technology 2 Stellar performance for a virtualized world Highlights Leverages
More informationMIKE Powered by DHI. GPU - Guidelines
MIKE Powered by DHI GPU - Guidelines MIKE 2017 DHI headquarters Agern Allé 5 DK-2970 Hørsholm Denmark +45 4516 9200 Telephone +45 4516 9333 Support +45 4516 9292 Telefax mike@dhigroup.com www.mikepoweredbydhi.com
More informationMemory. Objectives. Introduction. 6.2 Types of Memory
Memory Objectives Master the concepts of hierarchical memory organization. Understand how each level of memory contributes to system performance, and how the performance is measured. Master the concepts
More informationLecture 7. Memory Management
Lecture 7 Memory Management 1 Lecture Contents 1. Memory Management Requirements 2. Memory Partitioning 3. Paging 4. Segmentation 2 Memory Memory is an array of words or bytes, each with its own address.
More informationA Diagonal Split-cell Model for the High-order Symplectic FDTD Scheme
PIERS ONLINE, VOL. 2, NO. 6, 2006 715 A Diagonal Split-cell Model for the High-order Symplectic FDTD Scheme Wei Sha, Xianliang Wu, and Mingsheng Chen Key Laboratory of Intelligent Computing & Signal Processing
More informationWorkshop 10-1: HPC for Finite Arrays
Workshop 10-1: HPC for Finite Arrays 2015.0 Release ANSYS HFSS for Antenna Design 1 2015 ANSYS, Inc. Getting Started Launching ANSYS Electronics Desktop 2015 Select Programs > ANSYS Electromagnetics >
More informationSEMBA: Broadband Electromagnetic Simulator
SEMBA: Broadband Electromagnetic Simulator Overview and Meshers Salvador Gonzalez García Luis Manuel Díaz Angulo Miguel David Ruiz Cabello Daniel Mateos Romero June 1st-3rd, 2016 Convention on Advances
More informationSpreadsheet definition: Starting a New Excel Worksheet: Navigating Through an Excel Worksheet
Copyright 1 99 Spreadsheet definition: A spreadsheet stores and manipulates data that lends itself to being stored in a table type format (e.g. Accounts, Science Experiments, Mathematical Trends, Statistics,
More informationIntroduction...2. Executive summary...2. Test results...3 IOPs...3 Service demand...3 Throughput...4 Scalability...5
A6826A PCI-X Dual Channel 2Gb/s Fibre Channel Adapter Performance Paper for Integrity Servers Table of contents Introduction...2 Executive summary...2 Test results...3 IOPs...3 Service demand...3 Throughput...4
More informationAnsoft HFSS Convergence
Data Max Delta Matrix Parameters Ansoft HFSS Choose from the Executive Commands window to view information about the solution. If you have solved for a driven solution, the following window appears: Maxwell
More informationCONTENTS Preface Introduction Finite Element Formulation Finite Element Mesh Truncation
Preface xi 1 Introduction 1 1.1 Numerical Simulation of Antennas 1 1.2 Finite Element Analysis Versus Other Numerical Methods 2 1.3 Frequency- Versus Time-Domain Simulations 5 1.4 Brief Review of Past
More informationPowerful features (1)
HFSS Overview Powerful features (1) Tangential Vector Finite Elements Provides only correct physical solutions with no spurious modes Transfinite Element Method Adaptive Meshing r E = t E γ i i ( x, y,
More informationMemory Hierarchy: Caches, Virtual Memory
Memory Hierarchy: Caches, Virtual Memory Readings: 5.1-5.4, 5.8 Big memories are slow Computer Fast memories are small Processor Memory Devices Control Input Datapath Output Need to get fast, big memories
More informationTest Report: Digital Rapids Transcode Manager Application with NetApp Media Content Management Solution
Technical Report Test Report: Digital Rapids Transcode Manager Application with NetApp Media Content Management Solution Jim Laing, NetApp July 2012 TR-4084 TABLE OF CONTENTS 1 Executive Summary... 3 2
More information3. Memory Management
Principles of Operating Systems CS 446/646 3. Memory Management René Doursat Department of Computer Science & Engineering University of Nevada, Reno Spring 2006 Principles of Operating Systems CS 446/646
More informationLATEST INTEL TECHNOLOGIES POWER NEW PERFORMANCE LEVELS ON VMWARE VSAN
LATEST INTEL TECHNOLOGIES POWER NEW PERFORMANCE LEVELS ON VMWARE VSAN Russ Fellows Enabling you to make the best technology decisions November 2017 EXECUTIVE OVERVIEW* The new Intel Xeon Scalable platform
More informationCavium FastLinQ 25GbE Intelligent Ethernet Adapters vs. Mellanox Adapters
Cavium FastLinQ 25GbE Intelligent Ethernet Adapters vs. Mellanox Adapters Cavium FastLinQ QL45000 25GbE adapters provide maximum performance and flexible bandwidth management to optimize virtualized servers
More informationFile Server Comparison: Executive Summary. Microsoft Windows NT Server 4.0 and Novell NetWare 5. Contents
File Server Comparison: Microsoft Windows NT Server 4.0 and Novell NetWare 5 Contents Executive Summary Updated: October 7, 1998 (PDF version 240 KB) Executive Summary Performance Analysis Price/Performance
More informationCisco Nexus 3548 Switch Performance Validation December 2012
Cisco Nexus 3548 Switch Performance Validation December 212 212 Spirent Cisco. All rights reserved. Page 1 Contents Executive Summary... 3 Test Bed... 4 How Testing Was Performed... 4 Test Results... 6
More information#1593: The top 10 things that can go wrong with an IBM Traveler Server
#1593: The top 10 things that can go wrong with an IBM Traveler Server plus how to detect and correct them Alan Forbes Acknowledgements and Disclaimer. Copyright IBM Corporation 2016. All rights reserved.
More informationDU _v01. September User Guide
NVIDIA MAXIMUS TECHNOLOGY FOR ANSYS MECHANICAL DU-06467-001_v01 September 2012 User Guide DOCUMENT CHANGE HISTORY DU-06467-001_v01 Version Date Authors Description of Change 01 August 3, 2012 Initial release
More informationCondusiv s V-locity VM Accelerates Exchange 2010 over 60% on Virtual Machines without Additional Hardware
openbench Labs Executive Briefing: March 13, 2013 Condusiv s V-locity VM Accelerates Exchange 2010 over 60% on Virtual Machines without Additional Hardware Optimizing I/O for Increased Throughput and Reduced
More informationMemory Management. Memory Management
Memory Management Chapter 7 1 Memory Management Subdividing memory to accommodate multiple processes Memory needs to be allocated efficiently to pack as many processes into memory as possible 2 1 Memory
More informationSCALING UP VS. SCALING OUT IN A QLIKVIEW ENVIRONMENT
SCALING UP VS. SCALING OUT IN A QLIKVIEW ENVIRONMENT QlikView Technical Brief February 2012 qlikview.com Introduction When it comes to the enterprise Business Discovery environments, the ability of the
More informationConsulting Solutions WHITE PAPER Citrix XenDesktop XenApp Planning Guide: Virtualization Best Practices
Consulting Solutions WHITE PAPER Citrix XenDesktop XenApp Planning Guide: Virtualization Best Practices www.citrix.com Overview Desktop virtualization comprises of many different types of virtual desktops.
More informationThorsten Liebig, Andreas Rennings, and Daniel Erni
8th Workshop on Numerical Methods for Optical Nanostructures, ETH Zürich, July 2-4, Zürich Switzerland, 2012 OpenEMS A Free and Open Source Cartesian and Cylindrical EC-FDTD Simulation Platform Supporting
More informationBroadcast-Quality, High-Density HEVC Encoding with AMD EPYC Processors
Solution Brief December, 2018 2018 Broadcast-Quality, High-Density HEVC Encoding with AMD EPYC Processors HIGHLIGHTS o The AMD EPYC SoC brings a new balance to the datacenter. Utilizing an x86-architecture,
More informationChapter 4 Determining Cell Size
Chapter 4 Determining Cell Size Chapter 4 Determining Cell Size The third tutorial is designed to give you a demonstration in using the Cell Size Calculator to obtain the optimal cell size for your circuit
More informationA Simple Method for Static Load Balancing of Parallel FDTD Codes Franek, Ondrej
Aalborg Universitet A Simple Method for Static Load Balancing of Parallel FDTD Codes Franek Ondrej Published in: Electromagnetics in Advanced Applications (ICEAA) 2016 International Conference on DOI (link
More informationMcAfee Endpoint Security for Servers Product Guide. (McAfee epolicy Orchestrator)
McAfee Endpoint Security for Servers 5.1.0 Product Guide (McAfee epolicy Orchestrator) COPYRIGHT Copyright 2018 McAfee, LLC TRADEMARK ATTRIBUTIONS McAfee and the McAfee logo, McAfee Active Protection,
More informationCopyright 2015 by Sean Connolly
1 Copyright 2015 by Sean Connolly All rights reserved. No part of this publication may be reproduced, distributed, or transmitted in any form or by any means, including photocopying, recording, or other
More informationIBM InfoSphere Data Replication s Change Data Capture (CDC) for DB2 LUW databases (Version ) Performance Evaluation and Analysis
Page 1 IBM InfoSphere Data Replication s Change Data Capture (CDC) for DB2 LUW databases (Version 10.2.1) Performance Evaluation and Analysis 2014 Prasa Urithirakodeeswaran Page 2 Contents Introduction...
More informationResource Guide Implementing QoS for WX/WXC Application Acceleration Platforms
Resource Guide Implementing QoS for WX/WXC Application Acceleration Platforms Juniper Networks, Inc. 1194 North Mathilda Avenue Sunnyvale, CA 94089 USA 408 745 2000 or 888 JUNIPER www.juniper.net Table
More informationIBM XIV Storage System
IBM XIV Storage System Technical Description IBM XIV Storage System Storage Reinvented Performance The IBM XIV Storage System offers a new level of high-end disk system performance and reliability. It
More informationDTAP200 DTSX200 Control Visualization Software Las2.0 Data Conversion Guide
User s Manual DTAP200 DTSX200 Control Visualization Software Las2.0 Data Conversion Guide 2nd Edition Blank Page < Introduction > i Introduction About this Manual Thank you for purchasing the DTSX200 Control
More informationBS2000/OSD DAB Disk Access Buffer Intelligent Caching with AutoDAB
BS2000/OSD DAB Disk Access Buffer Intelligent Caching with AutoDAB Issue June 2009 Pages 7 To cache or not to cache? That is not the question! Business-critical computing is typified by high performance
More informationOperating Systems (2INC0) 2017/18
Operating Systems (2INC0) 2017/18 Memory Management (09) Dr. Courtesy of Dr. I. Radovanovic, Dr. R. Mak (figures from Bic & Shaw) System Architecture and Networking Group Agenda Reminder: OS & resources
More informationSoftware within building physics and ground heat storage. HEAT3 version 7. A PC-program for heat transfer in three dimensions Update manual
Software within building physics and ground heat storage HEAT3 version 7 A PC-program for heat transfer in three dimensions Update manual June 15, 2015 BLOCON www.buildingphysics.com Contents 1. WHAT S
More informationFrequently Asked Questions. s620 SATA SSD Enterprise-Class Solid-State Device
Frequently Asked Questions s620 SATA SSD Enterprise-Class Solid-State Device Frequently Asked Questions Q: What about advanced data protection? A: In mission-critical enterprise and datacenter applications,
More informationParallel Programming Patterns Overview and Concepts
Parallel Programming Patterns Overview and Concepts Partners Funding Reusing this material This work is licensed under a Creative Commons Attribution- NonCommercial-ShareAlike 4.0 International License.
More informationTechnical Brief. AGP 8X Evolving the Graphics Interface
Technical Brief AGP 8X Evolving the Graphics Interface Increasing Graphics Bandwidth No one needs to be convinced that the overall PC experience is increasingly dependent on the efficient processing of
More informationOptimizing Tiered Storage Workloads with Precise for Storage Tiering
Applied Technology Abstract By introducing Enterprise Flash Drives to EMC Symmetrix and CLARiiON storage systems, EMC revitalized the importance of tiered storage. Optimizing the use of these tiers provides
More informationIf you are interested in taking advantage of this integrated approach, you must own licensed copies of both ASAP PRO and FDTD Solutions.
ASAP Technical Publication brofn1420_fdtd (July 15, 2005) Breault Research Organization has engaged in a partnership with Lumerical, an innovative technology company based in Vancouver, Canada. This collaboration
More informationNVIDIA GPU BOOST FOR TESLA
NVIDIA GPU BOOST FOR TESLA DA-06767-001_v02 January 2014 Application Note DOCUMENT CHANGE HISTORY DA-06767-001_v02 Version Date Authors Description of Change 01 March 28, 2013 GG, SM Initial Release 02
More informationTECHNOLOGY BRIEF. Compaq 8-Way Multiprocessing Architecture EXECUTIVE OVERVIEW CONTENTS
TECHNOLOGY BRIEF March 1999 Compaq Computer Corporation ISSD Technology Communications CONTENTS Executive Overview1 Notice2 Introduction 3 8-Way Architecture Overview 3 Processor and I/O Bus Design 4 Processor
More informationDesign and Evaluation of I/O Strategies for Parallel Pipelined STAP Applications
Design and Evaluation of I/O Strategies for Parallel Pipelined STAP Applications Wei-keng Liao Alok Choudhary ECE Department Northwestern University Evanston, IL Donald Weiner Pramod Varshney EECS Department
More informationGrand Central Dispatch
A better way to do multicore. (GCD) is a revolutionary approach to multicore computing. Woven throughout the fabric of Mac OS X version 10.6 Snow Leopard, GCD combines an easy-to-use programming model
More informationAN504: Memory Options and Performance on the Intel 955X Express Chip Set. John Beekley, VP Applications Engineering, Corsair Memory, Inc.
APPLICATIONS NOTE AN504: Memory Options and Performance on the Intel 955X Express Chip Set John Beekley, VP Applications Engineering, Corsair Memory, Inc. Introduction This white paper will examine memory
More informationMONITORING STORAGE PERFORMANCE OF IBM SVC SYSTEMS WITH SENTRY SOFTWARE
MONITORING STORAGE PERFORMANCE OF IBM SVC SYSTEMS WITH SENTRY SOFTWARE WHITE PAPER JULY 2018 INTRODUCTION The large number of components in the I/O path of an enterprise storage virtualization device such
More informationMonitor Qlik Sense sites. Qlik Sense Copyright QlikTech International AB. All rights reserved.
Monitor Qlik Sense sites Qlik Sense 2.1.2 Copyright 1993-2015 QlikTech International AB. All rights reserved. Copyright 1993-2015 QlikTech International AB. All rights reserved. Qlik, QlikTech, Qlik Sense,
More informationUsing Alluxio to Improve the Performance and Consistency of HDFS Clusters
ARTICLE Using Alluxio to Improve the Performance and Consistency of HDFS Clusters Calvin Jia Software Engineer at Alluxio Learn how Alluxio is used in clusters with co-located compute and storage to improve
More informationConsulting Solutions WHITE PAPER Citrix XenDesktop XenApp 6.x Planning Guide: Virtualization Best Practices
Consulting Solutions WHITE PAPER Citrix XenDesktop XenApp 6.x Planning Guide: Virtualization Best Practices www.citrix.com Table of Contents Overview... 3 Scalability... 3 Guidelines... 4 Operations...
More informationAbaqus/CAE: Geometry Import and Meshing. Abaqus 2018
Abaqus/CAE: Geometry Import and Meshing Abaqus 2018 About this Course Course objectives Upon completion of this course you will be able to: Import, edit, and repair CAD geometry. Import and edit orphan
More informationebook library PAGE 1 HOW TO OPTIMIZE TRANSLATIONS AND ACCELERATE TIME TO MARKET
ebook library PAGE 1 HOW TO OPTIMIZE TRANSLATIONS AND ACCELERATE TIME TO MARKET Aligning people, process and technology to improve quality and speed to market To succeed in the global business arena, companies
More information6.034 Quiz 2, Spring 2005
6.034 Quiz 2, Spring 2005 Open Book, Open Notes Name: Problem 1 (13 pts) 2 (8 pts) 3 (7 pts) 4 (9 pts) 5 (8 pts) 6 (16 pts) 7 (15 pts) 8 (12 pts) 9 (12 pts) Total (100 pts) Score 1 1 Decision Trees (13
More informationSUCCESS STORY HÄAGEN-DAZS JAPAN HÄAGEN-DAZS JAPAN DRAMATICALLY IMPROVES USER EXPERIENCE AND PRODUCTIVITY WITH NVIDIA GRID
SUCCESS STORY HÄAGEN-DAZS JAPAN HÄAGEN-DAZS JAPAN DRAMATICALLY IMPROVES USER EXPERIENCE AND PRODUCTIVITY WITH NVIDIA GRID Graphics-acceleration streamlines internal access to company website and productivity
More information