FEMAP/NX NASTRAN PERFORMANCE TUNING
|
|
- Osborn Holt
- 6 years ago
- Views:
Transcription
1 FEMAP/NX NASTRAN PERFORMANCE TUNING Chris Teague - Saratech (949)
2 NX Nastran Hardware Performance History Running Nastran in 1984: Cray Y-MP, 32 Bits! (X-MP was only 24 Bits) Four Vector Processors (167 Mhz) 256 MB of RAM (Note the MB, PC was 256K) 333 Mflops per processor $3-$4 Million, plus special room Comparison in 2016: 64 Bits Dual Core (1.85 Ghz) 2GB (2048 MB) of RAM 340 Mflops (Single Thread)/ 613 Mflops (Multi Thread) iphone 6s, $689 NX Nastran currently not ported to ios Saratech proprietary and confidential Slide Number: 2
3 NX Nastran Hardware Performance History Rack Server in 2016: Dell R930, 64 Bits Max 4 Processors at a Max 18 Cores Each (72 Cores Total), or up to 3.2 Ghz 1.5 TB of RAM 1.8 Gflops per processor High Speed PCIe based SSD disk drive (2.8 GBs Read/2.2 GBs Write speed) $85K with 6.5TB PCIe SSD, 1.5TB RAM, 4x3.2Ghz 4C Xeon processors Blade Array System Up to 30 Blades, each configured like a single server So how much faster do our Nastran jobs run with this huge increase in computing Power? Saratech proprietary and confidential Slide Number: 3
4 NX Nastran Performance Tuning Tips What is LP-64 vs ILP-64? Hardware and OS Selection NX Nastran Scratch Drive I/O Performance, OS Settings Buffer Size Hyperthreading Element Iterative Solver SMP vs DMP GPGPU Saratech proprietary and confidential Slide Number: 4
5 NX NASTRAN LP-64 vs ILP-64 There are two 64 bit versions of NX Nastran: LP-64 Standard version when running through FEMAP 4-Byte Words 8 GB RAM limit ILP-64 Optional version when running through FEMAP 8-Byte Words 20 TB RAM limit, which is really the hardware RAM limit of the machine you are running on When running NX Nastran on the command line, the L executables are ILP. w executables will bring up a file browser. In some cases, ILP-64 may offer improved accuracy Saratech proprietary and confidential Slide Number: 5
6 NX NASTRAN LP-64 vs ILP-64 In general, the standard LP-64 version of NX Nastran is faster for models that do not need more than 8GB of RAM allocated to the Solver For larger models that need more than 8GB of RAM for the Solver, you will need to use the ILP-64 version and have available RAM. For performance reasons, you don t want to allocate more than about 50% of RAM to NX Nastran. The other RAM is needed for the OS and I/O Caching, which is a huge help to NX Nastran performance. This means that if you need to use the ILP-64 version of NX Nastran, you will want at least 16 GB of RAM. Larger models may require more. Sometimes LOTS more! Saratech proprietary and confidential Slide Number: 6
7 Hardware and OS Selection Processors Faster processers are good (Faster I/O Speed is just as important, if not more though) Large L2 or L3 processer cache can improve performance (Xeon can help here) Multi-Core is good, but don t get more cores over less cores with faster clock speed (Usually) Intel Xeon E v3, 2.3 Ghz, 45M, 18 Core Intel Xeon E v3, 3.2 Ghz, 45M, 4 Core Memory As much as budget allows, and the fastest available Saratech ran a large job with mem=24 GB on a system with 64 GB of RAM. Nastran used up all available RAM for 2-3 hours, the extra being used for I/O Caching. See Task Manager graph: Saratech proprietary and confidential Slide Number: 7
8 Hardware and OS Selection Disk SATA based SSD are significantly faster than mechanical drives PCIe based SSD devices are even faster still, and are available and laptop, desktop, and server models. Example: SanDisk SX , 3.2 TB, 2.8 GB/s Read, 2.2 GB/s write speed (Servers) Intel 750 Series, 1.2 TB, 2.5 GB/s Read, 1.2 GB/S write speed (Workstations) Operating System Generally Linux is faster that Windows on the same hardware due the superior I/O on Linux Because of this, most HPC cluster systems run Linux Intel 750 Series PCIe SSD Windows is more popular on the desktop due to the wide variety of applications that run on Windows. Saratech proprietary and confidential Slide Number: 8
9 Hardware and OS Selection Priorities for getting the most performance for the least money: Maximum number of *fast* cores with large cache Add as much RAM as possible, and go for the fastest RAM allowed Maximize I/O bandwidth and disk speed Add GPU processing for some large dynamics problems (More on this later) I always recommend at least two disks, and 3 if possible: Disk 1: Fast drive for OS & Applications Disk 2: Very fast drive for NASTRAN & FEMAP scratch space (Keep empty when not running NX Nastran & FEMAP) Disk 3: Large drive for data storage NASTRAN does so much disk I/O, it is better to have it s own drive for scratch files, and make sure it is as fast as possible, SSD PCIe, or even a RAID of SSD. We don t want to let the OS/Application data needs slow down our NASTRAN job. Saratech proprietary and confidential Slide Number: 9
10 NX Nastran Scratch Drive Nastran scratch folder should point to a fast disk, or a RAID array (RAID0) Local disk drives are preferred Using network mounted NFS or SMB (Windows Shared Drive) connection is generally going to have significant performance penalties Even laptops can have two drives, try msata cards, or even PCIe in newer laptops SanDisk Fusion iomemory SX Samsung 850 EVO M.2 SSD Saratech proprietary and confidential Slide Number: 10
11 NX Nastran Scratch Drive You can set the NX Nastran scratch drive in the rc file The nastran rc file for FEMAP can be found in FEMAPv113/nastran/conf, where 113 is the version of FEMAP that you have installed Sample from my laptop: Sdir=e:\scratch program=femap scr=yes buffsize=32769 memory=.45*physical smem=20.0x The E drive is a 512GB SSD msata card Samsung 850 EVO M.2 SSD Saratech proprietary and confidential Slide Number: 11
12 NX Nastran & FEMAP Scratch Drive in FEMAP Preferences FEMAP scratch drive NX Nastran scratch drive Saratech proprietary and confidential Slide Number: 12
13 OS Settings: I/O Cache Reading from and writing to disk drives are much slower than RAM, even with SSD Data that is typically written is probably read back soon Keeping information in memory instead of disk will reduce disk seek times Make use of unallocated memory for disk buffer I/O Cache Saratech proprietary and confidential Slide Number: 13
14 OS Settings: Enabling Disk I/O Cache Read cache is enabled by default on Linux and Windows Enable write cache on Linux using hdparm command or equivalent On Windows, use Device Manager property settings to enable write-cache on the Nastran scratch drive in the Policies tab Saratech proprietary and confidential Slide Number: 14
15 Buffer Size The NX Nastran buffer size is the size of each I/O unit The default size in NX Nastran 9 is 8193 and works well for small models (<100K DOF) For larger models (>400K DOF), increasing the default buffer size to may help. This is the default in NX Nastran 10 This can be done by editing the nastran rc file and editing the line to be: Buffsize=32769 The nastran rc file for FEMAP can be found in FEMAPv113/nastran/conf, where 113 is the version of FEMAP that you have installed Saratech proprietary and confidential Slide Number: 15
16 NX Nastran Settings: Memory Starting with NX Nastran 10, the new default memory settings in the rcf file are: Memory=0.45*physical (45% of total RAM installed in the workstation) Smem=20.0X (20% of Memory in line above) Buffpool=20.0X (Same as Smem) These settings are more appropriate for large models and machines with more RAM Inspect the F04 file to see if you have optimum settings for your model Note: Unless SMEM is large enough to contain all scratch files, it is better to set it to zero Saratech proprietary and confidential Slide Number: 16
17 NX NASTRAN MEMORY LAYOUT Saratech proprietary and confidential Slide Number: 17
18 NX NASTRAN MEMORY The f04 file will give a summary of the memory that was allocated. The allocations will be the areas shown on the previous slide. Here is an example from TET10 model around 650,000 elements: ** PHYSICAL FILES LARGER THAN 2GB ARE SUPPORTED ON THIS PLATFORM 0 ** MASTER DIRECTORIES ARE LOADED IN MEMORY. USER OPENCORE (HICORE) = WORDS EXECUTIVE SYSTEM WORK AREA = WORDS MASTER(RAM) = WORDS SCRATCH(MEM) AREA = WORDS ( 100 BUFFERS) BUFFER POOL AREA (GINO/EXEC) = WORDS ( 51 BUFFERS) TOTAL NX NASTRAN MEMORY LIMIT = WORDS This model was run with mem=1673mb Remember, LP-64 is 4 bytes per word, and ILP-64 is 8 bytes per word Saratech proprietary and confidential Slide Number: 18
19 HOW MUCH MEMORY IS ENOUGH? Look in the f04 file for USER OPENCORE: Compare to the HIWATER usage toward the end of the f04 file: If HIWATER is getting close to or over HICORE, then likely the job would benefit from more memory (mem=x) Saratech proprietary and confidential Slide Number: 19
20 SETTING MEMORY SIZE IN FEMAP FEMAP uses Mb units, and memory can be set in the NASTRAN Executive and Solution Options form. 0 is the default which will use NASTRAN s default in the rcf file For Windows, don t allocate more than about 50% of the physical memory of the machine to avoid performance issues (swapping). Less may be better since the other memory is used for I/O Caching by Windows NX Nastran 10 default of 45% is pretty good for most cases until you get to workstations/servers with a large amount of RAM Saratech proprietary and confidential Slide Number: 20
21 HYPERTHREADING Some modern Intel CPUs support Hyperthreading. Hyperthreading is a like a virtual CPU, where one CPU can run two threads. There can be a small performance advantage on some desktop applications, but it s very small. Nastran, like other Windows programs sees the virtual CPU as a real CPU, since that is what Intel intended. Since NX NASTRAN is very CPU intensive, it expects the virtual CPU to perform like a real CPU, but it won t. NX NASTRAN will usually perform better if you turn off Hyperthreading. This is typically done in the BIOS. Some Xeon processors do not have Hyperthreading for this reason Saratech proprietary and confidential Slide Number: 21
22 Element Iterative Solver For models that are mostly solid elements, the Element Iterative Solver can offer significant performance improvements. (2-3x) It does not help shell or bar elements, and will be ignored in dynamics solutions Set this in the Solution form Saratech proprietary and confidential Slide Number: 22
23 NX Nastran Linear Contact Solutions Specify the proper search distance Large Search distances typically involve more active contacts for the first few iterations Saratech proprietary and confidential Slide Number: 23
24 Multiple CPU s SMP vs DMP Shared Memory Parallel (SMP) is a single machine with multiple processors that share common memory and a common I/O system (disks) as shown in the figure to the right. SMP DMP Distributed Memory Parallel (DMP) is a set of multiple machines or cluster with one or more processors communicating over a network. Each machine has it s own memory and it s own I/O system Saratech proprietary and confidential Slide Number: 24
25 DMP vs. SMP SMP Shared Memory Parallel Common Memory Pool, Common I/O Pool Desktop/Laptop hardware Tapers off at 8 or so cores No extra license needed DMP Distributed Memory Parallel Multiple machines with one or more processors communicating over a network (Desktop/Cluster) Each machine has its own memory and disk I/O Used Message Passing Interface (MPI) which must be installed in the OS Highly Scalable Extra license needed Now can be supported with a Femap license DMP Solutions 101 Linear Statics 103 Normal Modes 105/108/111/112 Buckling, Direct/Model Frequency, Modal Transient response 200 Design Optimization Saratech proprietary and confidential Slide Number: 25
26 Multiple CPU s SMP Setup in FEMAP If you would like to use multiple CPU s to solve a NASTRAN run, FEMAP can set that right above the Solver Memory. If you are running NASTRAN on your desktop machine, it is recommended to leave one CPU available for other applications if you want to continue to use the machine for other work This can also be done in the input file with: NASTRAN PARALLEL=x PARALLEL is a command line option also, and can be set in the rc file if you would like to have a default number of processors There is no extra license needed for SMP Saratech proprietary and confidential Slide Number: 26
27 AMD PROFESSIONAL GRAPHICS ADVANTAGE INNOVATION PERFORMANCE RELIABILITY Simultaneous render & compute Up to six 4K displays 1 Intelligent power technologies Application optimizations Latest API support PCIe 3.0 support 100+ app certifications Rock-solid drivers Three year warranty Image courtesy of Siemens PLM Software 27 AMD Professional Graphics for NX August
28 AMD FIREPRO W-SERIES GRAPHICS PRODUCT STACK AMD FirePro W-Series Recommended for NX/FEMAP UHE W GB GDDR5 275W W8100 8GB GDDR5 220W AMD FirePro TM W7100 HE W7100 8GB GDDR5 150W Midrange W5100 4GB GDDR5 <75W AMD FirePro TM W5100 W4100 2GB GDDR5 LP, <50W 2D/3D Entry W2100 2GB DDR3 LP, 26W AMD FirePro TM W AMD Professional Graphics for NX August 2015
29 The Right Solution for your PLM Workflow Simulation NX Nastran Large Assemblies and Rendering AMD FirePro TM W9100 AMD FirePro TM W8100 AMD FirePro TM W7100 AMD FirePro TM W5100 Design and Validation AMD FirePro TM W4100 Drafting and Modeling AMD FirePro TM W2100 Visualize, Review and Mark-up Images courtesy of Siemens PLM Software 29 AMD Professional Graphics for NX August 2015
30 NX NASTRAN y High performance GPUs and OpenCL accelerate modal frequency response calculations in NX Nastran. y This solution makes it possible to compute a large number of modes over a wide frequency range, economically and efficiently. y Results of the AMD FirePro OpenCL acceleration for NX Nastran Modal Frequency Response: Up to 25x faster than serial Up to 4x faster than the top of the line 24-core CPU run time Ref. : Siemens 2012 NX CAE Symposium Presentation: Accelerating Modal Frequency Response in NX Nastran with AMD GPUs by Hoffnung and Reymond OpenCL-accelerated solution System Configuration: Supermicro H8DGi-F Dual Opteron Motherboard 24 core Magny-Cours with AMD FirePro W AMD Professional Graphics for NX August 2015
31 SCALABLE PROFESSIONAL GPU SOLUTIONS } AMD provides a wide range of products for a wide range of software solutions Desktop Workstations Servers Mobile Workstations & Thin Clients 31 AMD Professional Graphics for NX August 2015
32 Using GPGPU with NX Nastran (OpenCL) For modal frequency response (SOL 111) with more than 5000 modes, and if you have a fast GPU card, such as the AMD FirePro W9100, it may help turning on the GPGPU acceleration in the NASTRAN Executive and Solution Options form NVIDIA Tesla K40 and Intel Xeon Phi 7120D are also supported by NX Nastran Saratech proprietary and confidential Slide Number: 32
33 FEMAP Performance Graphics Performance graphics vs. regular graphics comparison Model: 6 million nodes / elements Action: full model display / group / full model display Saratech proprietary and confidential Slide Number: 33
34 Graphics Preferences - Options Hardware Acceleration: This will disable the hardware driver if you are having significant graphics problems and want to find out if the graphics driver is the cause Performance Graphics (11.1 and Higher): Uses a new graphics architecture to improve performance of initial draw and dynamic rotation. Needs OpenGL 4.2 or higher. Memory Optimization: Should be off unless you models are very big and swapping is occurring. If that is the case, turning this on can improve drawing speed. If not, it will slow things down. Multi-Model Memory: This will use more memory to help make the transition time between switching models faster. Auto Regenerate: This will force a redraw after every command. It s slower, but keeps the graphics up to date during modifications. Saratech proprietary and confidential Slide Number: 34
35 Graphics Preferences OpenGL Enabling the Performance Graphics option can dramatically improve performance on models with a large number of: Solids Points Nodes Solid and Shell Elements Set Max VBO MB (Memory) to no more than 75% of your graphics card memory Sample is shown for a graphics card with 2GB of VRAM Min VBO B is set to 1024 by default and this should work well with most graphics cards Saratech proprietary and confidential Slide Number: 35
36 Graphics Preferences Dynamic Rotation Include in Dynamic Rotation options - switching off any these options should improve performance. Some key options: Element Symbols - if you have a lot of lumped masses and springs Mesh Size - if you have a large number of curves with mesh sizes on Labels and Undeformed - switching these off helps performance. Elements as Free Edges this has a slight delay in starting and finishing dynamic rotation but dynamic rotation is much quicker. For some models e.g. a mesh on a sphere, there is no free edge and you will see nothing as the model rotates. Saratech proprietary and confidential Slide Number: 36
37 Graphics Card Performance Considerations Desktop area resolution should be taken into consideration when using Femap. Having a very fine screen resolution can increase the time animations need to generate and the time individual windows need to refresh. Something to consider for Ultra HD (4K/2160P) monitors with resolutions of 3840x2160. If Femap appears to be having graphics errors, it could be the driver for your graphics card. Update the drivers for your graphics card often! Drivers from the manufacturers of the graphics card chipset tend to be more stable then the drivers from the maker of the graphics card. (e.g. use an ATI or nvidia driver vs. an ASUS driver) You should also set your graphics card performance settings to the default settings. In some cases, setting a card for optimum performance for an application may cause Femap to crash. Saratech proprietary and confidential Slide Number: 37
38 Database Preferences The database memory limit is set to 20% of available system RAM by default. When FEMAP needs more, it will just swap to the scratch disk, slowing things down. Increasing this number will leave less available RAM for other FEMAP operations besides the database. In some cases it may be better to lower this number. The Max Cached Label must be set to an ID higher than any entity in the model. The Open/Save method may improve read/write performance if you are experiencing slow performance. Clicking the Read/Write Test button will automatically run a test and determine the best setting for your hardware. It takes about 1.2 GB of disk space and a few minutes of time Saratech proprietary and confidential Slide Number: 38
HP and NX. Introduction. What type of application is NX?
HP and NX Introduction The purpose of this document is to provide information that will aid in selection of HP Workstations for running Siemens PLMS NX. A performance study was completed by benchmarking
More informationHP and CATIA HP Workstations for running Dassault Systèmes CATIA
Whitepaper HP and NX HP and CATIA HP Workstations for running Dassault Systèmes CATIA 4AA3-xxxxENW, Created Month 20XX This is an HP Indigo digital print (optional) Table of contents 3 Introduction 3 What
More informationANSYS Improvements to Engineering Productivity with HPC and GPU-Accelerated Simulation
ANSYS Improvements to Engineering Productivity with HPC and GPU-Accelerated Simulation Ray Browell nvidia Technology Theater SC12 1 2012 ANSYS, Inc. nvidia Technology Theater SC12 HPC Revolution Recent
More informationWindows Hardware Performance Tuning for Nastran. Easwaran Viswanathan (Siemens PLM Software)
Windows Hardware Performance Tuning for Nastran By Easwaran Viswanathan (Siemens PLM Software) NX Nastran is a very I/O intensive application. It is important to select the proper hardware to satisfy expected
More informationRobert Jamieson. Robs Techie PP Everything in this presentation is at your own risk!
Robert Jamieson Robs Techie PP Everything in this presentation is at your own risk! PC s Today Basic Setup Hardware pointers PCI Express How will it effect you Basic Machine Setup Set the swap space Min
More informationFree SolidWorks from Performance Constraints
Free SolidWorks from Performance Constraints Adrian Fanjoy Technical Services Director, CATI Josh Altergott Technical Support Manager, CATI Objective Build a better understanding of what factors involved
More informationDell EMC Ready Bundle for HPC Digital Manufacturing Dassault Systѐmes Simulia Abaqus Performance
Dell EMC Ready Bundle for HPC Digital Manufacturing Dassault Systѐmes Simulia Abaqus Performance This Dell EMC technical white paper discusses performance benchmarking results and analysis for Simulia
More informationAdvances of parallel computing. Kirill Bogachev May 2016
Advances of parallel computing Kirill Bogachev May 2016 Demands in Simulations Field development relies more and more on static and dynamic modeling of the reservoirs that has come a long way from being
More informationA GUIDE TO IMPROVING YOUR WORKSTATION EXPERIENCE
A GUIDE TO IMPROVING YOUR WORKSTATION EXPERIENCE 12 FREE STEPS Illustration: @RobBiddulph Produced by www.develop3d.com OPTIMIs Tune your workstation for free Just like a kettle, workstations get furred
More informationAdvanced Topics In Hardware
Advanced Topics In Hardware You will learn the inner workings of the hardware components introduced in the previous section. Computer Buses: How Information Is Transmitted Carries information between the
More informationMaximize automotive simulation productivity with ANSYS HPC and NVIDIA GPUs
Presented at the 2014 ANSYS Regional Conference- Detroit, June 5, 2014 Maximize automotive simulation productivity with ANSYS HPC and NVIDIA GPUs Bhushan Desam, Ph.D. NVIDIA Corporation 1 NVIDIA Enterprise
More informationBusiness white paper HP and Bentley MicroStation V8i (SELECTseries3) Business white paper. HP and Bentley. MicroStationV8i (SELECTseries3)
Business white paper HP and Bentley MicroStationV8i (SELECTseries3) Table of contents 3 Introduction 3 What type of application is MicroStation V8i (SELECTseries3)? 3 How does the HP Workstation family
More informationMulti-Screen Computer Buyers Guide. // //
www.multiplemonitors.co.uk // Sales@MultipleMonitors.co.uk // 0845 508 53 77 CPU / Processors CPU s or processors are the heart of any computer system, they are the main chips which carry out instructions
More informationHPC Architectures. Types of resource currently in use
HPC Architectures Types of resource currently in use Reusing this material This work is licensed under a Creative Commons Attribution- NonCommercial-ShareAlike 4.0 International License. http://creativecommons.org/licenses/by-nc-sa/4.0/deed.en_us
More informationHeadline in Arial Bold 30pt. Visualisation using the Grid Jeff Adie Principal Systems Engineer, SAPK July 2008
Headline in Arial Bold 30pt Visualisation using the Grid Jeff Adie Principal Systems Engineer, SAPK July 2008 Agenda Visualisation Today User Trends Technology Trends Grid Viz Nodes Software Ecosystem
More informationHPC and IT Issues Session Agenda. Deployment of Simulation (Trends and Issues Impacting IT) Mapping HPC to Performance (Scaling, Technology Advances)
HPC and IT Issues Session Agenda Deployment of Simulation (Trends and Issues Impacting IT) Discussion Mapping HPC to Performance (Scaling, Technology Advances) Discussion Optimizing IT for Remote Access
More informationHP Z Turbo Drive G2 PCIe SSD
Performance Evaluation of HP Z Turbo Drive G2 PCIe SSD Powered by Samsung NVMe technology Evaluation Conducted Independently by: Hamid Taghavi Senior Technical Consultant August 2015 Sponsored by: P a
More informationEngineers can be significantly more productive when ANSYS Mechanical runs on CPUs with a high core count. Executive Summary
white paper Computer-Aided Engineering ANSYS Mechanical on Intel Xeon Processors Engineer Productivity Boosted by Higher-Core CPUs Engineers can be significantly more productive when ANSYS Mechanical runs
More informationComputers for Photography. Fort Collins Digital Camera Club September 14, 2010
Computers for Photography Fort Collins Digital Camera Club September 14, 2010 The Computer is Part of the "Digital Darkroom" The objective is to manipulate an image to a desired product as fast as possible
More informationDEDICATED SERVERS WITH EBS
DEDICATED WITH EBS TABLE OF CONTENTS WHY CHOOSE A DEDICATED SERVER? 3 DEDICATED WITH EBS 4 INTEL ATOM DEDICATED 5 AMD OPTERON DEDICATED 6 INTEL XEON DEDICATED 7 MANAGED SERVICES 8 SERVICE GUARANTEES 9
More informationFEMAP v Operating Systems and Minimum Hardware Requirements
FEMAP v11.3 - Operating Systems and Minimum Hardware Requirements Important Notes Regarding 32 bit Windows Operation Systems and Windows XP and Windows Vista Femap v11.1 was the last release of Femap that
More informationGeneral Purpose GPU Computing in Partial Wave Analysis
JLAB at 12 GeV - INT General Purpose GPU Computing in Partial Wave Analysis Hrayr Matevosyan - NTC, Indiana University November 18/2009 COmputationAL Challenges IN PWA Rapid Increase in Available Data
More informationForensic Toolkit System Specifications Guide
Forensic Toolkit System Specifications Guide February 2012 When it comes to performing effective and timely investigations, we recommend examiners take into consideration the demands the software, and
More informationThe Optimal CPU and Interconnect for an HPC Cluster
5. LS-DYNA Anwenderforum, Ulm 2006 Cluster / High Performance Computing I The Optimal CPU and Interconnect for an HPC Cluster Andreas Koch Transtec AG, Tübingen, Deutschland F - I - 15 Cluster / High Performance
More informationSupercomputing with Commodity CPUs: Are Mobile SoCs Ready for HPC?
Supercomputing with Commodity CPUs: Are Mobile SoCs Ready for HPC? Nikola Rajovic, Paul M. Carpenter, Isaac Gelado, Nikola Puzovic, Alex Ramirez, Mateo Valero SC 13, November 19 th 2013, Denver, CO, USA
More informationIntel Optane Memory and Intel SSD 545s combine to offer NVMe-class storage performance. November 24, 2017 Version 1.0
Intel Optane Memory and Intel SSD 545s combine to offer NVMe-class storage performance November 24, 2017 Version 1.0 A Complex Landscape of Storage Options Consumers and enthusiasts that want to build
More informationAltair OptiStruct 13.0 Performance Benchmark and Profiling. May 2015
Altair OptiStruct 13.0 Performance Benchmark and Profiling May 2015 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Intel, Dell, Mellanox Compute
More informationFemap v Operating Systems and Minimum Hardware Requirements
Femap v11.4 - Operating Systems and Minimum Hardware Requirements Important Notes Regarding 32 bit Windows Operation Systems and Windows XP and Windows Vista Femap v11.1 was the last release of Femap that
More informationWELCOME! LIVE with ROBERT GREEN:
WELCOME! LIVE with ROBERT GREEN: Select the Right Processor & RAM for CAD, Analysis & Visualization Workflows January 10, 2018 Robert Green CAD Management Expert Cadalyst Contributing Editor 113 TODAY
More informationCurrent Trends in Computer Graphics Hardware
Current Trends in Computer Graphics Hardware Dirk Reiners University of Louisiana Lafayette, LA Quick Introduction Assistant Professor in Computer Science at University of Louisiana, Lafayette (since 2006)
More informationSoftware within building physics and ground heat storage. HEAT3 version 7. A PC-program for heat transfer in three dimensions Update manual
Software within building physics and ground heat storage HEAT3 version 7 A PC-program for heat transfer in three dimensions Update manual June 15, 2015 BLOCON www.buildingphysics.com Contents 1. WHAT S
More informationFaster Metal Forming Solution with Latest Intel Hardware & Software Technology
12 th International LS-DYNA Users Conference Computing Technologies(3) Faster Metal Forming Solution with Latest Intel Hardware & Software Technology Nick Meng 1, Jixian Sun 2, Paul J Besl 1 1 Intel Corporation,
More informationDU _v01. September User Guide
NVIDIA MAXIMUS TECHNOLOGY FOR ANSYS MECHANICAL DU-06467-001_v01 September 2012 User Guide DOCUMENT CHANGE HISTORY DU-06467-001_v01 Version Date Authors Description of Change 01 August 3, 2012 Initial release
More informationBuilding a home lab : From OK to Bada$$$ By Maxime Mercier
Building a home lab : From OK to Bada$$$ By Maxime Mercier Disclaimer The following presentation is a generic guideline on building a home lab. It should not be used for production servers without proper
More informationTechnical guide. Windows HPC server 2016 for LS-DYNA How to setup. Reference system setup - v1.0
Technical guide Windows HPC server 2016 for LS-DYNA How to setup Reference system setup - v1.0 2018-02-17 2018 DYNAmore Nordic AB LS-DYNA / LS-PrePost 1 Introduction - Running LS-DYNA on Windows HPC cluster
More informationWaveView. System Requirement V6. Reference: WST Page 1. WaveView System Requirements V6 WST
WaveView System Requirement V6 Reference: WST-0125-01 www.wavestore.com Page 1 WaveView System Requirements V6 Copyright notice While every care has been taken to ensure the information contained within
More informationControl Center 15 Performance Reference Guide
Control Center 15 Performance Reference Guide Control Center front-end application This guide provides information about Control Center 15 components that may be useful when planning a system. System specifications
More informationThe BioHPC Nucleus Cluster & Future Developments
1 The BioHPC Nucleus Cluster & Future Developments Overview Today we ll talk about the BioHPC Nucleus HPC cluster with some technical details for those interested! How is it designed? What hardware does
More informationThe HP Blade Workstation Solution A new paradigm in workstation computing featuring the HP ProLiant xw460c Blade Workstation
The HP Blade Workstation Solution A new paradigm in workstation computing featuring the HP ProLiant xw460c Blade Workstation Executive overview...2 HP Blade Workstation Solution overview...2 Details of
More informationEnhancing Analysis-Based Design with Quad-Core Intel Xeon Processor-Based Workstations
Performance Brief Quad-Core Workstation Enhancing Analysis-Based Design with Quad-Core Intel Xeon Processor-Based Workstations With eight cores and up to 80 GFLOPS of peak performance at your fingertips,
More informationUltimate Workstation Performance
Product brief & COMPARISON GUIDE Intel Scalable Processors Intel W Processors Ultimate Workstation Performance Intel Scalable Processors and Intel W Processors for Professional Workstations Optimized to
More informationYour World is Hybrid:
Your World is Hybrid: Support All the GPU-Accelerated VDI and Virtual Application Delivery Use Cases One Platform: HPE Synergy Cristian Cojocaru, Solutions Architect VDI and remote graphics evolution High
More informationA+ Guide to Hardware: Managing, Maintaining, and Troubleshooting, 5e. Chapter 6 Supporting Hard Drives
A+ Guide to Hardware: Managing, Maintaining, and Troubleshooting, 5e Chapter 6 Supporting Hard Drives Objectives Learn about the technologies used inside a hard drive and how data is organized on the drive
More information(software agnostic) Computational Considerations
(software agnostic) Computational Considerations The Issues CPU GPU Emerging - FPGA, Phi, Nervana Storage Networking CPU 2 Threads core core Processor/Chip Processor/Chip Computer CPU Threads vs. Cores
More informationMaximizing Memory Performance for ANSYS Simulations
Maximizing Memory Performance for ANSYS Simulations By Alex Pickard, 2018-11-19 Memory or RAM is an important aspect of configuring computers for high performance computing (HPC) simulation work. The performance
More informationSun Lustre Storage System Simplifying and Accelerating Lustre Deployments
Sun Lustre Storage System Simplifying and Accelerating Lustre Deployments Torben Kling-Petersen, PhD Presenter s Name Principle Field Title andengineer Division HPC &Cloud LoB SunComputing Microsystems
More informationHardware RAID, RAID 6, and Windows Storage Server
White Paper NETWORK ATTACHED STORAGE SOLUTIONS FOR IT ADMINISTRATORS, DECISION-MAKERS, AND BUSINESS OWNERS Network Attached Storage (NAS) Solutions with. High Data Backup and Reliability without Loss of
More informationGPGPUs in HPC. VILLE TIMONEN Åbo Akademi University CSC
GPGPUs in HPC VILLE TIMONEN Åbo Akademi University 2.11.2010 @ CSC Content Background How do GPUs pull off higher throughput Typical architecture Current situation & the future GPGPU languages A tale of
More informationANSYS HPC Technology Leadership
ANSYS HPC Technology Leadership 1 ANSYS, Inc. November 14, Why ANSYS Users Need HPC Insight you can t get any other way It s all about getting better insight into product behavior quicker! HPC enables
More informationKronos File Optimizing Performance
Kronos File Optimizing Performance Version History Date Version Release by Reason for Changes 26/02/2016 2.0 J Metcalf First draft 20/02/2016 2.1 J Metcalf Update duration timing graphs 11/01/2017 3.0
More informationPreferred configuration for a Masterplay playback system and some hints:
Preferred configuration for a Masterplay playback system and some hints: - Windows 7,8.1,10 64 Bit NEW installations for PE2 advised to be Windows 10 64bit - At least 4GB 2400Mhz memory or faster. Systems
More informationPAC094 Performance Tips for New Features in Workstation 5. Anne Holler Irfan Ahmad Aravind Pavuluri
PAC094 Performance Tips for New Features in Workstation 5 Anne Holler Irfan Ahmad Aravind Pavuluri Overview of Talk Virtual machine teams 64-bit guests SMP guests e1000 NIC support Fast snapshots Virtual
More informationHardware Recommendations for SOLIDWORKS 2017
Hardware Recommendations for 2017 Minimum System OS: Windows 10, Windows 8.1 64, or Windows 7 64 CPU: Intel i5 Core Intel i7 Dual Core, or equivalent AMD Hard Drive: >250GB, 7200rpm Graphics Card: 2GB
More informationPerformance Optimizations via Connect-IB and Dynamically Connected Transport Service for Maximum Performance on LS-DYNA
Performance Optimizations via Connect-IB and Dynamically Connected Transport Service for Maximum Performance on LS-DYNA Pak Lui, Gilad Shainer, Brian Klaff Mellanox Technologies Abstract From concept to
More informationFujitsu VDI / vgpu Virtualization
Fujitsu VDI / vgpu Virtualization Antti Sirkiä Service Partner Manager, Certified Trainer Fujitsu, Product Business Unit Why Virtualization / Graphics Virtualization? :: GRAPHICS VIRTUALIZATION :: Multiple
More informationSolving Large Complex Problems. Efficient and Smart Solutions for Large Models
Solving Large Complex Problems Efficient and Smart Solutions for Large Models 1 ANSYS Structural Mechanics Solutions offers several techniques 2 Current trends in simulation show an increased need for
More informationPC-based data acquisition II
FYS3240 PC-based instrumentation and microcontrollers PC-based data acquisition II Data streaming to a storage device Spring 2015 Lecture 9 Bekkeng, 29.1.2015 Data streaming Data written to or read from
More informationCS24: INTRODUCTION TO COMPUTING SYSTEMS. Spring 2017 Lecture 13
CS24: INTRODUCTION TO COMPUTING SYSTEMS Spring 2017 Lecture 13 COMPUTER MEMORY So far, have viewed computer memory in a very simple way Two memory areas in our computer: The register file Small number
More informationGPUs and Emerging Architectures
GPUs and Emerging Architectures Mike Giles mike.giles@maths.ox.ac.uk Mathematical Institute, Oxford University e-infrastructure South Consortium Oxford e-research Centre Emerging Architectures p. 1 CPUs
More informationPerformance Pack. Benchmarking with PlanetPress Connect and PReS Connect
Performance Pack Benchmarking with PlanetPress Connect and PReS Connect Contents 2 Introduction 4 Benchmarking results 5 First scenario: Print production on demand 5 Throughput vs. Output Speed 6 Second
More informationAnalyzing Performance and Power of Applications on GPUs with Dell 12G Platforms. Dr. Jeffrey Layton Enterprise Technologist HPC
Analyzing Performance and Power of Applications on GPUs with Dell 12G Platforms Dr. Jeffrey Layton Enterprise Technologist HPC Why GPUs? GPUs have very high peak compute capability! 6-9X CPU Challenges
More informationA Comprehensive Study on the Performance of Implicit LS-DYNA
12 th International LS-DYNA Users Conference Computing Technologies(4) A Comprehensive Study on the Performance of Implicit LS-DYNA Yih-Yih Lin Hewlett-Packard Company Abstract This work addresses four
More information8/28/12. CSE 820 Graduate Computer Architecture. Richard Enbody. Dr. Enbody. 1 st Day 2
CSE 820 Graduate Computer Architecture Richard Enbody Dr. Enbody 1 st Day 2 1 Why Computer Architecture? Improve coding. Knowledge to make architectural choices. Ability to understand articles about architecture.
More informationSamsung Magician v4.8 Introduction and Installation Guide
Samsung Magician v4.8 Introduction and Installation Guide 1 Legal Disclaimer SAMSUNG ELECTRONICS RESERVES THE RIGHT TO CHANGE PRODUCTS, INFORMATION AND SPECIFICATIONS WITHOUT NOTICE. Products and specifications
More informationUsing Graphics Chips for General Purpose Computation
White Paper Using Graphics Chips for General Purpose Computation Document Version 0.1 May 12, 2010 442 Northlake Blvd. Altamonte Springs, FL 32701 (407) 262-7100 TABLE OF CONTENTS 1. INTRODUCTION....1
More informationMSC Nastran Explicit Nonlinear (SOL 700) on Advanced SGI Architectures
MSC Nastran Explicit Nonlinear (SOL 700) on Advanced SGI Architectures Presented By: Dr. Olivier Schreiber, Application Engineering, SGI Walter Schrauwen, Senior Engineer, Finite Element Development, MSC
More informationFaster Innovation - Accelerating SIMULIA Abaqus Simulations with NVIDIA GPUs. Baskar Rajagopalan Accelerated Computing, NVIDIA
Faster Innovation - Accelerating SIMULIA Abaqus Simulations with NVIDIA GPUs Baskar Rajagopalan Accelerated Computing, NVIDIA 1 Engineering & IT Challenges/Trends NVIDIA GPU Solutions AGENDA Abaqus GPU
More informationThe personal computer system uses the following hardware device types -
EIT, Author Gay Robertson, 2016 The personal computer system uses the following hardware device types - Input devices Input devices Processing devices Storage devices Processing Cycle Processing devices
More informationCSE 591/392: GPU Programming. Introduction. Klaus Mueller. Computer Science Department Stony Brook University
CSE 591/392: GPU Programming Introduction Klaus Mueller Computer Science Department Stony Brook University First: A Big Word of Thanks! to the millions of computer game enthusiasts worldwide Who demand
More information2
1 2 3 4 5 All resources: how fast, how many? If all the CPUs are pegged, that s as fast as you can go. CPUs have followed Moore s law, the rest of the system hasn t. Not everything can be made threaded,
More informationBuilding NVLink for Developers
Building NVLink for Developers Unleashing programmatic, architectural and performance capabilities for accelerated computing Why NVLink TM? Simpler, Better and Faster Simplified Programming No specialized
More informationBest practices to achieve optimal memory allocation and remote desktop user experience
E-Guide Best practices to achieve optimal memory allocation and remote desktop user experience Many virtual machines don t fully utilize their available RAM, just like they don t fully utilize their available
More informationNode Hardware. Performance Convergence
Node Hardware Improved microprocessor performance means availability of desktop PCs with performance of workstations (and of supercomputers of 10 years ago) at significanty lower cost Parallel supercomputers
More informationGPU Architecture. Alan Gray EPCC The University of Edinburgh
GPU Architecture Alan Gray EPCC The University of Edinburgh Outline Why do we want/need accelerators such as GPUs? Architectural reasons for accelerator performance advantages Latest GPU Products From
More informationThree OPTIMIZING. Your System for Photoshop. Tuning for Performance
Three OPTIMIZING Your System for Photoshop Tuning for Performance 72 Power, Speed & Automation with Adobe Photoshop This chapter goes beyond speeding up how you can work faster in Photoshop to how to make
More informationDell EMC Ready Bundle for HPC Digital Manufacturing ANSYS Performance
Dell EMC Ready Bundle for HPC Digital Manufacturing ANSYS Performance This Dell EMC technical white paper discusses performance benchmarking results and analysis for ANSYS Mechanical, ANSYS Fluent, and
More informationgoals review some basic concepts and terminology
goals review some basic concepts and terminology understand the major components of a computer and issues surrounding selection CPU motherboard (mainboard) and chipset storage and other high-performance
More informationIntroduction to GPU hardware and to CUDA
Introduction to GPU hardware and to CUDA Philip Blakely Laboratory for Scientific Computing, University of Cambridge Philip Blakely (LSC) GPU introduction 1 / 35 Course outline Introduction to GPU hardware
More informationRepresentation of the interested Bidders / vendors. Form no. T2 (TECHNICAL MINIMUM SPECIFICATIONS)
Sr. no. Clause no./page No. Item & Specification in the tender Bidder / Vendor s representation Response to the Bidders Page No.12 1 Chassis: 5U Rack Mountable or Higher Please consider Minimum 2U Rack
More informationPedraforca: a First ARM + GPU Cluster for HPC
www.bsc.es Pedraforca: a First ARM + GPU Cluster for HPC Nikola Puzovic, Alex Ramirez We ve hit the power wall ALL computers are limited by power consumption Energy-efficient approaches Multi-core Fujitsu
More informationThe Mont-Blanc approach towards Exascale
http://www.montblanc-project.eu The Mont-Blanc approach towards Exascale Alex Ramirez Barcelona Supercomputing Center Disclaimer: Not only I speak for myself... All references to unavailable products are
More informationSystem recommendations for version 17.1
System recommendations for version 17.1 This article contains information about recommended hardware resources and network environments for version 17.1 of Sage 300 Construction and Real Estate. NOTE:
More informationHigh Performance Computing with Accelerators
High Performance Computing with Accelerators Volodymyr Kindratenko Innovative Systems Laboratory @ NCSA Institute for Advanced Computing Applications and Technologies (IACAT) National Center for Supercomputing
More informationLENOVO WORKSTATIONS. The best designed workstations ever.
LENOVO WORKSTATIONS The best designed workstations ever. THINKSTATION P910 vs. COMPETITION EXTREME POWER AND PERFORMANCE P910 FLEX TECHNOLOGY Lenovo ThinkStations feature unique FLEX Technology, providing
More informationCIT 668: System Architecture. Computer Systems Architecture
CIT 668: System Architecture Computer Systems Architecture 1. System Components Topics 2. Bandwidth and Latency 3. Processor 4. Memory 5. Storage 6. Network 7. Operating System 8. Performance Implications
More informationAccelerating HPC. (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing
Accelerating HPC (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing SAAHPC, Knoxville, July 13, 2010 Legal Disclaimer Intel may make changes to specifications and product
More informationIllinois Proposal Considerations Greg Bauer
- 2016 Greg Bauer Support model Blue Waters provides traditional Partner Consulting as part of its User Services. Standard service requests for assistance with porting, debugging, allocation issues, and
More informationMinerva. Performance & Burn In Test Rev AD903A/AD903D Converter Card. Table of Contents. 1. Overview
Minerva AD903A/AD903D Converter Card Performance & Burn In Test Rev. 1.0 Table of Contents 1. Overview 2. Performance Measurement Tools and Results 2.1 Test Platform 2.2 Test target and Used SATA III SSD
More informationn N c CIni.o ewsrg.au
@NCInews NCI and Raijin National Computational Infrastructure 2 Our Partners General purpose, highly parallel processors High FLOPs/watt and FLOPs/$ Unit of execution Kernel Separate memory subsystem GPGPU
More informationDell Precision Workstations
Dell Precision Workstations #1 Workstation brand in the world! Celebrating 20 years of great minds using great machines. For 20 years, Dell Precision has been delivering innovative, high performance workstations
More informationECE 571 Advanced Microprocessor-Based Design Lecture 20
ECE 571 Advanced Microprocessor-Based Design Lecture 20 Vince Weaver http://www.eece.maine.edu/~vweaver vincent.weaver@maine.edu 12 April 2016 Project/HW Reminder Homework #9 was posted 1 Raspberry Pi
More informationSamsung V-NAND SSD 970 EVO Plus
Samsung V-NAND SSD 970 EVO Plus 2019 1 DISCLAIMER SAMSUNG ELECTRONICS RESERVES THE RIGHT TO CHANGE PRODUCTS, INFORMATION AND SPECIFICATIONS WITHOUT NOTICE. Products and specifications discussed herein
More informationStorage Devices for Database Systems
Storage Devices for Database Systems 5DV120 Database System Principles Umeå University Department of Computing Science Stephen J. Hegner hegner@cs.umu.se http://www.cs.umu.se/~hegner Storage Devices for
More informationVirtual Security Server
Data Sheet VSS Virtual Security Server Security clients anytime, anywhere, any device CENTRALIZED CLIENT MANAGEMENT UP TO 50% LESS BANDWIDTH UP TO 80 VIDEO STREAMS MOBILE ACCESS INTEGRATED SECURITY SYSTEMS
More informationLAMMPS-KOKKOS Performance Benchmark and Profiling. September 2015
LAMMPS-KOKKOS Performance Benchmark and Profiling September 2015 2 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Intel, Dell, Mellanox, NVIDIA
More informationMINERVA. Performance & Burn In Test Rev AD912A Interposer Card. Table of Contents. 1. Overview
MINERVA AD912A Interposer Card Performance & Burn In Test Rev. 1.0 Table of Contents 1. Overview 2. Performance Measurement Tools and Results 2.1 Test Platform 2.2 Test target and Used msata III SSD 2.3
More informationThe knight makes his play for the crown Phi & Omni-Path Glenn Rosenberg Computer Insights UK 2016
The knight makes his play for the crown Phi & Omni-Path Glenn Rosenberg Computer Insights UK 2016 2016 Supermicro 15 Minutes Two Swim Lanes Intel Phi Roadmap & SKUs Phi in the TOP500 Use Cases Supermicro
More informationCME 213 S PRING Eric Darve
CME 213 S PRING 2017 Eric Darve Summary of previous lectures Pthreads: low-level multi-threaded programming OpenMP: simplified interface based on #pragma, adapted to scientific computing OpenMP for and
More informationFuture Trends in Hardware and Software for use in Simulation
Future Trends in Hardware and Software for use in Simulation Steve Feldman VP/IT, CD-adapco April, 2009 HighPerformanceComputing Building Blocks CPU I/O Interconnect Software General CPU Maximum clock
More informationMicrosoft SQL Server 2012 Fast Track Reference Configuration Using PowerEdge R720 and EqualLogic PS6110XV Arrays
Microsoft SQL Server 2012 Fast Track Reference Configuration Using PowerEdge R720 and EqualLogic PS6110XV Arrays This whitepaper describes Dell Microsoft SQL Server Fast Track reference architecture configurations
More information