BLAST on Intel EM64T Architecture

Size: px
Start display at page:

Download "BLAST on Intel EM64T Architecture"

Transcription

1 Performance Characterization of BLAST on Intel EM64T Architecture Intel Extended Memory 64 Technology (EM64T) is designed to provide enhancements to IA-32 architecture that can support 64-bit address extensions. This article introduces EM64T, including a discussion of different features and modes of operation. In addition, this article examines the performance of the basic local alignment search tool (BLAST) in EM64T s various operating modes on a Dell PowerEdge 1850 server. BY GARIMA KOCHHAR, KALYANA CHADALAVADA, AMINA SAIFY, AND RIZWAN ALI Several high-performance computing (HPC) applications from the domains of simulation, modeling, and image rendering can benefit from a larger addressable memory space than is currently available on 32-bit systems. HPC applications from the fields of cryptography and scientific computing may also benefit from a 64-bit processor architecture. Intel Extended Memory 64 Technology (EM64T) 1 architecture can help provide a way for such applications to take advantage of a 64-bit address space without requiring the applications to be ported to an entirely new 64-bit architecture an effort that typically involves recoding and retesting, which can potentially absorb significant resources. 2 In this article, the basic local alignment search tool (BLAST) is used as an example application to introduce the performance characteristics of EM64T architecture. BLAST is an application used in biological research that provides a method to rapidly search nucleotide and protein databases. Overview of EM64T Introduced in early 2004, Intel EM64T is a 64-bit extension architecture that was previously referred to as Clackamas Technology (CT) or IA-32 extension (IA32e). The idea behind the EM64T design is to allow applications to address larger amounts of memory than 32-bit processors and to support the coexistence of 32-bit and 64-bit applications in a 64-bit address space. A processor that is EM64T-capable requires an EM64T-aware BIOS. The EM64T specification calls for no other platform changes. An EM64T-capable processor is designed to support both 32-bit and 64-bit 3 operating systems, although only one operating system (OS) can be active on a physical server at a time. On a server running an EM64T OS, both 32-bit and 64-bit applications can be active at the same time. Apart from the capability to address a larger memory space natively than 32-bit processors and support 64-bit applications, an EM64T processor also includes features and enhancements such as Intel Hyper-Threading Technology, the Streaming SIMD (single instruction, multiple data) Extensions 3 (SSE3) instruction set, an 800 MHz frontside bus (FSB), and Demand-Based Switching (DBS) power management (see Figure 1). Compared to previous-generation Intel Xeon processors, which were based on 0.13 micron process technology, 1 For more information about Intel EM64T, visit and 2,3 With the advent of the Intel EM64T architecture, Intel now provides two distinct paths for 64-bit computing: EM64T and Itanium (IA-64). Because the actual chip architectures and instruction sets are different, the two processor chips require different versions of operating systems. This article discusses the EM64T version of operating system. 46 POWER SOLUTIONS Reprinted from Dell Power Solutions, October Copyright 2004 Dell Inc. All rights reserved. October 2004

2 Function Feature Description 64-bit processing EM64T EM64T includes extended memory addressability (that is, 64-bit pointers and registers); eight additional registers for SSE3; eight additional general-purpose registers; and double-precision integer support. IA32e mode (discussed in the section Different operating modes of an EM64T processor in this article) will support both 32-bit and 64-bit applications side by side under a 64-bit operating system. Bus bandwidth 800 MHz The FSB provides the processor with more bandwidth FSB and faster access to memory than in previousgeneration Intel Xeon processors, which offered a 533 MHz FSB. Instruction set SSE3 SIMD programming processes more data with fewer instructions compared to the traditional single instruction, single data (SISD) processors, thus helping to improve performance. SSE3 enables accelerated processing of complex arithmetic and video decoding. Power DBS DBS minimizes energy consumption and heat by management changing processor performance or power state (that is, voltage or clock frequency) in response to processor utilization. EM64T-capable Intel Xeon processors are based on a smaller, 90 nm line geometry allowing designers to fit denser circuitry on the same size processor to help enable enhanced clock frequencies. Furthermore, Intel has doubled the level 1 (L1) and level 2 (L2) cache sizes on EM64T compared to the previous-generation Intel Xeon processors. EM64T s L1 data cache has been increased to 16 KB and its L2 cache has been increased to 1 MB up from the 8 KB L1 data cache and 512 KB L2 cache on previous-generation Intel Xeon processors. Different operating modes of an EM64T processor Installing a 32-bit OS on a server with 32-bit drivers and 32-bit compilers enables support only for 32-bit applications. This setup, known as Legacy mode, represents an IA-32 environment that does not support 64-bit applications. Installing the 64-bit EM64T OS with 64-bit drivers and both 32-bit and 64-bit compilers enables support for 32-bit applications as well as 64-bit applications simultaneously on the same server. This is IA32e mode, also known as Long mode. IA32e mode is an extension of Legacy mode, and it supports two sub-modes: Compatibility mode and 64-bit mode. A 32-bit application running on a 64-bit OS will execute in Compatibility mode. A 64-bit application running on a 64-bit OS will execute in 64-bit mode. The 32-bit application will continue to use 32-bit components such as libraries and compilers that are included in the 64-bit OS. Note: The 64-bit OS for EM64T is not the same as an IA-64 OS. EM64T and IA-64 are two entirely different architectures with different instruction sets. A 32-bit application compiled with a 32-bit compiler and a 64- bit application compiled with a 64-bit EM64T compiler can execute simultaneously in a 64-bit OS in their respective modes. These two modes are transparent to the user; the only caveat is that separate binaries must be generated for 32-bit and 64-bit applications using the appropriate compilers. These requirements are summarized in Figure 2, and further detail is provided in the following sections. 4 Legacy mode. Legacy mode includes three sub-modes: Protected mode, Real Address mode, and Virtual 8086 mode. Legacy mode is designed to execute a 32-bit application the same way the application would execute on a pure IA-32 processor. An EM64T processor running a 32-bit OS will operate in Protected mode. The processor will transition to IA32e mode once a bit in the IA-32 Extended Feature Enabled Register (IA32-EFER) is set and the Physical Address Extension (PAE) mode is enabled by the EM64T OS. IA32e mode. Both IA32e sub-modes 64-bit mode and Compatibility mode provide eight floating-point registers (in addition to the original eight in previous-generation Intel Xeon processors). However, 64-bit mode and Compatibility mode provide different numbers of general-purpose registers. 64-bit mode: This mode is enabled when a 64-bit application runs on a 64-bit OS. The 64-bit mode is enabled by the OS on a code-segment-by-code-segment basis and allows 32-bit and 64-bit applications to execute simultaneously. In 64-bit mode, all registers, including the instruction pointer, are 64 bits. This mode has a total of 16 SSE3 128-bit registers, and 13 additional SSE3 instructions. The default operand is 32 bits but can be extended to 64 bits by the compiler using the provided operation code prefixes. The 32-bit operands generate 32-bit results that are zero-extended to 64-bit results in the destination 64-bit registers. The EM64T Intel Xeon processor DP has 36 bits of address bus width designed to support up to 64 GB of physical IA32e mode/long mode IA-64 Legacy Compatibility 64-bit mode architecture Mode mode (bits) mode (bits) (bits) (bits) Default Address 32/16 32/ sizes Operand 32/16 32/ General Purpose Register (GPR) width OS needed Application recompile No No Yes Yes needed Figure 2. Operating modes of an EM64T processor 4 For more information about EM64T, refer to volumes 1 and 2 of the 64-bit Extension Technology Software Developer s Guide (visit 48 POWER SOLUTIONS October 2004

3 address space. In comparison, the EM64T Intel Xeon processor MP is expected to have 40 bits of address bus width designed to support up to 1 TB of physical address space. Both processors are designed to support 48 bits of virtual memory, which is designed to address up to 256 TB of memory. The limit on physical addressing is implementation-specific and not bounded by the technology. In addition, 64-bit mode can allow two kinds of paging mechanisms: three-level paging with 2 MB pages and four-level paging with 4 KB pages. Compatibility mode: Compatibility mode can permit legacy 16-bit and 32-bit applications to execute without recompilation under a 64-bit OS. Like 64-bit mode, Compatibility mode is enabled by the OS on an individual codesegment basis. In Compatibility mode, applications can access only up to the first 4 GB of linear address space, standard IA-32 instruction prefixes, and standard IA-32 registers. Compatibility mode must also use 16-bit and 32-bit address and operand sizes. Compatibility mode, like Protected mode, is designed to allow applications to access up to 64 GB of physical memory using PAE. However, some differences exist between Compatibility mode and Protected mode. Virtual 8086 mode, task switches, and stack parameter copy features are neither available nor supported in Compatibility mode. Thus, legacy applications that run in Virtual 8086 mode or use hardware task management will not work in Compatibility mode. Furthermore, from the viewpoint of the OS, 64-bit mechanisms are used to handle system data structures, address translation, and interrupt and exception handling in Compatibility mode (instead of the 32-bit mechanisms that are used in Protected mode). Advantages of EM64T EM64T is designed to provide investment protection by allowing administrators to revert to a standard IA-32 environment (using Protected mode or Compatibility mode) if their existing applications do not experience any benefit from executing under the 64-bit architecture. EM64T also helps provide a proving ground that enables organizations to develop and test 64-bit versions of applications using 64-bit mode from the vantage point of the familiar IA-32 environment. The main advantage of EM64T is its capability to allow legacy 32-bit applications as well as 64-bit applications to execute on the same platform and even on the same OS. This capability has the potential to lower total cost of ownership (TCO) because both 32-bit and 64-bit applications can coexist on an EM64T-capable processor without the need for extra investments such as servers that have 64-bit processors in addition to servers that have 32-bit processors. Instead, the EM64T architecture enables organizations to use a single server to run both 32-bit and 64-bit applications. A 32-bit application can be migrated as is or recompiled to help leverage architectural enhancements and features such as SSE3, a faster FSB, and Hyper- Threading Technology. Memory-constrained 32-bit applications have the potential to experience immediate performance improvements from the larger addressable memory space. Before organizations can benefit from the full range of EM64T features and functionality, they may be required to invest in porting the application code, recompiling, and testing performance in 64-bit mode. This investment in application porting can be made over a period of time because existing 32-bit applications can be run during the transition. EM64T-capable processors are priced below the IA-64 Intel Itanium processor, to help enable organizations to utilize the power of 64-bit computing at an affordable price. Overview of BLAST The BLAST family of sequence database-search algorithms serves as the foundation for much biological research. BLAST algorithms search for similarities between a short query sequence and a large, infrequently changing database of DNA or amino acid sequences. Newly discovered sequences are commonly searched against a database of known DNA or amino acid sequences. Similarities between the new sequence and a gene of known functions can help identify the function of the new sequence. The scores assigned in a BLAST search have a well-defined statistical interpretation, making real matches easier to distinguish from random background hits. BLAST uses a heuristic algorithm that seeks local as opposed to global alignments. Other uses of BLAST searches include phylogenetic profiling and pairwise genome alignment. NCBI BLAST Established in 1988 as a national resource for molecular biology information, the National Center for Biotechnology Information (NCBI) creates public databases, conducts research in computational biology, develops software tools for analyzing genome data, and disseminates biomedical information. NCBI hosts a public-domain implementation of the BLAST algorithms. This implementation supports uniprocessor and symmetric multiprocessing (SMP) computing systems and is popularly known as NCBI BLAST. 5 NCBI BLAST implements database fragmentation techniques when using multithreading on SMP architecture. 5 For more information about NCBI and NCBI BLAST, visit and refer to Basic local alignment search tool by Stephen F. Altschul, Warren Gish, Webb Miller, Eugene W. Myers, and David J. Lipman (1990) at POWER SOLUTIONS 49

4 Unfortunately, traditional approaches to sequence homology searches using BLAST have proven to be too slow to keep up with the current rate of sequence acquisition. Because BLAST is both computationally intensive and exceedingly parallel, it is an excellent candidate for parallel implementation. Several open source and commercial implementations are available that Because sequence comparison is an exceedingly parallel operation, the BLAST application can be an excellent choice for parallel processing. enable BLAST to execute on a cluster of standards-based servers. An example of one such open source parallel implementation, based on Message Passing Interface (MPI), is mpiblast. 6 BLAST characteristics The BLAST application is memory intensive and highly sensitive to memory bandwidth. BLAST uses the mmap function to map file system objects into virtual memory. The file system objects comprise formatted data sets that are usually very large in size potentially tens of gigabytes. Therefore the universal buffer cache (UBC) is heavily used to keep the data sets cached in memory. If the entire data set cannot be cached in memory because of the OS s limited UBC parameter settings or because of low memory caused either by inadequate physical memory to hold the data sets or by the processor architecture s limited memory-addressing capability the file system objects will be paged. This page swapping can significantly reduce runtime performance. However, the BLAST application does not involve any memory write-back operations on the database which alleviates the burden on CPU and memory and the fetched data can be deleted immediately after use. Hence the memory-to-cpu bandwidth is also an influential factor in the performance of BLAST. Because sequence comparison is an exceedingly parallel operation, the BLAST application can be an excellent choice for parallel processing. BLAST can be run on an SMP system as well as on a cluster of standards-based servers. In either case, a divide-andconquer technique is employed on the database. BLAST uses threads to achieve parallelism on an SMP system. In an SMP system, memory latency can become more important than in a parallel-processing cluster because the memory subsystem must cater to multiple processes. When all the processors share the same memory bus, unless the bus is capable of an adequate data-transfer rate it can become a bottleneck that cripples application performance. BLAST is an integer operation intensive application. This implies that standard benchmarks such as SPECint 7 can be useful in selecting the right hardware platform for processing the BLAST application. In addition, SPECint results can be used to cross-check the results from BLAST. Test bed details Although for pure performance it would be optimal to run BLAST in a cluster, for the purpose of the study conducted in June 2004 by Dell engineers for this article, BLAST was run on a single processor. The aim of the Dell study was to compare and understand the performance of the different modes of operation offered by the EM64T architecture. To understand how the performance of BLAST compares in the three modes of operation Protected mode, Compatibility mode, and 64-bit mode the following three test cases were performed: Test case 1: Compiling and executing BLAST on a 32-bit OS in Legacy mode. Test case 2: Executing the 32-bit binary compiled in test case 1 on a 64-bit EM64T OS in Compatibility mode. Test case 3: Compiling and executing BLAST on a 64-bit EM64T OS in 64-bit mode. For all three test cases, the BLAST database and queries were run on a Dell PowerEdge1850 server configured as detailed in the following sections. BLAST setup Two BLAST algorithms blastp and blastn were used to benchmark BLAST performance in all three test cases in this study. The blastp algorithm performs an amino-acid-versus-amino-acid search against a database of protein sequences, while the blastn algorithm performs a nucleotide-versus-nucleotide search against the database of protein sequences. The following databases were used: Protein: The nr data set comprised 2,715,099 sequences represented by 757,237,578 letters (1 GB). Nucleotide: The nt data set comprised 2,218,584 sequences represented by 10,347,811,295 letters (10 GB). Queries of following word sizes were used: Protein: 153,117; 206,848; 237,455 Nucleotide: 94,799; 143,357; 187,517 6 For more information about mpiblast, see The Design, Implementation, and Evaluation of mpiblast by Aaron Darling, Lucas Carey, and Wu-chun Feng (2003) at For information about MPI and MPICH, a portable implementation of MPI, visit and 7 For more information about SPECint, visit 50 POWER SOLUTIONS October 2004

5 Time (sec) 25,000 23,000 21,000 19,000 17,000 15,000 13,000 11,000 Test case 1: 32-bit OS and 32-bit binary in Legacy mode Test case 2: 64-bit EM64T OS and 32-bit binary in Compatibility mode Test case 3: 64-bit EM64T OS and 64-bit binary in 64-bit mode The servers used the following operating systems: Red Hat Enterprise Linux WS 3 for IA-32 applications Red Hat Enterprise Linux WS 3 (update 2) for EM64T applications The following compilers were used: Intel C and Fortran Compilers 8.0 for IA-32 applications Intel C and Fortran Compilers 8.1 (beta) for EM64T applications 9,000 The follow execution parameters were used for runs in all three test cases: All databases were uncompressed and were in FASTA format. All databases were formatted using the formatdb utility (provided by NCBI BLAST source code). All databases were staged on local disks before executing the query. All time measurements were accomplished using the Linux time command. Server setup 153, , ,455 Query word size Figure 3. Performance of blastp on a PowerEdge 1850 server in different modes of operation The testing environment was based on a single Dell PowerEdge 1850 server with the following specifications: Intel E7520 chip set Two EM64T-based Intel Xeon processors running at 3.6 GHz 1 MB of L2 cache An 800 MHz FSB 8 GB of double data rate 2 (DDR2) RAM at 400 MHz obtained by using four 2 GB dual in-line memory modules (DIMMs) Two integrated Intel 82541GI/PI Gigabit 8 Ethernet controllers Intel Hyper-Threading Technology turned off Note: The E7520 chip set is designed to accommodate up to six registered 400 MHz DDR 2 DIMMs and provide an option for dual 64-bit Peripheral Component Interconnect Extended (PCI-X) slots or dual PCI Express x4 slots for interconnects and appropriate network interface cards (NICs). Results and analysis The two BLAST algorithms blastp and blastn were executed with the relevant databases. The test results for blastp are shown in Figure 3. The results in Figure 3 show that the 64-bit binary executed the blastp algorithm on a 64-bit OS (test case 3) consistently better than a 32-bit binary (test cases 1 and 2). The 64-bit binary in test case 3 showed approximately 5 percent performance improvement (measured by taking the average improvement attained by the 64-bit binary across the three queries) over the 32-bit binary in test case 1, which can be attributed to the larger addressable memory space available to the application in 64-bit mode. In addition to a larger addressable memory space, 64-bit mode provides more registers than Legacy mode and also more registers than are available for 32-bit applications in Compatibility mode. In this study, the increased number of registers in 64-bit mode helped reduce the number of register memory operations, thereby improving performance in test case 3 versus test cases 1 and 2. Performance of a 32-bit binary on a 64-bit OS (test case 2) was comparable to that of a 32-bit binary on a 32-bit OS (test case 1). Similar behavior was observed when testing blastn (see Figure 4). When executing the blastn algorithm, the 64-bit binary on the 64-bit OS (test case 3) outperformed the 32-bit binaries (test cases 1 and 2) by 6 percent (measured by taking the average improvement attained by the 64-bit binary across the three queries) because the application was able to utilize all the available physical memory without being restricted by the 4 GB limit imposed by 32-bit architecture. This advantage is more relevant in the blastn example In this study, the increased number of registers in 64-bit mode helped reduce the number of register memory operations, thereby improving performance. 8 This term does not connote an actual operating speed of 1 Gbps. For high-speed transmission, connection to a Gigabit Ethernet server and network infrastructure is required. POWER SOLUTIONS 51

6 Time (sec) 1,250 1,150 1, Test case 1: 32-bit OS and 32-bit binary in Legacy mode Test case 2: 64-bit EM64T OS and 32-bit binary in Compatibility mode Test case 3: 64-bit EM64T OS and 64-bit binary in 64-bit mode 94, , ,517 Query word size Figure 4. Performance of blastn on a PowerEdge 1850 server in different modes of operation than in the blastp example, because the size of the database being used is more than 4 GB. (The nt data set is 10 GB in size, while the nr data set is only 1 GB.) Additional registers provided by 64-bit mode reduced the number of register memory copies, thus improving the performance of the application. The 32-bit binary on a 32-bit OS (test case 1) performed almost as well as the 32-bit binary on a 64-bit OS (test case 2) when executing both blastp and blastn. This behavior was expected because Compatibility mode helped to ensure that the differences in the platform were transparent to the application when the 32-bit binary was being executed on a 64-bit OS. In fact, there may be a slight degradation in the performance of a 32-bit application on a 64-bit OS because the remaining 32 bits have to be zero-padded that is, filled in with zeros to form 64-bit words for proper execution. However, in this study the degradation observed was not significant. The 5 to 6 percent performance improvement for both blastp and blastn in 64-bit mode (test case 3) was achieved only by recompiling the application code with 64-bit compilers. Porting the code was unnecessary to achieve this gain. An option for transitioning to 64-bit computing The Intel EM64T based processor architecture is designed to run either a 32-bit OS or a 64-bit OS, thereby enabling enterprises to support both 32-bit and 64-bit applications on the same hardware platform. The study discussed in this article showed that the performance of a 32-bit binary on the 64-bit EM64T OS was comparable to that of a 32-bit binary on a 32-bit IA-32 OS. By comparing the performance of BLAST on a PowerEdge 1850 server using three modes of operation Legacy mode, Compatibility mode, and 64-bit mode Dell engineers demonstrated in this study that a 64-bit binary can exploit the advantages of Intel EM64T extended memory and other processor enhancements to achieve higher levels of performance than an IA-32 OS can provide. To take advantage of benefits provided by the Intel EM64T processor s 64-bit mode, organizations need to recompile 32-bit application code with an EM64T compiler; recoding and retesting code should not be necessary in most cases. Re-certification of applications may be required depending on how much nonstandard coding the application uses. 9 By potentially saving organizations the time, effort, and resources required to port code, EM64T can help make the transition to 64-bit computing easy and affordable while providing 32-bit compatibility. Garima Kochhar is a systems engineer in the Scalable Systems Group at Dell. She has a B.S. in Computer Science and Physics from Birla Institute of Technology and Science (BITS) in Pilani, India. She has an M.S. from The Ohio State University, where she worked in the area of job scheduling. Kalyana Chadalavada is a senior engineer with the Dell Enterprise Solutions Engineering (ESE) Group at Bangalore Development Center (BDC). Kalyan has a bachelor of technology degree in Computer Sciences and Engineering from Nagarjuna University, India. His current interests include performance characterizations on HPC clusters and processor architectures. Amina Saify is a member of the Scalable Systems Group at Dell. Amina has a bachelor s degree in Computer Science from Devi Ahilya University (DAVV) in India, and a master s degree in Computer and Information Science from The Ohio State University. Rizwan Ali is a systems engineer in the Scalable Systems Group at Dell. His current research interests are performance benchmarking and high-speed interconnects. Rizwan has a B.S. in Electrical Engineering from the University of Minnesota. FOR MORE INFORMATION Intel EM64T home page: EM64T developer s guide: EM64T Intel Xeon processor MP: part2.html NCBI and NCBI BLAST: mpiblast: 9 For more information about porting 32-bit applications to the EM64T architecture, see 52 POWER SOLUTIONS October 2004

Molecular Dynamics and Quantum Mechanics Applications

Molecular Dynamics and Quantum Mechanics Applications Understanding the Performance of Molecular Dynamics and Quantum Mechanics Applications on Dell HPC Clusters High-performance computing (HPC) clusters are proving to be suitable environments for running

More information

Moving from 32 to 64 bits while maintaining compatibility. Orlando Ricardo Nunes Rocha

Moving from 32 to 64 bits while maintaining compatibility. Orlando Ricardo Nunes Rocha Moving from 32 to 64 bits while maintaining compatibility Orlando Ricardo Nunes Rocha Informatics Department, University of Minho 4710 Braga, Portugal orocha@deb.uminho.pt Abstract. The EM64T is a recent

More information

Maximizing NFS Scalability

Maximizing NFS Scalability Maximizing NFS Scalability on Dell Servers and Storage in High-Performance Computing Environments Popular because of its maturity and ease of use, the Network File System (NFS) can be used in high-performance

More information

Getting the Best Performance from an HPC Cluster: BY BARIS GULER; JENWEI HSIEH, PH.D.; RAJIV KAPOOR; LANCE SHULER; AND JOHN BENNINGHOFF

Getting the Best Performance from an HPC Cluster: BY BARIS GULER; JENWEI HSIEH, PH.D.; RAJIV KAPOOR; LANCE SHULER; AND JOHN BENNINGHOFF Getting the Best Performance from an HPC Cluster: A STAR-CD Case Study High-performance computing (HPC) clusters represent a new era in supercomputing. Because HPC clusters usually comprise standards-based,

More information

Bioinformatics explained: BLAST. March 8, 2007

Bioinformatics explained: BLAST. March 8, 2007 Bioinformatics Explained Bioinformatics explained: BLAST March 8, 2007 CLC bio Gustav Wieds Vej 10 8000 Aarhus C Denmark Telephone: +45 70 22 55 09 Fax: +45 70 22 55 19 www.clcbio.com info@clcbio.com Bioinformatics

More information

Performance Comparisons of Dell PowerEdge Servers with SQL Server 2000 Service Pack 4 Enterprise Product Group (EPG)

Performance Comparisons of Dell PowerEdge Servers with SQL Server 2000 Service Pack 4 Enterprise Product Group (EPG) Performance Comparisons of Dell PowerEdge Servers with SQL Server 2000 Service Pack 4 Enterprise Product Group (EPG) Dell White Paper By Neelima Chinthamani (Enterprise OS Releases) Ravikanth Chaganti

More information

BLAST, Profile, and PSI-BLAST

BLAST, Profile, and PSI-BLAST BLAST, Profile, and PSI-BLAST Jianlin Cheng, PhD School of Electrical Engineering and Computer Science University of Central Florida 26 Free for academic use Copyright @ Jianlin Cheng & original sources

More information

On the Efficacy of Haskell for High Performance Computational Biology

On the Efficacy of Haskell for High Performance Computational Biology On the Efficacy of Haskell for High Performance Computational Biology Jacqueline Addesa Academic Advisors: Jeremy Archuleta, Wu chun Feng 1. Problem and Motivation Biologists can leverage the power of

More information

Understanding Dual-processors, Hyper-Threading Technology, and Multicore Systems

Understanding Dual-processors, Hyper-Threading Technology, and Multicore Systems Understanding Dual-processors, Hyper-Threading Technology, and Multicore Systems This paper will provide you with a basic understanding of the differences among several computer system architectures dual-processor

More information

Performance Scaling. When deciding how to implement a virtualized environment. with Dell PowerEdge 2950 Servers and VMware Virtual Infrastructure 3

Performance Scaling. When deciding how to implement a virtualized environment. with Dell PowerEdge 2950 Servers and VMware Virtual Infrastructure 3 Scaling with Dell PowerEdge 2950 Servers and VMware Virtual Infrastructure 3 To assess virtualization scalability and performance, Dell engineers tested two Dell PowerEdge 2950 servers with dual-core Intel

More information

Meet the Increased Demands on Your Infrastructure with Dell and Intel. ServerWatchTM Executive Brief

Meet the Increased Demands on Your Infrastructure with Dell and Intel. ServerWatchTM Executive Brief Meet the Increased Demands on Your Infrastructure with Dell and Intel ServerWatchTM Executive Brief a QuinStreet Excutive Brief. 2012 Doing more with less is the mantra that sums up much of the past decade,

More information

OPEN MP-BASED PARALLEL AND SCALABLE GENETIC SEQUENCE ALIGNMENT

OPEN MP-BASED PARALLEL AND SCALABLE GENETIC SEQUENCE ALIGNMENT OPEN MP-BASED PARALLEL AND SCALABLE GENETIC SEQUENCE ALIGNMENT Asif Ali Khan*, Laiq Hassan*, Salim Ullah* ABSTRACT: In bioinformatics, sequence alignment is a common and insistent task. Biologists align

More information

InfoBrief. Dell 2-Node Cluster Achieves Unprecedented Result with Three-tier SAP SD Parallel Standard Application Benchmark on Linux

InfoBrief. Dell 2-Node Cluster Achieves Unprecedented Result with Three-tier SAP SD Parallel Standard Application Benchmark on Linux InfoBrief Dell 2-Node Cluster Achieves Unprecedented Result with Three-tier SAP SD Parallel Standard Application Benchmark on Linux Leveraging Oracle 9i Real Application Clusters (RAC) Technology and Red

More information

24 Grundlagen der Bioinformatik, SS 10, D. Huson, April 26, This lecture is based on the following papers, which are all recommended reading:

24 Grundlagen der Bioinformatik, SS 10, D. Huson, April 26, This lecture is based on the following papers, which are all recommended reading: 24 Grundlagen der Bioinformatik, SS 10, D. Huson, April 26, 2010 3 BLAST and FASTA This lecture is based on the following papers, which are all recommended reading: D.J. Lipman and W.R. Pearson, Rapid

More information

Intel Enterprise Processors Technology

Intel Enterprise Processors Technology Enterprise Processors Technology Kosuke Hirano Enterprise Platforms Group March 20, 2002 1 Agenda Architecture in Enterprise Xeon Processor MP Next Generation Itanium Processor Interconnect Technology

More information

High Performance Computing (HPC) Prepared By: Abdussamad Muntahi Muhammad Rahman

High Performance Computing (HPC) Prepared By: Abdussamad Muntahi Muhammad Rahman High Performance Computing (HPC) Prepared By: Abdussamad Muntahi Muhammad Rahman 1 2 Introduction to High Performance Computing (HPC) Introduction High-speed computing. Originally pertaining only to supercomputers

More information

eslim SV Xeon 2U Server

eslim SV Xeon 2U Server eslim SV7-2250 Xeon 2U Server www.eslim.co.kr Dual and Quad-Core Server Computing Leader!! ELSIM KOREA INC. 1. Overview Hyper-Threading eslim SV7-2250 Server Outstanding computing powered by 64-bit Intel

More information

Parallel & Cluster Computing. cs 6260 professor: elise de doncker by: lina hussein

Parallel & Cluster Computing. cs 6260 professor: elise de doncker by: lina hussein Parallel & Cluster Computing cs 6260 professor: elise de doncker by: lina hussein 1 Topics Covered : Introduction What is cluster computing? Classification of Cluster Computing Technologies: Beowulf cluster

More information

TECHNOLOGY BRIEF. Compaq 8-Way Multiprocessing Architecture EXECUTIVE OVERVIEW CONTENTS

TECHNOLOGY BRIEF. Compaq 8-Way Multiprocessing Architecture EXECUTIVE OVERVIEW CONTENTS TECHNOLOGY BRIEF March 1999 Compaq Computer Corporation ISSD Technology Communications CONTENTS Executive Overview1 Notice2 Introduction 3 8-Way Architecture Overview 3 Processor and I/O Bus Design 4 Processor

More information

Dell EMC Ready Bundle for HPC Digital Manufacturing Dassault Systѐmes Simulia Abaqus Performance

Dell EMC Ready Bundle for HPC Digital Manufacturing Dassault Systѐmes Simulia Abaqus Performance Dell EMC Ready Bundle for HPC Digital Manufacturing Dassault Systѐmes Simulia Abaqus Performance This Dell EMC technical white paper discusses performance benchmarking results and analysis for Simulia

More information

Use of the Internet SCSI (iscsi) protocol

Use of the Internet SCSI (iscsi) protocol A unified networking approach to iscsi storage with Broadcom controllers By Dhiraj Sehgal, Abhijit Aswath, and Srinivas Thodati In environments based on Internet SCSI (iscsi) and 10 Gigabit Ethernet, deploying

More information

The Optimal CPU and Interconnect for an HPC Cluster

The Optimal CPU and Interconnect for an HPC Cluster 5. LS-DYNA Anwenderforum, Ulm 2006 Cluster / High Performance Computing I The Optimal CPU and Interconnect for an HPC Cluster Andreas Koch Transtec AG, Tübingen, Deutschland F - I - 15 Cluster / High Performance

More information

BlueGene/L. Computer Science, University of Warwick. Source: IBM

BlueGene/L. Computer Science, University of Warwick. Source: IBM BlueGene/L Source: IBM 1 BlueGene/L networking BlueGene system employs various network types. Central is the torus interconnection network: 3D torus with wrap-around. Each node connects to six neighbours

More information

The PowerEdge 6450 systems offer the following major features:

The PowerEdge 6450 systems offer the following major features: The Dell PowerEdge 6450 system, which supports up to four Intel Pentium III Xeon microprocessors (and future generation Intel microprocessors), is a featurerich, enterprise class server that offers the

More information

The Use of Cloud Computing Resources in an HPC Environment

The Use of Cloud Computing Resources in an HPC Environment The Use of Cloud Computing Resources in an HPC Environment Bill, Labate, UCLA Office of Information Technology Prakashan Korambath, UCLA Institute for Digital Research & Education Cloud computing becomes

More information

Performance Benefits of OpenVMS V8.4 Running on BL8x0c i2 Server Blades

Performance Benefits of OpenVMS V8.4 Running on BL8x0c i2 Server Blades Performance Benefits of OpenVMS V8.4 Running on BL8xc i2 Server Blades A detailed review of performance features and test results for OpenVMS V8.4. March 211 211, TechWise Research. All Rights Reserved

More information

Best Practices for Setting BIOS Parameters for Performance

Best Practices for Setting BIOS Parameters for Performance White Paper Best Practices for Setting BIOS Parameters for Performance Cisco UCS E5-based M3 Servers May 2013 2014 Cisco and/or its affiliates. All rights reserved. This document is Cisco Public. Page

More information

Performance Optimizations via Connect-IB and Dynamically Connected Transport Service for Maximum Performance on LS-DYNA

Performance Optimizations via Connect-IB and Dynamically Connected Transport Service for Maximum Performance on LS-DYNA Performance Optimizations via Connect-IB and Dynamically Connected Transport Service for Maximum Performance on LS-DYNA Pak Lui, Gilad Shainer, Brian Klaff Mellanox Technologies Abstract From concept to

More information

Simultaneous Multithreading on Pentium 4

Simultaneous Multithreading on Pentium 4 Hyper-Threading: Simultaneous Multithreading on Pentium 4 Presented by: Thomas Repantis trep@cs.ucr.edu CS203B-Advanced Computer Architecture, Spring 2004 p.1/32 Overview Multiple threads executing on

More information

PARALIGN: rapid and sensitive sequence similarity searches powered by parallel computing technology

PARALIGN: rapid and sensitive sequence similarity searches powered by parallel computing technology Nucleic Acids Research, 2005, Vol. 33, Web Server issue W535 W539 doi:10.1093/nar/gki423 PARALIGN: rapid and sensitive sequence similarity searches powered by parallel computing technology Per Eystein

More information

Efficient Power Management

Efficient Power Management Efficient Power Management on Dell PowerEdge Servers with AMD Opteron Processors Efficient power management enables enterprises to help reduce overall IT costs by avoiding unnecessary energy use. This

More information

The rcuda middleware and applications

The rcuda middleware and applications The rcuda middleware and applications Will my application work with rcuda? rcuda currently provides binary compatibility with CUDA 5.0, virtualizing the entire Runtime API except for the graphics functions,

More information

Memory Systems IRAM. Principle of IRAM

Memory Systems IRAM. Principle of IRAM Memory Systems 165 other devices of the module will be in the Standby state (which is the primary state of all RDRAM devices) or another state with low-power consumption. The RDRAM devices provide several

More information

HP Z Turbo Drive G2 PCIe SSD

HP Z Turbo Drive G2 PCIe SSD Performance Evaluation of HP Z Turbo Drive G2 PCIe SSD Powered by Samsung NVMe technology Evaluation Conducted Independently by: Hamid Taghavi Senior Technical Consultant August 2015 Sponsored by: P a

More information

InfoBrief. Platform ROCKS Enterprise Edition Dell Cluster Software Offering. Key Points

InfoBrief. Platform ROCKS Enterprise Edition Dell Cluster Software Offering. Key Points InfoBrief Platform ROCKS Enterprise Edition Dell Cluster Software Offering Key Points High Performance Computing Clusters (HPCC) offer a cost effective, scalable solution for demanding, compute intensive

More information

The Effect of Inverse Document Frequency Weights on Indexed Sequence Retrieval. Kevin C. O'Kane. Department of Computer Science

The Effect of Inverse Document Frequency Weights on Indexed Sequence Retrieval. Kevin C. O'Kane. Department of Computer Science The Effect of Inverse Document Frequency Weights on Indexed Sequence Retrieval Kevin C. O'Kane Department of Computer Science The University of Northern Iowa Cedar Falls, Iowa okane@cs.uni.edu http://www.cs.uni.edu/~okane

More information

The Design, Implementation, and Evaluation of

The Design, Implementation, and Evaluation of ClusterWorld Conference & Expo and the 4th International Conference on Linux Clusters: The HPC Revolution 2003. LA-UR 03-2862 The Design, Implementation, and Evaluation of mpiblast Aaron E. Darling 1,3,

More information

Quotations invited. 2. The supplied hardware should have 5 years comprehensive onsite warranty (24 x 7 call logging) from OEM directly.

Quotations invited. 2. The supplied hardware should have 5 years comprehensive onsite warranty (24 x 7 call logging) from OEM directly. Enquiry No: IITK/ME/mkdas/2016/01 May 04, 2016 Quotations invited Sealed quotations are invited for the purchase of an HPC cluster with the specification outlined below. Technical as well as the commercial

More information

Robert Jamieson. Robs Techie PP Everything in this presentation is at your own risk!

Robert Jamieson. Robs Techie PP Everything in this presentation is at your own risk! Robert Jamieson Robs Techie PP Everything in this presentation is at your own risk! PC s Today Basic Setup Hardware pointers PCI Express How will it effect you Basic Machine Setup Set the swap space Min

More information

Four-Socket Server Consolidation Using SQL Server 2008

Four-Socket Server Consolidation Using SQL Server 2008 Four-Socket Server Consolidation Using SQL Server 28 A Dell Technical White Paper Authors Raghunatha M Leena Basanthi K Executive Summary Businesses of all sizes often face challenges with legacy hardware

More information

Microsoft Exchange Server 2010 workload optimization on the new IBM PureFlex System

Microsoft Exchange Server 2010 workload optimization on the new IBM PureFlex System Microsoft Exchange Server 2010 workload optimization on the new IBM PureFlex System Best practices Roland Mueller IBM Systems and Technology Group ISV Enablement April 2012 Copyright IBM Corporation, 2012

More information

Performance comparison between a massive SMP machine and clusters

Performance comparison between a massive SMP machine and clusters Performance comparison between a massive SMP machine and clusters Martin Scarcia, Stefano Alberto Russo Sissa/eLab joint Democritos/Sissa Laboratory for e-science Via Beirut 2/4 34151 Trieste, Italy Stefano

More information

Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Intel Xeon Processor E7 v2 Family-Based Platforms

Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Intel Xeon Processor E7 v2 Family-Based Platforms Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Family-Based Platforms Executive Summary Complex simulations of structural and systems performance, such as car crash simulations,

More information

Introduction Electrical Considerations Data Transfer Synchronization Bus Arbitration VME Bus Local Buses PCI Bus PCI Bus Variants Serial Buses

Introduction Electrical Considerations Data Transfer Synchronization Bus Arbitration VME Bus Local Buses PCI Bus PCI Bus Variants Serial Buses Introduction Electrical Considerations Data Transfer Synchronization Bus Arbitration VME Bus Local Buses PCI Bus PCI Bus Variants Serial Buses 1 Most of the integrated I/O subsystems are connected to the

More information

Accelerating HPC. (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing

Accelerating HPC. (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing Accelerating HPC (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing SAAHPC, Knoxville, July 13, 2010 Legal Disclaimer Intel may make changes to specifications and product

More information

The AMD64 Technology for Server and Workstation. Dr. Ulrich Knechtel Enterprise Program Manager EMEA

The AMD64 Technology for Server and Workstation. Dr. Ulrich Knechtel Enterprise Program Manager EMEA The AMD64 Technology for Server and Workstation Dr. Ulrich Knechtel Enterprise Program Manager EMEA Agenda Direct Connect Architecture AMD Opteron TM Processor Roadmap Competition OEM support The AMD64

More information

eslim SV Xeon 1U Server

eslim SV Xeon 1U Server eslim SV7-2100 Xeon 1U Server www.eslim.co.kr Dual and Quad-Core Server Computing Leader!! ELSIM KOREA INC. 1. Overview Hyper-Threading eslim SV7-2100 Server Outstanding computing powered by 64-bit Intel

More information

Understanding the performance of an X user environment

Understanding the performance of an X user environment Understanding the performance of an X550 11-user environment Overview NComputing s desktop virtualization technology enables significantly lower computing costs by letting multiple users share a single

More information

GROMACS Performance Benchmark and Profiling. August 2011

GROMACS Performance Benchmark and Profiling. August 2011 GROMACS Performance Benchmark and Profiling August 2011 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Intel, Dell, Mellanox Compute resource

More information

Application Performance on Dual Processor Cluster Nodes

Application Performance on Dual Processor Cluster Nodes Application Performance on Dual Processor Cluster Nodes by Kent Milfeld milfeld@tacc.utexas.edu edu Avijit Purkayastha, Kent Milfeld, Chona Guiang, Jay Boisseau TEXAS ADVANCED COMPUTING CENTER Thanks Newisys

More information

Technical Brief: Specifying a PC for Mascot

Technical Brief: Specifying a PC for Mascot Technical Brief: Specifying a PC for Mascot Matrix Science 8 Wyndham Place London W1H 1PP United Kingdom Tel: +44 (0)20 7723 2142 Fax: +44 (0)20 7725 9360 info@matrixscience.com http://www.matrixscience.com

More information

Heuristic methods for pairwise alignment:

Heuristic methods for pairwise alignment: Bi03c_1 Unit 03c: Heuristic methods for pairwise alignment: k-tuple-methods k-tuple-methods for alignment of pairs of sequences Bi03c_2 dynamic programming is too slow for large databases Use heuristic

More information

SU Dual and Quad-Core Xeon UP Server

SU Dual and Quad-Core Xeon UP Server SU4-1300 Dual and Quad-Core Xeon UP Server www.eslim.co.kr Dual and Quad-Core Server Computing Leader!! ESLIM KOREA INC. 1. Overview eslim SU4-1300 The ideal entry-level server Intel Xeon processor 3000/3200

More information

2 to 4 Intel Xeon Processor E v3 Family CPUs. Up to 12 SFF Disk Drives for Appliance Model. Up to 6 TB of Main Memory (with GB LRDIMMs)

2 to 4 Intel Xeon Processor E v3 Family CPUs. Up to 12 SFF Disk Drives for Appliance Model. Up to 6 TB of Main Memory (with GB LRDIMMs) Based on Cisco UCS C460 M4 Rack Servers Solution Brief May 2015 With Intelligent Intel Xeon Processors Highlights Integrate with Your Existing Data Center Our SAP HANA appliances help you get up and running

More information

Consolidating OLTP Workloads on Dell PowerEdge R th generation Servers

Consolidating OLTP Workloads on Dell PowerEdge R th generation Servers Consolidating OLTP Workloads on Dell PowerEdge R720 12 th generation Servers B Balamurugan Phani MV Dell Database Solutions Engineering March 2012 This document is for informational purposes only and may

More information

LS-DYNA Productivity and Power-aware Simulations in Cluster Environments

LS-DYNA Productivity and Power-aware Simulations in Cluster Environments LS-DYNA Productivity and Power-aware Simulations in Cluster Environments Gilad Shainer 1, Tong Liu 1, Jacob Liberman 2, Jeff Layton 2 Onur Celebioglu 2, Scot A. Schultz 3, Joshua Mora 3, David Cownie 3,

More information

HP SAS benchmark performance tests

HP SAS benchmark performance tests HP SAS benchmark performance tests technology brief Abstract... 2 Introduction... 2 Test hardware... 2 HP ProLiant DL585 server... 2 HP ProLiant DL380 G4 and G4 SAS servers... 3 HP Smart Array P600 SAS

More information

Upgrade to Microsoft SQL Server 2016 with Dell EMC Infrastructure

Upgrade to Microsoft SQL Server 2016 with Dell EMC Infrastructure Upgrade to Microsoft SQL Server 2016 with Dell EMC Infrastructure Generational Comparison Study of Microsoft SQL Server Dell Engineering February 2017 Revisions Date Description February 2017 Version 1.0

More information

Exploring the Effects of Hyperthreading on Scientific Applications

Exploring the Effects of Hyperthreading on Scientific Applications Exploring the Effects of Hyperthreading on Scientific Applications by Kent Milfeld milfeld@tacc.utexas.edu edu Kent Milfeld, Chona Guiang, Avijit Purkayastha, Jay Boisseau TEXAS ADVANCED COMPUTING CENTER

More information

Node Hardware. Performance Convergence

Node Hardware. Performance Convergence Node Hardware Improved microprocessor performance means availability of desktop PCs with performance of workstations (and of supercomputers of 10 years ago) at significanty lower cost Parallel supercomputers

More information

IBM InfoSphere Streams v4.0 Performance Best Practices

IBM InfoSphere Streams v4.0 Performance Best Practices Henry May IBM InfoSphere Streams v4.0 Performance Best Practices Abstract Streams v4.0 introduces powerful high availability features. Leveraging these requires careful consideration of performance related

More information

HPC-BLAST Scalable Sequence Analysis for the Intel Many Integrated Core Future

HPC-BLAST Scalable Sequence Analysis for the Intel Many Integrated Core Future HPC-BLAST Scalable Sequence Analysis for the Intel Many Integrated Core Future Dr. R. Glenn Brook & Shane Sawyer Joint Institute For Computational Sciences University of Tennessee, Knoxville Dr. Bhanu

More information

Deploy a High-Performance Database Solution: Cisco UCS B420 M4 Blade Server with Fusion iomemory PX600 Using Oracle Database 12c

Deploy a High-Performance Database Solution: Cisco UCS B420 M4 Blade Server with Fusion iomemory PX600 Using Oracle Database 12c White Paper Deploy a High-Performance Database Solution: Cisco UCS B420 M4 Blade Server with Fusion iomemory PX600 Using Oracle Database 12c What You Will Learn This document demonstrates the benefits

More information

MICROPROCESSOR TECHNOLOGY

MICROPROCESSOR TECHNOLOGY MICROPROCESSOR TECHNOLOGY Assis. Prof. Hossam El-Din Moustafa Lecture 20 Ch.10 Intel Core Duo Processor Architecture 2-Jun-15 1 Chapter Objectives Understand the concept of dual core technology. Look inside

More information

SQL Server 2005 on a Dell Scalable Enterprise Foundation

SQL Server 2005 on a Dell Scalable Enterprise Foundation on a Dell Scalable Enterprise Foundation Dell s vision for the scalable enterprise is based on the standardization of core elements of the data center to provide superior value, and encompasses the core

More information

FlashGrid Software Enables Converged and Hyper-Converged Appliances for Oracle* RAC

FlashGrid Software Enables Converged and Hyper-Converged Appliances for Oracle* RAC white paper FlashGrid Software Intel SSD DC P3700/P3600/P3500 Topic: Hyper-converged Database/Storage FlashGrid Software Enables Converged and Hyper-Converged Appliances for Oracle* RAC Abstract FlashGrid

More information

Design a Remote-Office or Branch-Office Data Center with Cisco UCS Mini

Design a Remote-Office or Branch-Office Data Center with Cisco UCS Mini White Paper Design a Remote-Office or Branch-Office Data Center with Cisco UCS Mini June 2016 2016 Cisco and/or its affiliates. All rights reserved. This document is Cisco Public. Page 1 of 9 Contents

More information

General Purpose GPU Computing in Partial Wave Analysis

General Purpose GPU Computing in Partial Wave Analysis JLAB at 12 GeV - INT General Purpose GPU Computing in Partial Wave Analysis Hrayr Matevosyan - NTC, Indiana University November 18/2009 COmputationAL Challenges IN PWA Rapid Increase in Available Data

More information

Computing architectures Part 2 TMA4280 Introduction to Supercomputing

Computing architectures Part 2 TMA4280 Introduction to Supercomputing Computing architectures Part 2 TMA4280 Introduction to Supercomputing NTNU, IMF January 16. 2017 1 Supercomputing What is the motivation for Supercomputing? Solve complex problems fast and accurately:

More information

MetaPhyler Usage Manual

MetaPhyler Usage Manual MetaPhyler Usage Manual Bo Liu boliu@umiacs.umd.edu March 13, 2012 Contents 1 What is MetaPhyler 1 2 Installation 1 3 Quick Start 2 3.1 Taxonomic profiling for metagenomic sequences.............. 2 3.2

More information

Processor 2 Quad Core Intel Xeon Processor. E GHz Memory (std/max) 2048MB/48GB Slots and bays. SAS Controller, 0 installed 1.

Processor 2 Quad Core Intel Xeon Processor. E GHz Memory (std/max) 2048MB/48GB Slots and bays. SAS Controller, 0 installed 1. Product Details > Model 7979KHG At a glance... 2 Quad Core Intel Xeon Hard disk type E5335 2.00GHz Memory (std/max) 2048MB/48GB Slots and bays Number of processors std/max Level 2 cache Disk subsystem

More information

Quad-Core Intel Xeon Processor 5400 Series

Quad-Core Intel Xeon Processor 5400 Series Platform Brief Intel Xeon Processor Technology Quad-Core Intel Xeon Processor 5400 Series Boost Performance and Energy Efficiency with New 45nm Multi-core Intel Xeon Processors Now featuring Intel s second-generation

More information

A Scalable Coprocessor for Bioinformatic Sequence Alignments

A Scalable Coprocessor for Bioinformatic Sequence Alignments A Scalable Coprocessor for Bioinformatic Sequence Alignments Scott F. Smith Department of Electrical and Computer Engineering Boise State University Boise, ID, U.S.A. Abstract A hardware coprocessor for

More information

FUSION1200 Scalable x86 SMP System

FUSION1200 Scalable x86 SMP System FUSION1200 Scalable x86 SMP System Introduction Life Sciences Departmental System Manufacturing (CAE) Departmental System Competitive Analysis: IBM x3950 Competitive Analysis: SUN x4600 / SUN x4600 M2

More information

Microsoft. iron Krokhmal et IT /2005

Microsoft. iron Krokhmal et IT /2005 Microsoft 2005 i et IT /2005 Current Features Automated Deployment Services (ADS). - imaging tools to automate the deployment of Microsoft operating systems. Active Directory Application Mode (ADAM). -

More information

INTRODUCTION TO BIOINFORMATICS

INTRODUCTION TO BIOINFORMATICS Molecular Biology-2017 1 INTRODUCTION TO BIOINFORMATICS In this section, we want to provide a simple introduction to using the web site of the National Center for Biotechnology Information NCBI) to obtain

More information

Six-Core AMD Opteron Processor

Six-Core AMD Opteron Processor What s you should know about the Six-Core AMD Opteron Processor (Codenamed Istanbul ) Six-Core AMD Opteron Processor Versatility Six-Core Opteron processors offer an optimal mix of performance, energy

More information

Performance of DB2 Enterprise-Extended Edition on NT with Virtual Interface Architecture

Performance of DB2 Enterprise-Extended Edition on NT with Virtual Interface Architecture Performance of DB2 Enterprise-Extended Edition on NT with Virtual Interface Architecture Sivakumar Harinath 1, Robert L. Grossman 1, K. Bernhard Schiefer 2, Xun Xue 2, and Sadique Syed 2 1 Laboratory of

More information

What Transitioning from 32-bit to 64-bit x86 Computing Means Today

What Transitioning from 32-bit to 64-bit x86 Computing Means Today What Transitioning from 32-bit to 64-bit x86 Computing Means Today Chris Wanner Senior Architect, Industry Standard Servers Hewlett-Packard 2004 Hewlett-Packard Development Company, L.P. The information

More information

InfiniBand SDR, DDR, and QDR Technology Guide

InfiniBand SDR, DDR, and QDR Technology Guide White Paper InfiniBand SDR, DDR, and QDR Technology Guide The InfiniBand standard supports single, double, and quadruple data rate that enables an InfiniBand link to transmit more data. This paper discusses

More information

HPC Solution. Technology for a New Era in Computing

HPC Solution. Technology for a New Era in Computing HPC Solution Technology for a New Era in Computing TEL IN HPC & Storage.. 20 years of changing with Technology Complete Solution Integrators for Select Verticals Mechanical Design & Engineering High Performance

More information

Consulting Solutions WHITE PAPER Citrix XenDesktop XenApp 6.x Planning Guide: Virtualization Best Practices

Consulting Solutions WHITE PAPER Citrix XenDesktop XenApp 6.x Planning Guide: Virtualization Best Practices Consulting Solutions WHITE PAPER Citrix XenDesktop XenApp 6.x Planning Guide: Virtualization Best Practices www.citrix.com Table of Contents Overview... 3 Scalability... 3 Guidelines... 4 Operations...

More information

The Dell Precision T3620 tower as a Smart Client leveraging GPU hardware acceleration

The Dell Precision T3620 tower as a Smart Client leveraging GPU hardware acceleration The Dell Precision T3620 tower as a Smart Client leveraging GPU hardware acceleration Dell IP Video Platform Design and Calibration Lab June 2018 H17415 Reference Architecture Dell EMC Solutions Copyright

More information

Nested Virtualization and Server Consolidation

Nested Virtualization and Server Consolidation Nested Virtualization and Server Consolidation Vara Varavithya Department of Electrical Engineering, KMUTNB varavithya@gmail.com 1 Outline Virtualization & Background Nested Virtualization Hybrid-Nested

More information

LS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance

LS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance 11 th International LS-DYNA Users Conference Computing Technology LS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance Gilad Shainer 1, Tong Liu 2, Jeff Layton

More information

HP ProLiant BladeSystem Gen9 vs Gen8 and G7 Server Blades on Data Warehouse Workloads

HP ProLiant BladeSystem Gen9 vs Gen8 and G7 Server Blades on Data Warehouse Workloads HP ProLiant BladeSystem Gen9 vs Gen8 and G7 Server Blades on Data Warehouse Workloads Gen9 server blades give more performance per dollar for your investment. Executive Summary Information Technology (IT)

More information

Basic Local Alignment Search Tool (BLAST)

Basic Local Alignment Search Tool (BLAST) BLAST 26.04.2018 Basic Local Alignment Search Tool (BLAST) BLAST (Altshul-1990) is an heuristic Pairwise Alignment composed by six-steps that search for local similarities. The most used access point to

More information

INTRODUCTION TO BIOINFORMATICS

INTRODUCTION TO BIOINFORMATICS Molecular Biology-2019 1 INTRODUCTION TO BIOINFORMATICS In this section, we want to provide a simple introduction to using the web site of the National Center for Biotechnology Information NCBI) to obtain

More information

Virtual Switch Acceleration with OVS-TC

Virtual Switch Acceleration with OVS-TC WHITE PAPER Virtual Switch Acceleration with OVS-TC HARDWARE ACCELERATED OVS-TC PROVIDES BETTER CPU EFFICIENCY, LOWER COMPLEXITY, ENHANCED SCALABILITY AND INCREASED NETWORK PERFORMANCE COMPARED TO KERNEL-

More information

Reduce Costs & Increase Oracle Database OLTP Workload Service Levels:

Reduce Costs & Increase Oracle Database OLTP Workload Service Levels: Reduce Costs & Increase Oracle Database OLTP Workload Service Levels: PowerEdge 2950 Consolidation to PowerEdge 11th Generation A Dell Technical White Paper Dell Database Solutions Engineering Balamurugan

More information

Performance of the AMD Opteron LS21 for IBM BladeCenter

Performance of the AMD Opteron LS21 for IBM BladeCenter August 26 Performance Analysis Performance of the AMD Opteron LS21 for IBM BladeCenter Douglas M. Pase and Matthew A. Eckl IBM Systems and Technology Group Page 2 Abstract In this paper we examine the

More information

Diffusion TM 5.0 Performance Benchmarks

Diffusion TM 5.0 Performance Benchmarks Diffusion TM 5.0 Performance Benchmarks Contents Introduction 3 Benchmark Overview 3 Methodology 4 Results 5 Conclusion 7 Appendix A Environment 8 Diffusion TM 5.0 Performance Benchmarks 2 1 Introduction

More information

Design a Remote-Office or Branch-Office Data Center with Cisco UCS Mini

Design a Remote-Office or Branch-Office Data Center with Cisco UCS Mini White Paper Design a Remote-Office or Branch-Office Data Center with Cisco UCS Mini February 2015 2015 Cisco and/or its affiliates. All rights reserved. This document is Cisco Public. Page 1 of 9 Contents

More information

Implementing Software RAID

Implementing Software RAID Implementing Software RAID on Dell PowerEdge Servers Software RAID is an inexpensive storage method offering fault tolerance and enhanced disk read-write performance. This article defines and compares

More information

The Central Processing Unit

The Central Processing Unit The Central Processing Unit All computers derive from the same basic design, usually referred to as the von Neumann architecture. This concept involves solving a problem by defining a sequence of commands

More information

Computer Architecture

Computer Architecture Computer Architecture Computer Architecture Hardware INFO 2603 Platform Technologies Week 1: 04-Sept-2018 Computer architecture refers to the overall design of the physical parts of a computer. It examines:

More information

Accelerating Microsoft SQL Server 2016 Performance With Dell EMC PowerEdge R740

Accelerating Microsoft SQL Server 2016 Performance With Dell EMC PowerEdge R740 Accelerating Microsoft SQL Server 2016 Performance With Dell EMC PowerEdge R740 A performance study of 14 th generation Dell EMC PowerEdge servers for Microsoft SQL Server Dell EMC Engineering September

More information

NEC Express5800/ft series

NEC Express5800/ft series Fault Tolerant Server ft series The ultimate choice for business continuity NEC Express5800 fault tolerant servers Fully redundant components are highly resistant to failures. High-availability servers

More information

Intel Solid State Drive Data Center Family for PCIe* in Baidu s Data Center Environment

Intel Solid State Drive Data Center Family for PCIe* in Baidu s Data Center Environment Intel Solid State Drive Data Center Family for PCIe* in Baidu s Data Center Environment Case Study Order Number: 334534-002US Ordering Information Contact your local Intel sales representative for ordering

More information

Network Design Considerations for Grid Computing

Network Design Considerations for Grid Computing Network Design Considerations for Grid Computing Engineering Systems How Bandwidth, Latency, and Packet Size Impact Grid Job Performance by Erik Burrows, Engineering Systems Analyst, Principal, Broadcom

More information