Performance Baseline of Exadata X4-2

Size: px
Start display at page:

Download "Performance Baseline of Exadata X4-2"

Transcription

1 Performance Baseline of Exadata X4-2 Part I: Processor (CPU) Performance Technical Presentation June 2014

2 Contents 1 Introduction to CPU Performance Tests 2 CPU and Server Configuration 3 CPU Benchmark Results Basic Arithmetic Operations 4 CPU Benchmark Results Mixed Operations with SQL built-in functions 5 CPU Benchmark Results Algorithms 6 Reviewing CPU Benchmark Results copyright 2014 by benchware.ch slide 2

3 Introduction to CPU Performance Why measure CPU performance? What is measured? How is CPU performance measured? Overview Benchware CPU performance test for Oracle Monitoring Benchware CPU performance tests Look at for detailed information copyright 2014 by benchware.ch slide 3

4 Contents 1 Introduction to CPU Performance Tests 2 CPU and Server Configuration 3 CPU Benchmark Results Basic Arithmetic Operations 4 CPU Benchmark Results Mixed Operations with SQL built-in functions 5 CPU Benchmark Results Algorithms 6 Reviewing CPU Benchmark Results copyright 2014 by benchware.ch slide 4

5 CPU Performance CPU architecture CPU X2-2 X3-2 X4-2 CPU Type X5675 E E V2 CPU Launch date Q Q Q Intel Generation Westmere Sandy Bridge Ivy Bridge Clock rate [GHz] Max number of sockets #cores per socket Multithreading 2-fach 2-fach 2-fach Performance Numbers from other Benchmarks SPECint_base2006 (speed) Oracle CPU speed in sys.aux_stats$ Remark: Oracle has an internal estimation about CPU speed in sys.aux_stats$, but none estimation about CPU throughput. This value does not correlate with SPECint_base2006 copyright 2014 by benchware.ch slide 5

6 CPU Performance Server configuration Server X2-2 X3-2 X4-2 CPU Type X5675 E E V2 CPU Launch date Q Q Q #sockets #cores #threads Memory capacity [GByte] Performance Numbers from other Benchmarks SPECint_base_rate_2006 (throughput) Software Operating System Oracle Lx Oracle Lx Oracle Lx Oracle Database System Benchware Performance Suite copyright 2014 by benchware.ch slide 6

7 Contents 1 Introduction to CPU Performance Tests 2 CPU and Server Configuration 3 CPU Benchmark Results Basic Arithmetic Operations 4 CPU Benchmark Results Mixed Operations with SQL built-in functions 5 CPU Benchmark Results Algorithms 6 Reviewing CPU Benchmark Results copyright 2014 by benchware.ch slide 7

8 Speed in CPU Performance Oracle CPU speed: arithmetic ADD operation, single process In contrast to SPEC, Benchware measures very similar speed of these processors when using Oracle data types X4-2 X3-2 X SIMPLE_INTEGER SIMPLE_FLOAT PLS_INTEGER NUMBER Simple operation Uses hardware arithmetic for SIMPLE_INTEGER, SIMPLE_FLOAT and PLS_INTEGER Uses software library for NUMBER copyright 2014 by benchware.ch slide 8

9 Throughput in CPU Performance Oracle CPU throughput: arithmetic ADD operation, system saturation Mops per core Capacity planning: X2 -> X3: + 50% X3 -> X4: + 50% Mops per core 442 Mops per core X4-2 X3-2 X Mops per core 38 Mops per core 35 Mops per core SIMPLE_INTEGER SIMPLE_FLOAT PLS_INTEGER NUMBER More throughput... More cores per socket More throughput per core - more efficient multi threading copyright 2014 by benchware.ch slide 9

10 Exadata X4-2 FR CPU Performance Oracle CPU performance: arithmetic ADD operation, data type SIMPLE_INTEGER CPU CPU CPU CPU Throughput Elap busy user sys idle ops/sec time Run Tst Code #N #J #T [%] [%] [%] [%] [ops] [s] CPU E CPU E CPU E CPU E CPU E CPU E CPU E CPU E CPU E CPU E CPU E CPU E CPU E Legend: Run benchmark run id #N number of RAC nodes [rps] rows per second [iops] i/o operations per second [s] time in seconds Tst benchmark test id #J number of load generators (jobs) [tps] transactions per second [dbps] database blocks per second [ms] time in milli seconds Code benchmark test code #T number of threads (PX) [ops] operations per second [MBps] mega byte per second [μs] time in micro seconds copyright 2014 by benchware.ch slide 10

11 Contents 1 Introduction to CPU Performance Tests 2 CPU and Server Configuration 3 CPU Benchmark Results Basic Arithmetic Operations 4 CPU Benchmark Results Mixed Operations with SQL built-in functions 5 CPU Benchmark Results Algorithms 6 Reviewing CPU Benchmark Results copyright 2014 by benchware.ch slide 11

12 Summary Reviewing CPU Performance Metric X2-2 X3-2 X4-2 #cores #threads Basic arithmetic ADD operation Metric X2-2 X3-2 X4-2 Single thread speed SIMPLE_INTEGER SIMPLE_FLOAT PLS_INTEGER NUMBER Throughput SIMPLE_INTEGER SIMPLE_FLOAT PLS_INTEGER NUMBER Legend: million operations per second copyright 2014 by benchware.ch slide 12

13 Summary Reviewing CPU Performance These 3 different Intel processor generations provide Similar speed for numeric operations in Oracle More throughput due to more cores and more efficient multi-threading copyright 2014 by benchware.ch slide 13

14 swiss precision in performance measurement

Performance Baseline of Oracle Exadata X2-2 HR HC

Performance Baseline of Oracle Exadata X2-2 HR HC Performance Baseline of Oracle Exadata X2-2 HR HC Part I: CPU Performance Benchware Performance Suite Release 8.4 (Build 130630) June 2013 Contents 1 Introduction to CPU Performance Tests 2 CPU and Server

More information

Performance Baseline of Hitachi Data Systems UCP for Oracle

Performance Baseline of Hitachi Data Systems UCP for Oracle Performance Baseline of Hitachi Data Systems UCP for Oracle Part I: CPU Performance Benchware Performance Suite Release 8.5 (Build 131015) October2013 Contents 1 Introduction to CPU Performance Tests 2

More information

Oracle Server Benchmark with In-Memory SQL Processing Exadata X2-2 half-rack high-capacity

Oracle Server Benchmark with In-Memory SQL Processing Exadata X2-2 half-rack high-capacity Oracle Server Benchmark with In-Memory SQL Processing Exadata X2-2 half-rack high-capacity Benchware Performance Suite Release 8.4 (Build 130630) June 2013 Contents 1 Introduction to Tests 2 CPU and Server

More information

Oracle Platform Performance Baseline Oracle 12c on Hitachi VSP G1000. Benchmark Report December 2014

Oracle Platform Performance Baseline Oracle 12c on Hitachi VSP G1000. Benchmark Report December 2014 Oracle Platform Performance Baseline Oracle 12c on Hitachi VSP G1000 Benchmark Report December 2014 Contents 1 System Configuration 2 Introduction into Oracle Platform Performance Tests 3 Storage Benchmark

More information

Oracle Performance on M5000 with F20 Flash Cache. Benchmark Report September 2011

Oracle Performance on M5000 with F20 Flash Cache. Benchmark Report September 2011 Oracle Performance on M5000 with F20 Flash Cache Benchmark Report September 2011 Contents 1 About Benchware 2 Flash Cache Technology 3 Storage Performance Tests 4 Conclusion copyright 2011 by benchware.ch

More information

Oracle auf HP/Violin Wirklich ein Exadata Killer? DOAG Jahreskonferenz November 2012

Oracle auf HP/Violin Wirklich ein Exadata Killer? DOAG Jahreskonferenz November 2012 Oracle auf HP/Violin Wirklich ein Exadata Killer? DOAG Jahreskonferenz November 2012 Contents 1 About Benchware 2 Architecture Overview 3 Qualitative Comparison 4 Quantitative Comparison 5 Conclusion copyright

More information

Exadata Evaluation using Key Performance Metrics

Exadata Evaluation using Key Performance Metrics Exadata Evaluation using Key Performance Metrics Manfred Drozd DOAG Jahrestagung 2011 November 2011 About us Strong cooperation in Exadata projects Benchware AG - independent company - services and products

More information

Removing the I/O Bottleneck in Enterprise Storage

Removing the I/O Bottleneck in Enterprise Storage Removing the I/O Bottleneck in Enterprise Storage WALTER AMSLER, SENIOR DIRECTOR HITACHI DATA SYSTEMS AUGUST 2013 Enterprise Storage Requirements and Characteristics Reengineering for Flash removing I/O

More information

Hitachi Converged Platform for Oracle

Hitachi Converged Platform for Oracle Hitachi Converged Platform for Oracle Manfred Drozd, Benchware Ltd. Sponsored by Hitachi Data Systems Corporation Introduction Because of their obvious advantages, engineered platforms are becoming increasingly

More information

Oracle Exadata X7. Uwe Kirchhoff Oracle ACS - Delivery Senior Principal Service Delivery Engineer

Oracle Exadata X7. Uwe Kirchhoff Oracle ACS - Delivery Senior Principal Service Delivery Engineer Oracle Exadata X7 Uwe Kirchhoff Oracle ACS - Delivery Senior Principal Service Delivery Engineer 05.12.2017 Oracle Engineered Systems ZFS Backup Appliance Zero Data Loss Recovery Appliance Exadata Database

More information

Datacenter application interference

Datacenter application interference 1 Datacenter application interference CMPs (popular in datacenters) offer increased throughput and reduced power consumption They also increase resource sharing between applications, which can result in

More information

Designing for Performance. Patrick Happ Raul Feitosa

Designing for Performance. Patrick Happ Raul Feitosa Designing for Performance Patrick Happ Raul Feitosa Objective In this section we examine the most common approach to assessing processor and computer system performance W. Stallings Designing for Performance

More information

Building blocks for high performance DWH Computing

Building blocks for high performance DWH Computing Building blocks for high performance DWH Computing Wolfgang Höfer, Nuremberg, 18 st November 2010 Copyright 2010 Fujitsu Technology Solutions Current trends (1) Intel/AMD CPU performance is growing fast

More information

Response Time and Throughput

Response Time and Throughput Response Time and Throughput Response time How long it takes to do a task Throughput Total work done per unit time e.g., tasks/transactions/ per hour How are response time and throughput affected by Replacing

More information

What is Good Performance. Benchmark at Home and Office. Benchmark at Home and Office. Program with 2 threads Home program.

What is Good Performance. Benchmark at Home and Office. Benchmark at Home and Office. Program with 2 threads Home program. Performance COMP375 Computer Architecture and dorganization What is Good Performance Which is the best performing jet? Airplane Passengers Range (mi) Speed (mph) Boeing 737-100 101 630 598 Boeing 747 470

More information

WHITE PAPER FUJITSU PRIMERGY SERVERS PERFORMANCE REPORT PRIMERGY BX924 S2

WHITE PAPER FUJITSU PRIMERGY SERVERS PERFORMANCE REPORT PRIMERGY BX924 S2 WHITE PAPER PERFORMANCE REPORT PRIMERGY BX924 S2 WHITE PAPER FUJITSU PRIMERGY SERVERS PERFORMANCE REPORT PRIMERGY BX924 S2 This document contains a summary of the benchmarks executed for the PRIMERGY BX924

More information

Simulation using MIC co-processor on Helios

Simulation using MIC co-processor on Helios Simulation using MIC co-processor on Helios Serhiy Mochalskyy, Roman Hatzky PRACE PATC Course: Intel MIC Programming Workshop High Level Support Team Max-Planck-Institut für Plasmaphysik Boltzmannstr.

More information

88X + PERFORMANCE GAINS USING IBM DB2 WITH BLU ACCELERATION ON INTEL TECHNOLOGY

88X + PERFORMANCE GAINS USING IBM DB2 WITH BLU ACCELERATION ON INTEL TECHNOLOGY 05.11.2013 Thomas Kalb 88X + PERFORMANCE GAINS USING IBM DB2 WITH BLU ACCELERATION ON INTEL TECHNOLOGY Copyright 2013 ITGAIN GmbH 1 About ITGAIN Founded as a DB2 Consulting Company into 2001 DB2 Monitor

More information

Fit for Purpose Platform Positioning and Performance Architecture

Fit for Purpose Platform Positioning and Performance Architecture Fit for Purpose Platform Positioning and Performance Architecture Joe Temple IBM Monday, February 4, 11AM-12PM Session Number 12927 Insert Custom Session QR if Desired. Fit for Purpose Categorized Workload

More information

Low-Overhead Flash Disaggregation via NVMe-over-Fabrics Vijay Balakrishnan Memory Solutions Lab. Samsung Semiconductor, Inc.

Low-Overhead Flash Disaggregation via NVMe-over-Fabrics Vijay Balakrishnan Memory Solutions Lab. Samsung Semiconductor, Inc. Low-Overhead Flash Disaggregation via NVMe-over-Fabrics Vijay Balakrishnan Memory Solutions Lab. Samsung Semiconductor, Inc. 1 DISCLAIMER This presentation and/or accompanying oral statements by Samsung

More information

An Oracle White Paper September Oracle Utilities Meter Data Management Demonstrates Extreme Performance on Oracle Exadata/Exalogic

An Oracle White Paper September Oracle Utilities Meter Data Management Demonstrates Extreme Performance on Oracle Exadata/Exalogic An Oracle White Paper September 2011 Oracle Utilities Meter Data Management 2.0.1 Demonstrates Extreme Performance on Oracle Exadata/Exalogic Introduction New utilities technologies are bringing with them

More information

GAIL The Graph Algorithm Iron Law

GAIL The Graph Algorithm Iron Law GAIL The Graph Algorithm Iron Law Scott Beamer, Krste Asanović, David Patterson GAP Berkeley Electrical Engineering & Computer Sciences gap.cs.berkeley.edu Graph Applications Social Network Analysis Recommendations

More information

Overview. Idea: Reduce CPU clock frequency This idea is well suited specifically for visualization

Overview. Idea: Reduce CPU clock frequency This idea is well suited specifically for visualization Exploring Tradeoffs Between Power and Performance for a Scientific Visualization Algorithm Stephanie Labasan & Matt Larsen (University of Oregon), Hank Childs (Lawrence Berkeley National Laboratory) 26

More information

Intel Workstation Technology

Intel Workstation Technology Intel Workstation Technology Turning Imagination Into Reality November, 2008 1 Step up your Game Real Workstations Unleash your Potential 2 Yesterday s Super Computer Today s Workstation = = #1 Super Computer

More information

Recent Innovations in Data Storage Technologies Dr Roger MacNicol Software Architect

Recent Innovations in Data Storage Technologies Dr Roger MacNicol Software Architect Recent Innovations in Data Storage Technologies Dr Roger MacNicol Software Architect Copyright 2017, Oracle and/or its affiliates. All rights reserved. Safe Harbor Statement The following is intended to

More information

Computer Architecture Spring 2016

Computer Architecture Spring 2016 Computer Architecture Spring 2016 Lecture 19: Multiprocessing Shuai Wang Department of Computer Science and Technology Nanjing University [Slides adapted from CSE 502 Stony Brook University] Getting More

More information

Low-Overhead Flash Disaggregation via NVMe-over-Fabrics

Low-Overhead Flash Disaggregation via NVMe-over-Fabrics Low-Overhead Flash Disaggregation via NVMe-over-Fabrics Vijay Balakrishnan Memory Solutions Lab. Samsung Semiconductor, Inc. August 2017 1 DISCLAIMER This presentation and/or accompanying oral statements

More information

Computer Architecture s Changing Definition

Computer Architecture s Changing Definition Computer Architecture s Changing Definition 1950s Computer Architecture Computer Arithmetic 1960s Operating system support, especially memory management 1970s to mid 1980s Computer Architecture Instruction

More information

Performance COE 403. Computer Architecture Prof. Muhamed Mudawar. Computer Engineering Department King Fahd University of Petroleum and Minerals

Performance COE 403. Computer Architecture Prof. Muhamed Mudawar. Computer Engineering Department King Fahd University of Petroleum and Minerals Performance COE 403 Computer Architecture Prof. Muhamed Mudawar Computer Engineering Department King Fahd University of Petroleum and Minerals What is Performance? How do we measure the performance of

More information

CSE502: Computer Architecture CSE 502: Computer Architecture

CSE502: Computer Architecture CSE 502: Computer Architecture CSE 502: Computer Architecture Multi-{Socket,,Thread} Getting More Performance Keep pushing IPC and/or frequenecy Design complexity (time to market) Cooling (cost) Power delivery (cost) Possible, but too

More information

Performance and power efficiency of Dell PowerEdge servers with E v2

Performance and power efficiency of Dell PowerEdge servers with E v2 Performance and power efficiency of Dell PowerEdge servers with E5-2600 v2 This white paper shows the performance and power efficiency improvements of Dell PowerEdge and PowerEdge C servers with the Intel

More information

45-year CPU Evolution: 1 Law -2 Equations

45-year CPU Evolution: 1 Law -2 Equations 4004 8086 PowerPC 601 Pentium 4 Prescott 1971 1978 1992 45-year CPU Evolution: 1 Law -2 Equations Daniel Etiemble LRI Université Paris Sud 2004 Xeon X7560 Power9 Nvidia Pascal 2010 2017 2016 Are there

More information

GPU Acceleration of Matrix Algebra. Dr. Ronald C. Young Multipath Corporation. fmslib.com

GPU Acceleration of Matrix Algebra. Dr. Ronald C. Young Multipath Corporation. fmslib.com GPU Acceleration of Matrix Algebra Dr. Ronald C. Young Multipath Corporation FMS Performance History Machine Year Flops DEC VAX 1978 97,000 FPS 164 1982 11,000,000 FPS 164-MAX 1985 341,000,000 DEC VAX

More information

Improving Virtual Machine Scheduling in NUMA Multicore Systems

Improving Virtual Machine Scheduling in NUMA Multicore Systems Improving Virtual Machine Scheduling in NUMA Multicore Systems Jia Rao, Xiaobo Zhou University of Colorado, Colorado Springs Kun Wang, Cheng-Zhong Xu Wayne State University http://cs.uccs.edu/~jrao/ Multicore

More information

Exadata X3 in action: Measuring Smart Scan efficiency with AWR. Franck Pachot Senior Consultant

Exadata X3 in action: Measuring Smart Scan efficiency with AWR. Franck Pachot Senior Consultant Exadata X3 in action: Measuring Smart Scan efficiency with AWR Franck Pachot Senior Consultant 16 March 2013 1 Exadata X3 in action: Measuring Smart Scan efficiency with AWR Exadata comes with new statistics

More information

Maximize automotive simulation productivity with ANSYS HPC and NVIDIA GPUs

Maximize automotive simulation productivity with ANSYS HPC and NVIDIA GPUs Presented at the 2014 ANSYS Regional Conference- Detroit, June 5, 2014 Maximize automotive simulation productivity with ANSYS HPC and NVIDIA GPUs Bhushan Desam, Ph.D. NVIDIA Corporation 1 NVIDIA Enterprise

More information

Performance, Power, Die Yield. CS301 Prof Szajda

Performance, Power, Die Yield. CS301 Prof Szajda Performance, Power, Die Yield CS301 Prof Szajda Administrative HW #1 assigned w Due Wednesday, 9/3 at 5:00 pm Performance Metrics (How do we compare two machines?) What to Measure? Which airplane has the

More information

Motivation Goal Idea Proposition for users Study

Motivation Goal Idea Proposition for users Study Exploring Tradeoffs Between Power and Performance for a Scientific Visualization Algorithm Stephanie Labasan Computer and Information Science University of Oregon 23 November 2015 Overview Motivation:

More information

Advanced and parallel architectures. Part B. Prof. A. Massini. June 13, Exercise 1a (3 points) Exercise 1b (3 points) Exercise 2 (8 points)

Advanced and parallel architectures. Part B. Prof. A. Massini. June 13, Exercise 1a (3 points) Exercise 1b (3 points) Exercise 2 (8 points) Advanced and parallel architectures Prof. A. Massini June 13, 2017 Part B Exercise 1a (3 points) Exercise 1b (3 points) Exercise 2 (8 points) Student s Name Exercise 3 (4 points) Exercise 4 (3 points)

More information

TPC Benchmark H Full Disclosure Report. SPARC T4-4 Server Using Oracle Database 11g Release 2 Enterprise Edition with Partitioning

TPC Benchmark H Full Disclosure Report. SPARC T4-4 Server Using Oracle Database 11g Release 2 Enterprise Edition with Partitioning TPC Benchmark H Full Disclosure Report SPARC T4-4 Server Using Oracle Database 11g Release 2 Enterprise Edition with Partitioning Submitted for Review September 26, 2011 First Printing September 26, 2011

More information

LS-DYNA Performance Benchmark and Profiling. October 2017

LS-DYNA Performance Benchmark and Profiling. October 2017 LS-DYNA Performance Benchmark and Profiling October 2017 2 Note The following research was performed under the HPC Advisory Council activities Participating vendors: LSTC, Huawei, Mellanox Compute resource

More information

Performance and Scalability of Server Consolidation

Performance and Scalability of Server Consolidation Performance and Scalability of Server Consolidation August 2010 Andrew Theurer IBM Linux Technology Center Agenda How are we measuring server consolidation? SPECvirt_sc2010 How is KVM doing in an enterprise

More information

Identifying Performance Bottlenecks with Real- World Applications and Flash-Based Storage

Identifying Performance Bottlenecks with Real- World Applications and Flash-Based Storage Identifying Performance Bottlenecks with Real- World Applications and Flash-Based Storage TechTarget Dennis Martin 1 Agenda About Demartek Enterprise Data Center Environments Storage Performance Metrics

More information

E-BUSINESS APPLICATIONS R12 (RUP 3) RECEIVABLES ONLINE PROCESSING: Using Oracle10g on an IBM System P5 595

E-BUSINESS APPLICATIONS R12 (RUP 3) RECEIVABLES ONLINE PROCESSING: Using Oracle10g on an IBM System P5 595 P ERFORMANCE R EPORT O RACLE E-BUSINESS B ENCHMARK R EV. 1.1 E-BUSINESS APPLICATIONS R12 (RUP 3) RECEIVABLES ONLINE PROCESSING: Using Oracle10g on an IBM System P5 595 As a global leader in e-business

More information

Multicore Scaling: The ECM Model

Multicore Scaling: The ECM Model Multicore Scaling: The ECM Model Single-core performance prediction The saturation point Stencil code examples: 2D Jacobi in L1 and L2 cache 3D Jacobi in memory 3D long-range stencil G. Hager, J. Treibig,

More information

WHITE PAPER FUJITSU PRIMERGY SERVERS PERFORMANCE REPORT PRIMERGY BX920 S2

WHITE PAPER FUJITSU PRIMERGY SERVERS PERFORMANCE REPORT PRIMERGY BX920 S2 WHITE PAPER PERFORMANCE REPORT PRIMERGY BX920 S2 WHITE PAPER FUJITSU PRIMERGY SERVERS PERFORMANCE REPORT PRIMERGY BX920 S2 This document contains a summary of the benchmarks executed for the PRIMERGY BX920

More information

Performance Analysis in the Real World of Online Services

Performance Analysis in the Real World of Online Services Performance Analysis in the Real World of Online Services Dileep Bhandarkar, Ph. D. Distinguished Engineer 2009 IEEE International Symposium on Performance Analysis of Systems and Software My Background:

More information

Oracle NoSQL Database and Cisco- Collaboration that produces results. 1 Copyright 2011, Oracle and/or its affiliates. All rights reserved.

Oracle NoSQL Database and Cisco- Collaboration that produces results. 1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. Oracle NoSQL Database and Cisco- Collaboration that produces results 1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. What is Big Data? SOCIAL BLOG SMART METER VOLUME VELOCITY VARIETY

More information

Intel profiling tools and roofline model. Dr. Luigi Iapichino

Intel profiling tools and roofline model. Dr. Luigi Iapichino Intel profiling tools and roofline model Dr. Luigi Iapichino luigi.iapichino@lrz.de Which tool do I use in my project? A roadmap to optimization (and to the next hour) We will focus on tools developed

More information

Hewlett Packard Enterprise HPE GEN10 PERSISTENT MEMORY PERFORMANCE THROUGH PERSISTENCE

Hewlett Packard Enterprise HPE GEN10 PERSISTENT MEMORY PERFORMANCE THROUGH PERSISTENCE Hewlett Packard Enterprise HPE GEN10 PERSISTENT MEMORY PERFORMANCE THROUGH PERSISTENCE Digital transformation is taking place in businesses of all sizes Big Data and Analytics Mobility Internet of Things

More information

ffwd: delegation is (much) faster than you think Sepideh Roghanchi, Jakob Eriksson, Nilanjana Basu

ffwd: delegation is (much) faster than you think Sepideh Roghanchi, Jakob Eriksson, Nilanjana Basu ffwd: delegation is (much) faster than you think Sepideh Roghanchi, Jakob Eriksson, Nilanjana Basu int get_seqno() { } return ++seqno; // ~1 Billion ops/s // single-threaded int threadsafe_get_seqno()

More information

IC220 Slide Set #5B: Performance (Chapter 1: 1.6, )

IC220 Slide Set #5B: Performance (Chapter 1: 1.6, ) Performance IC220 Slide Set #5B: Performance (Chapter 1: 1.6, 1.9-1.11) Measure, Report, and Summarize Make intelligent choices See through the marketing hype Key to understanding underlying organizational

More information

Virtuozzo Hyperconverged Platform Uses Intel Optane SSDs to Accelerate Performance for Containers and VMs

Virtuozzo Hyperconverged Platform Uses Intel Optane SSDs to Accelerate Performance for Containers and VMs Solution brief Software-Defined Data Center (SDDC) Hyperconverged Platforms Virtuozzo Hyperconverged Platform Uses Intel Optane SSDs to Accelerate Performance for Containers and VMs Virtuozzo benchmark

More information

Independent consultant. (Ex-) Oracle ACE Director. Member of OakTable Network. Performance Troubleshooting In-house workshops

Independent consultant. (Ex-) Oracle ACE Director. Member of OakTable Network. Performance Troubleshooting In-house workshops Independent consultant Performance Troubleshooting In-house workshops Cost-Based Optimizer Performance By Design (Ex-) Oracle ACE Director 2009-2016 Alumni Member of OakTable Network http://oracle-randolf.blogspot.com

More information

Dell PowerEdge R910 SQL OLTP Virtualization Study Measuring Performance and Power Improvements of New Intel Xeon E7 Processors and Low-Voltage Memory

Dell PowerEdge R910 SQL OLTP Virtualization Study Measuring Performance and Power Improvements of New Intel Xeon E7 Processors and Low-Voltage Memory Dell PowerEdge R910 SQL OLTP Virtualization Study Measuring Performance and Power Improvements of New Intel Xeon E7 Processors and Low-Voltage Memory A Dell Technical White Paper Dell, Inc. Waseem Raja

More information

System Design for a Million TPS

System Design for a Million TPS System Design for a Million TPS Hüsnü Sensoy Global Maksimum Data & Information Technologies Global Maksimum Data & Information Technologies Focused just on large scale data and information problems. Complex

More information

Background Heterogeneous Architectures Performance Modeling Single Core Performance Profiling Multicore Performance Estimation Test Cases Multicore

Background Heterogeneous Architectures Performance Modeling Single Core Performance Profiling Multicore Performance Estimation Test Cases Multicore By Dan Stafford Background Heterogeneous Architectures Performance Modeling Single Core Performance Profiling Multicore Performance Estimation Test Cases Multicore Design Space Results & Observations General

More information

Increasing the Throughput of Network Appliances through Virtualization

Increasing the Throughput of Network Appliances through Virtualization Increasing the Throughput of Network Appliances through Virtualization June 14 th, 2011 Pete Sanders FAE Director Napatech hinc. SHARKFEST 11 Stanford University June 13 16, 16 2011 What problem does virtualized

More information

WORTMANN AG IT Made in Germany

WORTMANN AG IT Made in Germany 1009343 - TERRA PC-BUSINESS 7100 i3770/8/ssd/sil+/w8p>w7 SiSoftware Sandra Lite 2012.SP1 (2012.01.18.24) Processor Arithmetic Aggregate Arithmetic Performance : 110.3GOPS Dhrystone SSE4.2 : 129GIPS Whetstone

More information

CMS High Level Trigger Timing Measurements

CMS High Level Trigger Timing Measurements Journal of Physics: Conference Series PAPER OPEN ACCESS High Level Trigger Timing Measurements To cite this article: Clint Richardson 2015 J. Phys.: Conf. Ser. 664 082045 Related content - Recent Standard

More information

Architecture of a Real-Time Operational DBMS

Architecture of a Real-Time Operational DBMS Architecture of a Real-Time Operational DBMS Srini V. Srinivasan Founder, Chief Development Officer Aerospike CMG India Keynote Thane December 3, 2016 [ CMGI Keynote, Thane, India. 2016 Aerospike Inc.

More information

KNL tools. Dr. Fabio Baruffa

KNL tools. Dr. Fabio Baruffa KNL tools Dr. Fabio Baruffa fabio.baruffa@lrz.de 2 Which tool do I use? A roadmap to optimization We will focus on tools developed by Intel, available to users of the LRZ systems. Again, we will skip the

More information

WHITE PAPER FUJITSU PRIMERGY SERVERS PERFORMANCE REPORT PRIMERGY RX600 S6

WHITE PAPER FUJITSU PRIMERGY SERVERS PERFORMANCE REPORT PRIMERGY RX600 S6 WHITE PAPER PERFORMANCE REPORT PRIMERGY RX600 S6 WHITE PAPER FUJITSU PRIMERGY SERVERS PERFORMANCE REPORT PRIMERGY RX600 S6 This document contains a summary of the benchmarks executed for the PRIMERGY RX600

More information

FLASHBACK RAC DATABASE TO RESTORE POINT Y. MORAN

FLASHBACK RAC DATABASE TO RESTORE POINT Y. MORAN I could not find any relevant documanet of RAC - Flashback on the net. So, I want to share my experiances with you. Check crs status. ora...t2.inst application ONLINE ONLINE rcnode2 Shutdown the database

More information

HP visoko-performantna OLTP rješenja

HP visoko-performantna OLTP rješenja HP visoko-performantna OLTP rješenja Tomislav Alpeza Presales Consultant, BCS/SD 2011 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Performance

More information

Consolidating OLTP Workloads on Dell PowerEdge R th generation Servers

Consolidating OLTP Workloads on Dell PowerEdge R th generation Servers Consolidating OLTP Workloads on Dell PowerEdge R720 12 th generation Servers B Balamurugan Phani MV Dell Database Solutions Engineering March 2012 This document is for informational purposes only and may

More information

Applications Software Example

Applications Software Example Applications Software Example How to run an application on Cluster? Rooh Khurram Supercomputing Laboratory King Abdullah University of Science and Technology (KAUST), Saudi Arabia Cluster Training: Applications

More information

The ECM (Execution-Cache-Memory) Performance Model

The ECM (Execution-Cache-Memory) Performance Model The ECM (Execution-Cache-Memory) Performance Model J. Treibig and G. Hager: Introducing a Performance Model for Bandwidth-Limited Loop Kernels. Proceedings of the Workshop Memory issues on Multi- and Manycore

More information

The Role of Performance

The Role of Performance Orange Coast College Business Division Computer Science Department CS 116- Computer Architecture The Role of Performance What is performance? A set of metrics that allow us to compare two different hardware

More information

Potentials and Limitations for Energy Efficiency Auto-Tuning

Potentials and Limitations for Energy Efficiency Auto-Tuning Center for Information Services and High Performance Computing (ZIH) Potentials and Limitations for Energy Efficiency Auto-Tuning Parco Symposium Application Autotuning for HPC (Architectures) Robert Schöne

More information

Performance and Energy Efficiency of the 14 th Generation Dell PowerEdge Servers

Performance and Energy Efficiency of the 14 th Generation Dell PowerEdge Servers Performance and Energy Efficiency of the 14 th Generation Dell PowerEdge Servers This white paper details the performance improvements of Dell PowerEdge servers with the Intel Xeon Processor Scalable CPU

More information

AQD-SD31GN16-HC. Test Report. AKDC DQA Engineer. AKDC DQA Engineer. Page 1 of 1

AQD-SD31GN16-HC. Test Report. AKDC DQA Engineer. AKDC DQA Engineer. Page 1 of 1 AQD-SD31GN16-HC Test Report Test Requestor Eric.Tu Job Title PM Release Date 2014-08-06 Testing Engineer Tenthy.Zhang Job Title AKDC DQA Engineer Revision V1.0 Approved by Jier.Liu Job Title AKDC DQA Engineer

More information

Deploy a High-Performance Database Solution: Cisco UCS B420 M4 Blade Server with Fusion iomemory PX600 Using Oracle Database 12c

Deploy a High-Performance Database Solution: Cisco UCS B420 M4 Blade Server with Fusion iomemory PX600 Using Oracle Database 12c White Paper Deploy a High-Performance Database Solution: Cisco UCS B420 M4 Blade Server with Fusion iomemory PX600 Using Oracle Database 12c What You Will Learn This document demonstrates the benefits

More information

File System and Storage Benchmarking Workshop SPECsfs Benchmark The first 10 years and beyond

File System and Storage Benchmarking Workshop SPECsfs Benchmark The first 10 years and beyond 1 File System and Storage Benchmarking Workshop SPECsfs Benchmark The first 10 years and beyond Sorin Faibish, EMC 2 NFS Chronology March 1984: SUN released NFS protocol version 1 used only for inhouse

More information

T T T T T T N T T T T T T T T N T T T T T T T T T N T T T T T T T T T T T N.

T T T T T T N T T T T T T T T N T T T T T T T T T N T T T T T T T T T T T N. A1: Architecture (25 points) Consider these four possible branch predictors: (A) Static backward taken, forward not taken (B) 1-bit saturating counter (C) 2-bit saturating counter (D) Global predictor

More information

Performance Report PRIMERGY RX300 S5

Performance Report PRIMERGY RX300 S5 Performance Report RX300 S5 Version 2.0 August 2009 Pages 54 Abstract This document contains a summary of the benchmarks executed for the RX300 S5. The RX300 S5 performance data are compared with the data

More information

Performance Optimizations via Connect-IB and Dynamically Connected Transport Service for Maximum Performance on LS-DYNA

Performance Optimizations via Connect-IB and Dynamically Connected Transport Service for Maximum Performance on LS-DYNA Performance Optimizations via Connect-IB and Dynamically Connected Transport Service for Maximum Performance on LS-DYNA Pak Lui, Gilad Shainer, Brian Klaff Mellanox Technologies Abstract From concept to

More information

Quiz for Chapter 6 Storage and Other I/O Topics 3.10

Quiz for Chapter 6 Storage and Other I/O Topics 3.10 Date: 3.10 Not all questions are of equal difficulty. Please review the entire quiz first and then budget your time carefully. Name: Course: 1. [6 points] Give a concise answer to each of the following

More information

Under the Hood of Oracle Database Appliance. Alex Gorbachev

Under the Hood of Oracle Database Appliance. Alex Gorbachev Under the Hood of Oracle Database Appliance Alex Gorbachev Mountain View, CA 9-Nov-2011 http://bit.ly/pythianasmwebinar 2 Alex Gorbachev CTO, The Pythian Group Blogger OakTable Network member Oracle ACE

More information

Advanced and parallel architectures

Advanced and parallel architectures Cognome Nome Advanced and parallel architectures Prof. A. Massini June 11, 2015 Exercise 1a (2 points) Exercise 1b (2 points) Exercise 2 (5 points) Exercise 3 (3 points) Exercise 4a (3 points) Exercise

More information

Lecture: Benchmarks, Pipelining Intro. Topics: Performance equations wrap-up, Intro to pipelining

Lecture: Benchmarks, Pipelining Intro. Topics: Performance equations wrap-up, Intro to pipelining Lecture: Benchmarks, Pipelining Intro Topics: Performance equations wrap-up, Intro to pipelining 1 Measuring Performance Two primary metrics: wall clock time (response time for a program) and throughput

More information

Evaluation of Chelsio Terminator 6 (T6) Unified Wire Adapter iscsi Offload

Evaluation of Chelsio Terminator 6 (T6) Unified Wire Adapter iscsi Offload November 2017 Evaluation of Chelsio Terminator 6 (T6) Unified Wire Adapter iscsi Offload Initiator and target iscsi offload improve performance and reduce processor utilization. Executive Summary The Chelsio

More information

Separating Reality from Hype in Processors' DSP Performance. Evaluating DSP Performance

Separating Reality from Hype in Processors' DSP Performance. Evaluating DSP Performance Separating Reality from Hype in Processors' DSP Performance Berkeley Design Technology, Inc. +1 (51) 665-16 info@bdti.com Copyright 21 Berkeley Design Technology, Inc. 1 Evaluating DSP Performance! Essential

More information

Was ist dran an einer spezialisierten Data Warehousing platform?

Was ist dran an einer spezialisierten Data Warehousing platform? Was ist dran an einer spezialisierten Data Warehousing platform? Hermann Bär Oracle USA Redwood Shores, CA Schlüsselworte Data warehousing, Exadata, specialized hardware proprietary hardware Introduction

More information

Benchmark of a Cubieboard cluster

Benchmark of a Cubieboard cluster Benchmark of a Cubieboard cluster M J Schnepf, D Gudu, B Rische, M Fischer, C Jung and M Hardt Steinbuch Centre for Computing, Karlsruhe Institute of Technology, Karlsruhe, Germany E-mail: matthias.schnepf@student.kit.edu,

More information

Adaptec MaxIQ SSD Cache Performance Solution for Web Server Environments Analysis

Adaptec MaxIQ SSD Cache Performance Solution for Web Server Environments Analysis Adaptec MaxIQ SSD Cache Performance Solution for Web Server Environments Analysis September 22, 2009 Page 1 of 7 Introduction Adaptec has requested an evaluation of the performance of the Adaptec MaxIQ

More information

A Cost Model for Data Stream Processing on Modern Hardware Constantin Pohl, Philipp Götze, Kai-Uwe Sattler

A Cost Model for Data Stream Processing on Modern Hardware Constantin Pohl, Philipp Götze, Kai-Uwe Sattler Processing on Modern Hardware Constantin Pohl, Philipp Götze, Kai-Uwe Sattler 31.08.17 Motivation and Introduction Main goals on Data Stream Processing Queries: High throughput & low latency Responsibility:

More information

Tools for Intel Xeon Phi: VTune & Advisor Dr. Fabio Baruffa - LRZ,

Tools for Intel Xeon Phi: VTune & Advisor Dr. Fabio Baruffa - LRZ, Tools for Intel Xeon Phi: VTune & Advisor Dr. Fabio Baruffa - fabio.baruffa@lrz.de LRZ, 27.6.- 29.6.2016 Architecture Overview Intel Xeon Processor Intel Xeon Phi Coprocessor, 1st generation Intel Xeon

More information

Economic Viability of Hardware Overprovisioning in Power- Constrained High Performance Compu>ng

Economic Viability of Hardware Overprovisioning in Power- Constrained High Performance Compu>ng Economic Viability of Hardware Overprovisioning in Power- Constrained High Performance Compu>ng Energy Efficient Supercompu1ng, SC 16 November 14, 2016 This work was performed under the auspices of the U.S.

More information

Outline Marquette University

Outline Marquette University COEN-4710 Computer Hardware Lecture 1 Computer Abstractions and Technology (Ch.1) Cristinel Ababei Department of Electrical and Computer Engineering Credits: Slides adapted primarily from presentations

More information

LS-DYNA Performance Benchmark and Profiling. October 2017

LS-DYNA Performance Benchmark and Profiling. October 2017 LS-DYNA Performance Benchmark and Profiling October 2017 2 Note The following research was performed under the HPC Advisory Council activities Participating vendors: LSTC, Huawei, Mellanox Compute resource

More information

Performance of computer systems

Performance of computer systems Performance of computer systems Many different factors among which: Technology Raw speed of the circuits (clock, switching time) Process technology (how many transistors on a chip) Organization What type

More information

The bottom line: Performance. Measuring and Discussing Computer System Performance. Our definition of Performance. How to measure Execution Time?

The bottom line: Performance. Measuring and Discussing Computer System Performance. Our definition of Performance. How to measure Execution Time? The bottom line: Performance Car to Bay Area Speed Passengers Throughput (pmph) Ferrari 3.1 hours 160 mph 2 320 Measuring and Discussing Computer System Performance Greyhound 7.7 hours 65 mph 60 3900 or

More information

Oracle Database 12c: JMS Sharded Queues

Oracle Database 12c: JMS Sharded Queues Oracle Database 12c: JMS Sharded Queues For high performance, scalable Advanced Queuing ORACLE WHITE PAPER MARCH 2015 Table of Contents Introduction 2 Architecture 3 PERFORMANCE OF AQ-JMS QUEUES 4 PERFORMANCE

More information

IBM Power Systems: Open Innovation to put data to work. Juan López-Vidriero Mata Director técnico de ventas de servidores

IBM Power Systems: Open Innovation to put data to work. Juan López-Vidriero Mata Director técnico de ventas de servidores IBM Power Systems: Open Innovation to put data to work Juan López-Vidriero Mata Director técnico de ventas de servidores Openpower Power vs Intel Strength of IBM Vertical Stack: What is it? From Semiconductors

More information

CC312: Computer Organization

CC312: Computer Organization CC312: Computer Organization 1 Chapter 1 Introduction Chapter 1 Objectives Know the difference between computer organization and computer architecture. Understand units of measure common to computer systems.

More information

96D3-8G1333NN-APL. Test Report. AKDC DQA Engineer. Page 1 of 1

96D3-8G1333NN-APL. Test Report. AKDC DQA Engineer. Page 1 of 1 96D3-8G1333NN-APL Test Report Test Requestor Jocy.Lin Job Title Release Date 2013-5-15 Testing Engineer Tenthy.Zhang Job Title AKDC DQA Engineer Revision V1.0 Approved by Sophie. Song Job Title AKDC DQA

More information

Application of the Computer Capacity to the Analysis of Processors Evolution. BORIS RYABKO 1 and ANTON RAKITSKIY 2 April 17, 2018

Application of the Computer Capacity to the Analysis of Processors Evolution. BORIS RYABKO 1 and ANTON RAKITSKIY 2 April 17, 2018 Application of the Computer Capacity to the Analysis of Processors Evolution BORIS RYABKO 1 and ANTON RAKITSKIY 2 April 17, 2018 arxiv:1705.07730v1 [cs.pf] 14 May 2017 Abstract The notion of computer capacity

More information

Diffusion TM 5.0 Performance Benchmarks

Diffusion TM 5.0 Performance Benchmarks Diffusion TM 5.0 Performance Benchmarks Contents Introduction 3 Benchmark Overview 3 Methodology 4 Results 5 Conclusion 7 Appendix A Environment 8 Diffusion TM 5.0 Performance Benchmarks 2 1 Introduction

More information

SPARC T4-4 Server with. Oracle Database 11g Release 2

SPARC T4-4 Server with. Oracle Database 11g Release 2 SPARC T4-4 Server with Oracle Database 11g Release 2 TPC-H Rev. 2.14.2 TPC-Pricing 1.6.0 Report Date: November 30, 2011 Total System Cost Composite Query per Hour Metric Price / Performance $843,656 USD

More information