SPEC* Java Platform Benchmarks and Their Role in the Java Technology Ecosystem. *Other names and brands may be claimed as the property of others.
|
|
- Warren Randall
- 5 years ago
- Views:
Transcription
1 SPEC* Java Platform Benchmarks and Their Role in the Java Technology Ecosystem Anil Kumar Intel Corporation Performance Analyst David Dagastine Sun Microsystems Java Performance Lead
2 Agenda > What are SPEC Java Benchmarks? > Timeline > Introduction > Basic characteristics > Java Ecosystem Roles > H/W evaluation and design > S/W performance and development > JVMs evaluation and optimizations > Summary > Resources Note: This is NOT an official SPEC presentation. All information is based on public data and references. 2
3 SPEC Java Benchmark Timeline > Standard Performance Evaluation Corporation > Industry collaboration of member companies > SPEC Java Benchmarks with timeline Old Retired Currently Active Future SPECjvm98* SPECjvm2008* SPECjbb2000* SPECjbb2005*??? SPECjAppServer2001* 2002 SPECjAppServer2004*??? SPECjms2007* * SPEC, SPECjvm, SPECjbb, SPECjAppServer and SPECjms are registered trademarks of Standard Performance Evaluation Corporation 3
4 Need for Multiple Benchmarks > Can a single benchmark represent all Java application areas? No > Too many application areas muddle the result > A good benchmark should fairly represent its area(s) > SPEC Java Benchmarks covered here > SPECjbb2005 > SPECjvm2008 > SPECjms2007 > SPECjAppServer2004 > Primary goal: characterize benchmarks to > Help to correlate to real world applications > Pros and cons 4
5 Methodology > Hardware setup > Intel Xeon X5570 (Core i7) based server > Characterize performance using > OS Perfmon counters > CPU utilization, context switches, interrupts, I/O etc. H/W counters Cache misses, branches, locks, etc. Use Intel VTune Performance Analyzer JVM profiling GC, object and heap characteristics, Java locks, code and data footprint, etc. Use Oracle JRockit Mission Control Configuration details, number of instances, etc. 5
6 SPECjbb2005 > Emulates a 3-tier warehouse system > Clients: simulated by driver threads > Middle-tier: business logic and object manipulation > Database: stored in binary trees > Exercises the JVM, compiler, GC, threads, locks, etc. > Measures performance of CPUs, cache/memory hierarchy and both horizontal and vertical scalability 6
7 SPECjbb2005 Benchmark metric > Final metrics SPECjbb2005 bops and SPECjbb2005 bops/jvm Arithmetic mean from Expected Peak Warehouse count to 2x that > Multiple instances > SPECjbb2005 bops totaled > Ramp-up warehouses > Rough indicator of thread scaling > Steady state warehouses > Rough indicator of degradation under load Total from 2 JVM instances ** Steady State Ramp-up ** 7
8 SPECjbb2005 JVM profiling > Six different transaction types > Allocation intensive Large dynamic footprint: ~25mb / thread High object allocation: ~5-15KB / bop Increases with increment in warehouses Minimal sharing among threads Small code footprint: ~2MB / JVM instance > Many locks, not highly contended: locks / bop > GC intensive with ~1 sec interval between GCs > ~2-5% CPU cycles spent in GC > CPU and cache/memory intensive > Often saturates CPU 8
9 SPECjbb2005 Hardware event profiling > Events normalized per SPECjbb2005 bop > ~65,000 IA x86 machine instructions > ~20% of instructions are branches > ~1.5% of all branches are mis-predicted > Barely any floating point operations (Don t care) > ~0 ITLB miss with large pages for code (Don t care) > ~125 DTLB misses with large pages for data > ~17k loads and ~7k stores: 70/30 read/write split > ~0.007 MPI (misses/inst.) for 8MB L3 Nehalem-EP > High cache/memory bandwidth utilization 9
10 SPECjbb2005 CPU, Cache/Memory and scaling characteristics > CPU utilization ~1-5% in OS kernel ~5-10% in GC + memory management libs ~85-94% in Java Transactions Top ~20 methods consume ~80% of all CPU cycles > Cache/Memory capacity, bandwidth and latency > 8-16 gb sufficient for 2 socket systems Very sensitive to cache size and latencies ~3 mb/thread is optimal cache size > Scaling Depends whether using single instance (vertical) or multiple instances (horizontal) 10
11 SPECjbb2005 Summary > Pros Easy to set up and run Intensively exercises CPU and memory Lots of published data to compare h/w and s/w > Cons Long single instance runtime for large h/w thread count Minimal disk I/O, no network I/O and no think times High h/w thread count required to scale memory capacity beyond 16 gb > Flexibility Many different use scenarios can be created with simple changes to SPECjbb.props file Runtime, cache/memory pressure, # of threads, etc. 11
12 SPECjvm2008 > Replaces decade old, client only SPECjvm98 Multi-threaded Broad collection of real-world Java applications Covers both client and server apps on single platform > Stresses many aspects of JVM and h/w functionality JIT, GC, threads, locks, etc. CPU, cache/memory, integer and floating-point units > Min. memory for SUT: 512 mb / hardware thread > Only single JVM instance allowed > Base run (simulates out-of-box) required for submission, Peak run optional > Freely downloadable 12
13 SPECjvm2008 Groups and sub-groups > 11 groups with sub-groups Individual scores for each group and sub-groups Overall score computed using nested geo-mean (ops/minute) Score= k n1 X X 1n1... nk Xk1... Xknk > Each workload has unique profile Excellent for specific JVM and h/w area debug and evaluation SPECjvm2008 groups 1) Startup : 17 sub-tests 2) Compiler compiler.compiler compiler.sunflow 3) Compress 4) Crypto crypto.aes crypto.rsa crypto.signverify 5) Derby 6) Mpegaudio 7) Scimark.X.large : 5 sub-tests 8) Scimark.X.small : 5 sub-tests 9) Serial 10) Sunflow 11) XML xml.transform xml.validation Characteristics too diverse to cover in a short time 13
14 SPECjvm2008 Benchmark metric compiler 821 compress 437 crypto 666 derby 645 mpegaudio 259 scimark.large 74 scimark.small 423 serial 377 startup 45 sunflow 165 xml 933 Base ops/m 317 compiler.sunflow 514 compiler.compiler 1313 crypto.signverify 1106 crypto.rsa 906 crypto.aes 295 Scimark fft lu sor sparse monte_carlo large small Cache size impact 17 sub groups xml.validation 1166 xml.transform 748 > Sun Blade X6270 > Intel Xeon X5570 (Nehalem-EP) > Sun HotSpot(TM) 64-Bit Server VM 14
15 SPECjvm2008: Summary > Pros Easy to run, quick feedback, configurable Generic optimizations more effective for Geo-mean Exercises wide range of JVM and H/W functionality Sub-groups can exercise specific sub-components Real-world relevance Single instance challenges scaling > Cons Run length a bit long Minimal disk I/O, no network I/O, no think times Limited scaling for memory capacity (>24GB RAM) Sub-group focus can be too narrow > Flexibility Plug-in analysis framework for heap, power, etc. 15
16 SPECjms2007 > First industry-standard multi-tier benchmark > Focus on message-oriented middleware (MOM) servers based on JMS (Java Message Service) > Exercises JMS server software, JVM, database for message persistence, network and h/w > Models the supply chain of a supermarket company > Scale along 2 dimensions Horizontal Vertical 16
17 SPECjms2007 Benchmark Metric > Two metrics cover horizontal and vertical topology > > Destinations (queues and topics) are increased > While keeping the traffic per Destination constant > > Traffic (message count) to a Destination is increased > While keeping the number of Destinations fixed For characterization data, please refer to // 17
18 SPECjms2007 Summary > Pros Real world 3-tier setup Excellent evaluation tool for JVM and application server > Cons Very complex setup Minimal characterization data > Flexibility Freeform topology gives user complete control over benchmark configuration 18
19 SPECjAppServer2004 > Multi-tier benchmark exercising many Java 2 Enterprise Edition (J2EE) technologies Web and EJB containers, JMS, transaction mgmt, Message Driven Beans, database connectivity, etc. > Emulates an automobile company and dealerships > Dealers interact using web browsers > Manufacturing process is accomplished via RMI > Exercises application s/w, JVM, DB and network > Metric "SPECjAppServer2004 jops" denotes jappserver Operations Per Second Driver(s) + Application server(s) + Database server(s) 19
20 SPECjAppServer2004 JVM profiling > Code footprint 12 mb / JVM instance > ~5000 methods compiled during a benchmark run > Method level CPU utilization profile extremely flat > ~70% User and ~20% Kernel CPU utilization > Moderate object allocation rate GC interval ~10 sec > Reasonable data sharing among threads Many Java locks including contented ones > Requires a finely tuned DB server > Multiple instance performance > single instance 20
21 SPECjAppServer2004 H/W profiling > Events per SPECjAppServer2004 jop > ~6 million IA x86 asm instructions > ~20% of instructions are branches > ~7% of all branches are mis-predicted > Minimal floating point operations (Don t care) > ~30% of instructions are loads and ~17% are stores > ~ 65/35 read/write split > ~5k ITLB misses with large pages for code > ~35k DTLB misses with large pages for data > ~0.005 MPI (misses/inst.) for 8MB L3 Nehalem-EP > Moderate cache/memory bandwidth utilization 21
22 SPECjAppServer2004 Summary > Pros Real-world system characteristics Well known to fairly exercise J2EE technologies Usage response time methodology > Cons Very complex to setup, run and optimize Wide variation in response time at high utilization > Flexibility EAStress benchmark enabled by a special research run mode, for use in research and development 22
23 High level characteristics: Bird s Eye View > Reference platform is 2 chip Intel Xeon X5570 Basic SPECjbb- SPECjvm- SPECjms- SPECjApp- Characteristic Server2004 System Tiers Stand-alone Stand-alone 3-Tier 3-Tier Disk I/O Minimal Minimal Reasonable Reasonable Network I/O None None Reasonable Reasonable Memory Capacity Memory Bandwidth Medium (~16GB) Low- Medium (~8-24GB) High (~24GB) High (>24GB) High Low-High Medium Medium # of Instances Multiple Single Multiple Multiple Ease of Use Easy Medium Very Very Complex Complex 23
24 SPEC Java Benchmarks Java Ecosystem Roles > H/W evaluation and design > S/W performance analysis and development > JVM performance optimization > Will my app run faster? > JVM optimization with focus on multi-core > Session TS7499 Wednesday 4:10PM 24
25 Java Ecosystem Role Hardware evaluation and design > SPEC Java benchmarks are heavily used to > Assess competitive performance across > Machine architecture generations > SKU differentiation > Different machine architectures > Measure H/W feature value add: HT, Turbo, etc. > Measure H/W scaling: CPU, memory system, etc. > Optimize platform configuration: BIOS, memory, etc. > Evaluate future processor and platform design using benchmark instruction traces 25
26 Java Ecosystem Role Hardware evaluation > Hardware and software change assessment SPECjbb2005 k bops Core 2 changes 2.7x 51 P27 Core scaling DC QC 1.8x 138 P27 Cache Scaling 4 6MB 1.2x 252 P27 JVM Optimizations 1.2x 302 P27 Nehalem-EP changes 1.5x 368 P P GHz Irwindale (2MB_L2) 3.00GHz Woodcrest (4MB_L2) 3.00GHz Clovertown (2x4MB_L2) 3.16GHz Harpertown (2x6MB_L2) 3.33GHz Harpertown (2x6MB_L2) 2.93GHz Nehalem-EP (8MB_L3) Data using Oracle JRockit 64-bit JVM P27 or P28 versions 26
27 Java Ecosystem Role S/W performance and development > Application Server performance > SPECjAppServer2004 very helpful in identifying improvement opportunities > Application S/W stack comparison > Power and performance > SPECpower_ssj2008, first industry standard benchmark to measure performance and power > Based on SPECjbb2005 > Plays significant role in measuring power efficiency > Virtualization environment evaluation > SPECjbb2005 and SPECjAppServer2004 play a very constructive role 27
28 Java Ecosystem Role Hardware evaluation > Hardware and software change assessment SPECjAppServer2004 JOPS Nehalem-EP, App Server, JVM changes 1.93x 2056 Oracle Application Server 10G Oracle WebLogic Server Standard Edition Release 10.3 Intel Xeon X5460 (3.16GHz_2x6MB L3) Intel Xeon X5570 (2.93GHz_8MB_L3 with_ht_and_turbo) *SPECjAppServer2004 based publications at SPEC:
29 Java Ecosystem Role Software evaluation SPECjAppServer2004 JOPS OpenSolaris Sun Hot Spot MySQL Oracle Linux Oracle JRockit Oracle DB Sun GlassFish Enterprise Server v2.1 Oracle WebLogic Server Standard Edition Release 10.3 Single node two processor server using Intel Xeon X GHz with Intel Turbo Boost Technology up to 3.20 GHz *SPECjAppServer2004 based publications at SPEC:
30 Java Ecosystem Role JVM performance optimization > SPEC benchmarks provide a platform to highlight JVM competitive performance > SPEC publication makes open source software credibile > SPEC benchmarks are a record of many years of fiery competition among JVM vendors > Active competition has led to many innovative performance optimizations > SPEC run rules ensures optimizations are widely applicable and acceptable 30
31 SPECjbb2005 Performance Sun CMT Systems, 1-4 chips x US-T2 Plus 1.4Ghz, JDK 6_06-P US-T1 1.2Ghz, JDK 5_ US-T1 1.4Ghz, JDK 6_02 US-T1 1.2Ghz, JDK 5_08 US-T2 1.4Ghz, JDK 6_04-P 4 X US-T2 Plus 1.4Ghz, JDK 6_06-P /01/05 12/01/05 06/01/06 06/01/06 06/01/07 10/01/07 10/01/07 04/01/08 10/1/08 31
32 SPECjbb2005 Performance Sun Intel Systems, 1-4 chips x Intel X7460 JDK 6_06-P x Intel X5460 JDK 6_05-P 4 x Intel X7350 JDK 6_06-P 2 x Intel X5570 JDK 6_14-P /01/08 05/01/08 09/01/08 04/01/09 32
33 Java Ecosystem Role JVM Optimization > JDK 5 Update 6 Biased Locking Improves uncontended synchronization An object is "biased" toward the thread which first acquires its monitor Initial BigDecimal optimization Internally represent BigDecimal as a long where possible Platform optimized arraycopy Hand optimized arraycopy > JDK 5 Update 8 Biased Locking improvements Much faster bias revocation Parallel Old Generation GC Parallel collector for full GCs 33
34 Java Ecosystem Role JVM Optimization > JDK 6 Update 2 Biased Locking on by default in Java 6 Vectorization (superword) Load, operate on and store multiple array elements at once with single machine instructions Depth first object promotion order Promote objects from young generation to old generation Depth-first closer to object allocation order than breadth-first JDK 6 Update 4 Performance Release Hashmap hashing algorithm Autobox elision Optimize autoboxing to reduce cache misses BigDecimal optimization Additional changes to reduce cache misses Object zeroing elision Elide object zeroing when fields guaranteed to be initialized 34
35 Java Ecosystem Role JVM Optimization > JDK 6 Update 5 Performance Release TreeMap: point optimizations HashMap: Integer front-cache Escape analysis: Scalar replacement and lock elimination > JDK 6 Update 6 Performance Release Variation on Harmony Wide Node TreeMap Compressed object references 32-bit reference in a 64-bit JVM HashMap: point optimization StringCache: cache commonly used strings 35
36 Java Ecosystem Role JVM Optimization > JDK 6 Update 14 Performance Zero-based compressed object references 64-bit now equals or exceeds 32-bit performance Will be default for servers 36
37 Java Ecosystem Role Will my application run faster? > JDK 6 Includes all performance optimizations in JDK 5 updates > JDK 6 Update 2 Biased Locking plus all prior performance optimizations > JDK 6 Update 14 All prior optimizations default on, except HashMap and TreeMap > Java For Business High Performance JDK 5 and J2SE including the latest HotSpot JVM 37
38 SPECjvm2008 Base Performance Sun Intel Systems, 1-4 chips x Intel X7350 JDK 6_06-P 4 x Intel X7460 JDK 6_06-P 2 x Intel X5570 JDK 6_14-P /01/08 09/01/08 04/01/09 SPECjvm2008 Base == Out of box performance. No JVM performance tuning. 38
39 SPECjbb2005 Performance IBM Power6 Systems, 1-4 chips /01/06 06/01/07 11/01/07 06/01/08 39
40 Java Ecosystem Role Will my application run faster? Java optimizations identified through SPEC Java benchmark analysis yield real improvements running your applications Experimental optimizations eventually become default behavior All production JVMs have shown ongoing performance improvement Active competition on SPEC benchmarks results in innovative performance optimization for the Java Platform 40
41 Summary > Are SPEC Java benchmarks perfect? > No, but highly relevant because > Created by consensus among members playing lead roles in Java ecosystem > Code available as well as characterization data > Many publications and strict fair comparison run rules > Very credible as competitive reference > Updated benchmarks to reflect latest trends > Lead to many optimizations benefiting wide application segments Excellent evaluation tool and more, with proven record of benefitting all of Java eco-system 41
42 Resources > > SPECjbb2005: > SPECjvm2008: > SPECjms > SPECjAppServer
43 Anil Kumar David Dagastine 43
A Preliminary Workload Analysis of SPECjvm2008
A Preliminary Workload Analysis of SPECjvm2008 Hitoshi Oi The University of Aizu January 24, 2009 International Conference on Computer Engineering and Technology 2009 Outline Introduction: SPECjvm2008
More informationSPECjbb2005. Alan Adamson, IBM Canada David Dagastine, Sun Microsystems Stefan Sarne, BEA Systems
SPECjbb2005 Alan Adamson, IBM Canada David Dagastine, Sun Microsystems Stefan Sarne, BEA Systems Topics Benchmarks SPECjbb2000 Impact Reasons to Update SPECjbb2005 Development Execution Benchmarking Uses
More informationSPEC Enterprise Java Benchmarks State of the Art and Future Directions
SPEC Enterprise Java Benchmarks State of the Art and Future Directions Samuel Kounev Release Manager, SPEC Java Subcommittee Chair, SPECjms Working Group Kai Sachs SPECjms2007 Lead Developer Databases
More informationA Comparative Study of JVM Implementations with SPECjvm2008
A Comparative Study of JVM Implementations with SPECjvm008 Hitoshi Oi Department of Computer Science, The University of Aizu Aizu-Wakamatsu, JAPAN Email: oi@oslab.biz Abstract SPECjvm008 is a new benchmark
More informationIntelligent servers- Lower TCO, Rapid ROI and More Performance
Intelligent servers- Lower TCO, Rapid ROI and More Performance Neil Lin Solutions Specialist Enterprise Solutions Sales Intel Xeon Processor 5500 Series: Transforming Computing Intelligent Platform World
More informationA Preliminary Workload Analysis of SPECjvm2008
A Preliminary Workload Analysis of SPECjvm2008 Hitoshi Oi The University of Aizu, Aizu Wakamatsu, JAPAN oi@oslab.biz Abstract SPECjvm2008 is a new benchmark program suite for measuring client-side Java
More information<Insert Picture Here>
The Other HPC: Profiling Enterprise-scale Applications Marty Itzkowitz Senior Principal SW Engineer, Oracle marty.itzkowitz@oracle.com Agenda HPC Applications
More informationReal Time: Understanding the Trade-offs Between Determinism and Throughput
Real Time: Understanding the Trade-offs Between Determinism and Throughput Roland Westrelin, Java Real-Time Engineering, Brian Doherty, Java Performance Engineering, Sun Microsystems, Inc TS-5609 Learn
More informationEfficient Runtime Tracking of Allocation Sites in Java
Efficient Runtime Tracking of Allocation Sites in Java Rei Odaira, Kazunori Ogata, Kiyokuni Kawachiya, Tamiya Onodera, Toshio Nakatani IBM Research - Tokyo Why Do You Need Allocation Site Information?
More informationIdentifying the Sources of Cache Misses in Java Programs Without Relying on Hardware Counters. Hiroshi Inoue and Toshio Nakatani IBM Research - Tokyo
Identifying the Sources of Cache Misses in Java Programs Without Relying on Hardware Counters Hiroshi Inoue and Toshio Nakatani IBM Research - Tokyo June 15, 2012 ISMM 2012 at Beijing, China Motivation
More informationOptimising Multicore JVMs. Khaled Alnowaiser
Optimising Multicore JVMs Khaled Alnowaiser Outline JVM structure and overhead analysis Multithreaded JVM services JVM on multicore An observational study Potential JVM optimisations Basic JVM Services
More informationSPECjAppServer2002 Statistics. Methodology. Agenda. Tuning Philosophy. More Hardware Tuning. Hardware Tuning.
Scaling Up the JBoss Application Server. Peter Johnson JBoss World 2005 March 1, 2005 Conclusion Configuration. 8-CPU ES7000 (32-bit) SPECjAppServer 2002 JBoss Application Server 3.2.6 Unisys JVM 1.4.1_07
More informationOS-caused Long JVM Pauses - Deep Dive and Solutions
OS-caused Long JVM Pauses - Deep Dive and Solutions Zhenyun Zhuang LinkedIn Corp., Mountain View, California, USA https://www.linkedin.com/in/zhenyun Zhenyun@gmail.com 2016-4-21 Outline q Introduction
More informationScaling Up Performance Benchmarking
Scaling Up Performance Benchmarking -with SPECjbb2015 Anil Kumar Runtime Performance Architect @Intel, OSG Java Chair Monica Beckwith Runtime Performance Architect @Arm, Java Champion FaaS Serverless Frameworks
More informationDell Guide to Server Benchmarks
Contents Introduction: Choosing a Benchmark 1 Important System Benchmark Quick Reference Chart by Application 3 4 TPC C 4 TPC H 5 TPC App 6 MMB3 7 SPEC CPU 8 SPECweb 9 SPECjbb 10 SPEC SFS 3.0 11 SPECjAppServer
More informationHigh Performance Java Technology in a Multi-Core World
High Performance Java Technology in a Multi-Core World David Dagastine Paul Hohensee VM Technologies Sun Microsystems, Inc. http://java.sun.com TS-2885 2007 JavaOne SM Conference Session TS-2885 What To
More informationA Comparative Performance Evaluation of Different Application Domains on Server Processor Architectures
A Comparative Performance Evaluation of Different Application Domains on Server Processor Architectures W.M. Roshan Weerasuriya and D.N. Ranasinghe University of Colombo School of Computing A Comparative
More informationNew Compiler Optimizations in the Java HotSpot Virtual Machine
New Compiler Optimizations in the Java HotSpot Virtual Machine Steve Dever Steve Goldman Kenneth Russell Sun Microsystems, Inc. TS-3412 Copyright 2006, Sun Microsystems Inc., All rights reserved. 2006
More informationWHITE PAPER FUJITSU PRIMERGY SERVERS PERFORMANCE REPORT PRIMERGY BX920 S2
WHITE PAPER PERFORMANCE REPORT PRIMERGY BX920 S2 WHITE PAPER FUJITSU PRIMERGY SERVERS PERFORMANCE REPORT PRIMERGY BX920 S2 This document contains a summary of the benchmarks executed for the PRIMERGY BX920
More informationMunara Tolubaeva Technical Consulting Engineer. 3D XPoint is a trademark of Intel Corporation in the U.S. and/or other countries.
Munara Tolubaeva Technical Consulting Engineer 3D XPoint is a trademark of Intel Corporation in the U.S. and/or other countries. notices and disclaimers Intel technologies features and benefits depend
More informationRun Anywhere. The Hardware Platform Perspective. Ben Pollan, AMD Java Labs October 28, 2008
Run Anywhere The Hardware Platform Perspective Ben Pollan, AMD Java Labs October 28, 2008 Agenda Java Labs Introduction Community Collaboration Performance Optimization Recommendations Leveraging the Latest
More informationArcGIS Enterprise Performance and Scalability Best Practices. Andrew Sakowicz
ArcGIS Enterprise Performance and Scalability Best Practices Andrew Sakowicz Agenda Definitions Design workload separation Provide adequate infrastructure capacity Configure Tune Test Monitor Definitions
More informationPerformance Analysis in the Real World of Online Services
Performance Analysis in the Real World of Online Services Dileep Bhandarkar, Ph. D. Distinguished Engineer 2009 IEEE International Symposium on Performance Analysis of Systems and Software My Background:
More informationPerformance Tools for Technical Computing
Christian Terboven terboven@rz.rwth-aachen.de Center for Computing and Communication RWTH Aachen University Intel Software Conference 2010 April 13th, Barcelona, Spain Agenda o Motivation and Methodology
More informationHardware and Software solutions for scaling highly threaded processors. Denis Sheahan Distinguished Engineer Sun Microsystems Inc.
Hardware and Software solutions for scaling highly threaded processors Denis Sheahan Distinguished Engineer Sun Microsystems Inc. Agenda Chip Multi-threaded concepts Lessons learned from 6 years of CMT
More informationW H I T E P A P E R. Comparison of Storage Protocol Performance in VMware vsphere 4
W H I T E P A P E R Comparison of Storage Protocol Performance in VMware vsphere 4 Table of Contents Introduction................................................................... 3 Executive Summary............................................................
More informationLinux Performance on IBM System z Enterprise
Linux Performance on IBM System z Enterprise Christian Ehrhardt IBM Research and Development Germany 11 th August 2011 Session 10016 Agenda zenterprise 196 design Linux performance comparison z196 and
More informationIntel VTune Amplifier XE
Intel VTune Amplifier XE Vladimir Tsymbal Performance, Analysis and Threading Lab 1 Agenda Intel VTune Amplifier XE Overview Features Data collectors Analysis types Key Concepts Collecting performance
More informationJackson Marusarz Intel Corporation
Jackson Marusarz Intel Corporation Intel VTune Amplifier Quick Introduction Get the Data You Need Hotspot (Statistical call tree), Call counts (Statistical) Thread Profiling Concurrency and Lock & Waits
More informationWhite Paper. Major Performance Tuning Considerations for Weblogic Server
White Paper Major Performance Tuning Considerations for Weblogic Server Table of Contents Introduction and Background Information... 2 Understanding the Performance Objectives... 3 Measuring your Performance
More informationImproving Data Access of J2EE Applications by Exploiting Asynchronous Messaging and Caching Services
Darmstadt University of Technology Databases & Distributed Systems Group Improving Data Access of J2EE Applications by Exploiting Asynchronous Messaging and Caching Services Samuel Kounev and Alex Buchmann
More information2
1 2 3 4 5 6 For more information, see http://www.intel.com/content/www/us/en/processors/core/core-processorfamily.html 7 8 The logic for identifying issues on Intel Microarchitecture Codename Ivy Bridge
More informationContinuous Object Access Profiling and Optimizations to Overcome the Memory Wall and Bloat
Continuous Object Access Profiling and Optimizations to Overcome the Memory Wall and Bloat Rei Odaira, Toshio Nakatani IBM Research Tokyo ASPLOS 2012 March 5, 2012 Many Wasteful Objects Hurt Performance.
More informationNehalem Hochleistungsrechnen für reale Anwendungen
Nehalem Hochleistungsrechnen für reale Anwendungen T-Systems HPCN Workshop DLR Braunschweig May 14-15, 2009 Hans-Joachim Plum Intel GmbH 1 Performance tests and ratings are measured using specific computer
More informationWHITE PAPER FUJITSU PRIMERGY SERVERS PERFORMANCE REPORT PRIMERGY BX924 S2
WHITE PAPER PERFORMANCE REPORT PRIMERGY BX924 S2 WHITE PAPER FUJITSU PRIMERGY SERVERS PERFORMANCE REPORT PRIMERGY BX924 S2 This document contains a summary of the benchmarks executed for the PRIMERGY BX924
More informationPerformance and Energy Efficiency of the 14 th Generation Dell PowerEdge Servers
Performance and Energy Efficiency of the 14 th Generation Dell PowerEdge Servers This white paper details the performance improvements of Dell PowerEdge servers with the Intel Xeon Processor Scalable CPU
More informationProfiling: Understand Your Application
Profiling: Understand Your Application Michal Merta michal.merta@vsb.cz 1st of March 2018 Agenda Hardware events based sampling Some fundamental bottlenecks Overview of profiling tools perf tools Intel
More informationUsing Intel VTune Amplifier XE and Inspector XE in.net environment
Using Intel VTune Amplifier XE and Inspector XE in.net environment Levent Akyil Technical Computing, Analyzers and Runtime Software and Services group 1 Refresher - Intel VTune Amplifier XE Intel Inspector
More informationPage 2 of 6 SUT Model Form Factor CPU CPU Characteristics Number of Systems 1 Nodes Per System 1 Chips Per System 2 Hardware hw_1 Cores Per System 44
Page 1 of 6 SPECjbb2015 Copyright 2015-2016 Standard Performance Evaluation Corporation Cisco Systems Cisco UCS C220 M4 Tested by: Cisco Systems SPEC license #: 9019 94667 SPECjbb2015-Multi max-jops 71951
More informationThe Z Garbage Collector Low Latency GC for OpenJDK
The Z Garbage Collector Low Latency GC for OpenJDK Per Lidén & Stefan Karlsson HotSpot Garbage Collection Team Jfokus VM Tech Summit 2018 Safe Harbor Statement The following is intended to outline our
More informationAn Oracle White Paper September Oracle Utilities Meter Data Management Demonstrates Extreme Performance on Oracle Exadata/Exalogic
An Oracle White Paper September 2011 Oracle Utilities Meter Data Management 2.0.1 Demonstrates Extreme Performance on Oracle Exadata/Exalogic Introduction New utilities technologies are bringing with them
More informationChapter 03. Authors: John Hennessy & David Patterson. Copyright 2011, Elsevier Inc. All rights Reserved. 1
Chapter 03 Authors: John Hennessy & David Patterson Copyright 2011, Elsevier Inc. All rights Reserved. 1 Figure 3.3 Comparison of 2-bit predictors. A noncorrelating predictor for 4096 bits is first, followed
More informationWHITE PAPER FUJITSU PRIMERGY SERVERS PERFORMANCE REPORT PRIMERGY RX600 S6
WHITE PAPER PERFORMANCE REPORT PRIMERGY RX600 S6 WHITE PAPER FUJITSU PRIMERGY SERVERS PERFORMANCE REPORT PRIMERGY RX600 S6 This document contains a summary of the benchmarks executed for the PRIMERGY RX600
More informationJava Performance: The Definitive Guide
Java Performance: The Definitive Guide Scott Oaks Beijing Cambridge Farnham Kbln Sebastopol Tokyo O'REILLY Table of Contents Preface ix 1. Introduction 1 A Brief Outline 2 Platforms and Conventions 2 JVM
More informationA Trace-based Java JIT Compiler Retrofitted from a Method-based Compiler
A Trace-based Java JIT Compiler Retrofitted from a Method-based Compiler Hiroshi Inoue, Hiroshige Hayashizaki, Peng Wu and Toshio Nakatani IBM Research Tokyo IBM Research T.J. Watson Research Center April
More informationPerformance & Scalability Testing in Virtual Environment Hemant Gaidhani, Senior Technical Marketing Manager, VMware
Performance & Scalability Testing in Virtual Environment Hemant Gaidhani, Senior Technical Marketing Manager, VMware 2010 VMware Inc. All rights reserved About the Speaker Hemant Gaidhani Senior Technical
More informationVirtualizing JBoss Enterprise Middleware with Azul
Virtualizing JBoss Enterprise Middleware with Azul Shyam Pillalamarri VP Engineering, Azul Systems Stephen Hess Sr. Director, Product Management, Red Hat June 25, 2010 Agenda Java Virtualization Current
More informationJava Application Performance Tuning for AMD EPYC Processors
Java Application Performance Tuning for AMD EPYC Processors Publication # 56245 Revision: 0.70 Issue Date: January 2018 Advanced Micro Devices 2018 Advanced Micro Devices, Inc. All rights reserved. The
More informationThe Z Garbage Collector An Introduction
The Z Garbage Collector An Introduction Per Lidén & Stefan Karlsson HotSpot Garbage Collection Team FOSDEM 2018 Safe Harbor Statement The following is intended to outline our general product direction.
More informationEvolution of Virtual Machine Technologies for Portability and Application Capture. Bob Vandette Java Hotspot VM Engineering Sept 2004
Evolution of Virtual Machine Technologies for Portability and Application Capture Bob Vandette Java Hotspot VM Engineering Sept 2004 Topics Virtual Machine Evolution Timeline & Products Trends forcing
More informationIBM WebSphere Application Server V4.0. Performance. 10/02/01 Copyright 2001 IBM Corporation WS40ST11.prz Page 248 of of 28
IBM WebSphere Application Server V4.0 Performance Page 248 of 401 1 of 28 Performance Enhancements to WebSphere V4.0 Performance Enhancement Overview Dynamic Caching of Servlets/JSPs Performance Monitoring
More informationTEST REPORT. JUNE 2007 SPECjbb2005 performance and power consumption on Dell and HP blade servers
TEST REPORT JUNE 2007 SPECjbb2005 performance and power Executive summary Dell Inc. (Dell) commissioned Principled Technologies (PT) to measure the SPECjbb2005 performance per watt of the following dual-socket
More informationMethod-Level Phase Behavior in Java Workloads
Method-Level Phase Behavior in Java Workloads Andy Georges, Dries Buytaert, Lieven Eeckhout and Koen De Bosschere Ghent University Presented by Bruno Dufour dufour@cs.rutgers.edu Rutgers University DCS
More informationIntel profiling tools and roofline model. Dr. Luigi Iapichino
Intel profiling tools and roofline model Dr. Luigi Iapichino luigi.iapichino@lrz.de Which tool do I use in my project? A roadmap to optimization (and to the next hour) We will focus on tools developed
More informationORACLE ENTERPRISE MANAGER 10g ORACLE DIAGNOSTICS PACK FOR NON-ORACLE MIDDLEWARE
ORACLE ENTERPRISE MANAGER 10g ORACLE DIAGNOSTICS PACK FOR NON-ORACLE MIDDLEWARE Most application performance problems surface during peak loads. Often times, these problems are time and resource intensive,
More informationHANA Performance. Efficient Speed and Scale-out for Real-time BI
HANA Performance Efficient Speed and Scale-out for Real-time BI 1 HANA Performance: Efficient Speed and Scale-out for Real-time BI Introduction SAP HANA enables organizations to optimize their business
More informationDiffusion TM 5.0 Performance Benchmarks
Diffusion TM 5.0 Performance Benchmarks Contents Introduction 3 Benchmark Overview 3 Methodology 4 Results 5 Conclusion 7 Appendix A Environment 8 Diffusion TM 5.0 Performance Benchmarks 2 1 Introduction
More informationLeading Performance for Oracle Applications? John McAbel Collaborate 2015
Leading Performance for Oracle Applications? John McAbel Collaborate 2015 First Let s Test Your IT Knowledge 1.) According to IDC in 2014, which system vendor is 2 nd in x86 blades market share in worldwide
More informationPAC094 Performance Tips for New Features in Workstation 5. Anne Holler Irfan Ahmad Aravind Pavuluri
PAC094 Performance Tips for New Features in Workstation 5 Anne Holler Irfan Ahmad Aravind Pavuluri Overview of Talk Virtual machine teams 64-bit guests SMP guests e1000 NIC support Fast snapshots Virtual
More informationDeploy a High-Performance Database Solution: Cisco UCS B420 M4 Blade Server with Fusion iomemory PX600 Using Oracle Database 12c
White Paper Deploy a High-Performance Database Solution: Cisco UCS B420 M4 Blade Server with Fusion iomemory PX600 Using Oracle Database 12c What You Will Learn This document demonstrates the benefits
More informationThe Z Garbage Collector Scalable Low-Latency GC in JDK 11
The Z Garbage Collector Scalable Low-Latency GC in JDK 11 Per Lidén (@perliden) Consulting Member of Technical Staff Java Platform Group, Oracle October 24, 2018 Safe Harbor Statement The following is
More informationPerformance Profiling
Performance Profiling Minsoo Ryu Real-Time Computing and Communications Lab. Hanyang University msryu@hanyang.ac.kr Outline History Understanding Profiling Understanding Performance Understanding Performance
More informationAccelerating HPC. (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing
Accelerating HPC (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing SAAHPC, Knoxville, July 13, 2010 Legal Disclaimer Intel may make changes to specifications and product
More informationFit for Purpose Platform Positioning and Performance Architecture
Fit for Purpose Platform Positioning and Performance Architecture Joe Temple IBM Monday, February 4, 11AM-12PM Session Number 12927 Insert Custom Session QR if Desired. Fit for Purpose Categorized Workload
More informationPerformance analysis tools: Intel VTuneTM Amplifier and Advisor. Dr. Luigi Iapichino
Performance analysis tools: Intel VTuneTM Amplifier and Advisor Dr. Luigi Iapichino luigi.iapichino@lrz.de Which tool do I use in my project? A roadmap to optimisation After having considered the MPI layer,
More informationJavaOne Topics. Keynotes Talks overview Interoperability, jini Real-time and Embedded XML and Java Cool Recommendations
JavaOne 2000 Topics Keynotes Talks overview Interoperability, jini Real-time and Embedded XML and Java Cool Recommendations First Keynote (McNeally( McNeally,, Jobs) 9 Gbits/s cross the Atlantic (m:m)
More informationHierarchical PLABs, CLABs, TLABs in Hotspot
Hierarchical s, CLABs, s in Hotspot Christoph M. Kirsch ck@cs.uni-salzburg.at Hannes Payer hpayer@cs.uni-salzburg.at Harald Röck hroeck@cs.uni-salzburg.at Abstract Thread-local allocation buffers (s) are
More informationIt Is a Difficult Question! The Goal of This Study. Specification. The Goal of This Study. History. Existing Benchmarks
It Is a Difficult Question! J2EE and.net Reloaded Yet Another Performance Case Study The Middleware Company Case Study Team Presented by Mark Grechanik How to compare two functionally rich platforms? Benchmarks?
More informationSAP ENTERPRISE PORTAL. Scalability Study - Windows
SAP NetWeaver SAP ENTERPRISE PORTAL Scalability Study - Windows ABOUT SAP ENTERPRISE PORTAL ABOUT THIS STUDY SAP Enterprise Portal is a key component of the SAP NetWeaver platform. SAP Enterprise Portal
More informationIBM Emulex 16Gb Fibre Channel HBA Evaluation
IBM Emulex 16Gb Fibre Channel HBA Evaluation Evaluation report prepared under contract with Emulex Executive Summary The computing industry is experiencing an increasing demand for storage performance
More informationDiplomado Certificación
Diplomado Certificación Duración: 250 horas. Horario: Sabatino de 8:00 a 15:00 horas. Incluye: 1. Curso presencial de 250 horas. 2.- Material oficial de Oracle University (e-kit s) de los siguientes cursos:
More information2011 IBM Research Strategic Initiative: Workload Optimized Systems
PIs: Michael Hind, Yuqing Gao Execs: Brent Hailpern, Toshio Nakatani, Kevin Nowka 2011 IBM Research Strategic Initiative: Workload Optimized Systems Yuqing Gao IBM Research 2011 IBM Corporation Motivation
More informationQLIKVIEW SCALABILITY BENCHMARK WHITE PAPER
QLIKVIEW SCALABILITY BENCHMARK WHITE PAPER Hardware Sizing Using Amazon EC2 A QlikView Scalability Center Technical White Paper June 2013 qlikview.com Table of Contents Executive Summary 3 A Challenge
More informationSAP SD Benchmark with DB2 and Red Hat Enterprise Linux 5 on IBM System x3850 M2
SAP SD Benchmark using DB2 and Red Hat Enterprise Linux 5 on IBM System x3850 M2 Version 1.0 November 2008 SAP SD Benchmark with DB2 and Red Hat Enterprise Linux 5 on IBM System x3850 M2 1801 Varsity Drive
More informationAn Empirical Analysis of Java Performance Quality
An Empirical Analysis of Java Performance Quality Simon Chow simonichow15@gmail.com Abstract Computer scientists have consistently searched for ways to optimize and improve Java performance utilizing a
More informationIBM Power Systems Performance Report. POWER9, POWER8 and POWER7 Results
IBM Power Systems Performance Report POWER9, POWER8 and POWER7 Results Feb 27, 2018 Table of Contents Performance of IBM UNIX, IBM i and Linux Operating System Servers... 3 Section 1 - AIX Multiuser SPEC
More informationJava On Steroids: Sun s High-Performance Java Implementation. History
Java On Steroids: Sun s High-Performance Java Implementation Urs Hölzle Lars Bak Steffen Grarup Robert Griesemer Srdjan Mitrovic Sun Microsystems History First Java implementations: interpreters compact
More informationScaling PostgreSQL on SMP Architectures
Scaling PostgreSQL on SMP Architectures Doug Tolbert, David Strong, Johney Tsai {doug.tolbert, david.strong, johney.tsai}@unisys.com PGCon 2007, Ottawa, May 21-24, 2007 Page 1 Performance vs. Scalability
More informationFuture of JRockit & Tools
Future of JRockit & Tools Or finding the right layer to attack Joakim Dahlstedt 15 September 2004 A Short Background on JRockit Server-centric JVM Java compatible (most of the Java libraries are Suns)
More informationTEST REPORT. AUGUST 2006 SPECjbb2005 performance and power consumption on Intel Xeon 51xx processor-based servers
Executive summary Intel Corporation (Intel) commissioned Principled Technologies (PT) to measure the SPECjbb2005 performance of dual-processor s using the following four processors: Xeon processor 5110
More informationA JVM Does What? Eva Andreasson Product Manager, Azul Systems
A JVM Does What? Eva Andreasson Product Manager, Azul Systems Presenter Eva Andreasson Innovator & Problem solver Implemented the Deterministic GC of JRockit Real Time Awarded patents on GC heuristics
More informationGoal. Outline. Outline. J2EE architecture. Enterprise JavaBeans. J2EE Performance Scalability and Clustering Part 1
Emmanuel Cecchet INRIA Rhône-Alpes, ObjectWeb J2EE Performance Scalability and Clustering Part 1 Goal J2EE performance scalability evaluation design patterns communication layers Java Virtual Machine J2EE
More informationDell PowerEdge R910 SQL OLTP Virtualization Study Measuring Performance and Power Improvements of New Intel Xeon E7 Processors and Low-Voltage Memory
Dell PowerEdge R910 SQL OLTP Virtualization Study Measuring Performance and Power Improvements of New Intel Xeon E7 Processors and Low-Voltage Memory A Dell Technical White Paper Dell, Inc. Waseem Raja
More informationPerformance Report PRIMERGY RX100 S6
Performance Report PRIMERGY RX100 S6 Version 2.0 March 2010 Pages 31 Abstract This document contains a summary of the benchmarks executed for the PRIMERGY RX100 S6. The PRIMERGY RX100 S6 performance data
More informationJVM Performance Study Comparing Java HotSpot to Azul Zing Using Red Hat JBoss Data Grid
JVM Performance Study Comparing Java HotSpot to Azul Zing Using Red Hat JBoss Data Grid Legal Notices JBoss, Red Hat and their respective logos are trademarks or registered trademarks of Red Hat, Inc.
More informationCOMPARISON OF ORACLE APPLICATION SERVER, WEBLOGIC AND WEBSPHERE USING PEOPLESOFT ENTERPRISE CAMPUS SOLUTIONS 8.9
O RACLE R EVISION 2.6 COMPARISON OF ORACLE APPLICATION SERVER, WEBLOGIC AND WEBSPHERE USING PEOPLESOFT ENTERPRISE CAMPUS SOLUTIONS 8.9 As a global leader in e-business applications, Oracle is committed
More informationExperiences with Multi-threading and Dynamic Class Loading in a Java Just-In-Time Compiler
, Compilation Technology Experiences with Multi-threading and Dynamic Class Loading in a Java Just-In-Time Compiler Daryl Maier, Pramod Ramarao, Mark Stoodley, Vijay Sundaresan TestaRossa JIT compiler
More informationIntroduction to parallel computers and parallel programming. Introduction to parallel computersand parallel programming p. 1
Introduction to parallel computers and parallel programming Introduction to parallel computersand parallel programming p. 1 Content A quick overview of morden parallel hardware Parallelism within a chip
More informationRed Hat Enterprise Linux on IBM System z Performance Evaluation
Christian Ehrhardt IBM Research and Development Red Hat Enterprise Linux on IBM System z Performance Evaluation 2011 IBM Corporation Agenda Performance Evaluation Results Environment Noteworthy improvements
More informationThe Role of Performance
Orange Coast College Business Division Computer Science Department CS 116- Computer Architecture The Role of Performance What is performance? A set of metrics that allow us to compare two different hardware
More informationVIProf: A Vertically Integrated Full-System Profiler
VIProf: A Vertically Integrated Full-System Profiler NGS Workshop, April 2007 Hussam Mousa Chandra Krintz Lamia Youseff Rich Wolski RACELab Research Dynamic software adaptation As program behavior or resource
More information<Insert Picture Here> MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure
MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure Mario Beck (mario.beck@oracle.com) Principal Sales Consultant MySQL Session Agenda Requirements for
More informationUsing Industry Standards to Exploit the Advantages and Resolve the Challenges of Multicore Technology
Using Industry Standards to Exploit the Advantages and Resolve the Challenges of Multicore Technology September 19, 2007 Markus Levy, EEMBC and Multicore Association Enabling the Multicore Ecosystem Multicore
More informationBorland Optimizeit Enterprise Suite 6
Borland Optimizeit Enterprise Suite 6 Feature Matrix The table below shows which Optimizeit product components are available in Borland Optimizeit Enterprise Suite and which are available in Borland Optimizeit
More informationScheduling the Intel Core i7
Third Year Project Report University of Manchester SCHOOL OF COMPUTER SCIENCE Scheduling the Intel Core i7 Ibrahim Alsuheabani Degree Programme: BSc Software Engineering Supervisor: Prof. Alasdair Rawsthorne
More informationFlashGrid Software Enables Converged and Hyper-Converged Appliances for Oracle* RAC
white paper FlashGrid Software Intel SSD DC P3700/P3600/P3500 Topic: Hyper-converged Database/Storage FlashGrid Software Enables Converged and Hyper-Converged Appliances for Oracle* RAC Abstract FlashGrid
More informationSUMMARY OF RESULTS BENCHMARK PROFILE. server.
O RACLE E-BUSINESS B ENCHMARK R EV. 1.1 E-BUSINESS SUITE APPLICATIONS R12 (FAMILY PACK C) PRODUCT INFORMATION MANAGEMENT (PIM) (500 MILLION ITEM RECORDS) BENCHMARK - USING ORACLE10g ON IBM System x3850
More informationSAS Enterprise Miner Performance on IBM System p 570. Jan, Hsian-Fen Tsao Brian Porter Harry Seifert. IBM Corporation
SAS Enterprise Miner Performance on IBM System p 570 Jan, 2008 Hsian-Fen Tsao Brian Porter Harry Seifert IBM Corporation Copyright IBM Corporation, 2008. All Rights Reserved. TABLE OF CONTENTS ABSTRACT...3
More informationPerformance and power efficiency of Dell PowerEdge servers with E v2
Performance and power efficiency of Dell PowerEdge servers with E5-2600 v2 This white paper shows the performance and power efficiency improvements of Dell PowerEdge and PowerEdge C servers with the Intel
More informationBill Nesheim Sun Microsystems, Inc. Bob Kasten Intel Corporation
Bill Nesheim Sun Microsystems, Inc. Bob Kasten Intel Corporation 1 Executive Summary Sun and Intel strategic alliance has resulted in powerful innovations for customers The Solaris OS and the Intel Xeon
More information