Improving Performance and Ensuring Scalability of Large SAS Applications and Database Extracts

Size: px
Start display at page:

Download "Improving Performance and Ensuring Scalability of Large SAS Applications and Database Extracts"

Transcription

1 Improving Performance and Ensuring Scalability of Large SAS Applications and Database Extracts Michael Beckerle, ChiefTechnology Officer, Torrent Systems, Inc., Cambridge, MA ABSTRACT Many organizations that have invested heavily in data warehouses, data mining systems, and other large business intelligence systems are now discovering that the sheer volume of data being processed prevents large jobs from completing within the batch window. In some notable cases, long runtimes have been crippling--preventing the data warehouse from achieving its goals. In many cases, though, the long runtimes represent an inefficient use of the data warehouse that exacts other significant costs, for example, preventing the organization from running the job more frequently. Many organizations are therefore looking to cut batch runtimes by using multiprocessor systems with parallel processing capabilities-for example, SMPs, MPPs, and clusters. Parallel databases like DB2 UDB makes excellent use of these machines because they partition data across multiple nodes and process all database functions--including insert, update, and delete--in parallel. However, like many other applications, SAS jobs do not take advantage of the parallel processing capabilities of multiprocessor computers--even when working with a parallel database. Because a SAS process can connect to only one DB2 UDB node at a time, all data must be passed to that node and then funneled through a single cursor to the SAS application. This process creates significant bottlenecks-essentially serializing the parallel database. Eliminating this sequential bottleneck would appear to be a straightforward matter of converting sequential SAS applications to run in parallel. Unfortunately, most users believe that the way to achieve parallel processing is through parallel programming, and they recognize that parallel programming is a complex, costly, and even arcane science. Few organizations regard it as a commercially viable undertaking. New technology now makes it both technically and economically feasible to achieve parallel processing without parallel programming. Torrent Systems in Cambridge, MA, has developed an enviro.nment for building and running high-performance parallel applications which shields programmers from the complexities of parallel programming. Their SAS product-orchestrator for SAS (}--provides facilities to allow developers to easily specify how multistep, parallel operations should be set up; it readily feeds the output of one step as input to another; and it provides facilities for managing parallel execution. THE TESTS To prove the feasibility of using to cut runtime by converting a sequential SAS application to parallel mode, Torrent Systems and IBM developed and ran a set of representative workloads to test the performance impact of applying parallel processing to data extraction and SAS analysis. The test team ran two tests, both on an IBM RS/6000 SP massively parallel system. The first test extracted data from a parallel IBM DB2 UDB database and passed it to a SAS application. The second test used an actual SAS inventory forecasting application, which processed data from flat files. Each test was first run sequentially and then in parallel, incrementally increasing the number of nodes to measure scalability. The tests demonstrated that SAS performance increased dramatically when the extracts and processing were run in parallel instead of sequentially. The tests also showed that parallel processing allowed the applications to process the workload faster by as the number of processors increased. This phenomenon--called scalability-was exhibited consistently throughout the tests. In fact, results showed linear scalability through all 32 processors. (An example of linear scalability is where 12 processors provide 12 times the performance of a single processor). In addition, because the Torrent software enabled and managed the conversion of the application to parallel mode, the developers achieved the full benefits of parallel processing--performance and scalability-without parallel programming. : The Pathway to Parallel SAS Torrent Systems' Orchestrator for the SAS System () enables SAS applications to execute in parallel and to interface in parallel with the DB2 UDB parallel database system on both SMP and MPP architectures. increased SAS performance by allowing the system to: Extract data in parallel from a parallel RDBMS Load the results of SAS programs back into the database in parallel Process parallel data streams with parallel instances of a SAS DATA or PROC step for much higher throughput rates Store large s in parallel, providing faster access and eliminating storage restrictions Stream data between SAS steps without having to write intermediate results to disk. The bottom line is that, with, SAS can process much larger volumes of data. is a commercially available and supported product designed to solve the parallel interface issue with minimal 286

2 user effort, allowing SAS users to insert simple parallel directives into SAS programs. SAS algorithms that can be partitioned into discrete components can be made parallel using a Multiple Processor Independent Data (MPID) parallel paradigm. In fact, nearly any SAS DATA or PROC step with a BY clause could be run in parallel. SQL statements that use a GROUP BY clause illustrate a class of statements that could execute in parallel. Each SAS processor would work on a separate "GROUP" or "PARTITION" of the data, allowing data extracted from a database in parallel to be efficiently processed local to the node containing the extracted data. This requires no processor-to-processor communication to move data. Figure 1 shows a typical sequential SAS/Access database extraction vs. an -enabled SAS/Access parallel database extraction. Figure 1: Sequential vs. Paia/fel Elrtraclion into SequenJial SAS oats Set uential Extraction.. ~.1... ~ ~~- ~node Testing Methodology To ensure that the workloads and configuration were realistic and that the comparisons between sequential and parallel processing were fair and meaningful, testers used real customer applications. One test used an actual SAS inventory forecasting application in use at an IBM customer site. This customer was using an IBM RS/6000 SP system, but running SAS applications on a single node, not taking full advantage of the system's multiple processors. The tests described here were run at the IBM Teraplex Integration Center in Poughkeepsie, NY TEST MACHINE CONFIGURATION Both uniprocessor and SMP nodes were used in the system configuration--a typical configuration in customer environments-which allowed measurement of the performance characteristics of each type of node. The software used on each node varied depending on the test being run. For example, in some tests, SAS applications ran on only one node, whereas in other tests they ran on all nodes. The DB2 UDB database was configured on different numbers of nodes (4,8, 16, or 32), depending on the test being run. Census data was used, and the database was partitioned by serial number for 4, 8, 16, and 32 thin nodes. Test 1: Parallel Extraction from the IBM DB2 UDB Database Extraction of data from a parallel database is typically the first step in setting up SAS processing of data that has been stored in RDBMS. It is also a good example of a process that can benefit from parallelism. Because sequential extraction requires that data distributed across a parallel table of an RDBMS be streamed from the various table nodes to a coordinator node-and then streamed out of the database-the flow of data is bottlenecked by the rate at which the coordinator node within the database can read and then write the data. In the test, a sequential extraction and a parallel extraction were performed on a table spread out over 4, 8, 16, and then 32 nodes ofthe database, with the volume of data extracted from the table held constant throughout all tests. The tests extracted an entire table composed of 5 million rows (records) of census data. Each record was 304 bytes long and composed of 126 fields. Total data volume was 1.52 Gbytes. Results of Parallel Extraction Test In the sequential extraction (represented by the left side of Figure 1), SAS/Access used a single stream to extract the data from the database. The "serialization of the distributed data" occurred when data had to be moved from node to node to then stream into the single coordinator node of the database. The parallel extraction (represented by the right side of Figure 1) eliminated the need to move data from node to node and also streamed the data out of the database in parallel. A sequential step under the control of then merged the data into a single file outside the database. This single then serves as a data mart for further SAS analysis. A key feature of is its ability to land data to disk in parallel. Figure 2 (page 3) shows a test that ran as a variation of the parallel extraction test shown in Figure 1. In this test, developers used to parallelize the extraction of 1.52 gigabytes of data from the database. ran a SAS/Access process on every node, which allowed each process to extract only the data local to its node. In this way, enabled SAS to extract all of the data from the parallel table without a sequential bottleneck at the coordinator node. 287

3 Figure 2: Parallel Extraction into Parallel Data Sets node 1 node 2 DB2UDB.... node3,.... As shown In Figure 3, provided near linear scalability of the data extraction process. (Perfect linear scalability would resuh when doubling the number of processors cuts the runtime exactly in half, increasing the number of processors by four times cuts the runtime to one quarter, and so on.) The black curve represents the actual data points. The heavy gray curve shows the theoretical scalability limit that exists for applications with no sequential bottlenecks. Figure 3: Scalability of Parallel Extraction to Sequential SAS Data Set ~r ~ ~~~~~ U~ ~~c_~~----~ 1 6 l ,:7""?~~~~ " ~c...,.:::;...,.,._--i :.z~ ~~~~--~ ~----~~~ ~ 4~~~ This step replaces the sequential step that merged the data into the single SAS with multiple parallel steps that move the data into parallel s. Table 1 shows the performance results ofthe parallel extraction from the parallel database, both to a single SAS (labeled N-to-1) and to parallel s (labeled N-to-N). The subheading of "high" and "thin" designates the data mart node type. Table 1: Runtimes for Extraction of Data from DBZ UDB Database High Node Thin Node SeauentiaiExUaction 32-wavto 1 3:16:34 2:48:42 Parallel Extraction 4-way to 1 30:12 30:10 S-way to 1 16:01 16:01 18-way to 1 08:35 08:48 32-wayto 1 08:39 05:13 30-way to 30 N//A 04:07 32-wav to 32 04:17 NIA Notice the times for the 32-way-to-1 sequential extraction of the data to a single SAS : 3:16:34 and 2:48:42, for the high and thin nodes, respectively. Compare these times to those for a 32-way-to-1 parallel extraction to a single SAS : 06:39 and 05:13. The difference in these run times is the time it takes to converge a 32-way table to a single file stream inside the database (sequential extraction) vs. outside the database (parallel extraction). These results demonstrate that is extremely efficient at streaming large volumes of data. Next, notice the drop in time in each of the columns as the extractions progress from the "4-way- to-1" extraction to the "32-way-to-1" extraction. In this progression, the same volume of data is extracted from a table that is spread out over 4, 8, 16, and then 32 nodes of the database. As the number of processors used to extract the data increases, performance begins to deviate from 1:1 scalability, and even more so after 16 nodes. This deviation results from the inevitable sequential bottleneck created when the extracted data merges into a single sequential SAS file. With many parallel clients, parallel. extraction is so fast that regardless of whether data streams from 30, 32, or even 128 extraction nodes, a bottleneck occurs when writing data to disk on a single node. In Table 1, the difference in runtime between the "32-way-to-1" extraction time (06:39) and the "32-way-to- 32" extraction time (04:17) is basically the time that it takes to write the data to a single file. By writing to multiple parallel s instead of to a single sequential SAS, increased extnrction performance to provide almost perfect scalability. allows you to avoid sequential bottlenecks in production-mode applications by loading data into parallel s. These parallel s can, in tum, be fed directly to parallel instances of the SAS application. TEST TWO: PARALLEL SAS APPLICATION This second test measured the effects of parallelizing" a large commercial inventory-forecasting model, a SAS program with 526 lines of code. The code covered twenty DATA steps and 33 PROC steps, including linear regression, freq, sort, SQL, summary, transpose, and more. This application currently runs in sequential mode at an IBM customer site. The customer stores the data in a flat file. The volume of data processed was fixed at a constant volume during all phases of the test. The execution of a complex sequential SAS application against muhiple gigabytes of data can be a very timeconsuming operation. Because every record must pass through the same CPU, the resulting bottleneck of 288

4 sequential processing can far exceed even that observed in sequential database extraction. Speed is constrained not only by the rate at which data can stream through a single node, but also by the speed of the processor. Segment 1: Sequential read In the first segment of this test. took the sequential flat file and converted it to a SAS. then converted the SAS into parallel SAS s by partitioning it onto multiple nodes of the system. then measured the difference in performance between writing the s to high nodes vs. writing them to thin nodes. This segment requires a very small fraction of the time needed to run the actual SAS application. The output from this section was used as input to the next step, the execution of the SAS application. Section 2: Parallel execution In the second segment of the test, executed the SAS model in parallel. The more nodes executing the SAS model, the shorter the run time. Because each CPU needed to process only the data local to its node-and the data was divided up equally among N nodes (where N=1, 2, 4, 8, 16, or 32}--each CPU had to process only 1/Nth ofthe total data. Therefore, the whole model executed in 1/Nth the time that had been required to run the SAS model sequentially. After processing, wrote the SAS out to disk in parallel. Section 3: Sequential write In the final segment of the test, the parallel created by the SAS model was read into SAS in parallel, and then merged into a single sequential SAS, the output most easily inspected by the data analyst. Results of Parallel SAS Application Test Table 2 Parallel Application Runtimes N Hiah node Thin node Segment 1: flat file to N-way 1-wav 12:02 10:17 2-wav 07:34 07:43 4wav 06:41 07:07 B-wav 07:15 07:17 16-wav N/A 07:26 32-way MIA 07:00 1-way 3:19:02 1:42:29!Segment 2: parallel SAS application 2-wav 1:10:34 36:36 4wav 31:23 15:56 8-wav 15:23 07:08 16-wav N/A 04:19 32-wav N/A 02:36 Seament 3: N-wav to 1-wav 1-way 01:01 01:19 2-way 01:00 01:01 4wav 01:03 01:02 8-way 01:06 01:04 16-way N/A 01:08 32-wav N/A 01:15 Table 2 shows the results for the parallel SAS application test. The second segment runs the actual SAS application and is therefore the most time-consuming and CPU-intensive. This segment actually scales superlinearly: as the number of processors doubles, the run time is cut by more than a factor of two. Figure 4 shows a composite bar chart for the run time of the entire application as a function of the number of nodes executing the application. Figure 4: SAS Application Runtime vs. Number of Pro<:<1ssors To capture data on the runtimes of each segment of the execution, data was written to disk. The parallel model was run on 1, 2, 4, 8, 16, and 32 thin nodes ofthe RS/6000 SP and on 1, 2, 3, 4, 5, 6, 7, and 8 CPUs ofthe SMP. Running the parallel model on one node is equivalent to running the sequential model. Pipelined Parallelism.. 40, 20 "" 0 32 In a normal SAS application, each SAS step executes until completion, passing all its data at once to the next step, which executes to completion, and so on. This means that at any one time, most segments are "blocked" as they wait for the previous segment to complete. In contrast, applications containing multiple segments-such as the SAS application discussed here-<:an take advantage of "pipe lined parallelism. With pipelined parallelism. data flows into and out of each segment continuously. so that each segment of the application is always processing data-as long as input data is available. A segment will be inactive only if there is no input data available to process. Figure 4 shows that as processors were added, the runtime of segment 2 decreased dramatically, while the runtimes of the sequential segments (1 and 3} remained the same. Programs that have sequential bottlenecks-like those found in segments 1 and ~nnot scale unless the sequential bottleneck from reading from and writing to sequential is removed. can remove the sequential bottleneck by creating parallel s when extracting data from the database. In this way, eliminates all the run time associated with segment 1, making the application as a whole scale even better. 289

5 Figure 5 shows that, up to eight processors, segment 2 exhibited super-linear speedup. Figure 5: Super-Unear Speedup of SAS Forecasting Model ~ ~-- M~ ~~~~~--- 32r ~~~~--~~- ~r ~~~~--~~---- ~U~ ~~--~~~~--- ~ 20 r ~16~----~~C---~~ ~ 12r---~~~~~ ~ ~~--~~~ ~~~ ~----~--~~-=--~--~~=-~=-~ Complex analysis applications run against large volumes of data can be very memory-intensive. When memory limits are exceeded, virtual memory paging and even thrashing can occur. By distributing data in parallel, can provide super-linear speedup because it reduces the memory requirements on any single node below the threshold at which these slow processes occur. Figure 5 provides dramatic illustration of how distributing data over many nodes can lighten the load placed on the memory of each individual node and thereby reduce the overall work performed by the system. CONCLUSION The benchmark performance studies of Torrent System's undertaken at the IBM Teraplex Center have demonstrated scalable performance can be achieved when executing database extracts and SAS applications in the parallel environment on IBM SMP and MPP hardware. Several significant conclusions can be drawn from these results: distributes memory-intensive applications across multiple nodes, thereby optimizing the memory processing. Re-partitioning a parallel database can be done simply and efficiently, but not with traditional parallel programming techniques. In any environment, it is necessary to periodically add nodes or reconfigure a system. Without a tool like, it can be a daunting task to re-partition a parallel database to take advantage of the additional nodes. 's ability to stream large s in parallel into and out of a parallel RDBMS allows redistribution of tables within the database very simply and easily. For example, the database used for this test was re-partitioned repeatedly in order to measure processing scalability with different node configurations. Finally, as the needs of database users evolve, the ability to dynamically re-partition tables "on the fly" for the purpose of executing a particular operation will increase in value. All Parallel, All The Time Organizations performing data extract and refinement, data warehouse and data mart loading, models, analysis, data mining, and other such large-scale batch jobs now have a cost-effective way to handle growth in data volume and application complexity. By implementing end-to-end parallelism with using parallel hardware, parallel databases, parallel applications and the Orchestrate for SAS development and runtime environment from Torrent Systems, users can create an enterprise environment that will scale to many times its sequential capacity. And because Torrenfs software shields programmers from the complexities of parallel programming, end-to-end parallelism-and the enormous benefits it brings-is available to all IT organizations, without the need to invest in parallel programmers. Extracts from a parallel database should be done in parallel: Because sequential extraction requires that data distributed across a parallel table be streamed from the various table nodes to a coordinator node-and then streamed out of the database-the flow of data is bottlenecked by the rate at which the coordinator node within the database can read and then write the data. Extracting in parallel eliminates the need to stream data within the database and allows almost linear scalability. Extracted data should be written to parallel s: Writing parallel data to a single creates a sequential bottleneck at the point of convergence. can write to multiple parallel s instead, increasing extraction performance to provide almost perfect scalability. Executing SAS applications in parallel yields linear and even super-linear scalability: Because distributes processing across all the available processors, overcomes the memory and disk constraints that arise in complex sequential applications run against large volumes of data. Super-linear scalability can result when 290

A SAS/AF Application for Parallel Extraction, Transformation, and Scoring of a Very Large Database

A SAS/AF Application for Parallel Extraction, Transformation, and Scoring of a Very Large Database Paper 11 A SAS/AF Application for Parallel Extraction, Transformation, and Scoring of a Very Large Database Daniel W. Kohn, Ph.D., Torrent Systems Inc., Cambridge, MA David L. Kuhn, Ph.D., Innovative Idea

More information

System Architecture PARALLEL FILE SYSTEMS

System Architecture PARALLEL FILE SYSTEMS Software and the Performance Effects of Parallel Architectures Keith F. Olsen,, Poughkeepsie, NY James T. West,, Austin, TX ABSTRACT There are a number of different parallel architectures: parallel hardware

More information

Performance and Scalability

Performance and Scalability Measuring Critical Manufacturing cmnavigo: Affordable MES Performance and Scalability for Time-Critical Industry Environments Critical Manufacturing, 2015 Summary Proving Performance and Scalability Time-Critical

More information

Netezza The Analytics Appliance

Netezza The Analytics Appliance Software 2011 Netezza The Analytics Appliance Michael Eden Information Management Brand Executive Central & Eastern Europe Vilnius 18 October 2011 Information Management 2011IBM Corporation Thought for

More information

Disk to Disk Data File Backup and Restore.

Disk to Disk Data File Backup and Restore. Disk to Disk Data File Backup and Restore. Implementation Variations and Advantages with Tivoli Storage Manager and Tivoli SANergy software Dimitri Chernyshov September 26th, 2001 Data backup procedure.

More information

Advanced Databases: Parallel Databases A.Poulovassilis

Advanced Databases: Parallel Databases A.Poulovassilis 1 Advanced Databases: Parallel Databases A.Poulovassilis 1 Parallel Database Architectures Parallel database systems use parallel processing techniques to achieve faster DBMS performance and handle larger

More information

Best ETL Design Practices. Helpful coding insights in SAS DI studio. Techniques and implementation using the Key transformations in SAS DI studio.

Best ETL Design Practices. Helpful coding insights in SAS DI studio. Techniques and implementation using the Key transformations in SAS DI studio. SESUG Paper SD-185-2017 Guide to ETL Best Practices in SAS Data Integration Studio Sai S Potluri, Synectics for Management Decisions; Ananth Numburi, Synectics for Management Decisions; ABSTRACT This Paper

More information

Topics. Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples

Topics. Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples Hadoop Introduction 1 Topics Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples 2 Big Data Analytics What is Big Data?

More information

1 Dulcian, Inc., 2001 All rights reserved. Oracle9i Data Warehouse Review. Agenda

1 Dulcian, Inc., 2001 All rights reserved. Oracle9i Data Warehouse Review. Agenda Agenda Oracle9i Warehouse Review Dulcian, Inc. Oracle9i Server OLAP Server Analytical SQL Mining ETL Infrastructure 9i Warehouse Builder Oracle 9i Server Overview E-Business Intelligence Platform 9i Server:

More information

B.H.GARDI COLLEGE OF ENGINEERING & TECHNOLOGY (MCA Dept.) Parallel Database Database Management System - 2

B.H.GARDI COLLEGE OF ENGINEERING & TECHNOLOGY (MCA Dept.) Parallel Database Database Management System - 2 Introduction :- Today single CPU based architecture is not capable enough for the modern database that are required to handle more demanding and complex requirements of the users, for example, high performance,

More information

Crystal Reports. Overview. Contents. How to report off a Teradata Database

Crystal Reports. Overview. Contents. How to report off a Teradata Database Crystal Reports How to report off a Teradata Database Overview What is Teradata? NCR Teradata is a database and data warehouse software developer. This whitepaper will give you some basic information on

More information

HANA Performance. Efficient Speed and Scale-out for Real-time BI

HANA Performance. Efficient Speed and Scale-out for Real-time BI HANA Performance Efficient Speed and Scale-out for Real-time BI 1 HANA Performance: Efficient Speed and Scale-out for Real-time BI Introduction SAP HANA enables organizations to optimize their business

More information

IBM Tivoli OMEGAMON XE for Storage on z/os Version Tuning Guide SC

IBM Tivoli OMEGAMON XE for Storage on z/os Version Tuning Guide SC IBM Tivoli OMEGAMON XE for Storage on z/os Version 5.1.0 Tuning Guide SC27-4380-00 IBM Tivoli OMEGAMON XE for Storage on z/os Version 5.1.0 Tuning Guide SC27-4380-00 Note Before using this information

More information

vsan 6.6 Performance Improvements First Published On: Last Updated On:

vsan 6.6 Performance Improvements First Published On: Last Updated On: vsan 6.6 Performance Improvements First Published On: 07-24-2017 Last Updated On: 07-28-2017 1 Table of Contents 1. Overview 1.1.Executive Summary 1.2.Introduction 2. vsan Testing Configuration and Conditions

More information

TeraData 1. INTRODUCTION

TeraData 1. INTRODUCTION 1. INTRODUCTION Teradata is a relational database management system (RDBMS) that drives a company's data warehouse. Teradata provides the foundation to give a company the power to grow, to compete in today's

More information

CSCE 626 Experimental Evaluation.

CSCE 626 Experimental Evaluation. CSCE 626 Experimental Evaluation http://parasol.tamu.edu Introduction This lecture discusses how to properly design an experimental setup, measure and analyze the performance of parallel algorithms you

More information

Lawson M3 7.1 Large User Scaling on System i

Lawson M3 7.1 Large User Scaling on System i Lawson M3 7.1 Large User Scaling on System i IBM System i Paul Swenson paulswen@us.ibm.com System i ERP, Lawson Team Version Date: November 15 2007 Statement of Approval... 3 Introduction... 4 Benchmark

More information

Intelligent Caching in Data Virtualization Recommended Use of Caching Controls in the Denodo Platform

Intelligent Caching in Data Virtualization Recommended Use of Caching Controls in the Denodo Platform Data Virtualization Intelligent Caching in Data Virtualization Recommended Use of Caching Controls in the Denodo Platform Introduction Caching is one of the most important capabilities of a Data Virtualization

More information

Was ist dran an einer spezialisierten Data Warehousing platform?

Was ist dran an einer spezialisierten Data Warehousing platform? Was ist dran an einer spezialisierten Data Warehousing platform? Hermann Bär Oracle USA Redwood Shores, CA Schlüsselworte Data warehousing, Exadata, specialized hardware proprietary hardware Introduction

More information

Incremental Updates VS Full Reload

Incremental Updates VS Full Reload Incremental Updates VS Full Reload Change Data Capture Minutes VS Hours 1 Table of Contents Executive Summary - 3 Accessing Data from a Variety of Data Sources and Platforms - 4 Approaches to Moving Changed

More information

Accelerate your SAS analytics to take the gold

Accelerate your SAS analytics to take the gold Accelerate your SAS analytics to take the gold A White Paper by Fuzzy Logix Whatever the nature of your business s analytics environment we are sure you are under increasing pressure to deliver more: more

More information

Extreme Storage Performance with exflash DIMM and AMPS

Extreme Storage Performance with exflash DIMM and AMPS Extreme Storage Performance with exflash DIMM and AMPS 214 by 6East Technologies, Inc. and Lenovo Corporation All trademarks or registered trademarks mentioned here are the property of their respective

More information

Evolving To The Big Data Warehouse

Evolving To The Big Data Warehouse Evolving To The Big Data Warehouse Kevin Lancaster 1 Copyright Director, 2012, Oracle and/or its Engineered affiliates. All rights Insert Systems, Information Protection Policy Oracle Classification from

More information

IBM Emulex 16Gb Fibre Channel HBA Evaluation

IBM Emulex 16Gb Fibre Channel HBA Evaluation IBM Emulex 16Gb Fibre Channel HBA Evaluation Evaluation report prepared under contract with Emulex Executive Summary The computing industry is experiencing an increasing demand for storage performance

More information

EMC GREENPLUM MANAGEMENT ENABLED BY AGINITY WORKBENCH

EMC GREENPLUM MANAGEMENT ENABLED BY AGINITY WORKBENCH White Paper EMC GREENPLUM MANAGEMENT ENABLED BY AGINITY WORKBENCH A Detailed Review EMC SOLUTIONS GROUP Abstract This white paper discusses the features, benefits, and use of Aginity Workbench for EMC

More information

Performance of relational database management

Performance of relational database management Building a 3-D DRAM Architecture for Optimum Cost/Performance By Gene Bowles and Duke Lambert As systems increase in performance and power, magnetic disk storage speeds have lagged behind. But using solidstate

More information

Oracle s JD Edwards EnterpriseOne IBM POWER7 performance characterization

Oracle s JD Edwards EnterpriseOne IBM POWER7 performance characterization Oracle s JD Edwards EnterpriseOne IBM POWER7 performance characterization Diane Webster IBM Oracle International Competency Center January 2012 Copyright IBM Corporation, 2012. All Rights Reserved. All

More information

EMC VMAX 400K SPC-2 Proven Performance. Silverton Consulting, Inc. StorInt Briefing

EMC VMAX 400K SPC-2 Proven Performance. Silverton Consulting, Inc. StorInt Briefing EMC VMAX 400K SPC-2 Proven Performance Silverton Consulting, Inc. StorInt Briefing EMC VMAX 400K SPC-2 PROVEN PERFORMANCE PAGE 2 OF 10 Introduction In this paper, we analyze all- flash EMC VMAX 400K storage

More information

WebSphere Application Server Base Performance

WebSphere Application Server Base Performance WebSphere Application Server Base Performance ii WebSphere Application Server Base Performance Contents WebSphere Application Server Base Performance............. 1 Introduction to the WebSphere Application

More information

Lotus Sametime 3.x for iseries. Performance and Scaling

Lotus Sametime 3.x for iseries. Performance and Scaling Lotus Sametime 3.x for iseries Performance and Scaling Contents Introduction... 1 Sametime Workloads... 2 Instant messaging and awareness.. 3 emeeting (Data only)... 4 emeeting (Data plus A/V)... 8 Sametime

More information

Performance of DB2 Enterprise-Extended Edition on NT with Virtual Interface Architecture

Performance of DB2 Enterprise-Extended Edition on NT with Virtual Interface Architecture Performance of DB2 Enterprise-Extended Edition on NT with Virtual Interface Architecture Sivakumar Harinath 1, Robert L. Grossman 1, K. Bernhard Schiefer 2, Xun Xue 2, and Sadique Syed 2 1 Laboratory of

More information

JULIA ENABLED COMPUTATION OF MOLECULAR LIBRARY COMPLEXITY IN DNA SEQUENCING

JULIA ENABLED COMPUTATION OF MOLECULAR LIBRARY COMPLEXITY IN DNA SEQUENCING JULIA ENABLED COMPUTATION OF MOLECULAR LIBRARY COMPLEXITY IN DNA SEQUENCING Larson Hogstrom, Mukarram Tahir, Andres Hasfura Massachusetts Institute of Technology, Cambridge, Massachusetts, USA 18.337/6.338

More information

Tuning WebHound 4.0 and SAS 8.2 for Enterprise Windows Systems James R. Lebak, Unisys Corporation, Malvern, PA

Tuning WebHound 4.0 and SAS 8.2 for Enterprise Windows Systems James R. Lebak, Unisys Corporation, Malvern, PA Paper 272-27 Tuning WebHound 4.0 and SAS 8.2 for Enterprise Windows Systems James R. Lebak, Unisys Corporation, Malvern, PA ABSTRACT Windows is SAS largest and fastest growing platform. Windows 2000 Advanced

More information

Performance Report: Multiprotocol Performance Test of VMware ESX 3.5 on NetApp Storage Systems

Performance Report: Multiprotocol Performance Test of VMware ESX 3.5 on NetApp Storage Systems NETAPP TECHNICAL REPORT Performance Report: Multiprotocol Performance Test of VMware ESX 3.5 on NetApp Storage Systems A Performance Comparison Study of FC, iscsi, and NFS Protocols Jack McLeod, NetApp

More information

HYRISE In-Memory Storage Engine

HYRISE In-Memory Storage Engine HYRISE In-Memory Storage Engine Martin Grund 1, Jens Krueger 1, Philippe Cudre-Mauroux 3, Samuel Madden 2 Alexander Zeier 1, Hasso Plattner 1 1 Hasso-Plattner-Institute, Germany 2 MIT CSAIL, USA 3 University

More information

Teradata Analyst Pack More Power to Analyze and Tune Your Data Warehouse for Optimal Performance

Teradata Analyst Pack More Power to Analyze and Tune Your Data Warehouse for Optimal Performance Data Warehousing > Tools & Utilities Teradata Analyst Pack More Power to Analyze and Tune Your Data Warehouse for Optimal Performance By: Rod Vandervort, Jeff Shelton, and Louis Burger Table of Contents

More information

HiTune. Dataflow-Based Performance Analysis for Big Data Cloud

HiTune. Dataflow-Based Performance Analysis for Big Data Cloud HiTune Dataflow-Based Performance Analysis for Big Data Cloud Jinquan (Jason) Dai, Jie Huang, Shengsheng Huang, Bo Huang, Yan Liu Intel Asia-Pacific Research and Development Ltd Shanghai, China, 200241

More information

Oracle #1 RDBMS Vendor

Oracle #1 RDBMS Vendor Oracle #1 RDBMS Vendor IBM 20.7% Microsoft 18.1% Other 12.6% Oracle 48.6% Source: Gartner DataQuest July 2008, based on Total Software Revenue Oracle 2 Continuous Innovation Oracle 11g Exadata Storage

More information

High Performance Analytics with In-Database Processing

High Performance Analytics with In-Database Processing AD-001 High Performance Analytics with In-Database Processing Stephen Brobst, Chief Technology Officer, Teradata Corporation, San Diego, CA Keith Collins, Senior Vice President & Chief Technology Officer,

More information

Hybrid Data Platform

Hybrid Data Platform UniConnect-Powered Data Aggregation Across Enterprise Data Warehouses and Big Data Storage Platforms A Percipient Technology White Paper Author: Ai Meun Lim Chief Product Officer Updated Aug 2017 2017,

More information

Multi-Threaded Reads in SAS/Access for Relational Databases Sarah Whittier, ISO New England, Holyoke, MA

Multi-Threaded Reads in SAS/Access for Relational Databases Sarah Whittier, ISO New England, Holyoke, MA Multi-Threaded Reads in SAS/Access for Relational Databases Sarah Whittier, ISO New England, Holyoke, MA ABSTRACT Multi-threading was implemented in SAS 9. This new feature affects the performance of certain

More information

Frequent Item Set using Apriori and Map Reduce algorithm: An Application in Inventory Management

Frequent Item Set using Apriori and Map Reduce algorithm: An Application in Inventory Management Frequent Item Set using Apriori and Map Reduce algorithm: An Application in Inventory Management Kranti Patil 1, Jayashree Fegade 2, Diksha Chiramade 3, Srujan Patil 4, Pradnya A. Vikhar 5 1,2,3,4,5 KCES

More information

Database Acceleration Solution Using FPGAs and Integrated Flash Storage

Database Acceleration Solution Using FPGAs and Integrated Flash Storage Database Acceleration Solution Using FPGAs and Integrated Flash Storage HK Verma, Xilinx Inc. August 2017 1 FPGA Analytics in Flash Storage System In-memory or Flash storage based DB reduce disk access

More information

Massively Parallel Processing. Big Data Really Fast. A Proven In-Memory Analytical Processing Platform for Big Data

Massively Parallel Processing. Big Data Really Fast. A Proven In-Memory Analytical Processing Platform for Big Data Big Data Really Fast A Proven In-Memory Analytical Processing Platform for Big Data 2 Executive Summary / Overview: Big Data can be a big headache for organizations that have outgrown the practicality

More information

Mark Sandstrom ThroughPuter, Inc.

Mark Sandstrom ThroughPuter, Inc. Hardware Implemented Scheduler, Placer, Inter-Task Communications and IO System Functions for Many Processors Dynamically Shared among Multiple Applications Mark Sandstrom ThroughPuter, Inc mark@throughputercom

More information

Introduction to Big-Data

Introduction to Big-Data Introduction to Big-Data Ms.N.D.Sonwane 1, Mr.S.P.Taley 2 1 Assistant Professor, Computer Science & Engineering, DBACER, Maharashtra, India 2 Assistant Professor, Information Technology, DBACER, Maharashtra,

More information

QLogic TrueScale InfiniBand and Teraflop Simulations

QLogic TrueScale InfiniBand and Teraflop Simulations WHITE Paper QLogic TrueScale InfiniBand and Teraflop Simulations For ANSYS Mechanical v12 High Performance Interconnect for ANSYS Computer Aided Engineering Solutions Executive Summary Today s challenging

More information

6.2 DATA DISTRIBUTION AND EXPERIMENT DETAILS

6.2 DATA DISTRIBUTION AND EXPERIMENT DETAILS Chapter 6 Indexing Results 6. INTRODUCTION The generation of inverted indexes for text databases is a computationally intensive process that requires the exclusive use of processing resources for long

More information

CAPACITY PLANNING FOR THE DATA WAREHOUSE BY W. H. Inmon

CAPACITY PLANNING FOR THE DATA WAREHOUSE BY W. H. Inmon CAPACITY PLANNING FOR THE DATA WAREHOUSE BY W. H. Inmon The data warehouse environment - like all other computer environments - requires hardware resources. Given the volume of data and the type of processing

More information

Optimizing Testing Performance With Data Validation Option

Optimizing Testing Performance With Data Validation Option Optimizing Testing Performance With Data Validation Option 1993-2016 Informatica LLC. No part of this document may be reproduced or transmitted in any form, by any means (electronic, photocopying, recording

More information

Version 8 Base SAS Performance: How Does It Stack-Up? Robert Ray, SAS Institute Inc, Cary, NC

Version 8 Base SAS Performance: How Does It Stack-Up? Robert Ray, SAS Institute Inc, Cary, NC Paper 9-25 Version 8 Base SAS Performance: How Does It Stack-Up? Robert Ray, SAS Institute Inc, Cary, NC ABSTRACT This paper presents the results of a study conducted at SAS Institute Inc to compare the

More information

CA Test Data Manager Key Scenarios

CA Test Data Manager Key Scenarios WHITE PAPER APRIL 2016 CA Test Data Manager Key Scenarios Generate and secure all the data needed for rigorous testing, and provision it to highly distributed teams on demand. Muhammad Arif Application

More information

Configuration changes such as conversion from a single instance to RAC, ASM, etc.

Configuration changes such as conversion from a single instance to RAC, ASM, etc. Today, enterprises have to make sizeable investments in hardware and software to roll out infrastructure changes. For example, a data center may have an initiative to move databases to a low cost computing

More information

Got Isilon? Need IOPS? Get Avere.

Got Isilon? Need IOPS? Get Avere. Got Isilon? Need IOPS? Get Avere. Scalable I/O Performance to Complement Any EMC Isilon Environment By: Jeff Tabor, Director of Product Marketing Achieving Performance Scaling Overcoming Random I/O and

More information

Modern Data Warehouse The New Approach to Azure BI

Modern Data Warehouse The New Approach to Azure BI Modern Data Warehouse The New Approach to Azure BI History On-Premise SQL Server Big Data Solutions Technical Barriers Modern Analytics Platform On-Premise SQL Server Big Data Solutions Modern Analytics

More information

Clustering and Reclustering HEP Data in Object Databases

Clustering and Reclustering HEP Data in Object Databases Clustering and Reclustering HEP Data in Object Databases Koen Holtman CERN EP division CH - Geneva 3, Switzerland We formulate principles for the clustering of data, applicable to both sequential HEP applications

More information

Lenovo Database Configuration

Lenovo Database Configuration Lenovo Database Configuration for Microsoft SQL Server Standard Edition DWFT 9TB Reduce time to value with pretested hardware configurations Data Warehouse problem and a solution The rapid growth of technology

More information

Oracle Data Integration and OWB: New for 11gR2

Oracle Data Integration and OWB: New for 11gR2 Oracle Data Integration and OWB: New for 11gR2 C. Antonio Romero, Oracle Corporation, Redwood Shores, US Keywords: data integration, etl, real-time, data warehousing, Oracle Warehouse Builder, Oracle Data

More information

Condusiv s V-locity Server Boosts Performance of SQL Server 2012 by 55%

Condusiv s V-locity Server Boosts Performance of SQL Server 2012 by 55% openbench Labs Executive Briefing: May 20, 2013 Condusiv s V-locity Server Boosts Performance of SQL Server 2012 by 55% Optimizing I/O for Increased Throughput and Reduced Latency on Physical Servers 01

More information

Modification and Evaluation of Linux I/O Schedulers

Modification and Evaluation of Linux I/O Schedulers Modification and Evaluation of Linux I/O Schedulers 1 Asad Naweed, Joe Di Natale, and Sarah J Andrabi University of North Carolina at Chapel Hill Abstract In this paper we present three different Linux

More information

Fit for Purpose Platform Positioning and Performance Architecture

Fit for Purpose Platform Positioning and Performance Architecture Fit for Purpose Platform Positioning and Performance Architecture Joe Temple IBM Monday, February 4, 11AM-12PM Session Number 12927 Insert Custom Session QR if Desired. Fit for Purpose Categorized Workload

More information

A Simulation: Improving Throughput and Reducing PCI Bus Traffic by. Caching Server Requests using a Network Processor with Memory

A Simulation: Improving Throughput and Reducing PCI Bus Traffic by. Caching Server Requests using a Network Processor with Memory Shawn Koch Mark Doughty ELEC 525 4/23/02 A Simulation: Improving Throughput and Reducing PCI Bus Traffic by Caching Server Requests using a Network Processor with Memory 1 Motivation and Concept The goal

More information

DATABASE SCALE WITHOUT LIMITS ON AWS

DATABASE SCALE WITHOUT LIMITS ON AWS The move to cloud computing is changing the face of the computer industry, and at the heart of this change is elastic computing. Modern applications now have diverse and demanding requirements that leverage

More information

Table of contents. OpenVMS scalability with Oracle Rdb. Scalability achieved through performance tuning.

Table of contents. OpenVMS scalability with Oracle Rdb. Scalability achieved through performance tuning. OpenVMS scalability with Oracle Rdb Scalability achieved through performance tuning. Table of contents Abstract..........................................................2 From technical achievement to

More information

Oracle Database 12c: JMS Sharded Queues

Oracle Database 12c: JMS Sharded Queues Oracle Database 12c: JMS Sharded Queues For high performance, scalable Advanced Queuing ORACLE WHITE PAPER MARCH 2015 Table of Contents Introduction 2 Architecture 3 PERFORMANCE OF AQ-JMS QUEUES 4 PERFORMANCE

More information

Segregating Data Within Databases for Performance Prepared by Bill Hulsizer

Segregating Data Within Databases for Performance Prepared by Bill Hulsizer Segregating Data Within Databases for Performance Prepared by Bill Hulsizer When designing databases, segregating data within tables is usually important and sometimes very important. The higher the volume

More information

Increasing Performance of Existing Oracle RAC up to 10X

Increasing Performance of Existing Oracle RAC up to 10X Increasing Performance of Existing Oracle RAC up to 10X Prasad Pammidimukkala www.gridironsystems.com 1 The Problem Data can be both Big and Fast Processing large datasets creates high bandwidth demand

More information

QLIKVIEW SCALABILITY BENCHMARK WHITE PAPER

QLIKVIEW SCALABILITY BENCHMARK WHITE PAPER QLIKVIEW SCALABILITY BENCHMARK WHITE PAPER Measuring Business Intelligence Throughput on a Single Server QlikView Scalability Center Technical White Paper December 2012 qlikview.com QLIKVIEW THROUGHPUT

More information

CIS 601 Graduate Seminar. Dr. Sunnie S. Chung Dhruv Patel ( ) Kalpesh Sharma ( )

CIS 601 Graduate Seminar. Dr. Sunnie S. Chung Dhruv Patel ( ) Kalpesh Sharma ( ) Guide: CIS 601 Graduate Seminar Presented By: Dr. Sunnie S. Chung Dhruv Patel (2652790) Kalpesh Sharma (2660576) Introduction Background Parallel Data Warehouse (PDW) Hive MongoDB Client-side Shared SQL

More information

Hitachi Converged Platform for Oracle

Hitachi Converged Platform for Oracle Hitachi Converged Platform for Oracle Manfred Drozd, Benchware Ltd. Sponsored by Hitachi Data Systems Corporation Introduction Because of their obvious advantages, engineered platforms are becoming increasingly

More information

The Hadoop Paradigm & the Need for Dataset Management

The Hadoop Paradigm & the Need for Dataset Management The Hadoop Paradigm & the Need for Dataset Management 1. Hadoop Adoption Hadoop is being adopted rapidly by many different types of enterprises and government entities and it is an extraordinarily complex

More information

Evaluation Report: HP StoreFabric SN1000E 16Gb Fibre Channel HBA

Evaluation Report: HP StoreFabric SN1000E 16Gb Fibre Channel HBA Evaluation Report: HP StoreFabric SN1000E 16Gb Fibre Channel HBA Evaluation report prepared under contract with HP Executive Summary The computing industry is experiencing an increasing demand for storage

More information

Multiprocessors and Thread-Level Parallelism. Department of Electrical & Electronics Engineering, Amrita School of Engineering

Multiprocessors and Thread-Level Parallelism. Department of Electrical & Electronics Engineering, Amrita School of Engineering Multiprocessors and Thread-Level Parallelism Multithreading Increasing performance by ILP has the great advantage that it is reasonable transparent to the programmer, ILP can be quite limited or hard to

More information

Big Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara

Big Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara Big Data Technology Ecosystem Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara Agenda End-to-End Data Delivery Platform Ecosystem of Data Technologies Mapping an End-to-End Solution Case

More information

Using Alluxio to Improve the Performance and Consistency of HDFS Clusters

Using Alluxio to Improve the Performance and Consistency of HDFS Clusters ARTICLE Using Alluxio to Improve the Performance and Consistency of HDFS Clusters Calvin Jia Software Engineer at Alluxio Learn how Alluxio is used in clusters with co-located compute and storage to improve

More information

The Optimal CPU and Interconnect for an HPC Cluster

The Optimal CPU and Interconnect for an HPC Cluster 5. LS-DYNA Anwenderforum, Ulm 2006 Cluster / High Performance Computing I The Optimal CPU and Interconnect for an HPC Cluster Andreas Koch Transtec AG, Tübingen, Deutschland F - I - 15 Cluster / High Performance

More information

SAS System Powers Web Measurement Solution at U S WEST

SAS System Powers Web Measurement Solution at U S WEST SAS System Powers Web Measurement Solution at U S WEST Bob Romero, U S WEST Communications, Technical Expert - SAS and Data Analysis Dale Hamilton, U S WEST Communications, Capacity Provisioning Process

More information

Embedded Technosolutions

Embedded Technosolutions Hadoop Big Data An Important technology in IT Sector Hadoop - Big Data Oerie 90% of the worlds data was generated in the last few years. Due to the advent of new technologies, devices, and communication

More information

On BigFix Performance: Disk is King. How to get your infrastructure right the first time! Case Study: IBM Cloud Development - WW IT Services

On BigFix Performance: Disk is King. How to get your infrastructure right the first time! Case Study: IBM Cloud Development - WW IT Services On BigFix Performance: Disk is King How to get your infrastructure right the first time! Case Study: IBM Cloud Development - WW IT Services Authors: Shaun T. Kelley, Mark Leitch Abstract: Rolling out large

More information

Scalable Access to SAS Data Billy Clifford, SAS Institute Inc., Austin, TX

Scalable Access to SAS Data Billy Clifford, SAS Institute Inc., Austin, TX Scalable Access to SAS Data Billy Clifford, SAS Institute Inc., Austin, TX ABSTRACT Symmetric multiprocessor (SMP) computers can increase performance by reducing the time required to analyze large volumes

More information

Large-Scale Network Simulation Scalability and an FPGA-based Network Simulator

Large-Scale Network Simulation Scalability and an FPGA-based Network Simulator Large-Scale Network Simulation Scalability and an FPGA-based Network Simulator Stanley Bak Abstract Network algorithms are deployed on large networks, and proper algorithm evaluation is necessary to avoid

More information

Memory Allocation. Copyright : University of Illinois CS 241 Staff 1

Memory Allocation. Copyright : University of Illinois CS 241 Staff 1 Memory Allocation Copyright : University of Illinois CS 241 Staff 1 Allocation of Page Frames Scenario Several physical pages allocated to processes A, B, and C. Process B page faults. Which page should

More information

Lenovo Database Configuration for Microsoft SQL Server TB

Lenovo Database Configuration for Microsoft SQL Server TB Database Lenovo Database Configuration for Microsoft SQL Server 2016 22TB Data Warehouse Fast Track Solution Data Warehouse problem and a solution The rapid growth of technology means that the amount of

More information

Part 1: Indexes for Big Data

Part 1: Indexes for Big Data JethroData Making Interactive BI for Big Data a Reality Technical White Paper This white paper explains how JethroData can help you achieve a truly interactive interactive response time for BI on big data,

More information

Accelerating BI on Hadoop: Full-Scan, Cubes or Indexes?

Accelerating BI on Hadoop: Full-Scan, Cubes or Indexes? White Paper Accelerating BI on Hadoop: Full-Scan, Cubes or Indexes? How to Accelerate BI on Hadoop: Cubes or Indexes? Why not both? 1 +1(844)384-3844 INFO@JETHRO.IO Overview Organizations are storing more

More information

Reducing Network Contention with Mixed Workloads on Modern Multicore Clusters

Reducing Network Contention with Mixed Workloads on Modern Multicore Clusters Reducing Network Contention with Mixed Workloads on Modern Multicore Clusters Matthew Koop 1 Miao Luo D. K. Panda matthew.koop@nasa.gov {luom, panda}@cse.ohio-state.edu 1 NASA Center for Computational

More information

Four Steps to Unleashing The Full Potential of Your Database

Four Steps to Unleashing The Full Potential of Your Database Four Steps to Unleashing The Full Potential of Your Database This insightful technical guide offers recommendations on selecting a platform that helps unleash the performance of your database. What s the

More information

Increasing Performance for PowerCenter Sessions that Use Partitions

Increasing Performance for PowerCenter Sessions that Use Partitions Increasing Performance for PowerCenter Sessions that Use Partitions 1993-2015 Informatica LLC. No part of this document may be reproduced or transmitted in any form, by any means (electronic, photocopying,

More information

Design Studio Data Flow performance optimization

Design Studio Data Flow performance optimization Design Studio Data Flow performance optimization Overview Plan sources Plan sinks Plan sorts Example New Features Summary Exercises Introduction Plan sources Plan sinks Plan sorts Example New Features

More information

Condusiv s V-locity VM Accelerates Exchange 2010 over 60% on Virtual Machines without Additional Hardware

Condusiv s V-locity VM Accelerates Exchange 2010 over 60% on Virtual Machines without Additional Hardware openbench Labs Executive Briefing: March 13, 2013 Condusiv s V-locity VM Accelerates Exchange 2010 over 60% on Virtual Machines without Additional Hardware Optimizing I/O for Increased Throughput and Reduced

More information

IBM InfoSphere Data Replication s Change Data Capture (CDC) for DB2 LUW databases (Version ) Performance Evaluation and Analysis

IBM InfoSphere Data Replication s Change Data Capture (CDC) for DB2 LUW databases (Version ) Performance Evaluation and Analysis Page 1 IBM InfoSphere Data Replication s Change Data Capture (CDC) for DB2 LUW databases (Version 10.2.1) Performance Evaluation and Analysis 2014 Prasa Urithirakodeeswaran Page 2 Contents Introduction...

More information

Key metrics for effective storage performance and capacity reporting

Key metrics for effective storage performance and capacity reporting Key metrics for effective storage performance and capacity reporting Key Metrics for Effective Storage Performance and Capacity Reporting Objectives This white paper will cover the key metrics in storage

More information

Introducing the SAS ODBC Driver

Introducing the SAS ODBC Driver 1 CHAPTER 1 Introducing the SAS ODBC Driver Overview: The SAS ODBC Driver 1 What Is ODBC? 2 What Is the SAS ODBC Driver? 2 Types of Data Accessed with the SAS ODBC Driver 3 Understanding SAS 5 SAS Data

More information

Specifying Storage Servers for IP security applications

Specifying Storage Servers for IP security applications Specifying Storage Servers for IP security applications The migration of security systems from analogue to digital IP based solutions has created a large demand for storage servers high performance PCs

More information

SCASE STUDYS. Migrating from MVS to.net: an Italian Case Study. bizlogica Italy. segui bizlogica

SCASE STUDYS. Migrating from MVS to.net: an Italian Case Study. bizlogica Italy.  segui bizlogica SCASE STUDYS Migrating from MVS to.net: an Italian Case Study bizlogica Italy executive summary This report describes how BIZLOGICA helped a large Corporation to successful reach the objective of saving

More information

White Paper FUJITSU Storage ETERNUS DX S4/S3 series Extreme Cache/Extreme Cache Pool best fit for fast processing of vast amount of data

White Paper FUJITSU Storage ETERNUS DX S4/S3 series Extreme Cache/Extreme Cache Pool best fit for fast processing of vast amount of data White Paper FUJITSU Storage ETERNUS DX S4/S3 series Extreme Cache/Extreme Cache Pool best fit for fast processing of vast amount of data Extreme Cache / Extreme Cache Pool, which expands cache capacity

More information

A Hybrid Recursive Multi-Way Number Partitioning Algorithm

A Hybrid Recursive Multi-Way Number Partitioning Algorithm Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence A Hybrid Recursive Multi-Way Number Partitioning Algorithm Richard E. Korf Computer Science Department University

More information

Enhancing Analysis-Based Design with Quad-Core Intel Xeon Processor-Based Workstations

Enhancing Analysis-Based Design with Quad-Core Intel Xeon Processor-Based Workstations Performance Brief Quad-Core Workstation Enhancing Analysis-Based Design with Quad-Core Intel Xeon Processor-Based Workstations With eight cores and up to 80 GFLOPS of peak performance at your fingertips,

More information

IBM System Storage DCS3700

IBM System Storage DCS3700 IBM System Storage DCS3700 Maximize performance, scalability and storage density at an affordable price Highlights Gain fast, highly dense storage capabilities at an affordable price Deliver simplified

More information

TDT- An Efficient Clustering Algorithm for Large Database Ms. Kritika Maheshwari, Mr. M.Rajsekaran

TDT- An Efficient Clustering Algorithm for Large Database Ms. Kritika Maheshwari, Mr. M.Rajsekaran TDT- An Efficient Clustering Algorithm for Large Database Ms. Kritika Maheshwari, Mr. M.Rajsekaran M-Tech Scholar, Department of Computer Science and Engineering, SRM University, India Assistant Professor,

More information