Demystifying Storage Area Networks. Michael Wells Microsoft Application Solutions Specialist EMC Corporation

Similar documents
Infrastructure Tuning

Storage Update and Storage Best Practices for Microsoft Server Applications. Dennis Martin President, Demartek January 2009 Copyright 2009 Demartek

davidklee.net heraflux.com linkedin.com/in/davidaklee

EMC Tiered Storage for Microsoft SQL Server 2008 Enabled by EMC CLARiiON CX4 and Enterprise Flash Drives

IBM System Storage DS3300 Storage System 1000 Mailbox Clustered Continuous Replication Microsoft Exchange 2007 Storage Solution

Accelerate Applications Using EqualLogic Arrays with directcache

Virtual SQL Servers. Actual Performance. 2016

White Paper. EonStor GS Family Best Practices Guide. Version: 1.1 Updated: Apr., 2018

Dell EMC SCv3020 7,000 Mailbox Exchange 2016 Resiliency Storage Solution using 7.2K drives

3.1. Storage. Direct Attached Storage (DAS)

Dell PowerVault MD Mailbox Single Copy Cluster Microsoft Exchange 2007 Storage Solution

Assessing performance in HP LeftHand SANs

Exam : S Title : Snia Storage Network Management/Administration. Version : Demo

Dell Storage SC Series Arrays and Microsoft SQL Server

Dell PowerVault MD1220 is a SAS based storage enclosure. The major features of the storage system include:

Storage Area Network (SAN) Training Presentation. July 2007 IBM PC CLUB Jose Medeiros Storage Systems Engineer MCP+I, MCSE, NT4 MCT

Dell PowerVault MD3000i 5000 Mailbox Single Copy Cluster Microsoft Exchange 2007 Storage Solution

ESRP Storage Program

Sun ZFS Storage 7120 Appliance 5,000 Mailbox Resiliency Exchange 2010 Storage Solution

InfoSphere Warehouse with Power Systems and EMC CLARiiON Storage: Reference Architecture Summary


Dell Compellent Storage Center 6.5 SC4020 4,500 Mailbox Exchange 2013 Resiliency Storage Solution

ESRP Storage Program EMC CLARiiON CX3-20c (500 User) Storage Solution for Microsoft Exchange Server

White Paper. A System for Archiving, Recovery, and Storage Optimization. Mimosa NearPoint for Microsoft

EMC CLARiiON Database Storage Solutions: Microsoft SQL Server 2000 and 2005

Microsoft SQL Server 2012 Fast Track Reference Configuration Using PowerEdge R720 and EqualLogic PS6110XV Arrays

EMC CLARiiON CX3 Series FCP

S S SNIA Storage Networking Foundations

SQL Server: Practical Troubleshooting. Dmitri Korotkevitch (

Chapter 12: Mass-Storage

DELL TM AX4-5 Application Performance

HP StorageWorks 600 Modular Disk System 4,000 user 3GB Mailbox resiliency Exchange 2010 storage solution

ESRP Storage Program EMC CLARiiON CX3-20c(1000 user) Storage Solution for Microsoft Exchange Server

Field Testing Buffer Pool Extension and In-Memory OLTP Features in SQL Server 2014

Storage Optimization with Oracle Database 11g

1Z0-433

Storage Best Practices for Microsoft Server Applications

EMC VSPEX FOR VIRTUALIZED MICROSOFT EXCHANGE 2013 WITH MICROSOFT HYPER-V

DELL EMC UNITY: BEST PRACTICES GUIDE

This document contains information about the EMC DMX SRDF/A Storage Solution for Microsoft Exchange Server.

Dell Storage Center 6.6 SCv2000 SAS Front-end Arrays and 2,500 Mailbox Exchange 2013 Resiliency Storage Solution

Chapter 10: Mass-Storage Systems

SurFS Product Description

Snia S Storage Networking Management/Administration.

Sun ZFS Storage 7320 Appliance 10,000 Mailbox Resiliency Exchange 2010 Storage Solution

Storage Area Networks SAN. Shane Healy

Chapter 10: Mass-Storage Systems. Operating System Concepts 9 th Edition

Introducing Tegile. Company Overview. Product Overview. Solutions & Use Cases. Partnering with Tegile

IBM Tivoli Storage Manager for Windows Version Installation Guide IBM

IBM Storwize V ,000 mailbox resiliency Microsoft Exchange 2013 storage solution. IBM Systems and Technology Group ISV Enablement March 2014

access addresses/addressing advantages agents allocation analysis

DELL EMC UNITY: HIGH AVAILABILITY

VMware Virtual SAN. Technical Walkthrough. Massimiliano Moschini Brand Specialist VCI - vexpert VMware Inc. All rights reserved.

Dell Fluid Data solutions. Powerful self-optimized enterprise storage. Dell Compellent Storage Center: Designed for business results

Reference Architecture

NST6000 UNIFIED HYBRID STORAGE. Performance, Availability and Scale for Any SAN and NAS Workload in Your Environment

HP 3PAR StoreServ ,000 Mailbox Resiliency Exchange 2010 Storage Solution

PESIT Bangalore South Campus

NAS for Server Virtualization Dennis Chapman Senior Technical Director NetApp

Chapter 6. Storage and Other I/O Topics

EMC CLARiiON CX3 UltraScale Series The Proven Midrange Storage

Isilon Performance. Name

Nutanix Tech Note. Virtualizing Microsoft Applications on Web-Scale Infrastructure

Getting the most from your SAN File and Filegroup design patterns. Stephen Archbold

EMC Backup and Recovery for Microsoft Exchange 2007

NetApp E-Series E ,000-Mailbox Microsoft Exchange Server 2013 Mailbox Resiliency Storage Solution

EMC STORAGE FOR MILESTONE XPROTECT CORPORATE

SQL Server 2014 Training. Prepared By: Qasim Nadeem

Hitachi Unified Storage VM Dynamically Provisioned 21,600 Mailbox Exchange 2013 Mailbox Resiliency Storage Solution

EMC VSPEX FOR VIRTUALIZED MICROSOFT SQL SERVER 2012 WITH MICROSOFT HYPER-V

Nexenta Technical Sales Professional (NTSP)

Introduction to NetApp E-Series E2700 with SANtricity 11.10

Dell EMC SC Series SC5020 9,000 Mailbox Exchange 2016 Resiliency Storage Solution using 7.2K Drives

EMC Business Continuity for Microsoft Applications

The Oracle Database Appliance I/O and Performance Architecture

Slide 0 Welcome to this Web Based Training session introducing the ETERNUS DX80 S2, DX90 S2, DX410 S2 and DX440 S2 storage systems from Fujitsu.

EMC CLARiiON AX4-5i (2,000 User) Storage Solution for Microsoft Exchange Server 2007 SP1

Oracle Performance on M5000 with F20 Flash Cache. Benchmark Report September 2011

Microsoft SQL Server 2012 Fast Track Reference Architecture Using PowerEdge R720 and Compellent SC8000

Surveillance Dell EMC Storage with Milestone XProtect Corporate

iscsi Technology Brief Storage Area Network using Gbit Ethernet The iscsi Standard

Data center requirements

Isilon Scale Out NAS. Morten Petersen, Senior Systems Engineer, Isilon Division

Database Solutions Engineering. Best Practices for running Microsoft SQL Server and Microsoft Hyper-V on Dell PowerEdge Servers and Storage

Vblock Architecture. Andrew Smallridge DC Technology Solutions Architect

Synology High Availability (SHA)

IBM EXAM QUESTIONS & ANSWERS

COMP283-Lecture 3 Applied Database Management

Computer Architecture 计算机体系结构. Lecture 6. Data Storage and I/O 第六讲 数据存储和输入输出. Chao Li, PhD. 李超博士

COSC6376 Cloud Computing Lecture 17: Storage Systems

Chapter 10: Mass-Storage Systems

Dell PowerEdge R720xd 6,000 Mailbox Resiliency Microsoft Exchange 2013 Storage Solution. Tested with ESRP Storage Version 4.0 Tested Date: Feb 2014

TECHNICAL SPECIFICATIONS

FAST & Furious mit EMC² VNX

Milestone Solution Partner IT Infrastructure Components Certification Report


UCS Invicta: A New Generation of Storage Performance. Mazen Abou Najm DC Consulting Systems Engineer

New HPE 3PAR StoreServ 8000 and series Optimized for Flash

Evaluation Report: Improving SQL Server Database Performance with Dot Hill AssuredSAN 4824 Flash Upgrades

STORAGE CONSOLIDATION WITH IP STORAGE. David Dale, NetApp

Transcription:

Demystifying Storage Area Networks Michael Wells Microsoft Application Solutions Specialist EMC Corporation

About Me DBA for 7+ years Developer for 10+ years MCSE: Data Platform MCSE: SQL Server 2012 MCITP: Database Administrator 2008 MCITP: Database Developer 2008 MCTS: SQL Server 2005

Agenda SAN Protocols SAN Storage Types The parts of a SAN Different SAN Architectures Understanding RAID Understanding Multipathing

SAN Protocols Fiber Channel high performance, dedicated fiber network requiring special hardware iscsi sends SCSI commands over traditional Ethernet interfaces (uses TCP/IP) Fiber Channel over Ethernet (FCOE) Fiber Channel commands over a lossless Ethernet network and requires special hardware

SAN Storage Types Block Storage volumes must be mounted to a host File Storage configured at the array with a file system and can be addressed by a UNC path Unified Storage array capable of supporting both storage types

The Main Parts of a SAN (FC) LUN (Logical) HBA (Physical) Fiber Optic Cable (Physical) Fiber Channel Switch (Physical) Storage Processor/Engine (Physical) Cache (Physical) Disk Array Enclosure DAE (Physical) Storage Pool (Logical) RAID Set (Logical) Drives (Physical)

LUN Logical Unit Number Volume mounted to one or more servers and appears in Windows as a drive Can be mounted using a drive letter or a mount point

HBA Host Bus Adapter Physical card that connects the server to the SAN Fabric Rating Net Throughput Efficiency 1Gb 98.44 MBs 77.7% 2Gb 196.9 MBs 77.7% 4Gb 393.8 MBs 77.7% 8Gb 787.6 MBs 77.7% 10Gb 1,181 MBs 94.2% 16Gb 1,575 MBs 94.2%

Fiber Optic Cable Physical medium used for data transmission between components connected to the SAN Fabric

Fiber Channel Switch Networking device that handles traffic between components the heart of the SAN Fabric

Storage Processor/Engine The brains of the SAN there are at least 2 and can be Active/Passive or Active/Active depending on the system

Cache Split into Read Cache and Write Cache Read Cache prevents the need to re-read the data from spinning disk Write Cache allows the write operation to be acknowledged before the data is committed to disk (should have battery backup) Can be memory in the Storage Processor/Engine or it can be Drives in the array (usually SSD)

Disk Array Enclosure (DAE) A piece of hardware that holds hard drives and is connected to the storage processors

Storage Pool One or more RAID sets grouped into a single pool from which LUNs are allocated Usually built using drives of the same size and speed Can be a mixed pool when storage tiering technology is used

RAID Set Redundant Array of Inexpensive Disks - Method of grouping disks together for performance, redundancy, or both RAID 0 Striping Performance with no redundancy RAID 1 Mirroring Redundancy, but not space efficient RAID 5 Striping with Parity Redundancy, with better space efficiency RAID 6 Striping with Parity Redundancy, with better fault tolerance than RAID 5 RAID 1/0 Mirroring + Striping Performance with redundancy

Drives The physical drive that stores the data blocks Enterprise Flash Drives (EFD) 3500 IOPS Single Layer Cells (SLC) low density with the best durability Multi Layer Cells (MLC) higher density at the cost of long-term durability SAS Drives 15k RPM 160 IOPS 10k RPM 140 IOPS Near-Line (NL) SAS Drives 7.2k RPM 90 IOPS SATA Drives SATA2 7.2k 80 IOPS SATA 7.2k 60 IOPS

The Four Storage Architectures Clustered Scale Up & Down TYPE 1 Tightly Coupled Scale Out TYPE 2 Loosely Coupled Scale Out TYPE 3 Distributed Shared Nothing TYPE 4 General Purpose Storage Balance of Perf/Cost/RAS Integrated & Unified Efficient & Simple Transactional Commits Brains Share Memory Distributed Data Data Available all Brains Shared Meta-Data Transactional Commits Independent Brains Inter-Brain Communication Distributed Data Data Available all Brains Transactional Commits Independent Brains Direct Attach Storage Lazy / Forced Data Dist. Distributed Commits Non-Transactional Commits

Key Performance Metrics IOPS I/O Operations per Second Front-End IOPS IO traffic generated at the host and sent to the Storage Processor / Engine Back-End IOPS IO traffic generated at the Storage Processor and sent to the disks Bandwidth the amount of data being sent/recieved Latency the time it takes for an IO request to complete

What Size IO Does SQL Server Use? File type Operation READ pattern WRITE pattern Threads used I/O type Data File Normal Activity 8KiB up to 128KiB 8KiB up to 128KiB Based on MaxDOP Random Checkpoint N/A 64KiB up to 128 KiB # of Sockets in Computer Random LazyWriter N/A 64KiB up to 128 KiB 1 per NUMA Node Random Bulk Insert N/A 8KiB up to 128 KiB Based on MaxDOP Sequential Backup 1 MB 1 MB Based on MaxDOP Sequential Restore 64KiB 64KiB Based on MaxDOP Sequential DBCC Checkdb w/ no repair option 8KiB up to 64KiB N/A Based on MaxDOP Sequential Rebuild Index See Read Ahead 8KiB 128 KiB Based on MaxDOP Sequential ReadAhead Up to 512 KiB N/A Based on MaxDOP Sequential Log File Normal Activity 512 bytes - 64KiB 512 bytes - 64KiB one log writer thread per soft NUMA node with a cap of 4 Sequential

Understanding RAID RAID Protection comes at a cost Small block random writes 1 application write IO RAID 1/0 2 back-end write IO RAID 5 4 back-end IO (2 read IO + 2 write IO) RAID 6 6 back-end IO (3 read IO + 3 write IO)

RAID Recommendations for SQL Always place Log files on RAID 1/0 or RAID 1 Better protection from failure Better write performance (log activity is almost all write) Consider using RAID 5 or RAID 6 for data files Data files are less write intensive than log files and can benefit from the less expensive RAID level Consider using RAID 1/0 for TEMPDB Faster writes will improve performance when temporary objects are used and/or operations spill over to TEMPDB https://technet.microsoft.com/en-us/library/cc966534.aspx

Understanding Multipathing The primary purpose of multipathing is redundancy

Multipathing - Redundancy Storage Processor A Storage Processor B

Understanding Multipathing The primary purpose of multipathing is redundancy The secondary purpose of multipathing is for performance and load balancing

Multipathing - Performance Storage Processor A Storage Processor B

Microsoft Multipath I/O (MPIO) Use more than one path for read and write functions to your storage device Provides redundant failover and load-balancing support for disks or LUNs Supports bandwidth aggregation Distributes I/O transactions across multiple adapters Windows Server feature

The Importance of a Baseline A query takes 30 seconds to run, is that too long? What about a query that runs in 3 seconds? To identify abnormal behavior you must first understand normal

SQL Monitoring Dynamic Management Views (DMVs) SQL Profiler/Trace Extended Events Performance Dashboard Management Data Warehouse (MDW)

DYNAMIC MANAGEMENT VIEWS SYS.DM_IO_VIRTUAL_FILE_STATS SELECT [ReadLatency] = CASE WHEN [num_of_reads] = 0 THEN 0 ELSE ([io_stall_read_ms] / [num_of_reads]) END, [WriteLatency] = CASE WHEN [num_of_writes] = 0 THEN 0 ELSE ([io_stall_write_ms] / [num_of_writes]) END, [Latency] = CASE WHEN ([num_of_reads] = 0 AND [num_of_writes] = 0) THEN 0 ELSE ([io_stall] / ([num_of_reads] + [num_of_writes])) END, [AvgBPerRead] = CASE WHEN [num_of_reads] = 0 THEN 0 ELSE ([num_of_bytes_read] / [num_of_reads]) END, [AvgBPerWrite] = CASE WHEN [num_of_writes] = 0 THEN 0 ELSE ([num_of_bytes_written] / [num_of_writes]) END, [AvgBPerTransfer] = CASE WHEN ([num_of_reads] = 0 AND [num_of_writes] = 0) THEN 0 ELSE (([num_of_bytes_read] + [num_of_bytes_written]) / ([num_of_reads] + [num_of_writes])) END, LEFT ([mf].[physical_name], 2) AS [Drive], DB_NAME ([vfs].[database_id]) AS [DB], [mf].[physical_name] FROM sys.dm_io_virtual_file_stats (NULL,NULL) AS [vfs] JOIN sys.master_files AS [mf] ON [vfs].[database_id] = [mf].[database_id] AND [vfs].[file_id] = [mf].[file_id] -- WHERE [vfs].[file_id] = 2 -- log files -- ORDER BY [Latency] DESC -- ORDER BY [ReadLatency] DESC ORDER BY [WriteLatency] DESC; GO Query Credit Paul Randal and Jimmy May

Dynamic Management Views

SQL Profiler Warning: SQL Trace was deprecated in SQL 2012 No new events or enhancements! Look at ClearTrace for aggregating trace results http://www.scalesql.com/cleartrace/

Extended Events Replaces the functionality of SQL Trace The only way to monitor new SQL Server events like those related to Availability Group Replication Jonathan Kehayias has written a converter to migrate trace definitions to extended events - https://www.sqlskills.com/blogs/jonathan/converting-sql-traceto-extended-events-in-sql-server-2012/

Performance Dashboard

Management Data Warehouse

Windows Monitoring Windows Performance Counters Performance Monitor (PerfMon) System Center Operations Manager (SCOM)

WINDOWS PERFORMANCE COUNTERS PERFORMANCE MONITOR: LOGICAL DISK Latency Avg. Disk sec/transfer Avg. Disk sec/read Avg. Disk sec/write IOPS Disk Transfers/sec Disk Read/sec Disk Writes/sec Throughput Disk Bytes/sec Disk Read Bytes/sec Disk Write Bytes/sec Transfer Size Avg. Disk Bytes/Transfer Avg. Disk Read Bytes/Transfer Avg. Disk Write Bytes/Transfer Disk Queue Length Current Disk Queue Length Avg. Disk Queue Length Avg. Disk Read Queue Length Avg. Disk Write Queue Length Capacity % Free Space Free Megabytes For details on these counters http://blogs.technet.com/b/askcore/archive/2012/03/16/windows-performance-monitor-disk-counters-explained.aspx

Performance Monitor (PerfMon)

System Center Operations MGR

When I Find An Issue Don t immediately blame the I/O Subsystem, using your baseline look for things like: Query plan changes Additional indexes or index changes Access pattern or key changes Adding Change Data Tracking (CDC) or triggers Enabling Snapshot Isolation Decreased server memory Increased session counts

Escalating Issues Provide as much relevant information as possible Show normal/expected performance metrics These must already exist and be easily accessible Show how the current workload compares to normal/expected If the current workload is different, what changed and is it permanent? Focus on solving the problem Don t get bogged down in the blame game