Table 1 The Elastic Stack use cases Use case Industry or vertical market Operational log analytics: Gain real-time operational insight, reduce Mean Ti

Similar documents
Cisco and Cloudera Deliver WorldClass Solutions for Powering the Enterprise Data Hub alerts, etc. Organizations need the right technology and infrastr

2 to 4 Intel Xeon Processor E v3 Family CPUs. Up to 12 SFF Disk Drives for Appliance Model. Up to 6 TB of Main Memory (with GB LRDIMMs)

Service-Level Agreement (SLA) based Reliability, Availability, and Scalability (RAS) for analytics The solution has no single point of failure. The Ve

Design a Remote-Office or Branch-Office Data Center with Cisco UCS Mini

Design a Remote-Office or Branch-Office Data Center with Cisco UCS Mini

by Cisco Intercloud Fabric and the Cisco

Cisco HyperFlex HX220c M4 Node

Cisco UCS B460 M4 Blade Server

Data Protection for Cisco HyperFlex with Veeam Availability Suite. Solution Overview Cisco Public

Achieve Optimal Network Throughput on the Cisco UCS S3260 Storage Server

Cisco UCS M5 Blade and Rack Servers

Cisco HyperFlex HX220c Edge M5

Cisco HyperFlex HX220c M4 and HX220c M4 All Flash Nodes

Cisco Unified Computing System Delivering on Cisco's Unified Computing Vision

Cisco UCS C210 M2 General-Purpose Rack-Mount Server

Cisco HyperFlex HX220c M4 and HX220c M4 All Flash Nodes

Veritas NetBackup on Cisco UCS S3260 Storage Server

Cisco UCS Mini Software-Defined Storage with StorMagic SvSAN for Remote Offices

Cisco UCS B440 M1High-Performance Blade Server

Cisco UCS C210 M1 General-Purpose Rack-Mount Server

Veeam Availability Solution for Cisco UCS: Designed for Virtualized Environments. Solution Overview Cisco Public

Cisco UCS C24 M3 Server

UCS M-Series + Citrix XenApp Optimizing high density XenApp deployment at Scale

DataON and Intel Select Hyper-Converged Infrastructure (HCI) Maximizes IOPS Performance for Windows Server Software-Defined Storage

Supports up to four 3.5-inch SAS/SATA drives. Drive bays 1 and 2 support NVMe SSDs. A size-converter

Cisco UCS C200 M2 High-Density Rack-Mount Server

An Oracle Technical White Paper October Sizing Guide for Single Click Configurations of Oracle s MySQL on Sun Fire x86 Servers

Next Generation Computing Architectures for Cloud Scale Applications

Implementing SQL Server 2016 with Microsoft Storage Spaces Direct on Dell EMC PowerEdge R730xd

Deploy a High-Performance Database Solution: Cisco UCS B420 M4 Blade Server with Fusion iomemory PX600 Using Oracle Database 12c

Flash Storage Complementing a Data Lake for Real-Time Insight

ScaleProtect with Cisco UCS on the Cisco UCS C240 M5 Rack Server

Cisco UCS C240 M3 Server

Cisco UCS C240 M3 Server

Commvault ScaleProtect with Cisco UCS S3260 Storage Server

Cisco UCS B200 M3 Blade Server

Big Data and Object Storage

Datasheet FUJITSU Software ServerView Cloud Monitoring Manager V1.1

Fusion iomemory PCIe Solutions from SanDisk and Sqrll make Accumulo Hypersonic

Cisco UCS C250 M2 Extended-Memory Rack-Mount Server

HPE ProLiant ML350 Gen P 16GB-R E208i-a 8SFF 1x800W RPS Solution Server (P04674-S01)

Intel Select Solutions for Professional Visualization with Advantech Servers & Appliances

Oracle Big Data Connectors

Red Hat Ceph Storage and Samsung NVMe SSDs for intensive workloads

Accelerating Enterprise Search with Fusion iomemory PCIe Application Accelerators

Lenovo Database Configuration for Microsoft SQL Server TB

Cisco UCS B230 M2 Blade Server

Microsoft Exchange Server 2010 workload optimization on the new IBM PureFlex System

Accelerating Microsoft SQL Server 2016 Performance With Dell EMC PowerEdge R740

SAP High-Performance Analytic Appliance on the Cisco Unified Computing System

Micron and Hortonworks Power Advanced Big Data Solutions

Cisco UCS. Click to edit Master text styles Second level Third level Fourth level Fifth level

Cisco UCS C250 M2 Extended-Memory Rack-Mount Server

The webinar will start soon... Elasticsearch Performance Optimisation

Vblock Architecture. Andrew Smallridge DC Technology Solutions Architect

Accelerating Workload Performance with Cisco 16Gb Fibre Channel Deployments

HPE ProLiant ML350 Gen10 Server

Deploy a Next-Generation Messaging Platform with Microsoft Exchange Server 2010 on Cisco Unified Computing System Powered by Intel Xeon Processors

Apache Spark Graph Performance with Memory1. February Page 1 of 13

ACCELERATE YOUR ANALYTICS GAME WITH ORACLE SOLUTIONS ON PURE STORAGE

UCS Architecture Overview

FlashStack 70TB Solution with Cisco UCS and Pure Storage FlashArray

Dell PowerEdge R730xd Servers with Samsung SM1715 NVMe Drives Powers the Aerospike Fraud Prevention Benchmark


The FlashStack Data Center

Veeam Availability Suite on Cisco UCS S3260

Massive Scalability With InterSystems IRIS Data Platform

IBM Power Systems HPC Cluster

New Approach to Unstructured Data

HCI: Hyper-Converged Infrastructure

Front-loading drive bays 1 12 support 3.5-inch SAS/SATA drives. Optionally, front-loading drive bays 1 and 2 support 3.5-inch NVMe SSDs.

HPE ProLiant DL360 Gen P 16GB-R P408i-a 8SFF 500W PS Performance Server (P06453-B21)

Release Notes for Cisco UCS C-Series Software, Release 2.0(13)

TPC-E testing of Microsoft SQL Server 2016 on Dell EMC PowerEdge R830 Server and Dell EMC SC9000 Storage

Intel Solid State Drive Data Center Family for PCIe* in Baidu s Data Center Environment

Three Paths to Better Business Decisions

REFERENCE ARCHITECTURE Microsoft SQL Server 2016 Data Warehouse Fast Track. FlashStack 70TB Solution with Cisco UCS and Pure Storage FlashArray//X

Cisco UCS C240 M4 I/O Characterization

Cisco MCS 7825-I1 Unified CallManager Appliance

ADVANCED IN-MEMORY COMPUTING USING SUPERMICRO MEMX SOLUTION

Scality RING on Cisco UCS: Store File, Object, and OpenStack Data at Scale

Evolution of Rack Scale Architecture Storage

Emerging Technologies for HPC Storage

Cold Storage: The Road to Enterprise Ilya Kuznetsov YADRO

Sun Fire X4170 M2 Server Frequently Asked Questions

Cisco Content Delivery Engine 285 for Open Media Distribution

The Oracle Database Appliance I/O and Performance Architecture

Rack-Level I/O Consolidation with Cisco Nexus 5000 Series Switches

UCS Architecture Overview

Overview. About the Cisco UCS S3260 System

Density Optimized System Enabling Next-Gen Performance

Cisco UCS Integrated Infrastructure for Big Data and Analytics

SmartPlay Select. Cisco Global Data Center Sales. Haresh Kripalani UCS Product Management. Jonathan Ralph Global Data Center Sales April 6, 2015

Cisco MCS 7845-H1 Unified CallManager Appliance

Cisco Desktop Virtualization Solutions

Release Notes for Cisco Integrated System for Microsoft Azure Stack, Release 1.0. Release Notes for Cisco Integrated System for Microsoft

SUPERMICRO, VEXATA AND INTEL ENABLING NEW LEVELS PERFORMANCE AND EFFICIENCY FOR REAL-TIME DATA ANALYTICS FOR SQL DATA WAREHOUSE DEPLOYMENTS

Maailman paras palvelinjärjestelmä. Tommi Salli Distinguished Engineer

Cisco Connected Safety and Security UCS C220

Data Sheet. DriveScale Overview

Transcription:

Solution Overview Cisco UCS Integrated Infrastructure for Big Data with the Elastic Stack Cisco and Elastic deliver a powerful, scalable, and programmable IT operations and security analytics platform with near-real-time search, visualization, and monitoring capabilities. Organizations today are overwhelmed with data. This data arrives in multiple formats and includes many associated documents, and its huge volume dictates a need for a high-performance analytics platform. Organizations must manage increasing data volume and variety and the need for ever quicker responsiveness to new data with the continuously increasing complexity of the IT infrastructure. IT operations analytics is a set of methodologies and processes for collecting, organizing, and identifying patterns in network, computing, and application data across multiple tiers. It also includes the capability to automate routine analyses and health checks through interactive dashboards. Organizations also are challenged by the need to deploy a fast, reliable, and scalable hardware and powerful and flexible software platform to store and gain actionable insight from large volumes of data. Systems must provide the facility to mine data for valuable information that could augment the business and help make important decisions. The technology used must be able to correlate data and documents from multiple sources to get a full understanding of the health of digital infrastructure and applications. The Elastic Stack (previously known as the ELK Stack) is a combination of powerful, open-source software products for logging, storage, search, data visualization, and monitoring. Although each product can be used separately, together they make a powerful, comprehensive, flexible, and programmable IT operational analytics platform that addresses a variety of use cases, such as those listed in Table 1, and puts an organization s data into action. The Elastic Stack for comprehensive operational analytics

Table 1 The Elastic Stack use cases Use case Industry or vertical market Operational log analytics: Gain real-time operational insight, reduce Mean Time To Resolution (MTTR), improve customer satisfaction and overall experience, and protect sensitive personal information. Telecommunications, retail, finance, gaming, e-commerce, research & development, technology, entertainment, and public sector Security analytics: Protect against cyber threats, secure internal networks and applications, monitor IT activity, empower centralized Security Operations Centers (SOCs) and Security Information and Event Management (SIEM), and detect insider threats and cyber intruders including Advanced Persistent Threats (APTs). Software, finance, telecommunications, and government Business analytics: Improve marketing campaigns, detect customer purchasing patterns, analyze customer sentiment, provide recommendations, get credit scores, and analyze medical information. Healthcare, education, telecommunications, technology, retail, online dating, finance, news, and gaming Metrics analytics: Monitor sensors, improve manufacturing efficiency, and power large-scale data analysis. Technology, research, and gaming Application search: Perform e-commerce searches and media searches, improve research and development, perform real-time plagiarism detection, improve customer conversion and engagement, perform job searches and archive searches, get 360-degree customer views, and perform web searches. E-commerce, news, research, technology, pharmaceutics, healthcare, retail, government, finance, telecommunications, and education Enterprise search: Perform document searches, integrate content repositories, and improve internal knowledge portals. Technology, software, logistics, healthcare, finance, education, and telecommunications The Elastic Stack consists of the following components: Elasticsearch is a distributed search and analytics engine and data store. It can scale from a single node to hundreds of servers and store many terabytes to petabytes of data. It maintains near-real-time query performance by efficiently sharding data and distributing query processing. Its open, JavaScript Object Notation (JSON) based Representational State Transfer (REST) API makes it easy to use in any environment and provides both full text search and analytics capabilities. Queries include simple term searches, wildcard queries, aggregations including cardinality calculations at scale, and predictive analytics. Kibana is an intuitive and visual web-based user interface built on top of the Elasticsearch APIs. As the window into the whole Elastic Stack, it allows organizations to perform impromptu data exploration, build and share visualizations and dashboards, configure and run machine learning jobs, and monitor and manage the full stack. Logstash is the data transformation pipeline. It collects data from a multitude of sources and then transforms and outputs the results to Elasticsearch or other sources. More than 200 plug-ins are available, including 2

input plug-ins for reading data from sources such as text files, message queues, syslog, and various databases; filter plug-ins for field extraction and data enrichment from SQL data sources; and codec plug-ins for formats such as Avro, Comma-Separated Values (CSV), and NetFlow. Beats is a collection of lightweight data shippers or agents. It can collect data from log files, general system metrics, and specific metrics from many open-source applications (for example, databases). It can capture network traffic in real time and dozens of other types of data. It is can be deployed across many hosts to capture and send data to Logstash and Elasticsearch. ES-Hadoop integrates many Apache Hadoop technologies with Elasticsearch, allowing big data developers to combine and process data from Elasticsearch in native Hadoop technologies such as MapReduce, Cascading, Spark, Pig, Hive, and Storm. X-Pack adds critical enterprise features to the Elastic Stack, such as enterprise-class security, unsupervised anomaly detection, alerting to indicate changes in data instantly, constant monitoring, graph-based exploration for behavioral analytics, and PDF reporting. Cisco UCS Integrated Infrastructure for Elastic Cisco UCS Integrated Infrastructure for Elastic is based on the fifth generation of the industryleading converged infrastructure solution: Cisco UCS Integrated Infrastructure for Big Data. Two reference configurations are available: The high-performance configuration is suitable for deployments with short retention periods. It offers fast access to data. See Figure 1 and Table 2. The performance- and capacity-balanced configuration is suitable for deployments that require a tiered storage model with longer data retention. It provides faster access to frequently accessed data. See Figure 2 and Table 3. These configurations can be deployed as is or used as templates for building custom configurations. The solution can be customized based on workload demands, including expansion to thousands of servers through the use of Cisco Nexus 9000 Series Switches. With its extremely fast computing and memory and flexible storage options, this nextgeneration infrastructure can be used to power fast data access to the large storage resources required for modern applications. Figure 1 High-performance configuration High Performance configuration Cisco UCS 6332 Fabric Interconnect (40 Gbps) Data nodes 8 Cisco UCS C220 M5 Rack Servers 3

Table 1 Reference configuration 1: High performance Item Configuration Servers 8 x Cisco UCS C220 M5 Rack Servers, each with: 2 Intel Xeon Processor Scalable Family 6132 CPUs (2 x 14 cores at 2.6 GHz) 192 GB of memory at 2666 MHz (DDR4) 10 x 1.9-TB Enterprise Value SATA Solid-State Disks (SSDs)1 configured as RAID 5 M.2 with 2 x 480-GB SSDs Cisco 12-Gbps SAS Modular RAID Controller with 2-GB Flash-Based Write Cache (FBWC) 40-Gbps card (Cisco UCS Virtual Interface Card [VIC] 1387) Network connectivity Cisco UCS 6332 Fabric Interconnect Storage available2 17 TB per server (136 TB total) Elasticsearch ingest capacity3 More than 2 TB per day More than 100,000 events per second Figure 2 Performance- and capacity-balanced configuration Performance and Capacity balanced configuration Cisco UCS 6332 Fabric Interconnect (40 Gbps) Hot data nodes 5 Cisco UCS C220 M5 Rack Servers Warm data nodes 10 Cisco UCS C240 M5 Rack Servers 4

Table 3 Reference configuration 2: performance and capacity balanced Item Configuration Servers (hot tier) 5 x Cisco UCS C220 M5 Rack Servers, each with: Servers (warm tier) 2 Intel Xeon Processor Scalable Family 6132 CPUs (2 x 14 cores at 2.6 GHz) 192 GB of memory at 2666 MHz (DDR4) 10 x 960-GB Enterprise Value SATA SSDs1 configured as RAID 5 M.2 with 2 x 480-GB SSDs Cisco 12-Gbps SAS Modular RAID Controller with 2-GB FBWC 40-Gbps card (Cisco UCS VIC 1387) 10 x Cisco UCS C240 M5 Rack Servers, each with: 2 Intel Xeon Processor Scalable Family 6132 CPUs (2 x 14 cores at 2.6 GHz) 192 GB of memory at 2666 MHz (DDR4) 24 x 1.8-TB 10,000-rpm Hard-Disk Drives (HDDs)1 configured as RAID 10 M.2 with 2 x 480-GB SSDs Cisco 12-Gbps SAS Modular RAID Controller with 4-GB FBWC 40-Gbps card (Cisco UCS VIC 1387) Network connectivity Cisco UCS 6332 Fabric Interconnect Storage available2 Hot tier: 8 TB per server (40 TB total) Warm tier: 21 TB per server (210 TB total) More than 2 TB per day More than 100,000 events per second Elasticsearch ingest capacity3 Notes: 1. Other SSD and HDD options are available: 1.6- and 3.8-TB SSDs, and 1.8-TB 10,000-rpm SAS HDD. 2. The total storage capacity per server is the unformatted available storage based on the parity used for the RAID group. The actual available storage space varies depending on the file system used. 3. Actual ingest capacity depends on the retention period of the data, the average event size, and other factors. 4. Each server can host several Elasticsearch data nodes with the X-Pack extension. 5. Elasticsearch master nodes can be deployed on the same servers or in a separate virtual environment. 6. Any Elasticsearch ingest nodes as well as Logstash and Kibana services can be deployed on additional Cisco UCS Integrated Infrastructure. 7. For cold storage, any of the Hadoop solutions based on Cisco UCS Integrated Infrastructure for Big Data can be used with the Elastic ES-Hadoop connector. 5

Cisco UCS 6300 Series fabric interconnects Cisco UCS fabric interconnects establish a single point of connectivity and management for the entire system. They provide high-bandwidth, low-latency connectivity for Cisco UCS servers, with integrated, unified management for all connected devices provided by Cisco UCS Manager, which is embedded within each fabric interconnect. Deployed in redundant pairs, Cisco UCS fabric interconnects offer full active-active redundancy, high performance, and the exceptional scalability needed to support the large number of servers that are typical in clusters serving big data applications. Cisco UCS Manager enables rapid and consistent server configuration using Cisco UCS service profiles, advanced health monitoring, and automation of ongoing system maintenance activities across the entire cluster as a single operation. Cisco UCS C240 and C220 M5 Rack Servers The Cisco UCS C240 and C220 M5 Rack Servers are dual-socket servers offering industry-leading performance and expandability for a wide range of storage and I/O-intensive infrastructure workloads, from big data analytics to collaboration. These servers use the new Intel Xeon Scalable processors with up to 28 cores per socket. The servers support up to 24 DDR4 DIMMs for improved performance and lower power consumption. The DIMM slots are also 3D XPoint ready, supporting next-generation nonvolatile memory technology. The 2-rack-unit (2RU) Cisco UCS C240 M5 supports up to 26 Small-Form-Factor (SFF) 2.5-inch drives (with support for up to 10 Non-Volatile Memory Express [NVMe] PCIe SSDs on the NVMe-optimized chassis version) with a Cisco 12-Gbps SAS Module RAID Controller. The 1RU Cisco UCS C220 M5 supports up to 10 SFF 2.5-inch drives (with support for up to 10 NVMe PCIe SSDs on the NVMe-optimized chassis version). Additionally, it has two modular M.2 cards that can be used for boot. A modular LAN-on-motherboard (mlom) slot supports the Cisco UCS VIC 1387 with dual 40-Gbps network connectivity. Conclusion Big data technology has become a valuable and compelling asset for organizations of all sizes. Harnessing the value of big data analytics in real time from a stable, manageable, and scalable infrastructure has been a longstanding challenge. The Elastic Stack on Cisco UCS solution meets this challenge by delivering a real-time data analytics and search platform that can be rapidly deployed and scaled on demand and is easy to use. It also reduces the Total Cost of Ownership (TCO) by requiring fewer infrastructure components and reducing operating expenses associated with staff time. Organizations can rely on this solution to provide real-time data intelligence that can help them make critical business decisions. For more information For more information about Cisco UCS, visit https://www.cisco.com/go/ucs. For more information about Elastic, visit https://www.elastic.co/. For more information about Cisco UCS Big Data Solutions, visit https://www.cisco.com/go/bigdata. For more information about Cisco Validated Designs for big data, visit http://www.cisco.com/go/bigdata_design 2017 Cisco and/or its affiliates. All rights reserved. Cisco and the Cisco logo are trademarks or registered trademarks of Cisco and/or its affiliates in the U.S. and other countries. To view a list of Cisco trademarks, go to this URL: https://www.cisco.com/go/trademarks. Third-party trademarks mentioned are the property of their respective owners. The use of the word partner does not imply a partnership relationship between Cisco and any other company. (1110R) 2017 All Rights Reserved - Elasticsearch BV Elasticsearch is a trademark of Elasticsearch BV, registered in the U.S. and in other countries Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant logo are trademarks of the Apache Software Foundation in the United States and/ or other countries. C22-739529-00 09/17