OpenStack and Hadoop. Achieving near bare-metal performance for big data workloads running in a private cloud ABSTRACT

Similar documents
Dell EMC Ready Solution for VMware vcloud NFV 3.0 OpenStack Edition Platform

Red Hat Cloud Platforms with Dell EMC. Quentin Geldenhuys Emerging Technology Lead

Dell EMC Ready Solution for VMware vcloud NFV 3.0 OpenStack Edition Platform

Deploying and Managing Dell Big Data Clusters with StackIQ Cluster Manager

The FlashStack Data Center

Dell EMC All-Flash solutions are powered by Intel Xeon processors. Learn more at DellEMC.com/All-Flash

Oracle Exadata Statement of Direction NOVEMBER 2017

Bringing OpenStack to the Enterprise. An enterprise-class solution ensures you get the required performance, reliability, and security

MODERNISE WITH ALL-FLASH. Intel Inside. Powerful Data Centre Outside.

Cloud Confidence: Simple Seamless Secure. Dell EMC Data Protection for VMware Cloud on AWS

ELASTIC DATA PLATFORM

Dell EMC ScaleIO Ready Node

TALK THUNDER SOFTWARE FOR BARE METAL HIGH-PERFORMANCE SOFTWARE FOR THE MODERN DATA CENTER WITH A10 DATASHEET YOUR CHOICE OF HARDWARE

Introduction to Cisco and Intel NFV Quick Start

SECURE, FLEXIBLE ON-PREMISE STORAGE WITH EMC SYNCPLICITY AND EMC ISILON

DriveScale-DellEMC Reference Architecture

Contrail Cloud Platform Architecture

Contrail Cloud Platform Architecture

40,000 TRANSFORM INFRASTRUCTURE AT THE EDGE. Introduction. Exploring the edge. The digital universe is doubling every two years

Copyright 2015 EMC Corporation. All rights reserved. Published in the USA.

Transformation Through Innovation

Vscale: Real-World Deployments of Next-Gen Data Center Architecture

21ST century enterprise. HCL Technologies Presents. Roadmap for Data Center Transformation

ECONOMICAL, STORAGE PURPOSE-BUILT FOR THE EMERGING DATA CENTERS. By George Crump

Cloud Computing: Making the Right Choice for Your Organization

HPE Strategy for VMware Cloud Foundation

Enterprise Private Cloud. Fully managed private cloud as a service in your data centre or ours.

Dell EMC XC Series: Expanding the XC Series Ecosystem to Empower your Virtualized Applications

With K5 you can. Do incredible things with Fujitsu Cloud Service K5

Get more out of technology starting day one. ProDeploy Enterprise Suite

Dell EMC SAP HANA Appliance Backup and Restore Performance with Dell EMC Data Domain

POWEREDGE RACK SERVERS

IBM POWER SYSTEMS: YOUR UNFAIR ADVANTAGE

PERFORMING MICROSOFT SQL ALWAYSON AVAILABILITY GROUP BACKUPS ON AMAZON WEB SERVER AND MICROSOFT AZURE WITH DELL EMC AVAMAR

White Paper Server. Five Reasons for Choosing SUSE Manager

Storage Key Issues for 2017

Build your own Cloud on Christof Westhues

Minimizing the Risks of OpenStack Adoption

Cisco HyperFlex and the F5 BIG-IP Platform Accelerate Infrastructure and Application Deployments

Fast and Easy Persistent Storage for Docker* Containers with Storidge and Intel

VxRail: Level Up with New Capabilities and Powers GLOBAL SPONSORS

BUILDING the VIRtUAL enterprise

Hyperconverged Infrastructure: Cost-effectively Simplifying IT to Improve Business Agility at Scale

Linux Automation.

Dell EMC Hyperconverged Portfolio: Solutions that Cover the Use Case Spectrum

Modernizing Virtual Infrastructures Using VxRack FLEX with ScaleIO

Merging Enterprise Applications with Docker* Container Technology

Enabling the software-defined data center

Designing a Reference Architecture for Virtualized Environments Using IBM System Storage N series IBM Redbooks Solution Guide

The Next Evolution of Enterprise Public Cloud. Bring the Oracle Cloud to Your Data Center

Dell OpenManage Cluster Configurator on Dell PowerEdge VRTX

We make hybrid cloud deliver the business outcomes you require

ProDeploy Suite. Accelerate enterprise technology adoption with expert deployment designed for you

VxRack System SDDC Enabling External Services

EMC XTREMCACHE ACCELERATES VIRTUALIZED ORACLE

High performance and functionality

Transform Your Business with Hybrid Cloud

Dell EMC Surveillance for Reveal Body- Worn Camera Systems

Red Hat OpenStack Platform 10 Product Guide

Understanding Virtual System Data Protection

Build Converged Infrastructures With True Systems Management

Falling Out of the Clouds: When Your Big Data Needs a New Home

Protect Your Data At Every Point Possible. Reduce risk while controlling costs with Dell EMC and Intel #1 in data protection 1

Dell EMC Ready Bundle for HPC Digital Manufacturing Dassault Systѐmes Simulia Abaqus Performance

Dell EMC Hyper-Converged Infrastructure

The McAfee MOVE Platform and Virtual Desktop Infrastructure

Virtuozzo Hyperconverged Platform Uses Intel Optane SSDs to Accelerate Performance for Containers and VMs

Going cloud-native with Kubernetes and Pivotal

Modernize Your Infrastructure

Enabling Red Hat Virtualization for the Hybrid Cloud

Open Source Community Extends Virtual Central Office to Mobile Use Case

Move, manage, and run SAP applications in the cloud. SAP-Certified Infrastructure from IBM Cloud

SUSE OpenStack Cloud

EMC XTREMCACHE ACCELERATES MICROSOFT SQL SERVER

Delivering Nexenta Software-Based File Services to Cisco HyperFlex

Run Critical Databases in the Cloud

An Oracle White Paper June Enterprise Database Cloud Deployment with Oracle SuperCluster T5-8

IBM dashdb Local. Using a software-defined environment in a private cloud to enable hybrid data warehousing. Evolving the data warehouse

Oracle Database Appliance X6-2S / X6-2M ORACLE ENGINEERED SYSTEMS NOW WITHIN REACH FOR EVERY ORGANIZATION

5 Things You Need for a True VMware Private Cloud

HPE ProLiant ML350 Gen P 16GB-R E208i-a 8SFF 1x800W RPS Solution Server (P04674-S01)

Adobe Digital Marketing s IT Transformation with OpenStack

Enabling Innovation in the Digital Economy

CLOUD COMPUTING PRIMER

A Better Approach to Leveraging an OpenStack Private Cloud. David Linthicum

Best Practices for Deploying a Mixed 1Gb/10Gb Ethernet SAN using Dell Storage PS Series Arrays

DXC Technology and VMware: Innovation that Transforms

Create a Flexible, Scalable High-Performance Storage Cluster with WekaIO Matrix

Problem Solved. Now Is the Time for 10GbE

DELL EMC TEST DRIVE. Build Confidence and Close More Deals EXPLORE TEST DRIVES BY PRODUCT

VMAX3 AND VMAX ALL FLASH WITH CLOUDARRAY

Wired + Wireless Cloud-managed Campus Reference Architecture

IBM Power Systems solution for SugarCRM

Genomics on Cisco Metacloud + SwiftStack

Vision of the Software Defined Data Center (SDDC)

DevOps and Continuous Delivery USE CASE

Easily Managing Hybrid IT with Transformation Technology

7 Things ISVs Must Know About Virtualization

Oracle s Netra Modular System. A Product Concept Introduction

SUSE OpenStack Cloud. Enabling your SoftwareDefined Data Center. SUSE Expert Days. Nyers Gábor Trainer &

Transcription:

OpenStack and Hadoop Achieving near bare-metal performance for big data workloads running in a private cloud ABSTRACT IT organizations are increasingly turning to the open source Apache Hadoop software framework for distributed processing of big data. Hadoop is typically deployed on dedicated physical servers in on-premises data centers. While Hadoop can be deployed in scalable, cost-effective private clouds, many IT organizations have not taken this step because of concerns about performance limitations and deployment complexity. Today, Dell EMC is changing this equation. Dell EMC streamlines the deployment of Hadoop in OpenStack clouds with Ready Bundles that are optimized to deliver near bare-metal performance for big data workloads running in a cloud environment. October 2017 DELL EMC WHITE PAPER

TABLE OF CONTENTS FINDING THE RIGHT CLOUD SOLUTION FOR A HADOOP DEPLOYMENT 1 THE DELL EMC READY BUNDLE FOR RED HAT OPENSTACK PLATFORM 1 A validated and tested cloud stack................... 1 Tools for rapid deployment and more................. 2 RUNNING HADOOP IN AN OPENSTACK CLOUD 2 Proven performance.......................... 2 KEY TAKEAWAYS 3 The information in this publication is provided as is. Dell Inc. makes no representations or warranties of any kind with respect to the information in this publication, and specifically disclaims implied warranties of merchantability or fitness for a particular purpose. Use, copying, and distribution of any software described in this publication requires an applicable software license. Copyright 2017 Dell Inc. or its subsidiaries. All Rights Reserved. Dell, EMC, and other trademarks are trademarks of Dell Inc. or its subsidiaries. Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries. Other trademarks may be trademarks of their respective owners. Dell disclaims proprietary interest in the marks and names of others. Published in the USA October 2017, White Paper. Dell EMC believes the information in this document is accurate as of its publication date. The information is subject to change without notice. DELL EMC WHITE PAPER

FINDING THE RIGHT CLOUD SOLUTION FOR A HADOOP DEPLOYMENT In enterprise environments, the benefits of private clouds are now well understood from IT agility and scalability to better utilization of resources. Today, some innovative IT organizations are looking to bring these and other cloud benefits to their big data workloads by deploying the Apache Hadoop software framework in an OpenStack cloud environment. For these organizations, the initial challenge is to find the right private cloud solution. Some organizations have tried to get to the cloud quickly by deploying appliances that package hardware and software in pre-integrated systems. The outcomes of these efforts have been mixed. Many organizations have found that their out-of-the box cloud environments lack the flexibility and extensibility needed for a true enterprise-class cloud. Homegrown cloud solutions, on the other hand, offer flexibility and extensibility but bring their own set of challenges. These solutions, which are typically based on the OpenStack platform, can be difficult to deploy due to the interdependency of software components from multiple open source projects, the rapid evolution of the OpenStack platform and the need for special skillsets. When building an OpenStack cloud from scratch, integrations with hardware, cloud management software, services and enterprise IT systems are needed to package the platform into a solution that delivers real results to the enterprise. This isn t an easy proposition. As just about anyone in the OpenStack user community can tell you, this work requires significant knowledge and substantial investment, and it can lead to a long-running IT initiative before an environment goes into production. Today, Dell EMC, Intel and Red Hat offer a better way forward via a robust Ready Bundle for an agile OpenStack cloud built for data-intensive enterprise-scale workloads. This solution gives you the best of two worlds: the confidence that comes with a proven architecture along with the flexibility and extensibility of a custom cloud solution. THE DELL EMC READY BUNDLE FOR RED HAT OPENSTACK PLATFORM A VALIDATED AND TESTED CLOUD STACK The Dell EMC Ready Bundle for Red Hat OpenStack Platform streamlines your path to a private cloud. It combines a core validated architecture with select extensions to create an adaptive infrastructure that integrates innovations from the OpenStack community with proven technologies from Dell EMC, Intel and Red Hat. This open, secure, reliable and supported solution is built on validated, integrated components, including Dell EMC PowerEdge servers with Intel Xeon processors. The solution features rapid, automated provisioning capabilities to simplify and speed deployment so you can take advantage of cloud benefits sooner. Better still, you can deploy your solution with the confidence that comes with knowing that Dell EMC and Red Hat have put all configurations and deployment options through extensive validation and stress testing. With the Ready Bundle for Red Hat OpenStack Platform, your system administrators can get your private cloud up and running quickly and keep it running reliably with all the ease and elegance of a fully featured, enterprise-grade cloud manager. 1 DELL EMC WHITE PAPER

TOOLS FOR RAPID DEPLOYMENT AND MORE One of the hallmarks of the Dell EMC Ready Bundle for Red Hat OpenStack Platform is a rapid and automated deployment experience. To enable this experience, the Ready Bundle is available with a powerful automation toolkit called JetPack. This toolkit provides a single package for configuring the underlying hardware and the OpenStack software in a fully automated fashion. JetPack includes a large set of automation tools that are used in conjunction with Red Hat OSP Director and OpenStack Ironic bare metal provisioning to provide a completely automated software deployment experience, as well as an automated update and upgrade experience. Whether you choose to have Dell EMC experts deploy your system or to do it yourself, JetPack gives you access to the same tools and validated deployment guides and the same rapid, reliable and automated processes. In addition, when you work with Dell EMC, our engineers help you leverage the OpenStack integration test suite, known as Tempest. This software suite includes tests for OpenStack API validation, scenarios and other specific tests for validating an OpenStack deployment. The use of the Tempest test suite gives you the assurance that your OpenStack solution is ready to go into production. RUNNING HADOOP IN AN OPENSTACK CLOUD While the benefits of the Dell EMC Ready Bundle for Red Hat OpenStack Platform are compelling, the goal is never better infrastructure. It s better performance for workloads. When you move applications to a private cloud, your business units expect to enjoy the same level of performance they get when their workloads run on dedicated hardware. For organizations deploying Hadoop in an OpenStack cloud, Dell EMC provides optimized configurations based on proven architectures. These configurations lessen the need to tune your hardware and cloud software to enable the cloud environment to best meet the demands of your Hadoop workloads. With these optimized configurations for Hadoop workloads running in a Red Hat OpenStack cloud, your organization can achieve near bare-metal performance for Hadoop in a cloud environment an outcome proven in Dell EMC testing. PROVEN PERFORMANCE At the Barcelona OpenStack Summit in October 2016, Dell EMC engineers presented lab testing results showing that Hadoop performance on an OpenStack cloud can almost match that of Hadoop on dedicated hardware. This level of performance was achieved in a proof of concept that leveraged an optimized Red Hat OpenStack configuration with a Hadoop distribution from Cloudera. This configuration included: Leading-edge hardware, including Dell EMC servers with Intel Xeon processors A select version of the OpenStack Nova project for compute provisioning Use of OpenStack block storage (Cinder) volumes using disk pinning on local storage Use of NUMA (Non-Uniform Memory Access) memory nodes To view the full presentation, see Hadoop on OpenStack Cloud: The Elephant Can Fly. 2 DELL EMC WHITE PAPER

READY BUNDLES FOR HADOOP For organizations looking to accelerate the deployment of a Hadoop environment running on dedicated hardware, Dell EMC has an ideal solution: Ready Bundles for Hadoop. These solutions, developed jointly with leading Hadoop distributions, are based on best practices gleaned from Dell EMC s work on Hadoop installations at customer sites around the world. They include all the essential hardware, software, resources and services you need to deploy and manage a robust Hadoop production environment. KEY TAKEAWAYS Dell EMC offers everything you need to bring the combined power of OpenStack and Hadoop to your data center including Intel-based Dell EMC PowerEdge servers, Red Hat OpenStack software and leading distributions of Hadoop. With this combination, you can gain the best of all worlds the agility, scalability and economies of cloud along with near bare-metal performance for your big data workloads running in a cloud environment. Learn more at: dell.com/openstack and dell.com/hadoop 3 DELL EMC WHITE PAPER