When, Where & Why to Use NoSQL?

Similar documents
Abstract. The Challenges. ESG Lab Review InterSystems IRIS Data Platform: A Unified, Efficient Data Platform for Fast Business Insight

Introduction to the Active Everywhere Database

Fast Innovation requires Fast IT

The Data Explosion. A Guide to Oracle s Data-Management Cloud Services

New Approach to Unstructured Data

Discover the all-flash storage company for the on-demand world

SD-WAN. Enabling the Enterprise to Overcome Barriers to Digital Transformation. An IDC InfoBrief Sponsored by Comcast

BUILD BETTER MICROSOFT SQL SERVER SOLUTIONS Sales Conversation Card

A Single Source of Truth

FIVE BEST PRACTICES FOR ENSURING A SUCCESSFUL SQL SERVER MIGRATION

Progress DataDirect For Business Intelligence And Analytics Vendors

Strategic Briefing Paper Big Data

10 Cloud Myths Demystified

Why Converged Infrastructure?

Top 5 Reasons to Consider

Introduction to K2View Fabric

Massive Scalability With InterSystems IRIS Data Platform

Bringing OpenStack to the Enterprise. An enterprise-class solution ensures you get the required performance, reliability, and security

Healthcare IT Modernization and the Adoption of Hybrid Cloud

Hyper-Converged Infrastructure: Providing New Opportunities for Improved Availability

Cloud Computing: Making the Right Choice for Your Organization

Embedded Technosolutions

For Healthcare Providers: How All-Flash Storage in EHR and VDI Can Lower Costs and Improve Quality of Care

A Survey Paper on NoSQL Databases: Key-Value Data Stores and Document Stores

ACCELERATE YOUR ANALYTICS GAME WITH ORACLE SOLUTIONS ON PURE STORAGE

Problem Solved. Now Is the Time for 10GbE

Composite Software Data Virtualization The Five Most Popular Uses of Data Virtualization

Why Enterprises Need to Optimize Their Data Centers

Was ist dran an einer spezialisierten Data Warehousing platform?

Aerospike Scales with Google Cloud Platform

5/24/ MVP SQL Server: Architecture since 2010 MCT since 2001 Consultant and trainer since 1992

High Performance with Distributed Caching. Key Requirements For Choosing The Right Solution

Markus Kujala, Systems Engineering Manager

IBM Power Systems: Open innovation to put data to work Dexter Henderson Vice President IBM Power Systems

Migrating Oracle Databases To Cassandra

NoSQL database and its business applications

Why Converged Infrastructure?

WHITE PAPER HYBRID CLOUD: FLEXIBLE, SCALABLE, AND COST-EFFICIENT UK: US: HK:

Government IT Modernization and the Adoption of Hybrid Cloud

New Approaches to Big Data Processing and Analytics

IT Optimization Trends. Summary Results January 2018

DATACENTER SERVICES DATACENTER

DATABASE SCALE WITHOUT LIMITS ON AWS

ScaleArc for SQL Server

VoltDB for Financial Services Technical Overview

5 Fundamental Strategies for Building a Data-centered Data Center

10 Cloud Myths Demystified

SoftNAS Cloud Data Management Products for AWS Add Breakthrough NAS Performance, Protection, Flexibility

That Set the Foundation for the Private Cloud

Solution Brief. A Key Value of the Future: Trillion Operations Technology. 89 Fifth Avenue, 7th Floor. New York, NY

powered by Cloudian and Veritas

2017 Riverbed Technology. All rights reserved.

VEXATA FOR ORACLE. Digital Business Demands Performance and Scale. Solution Brief

Making hybrid IT simple with Capgemini and Microsoft Azure Stack

Stages of Data Processing

Taming Structured And Unstructured Data With SAP HANA Running On VCE Vblock Systems

Top Priority for Hybrid IT

Big Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara

<Insert Picture Here> MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure

Modernizing Healthcare IT for the Data-driven Cognitive Era Storage and Software-Defined Infrastructure

I D C T E C H N O L O G Y S P O T L I G H T. V i r t u a l and Cloud D a t a Center Management

Answer: A Reference: df(page 1, first para)

Data 101 Which DB, When. Joe Yong Azure SQL Data Warehouse, Program Management Microsoft Corp.

"Software-defined storage Crossing the right bridge"

TECHNOLOGY WHITE PAPER. Java for the Real Time Business

W H I T E P A P E R U n l o c k i n g t h e P o w e r o f F l a s h w i t h t h e M C x - E n a b l e d N e x t - G e n e r a t i o n V N X

Dell PowerEdge R730xd Servers with Samsung SM1715 NVMe Drives Powers the Aerospike Fraud Prevention Benchmark

Active Archive and the State of the Industry

THE COMPLETE GUIDE COUCHBASE BACKUP & RECOVERY

Dell EMC Isilon All-Flash

How Flash Storage is Changing the Economics of SaaS Businesses

THE STATE OF IT TRANSFORMATION FOR TELECOMMUNICATIONS

Integrating Oracle Databases with NoSQL Databases for Linux on IBM LinuxONE and z System Servers

Top 4 considerations for choosing a converged infrastructure for private clouds

Enabling Innovation in the Digital Economy

Composable Infrastructure for Public Cloud Service Providers

Performance Innovations with Oracle Database In-Memory

CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED DATA PLATFORM

Hybrid IT for SMBs. HPE addressing SMB and channel partner Hybrid IT demands ANALYST ANURAG AGRAWAL REPORT : HPE. October 2018

Market Report. Scale-out 2.0: Simple, Scalable, Services- Oriented Storage. Scale-out Storage Meets the Enterprise. June 2010.

Upgrade Your MuleESB with Solace s Messaging Infrastructure

Vision of the Software Defined Data Center (SDDC)

Making the case for SD-WAN

For DBAs and LOB Managers: Using Flash Storage to Drive Performance and Efficiency in Oracle Databases

Four Steps to Unleashing The Full Potential of Your Database

Architecture of a Real-Time Operational DBMS

Cloud Migration Strategies

Nimble Storage vs HPE 3PAR: A Comparison Snapshot

THE STATE OF IT TRANSFORMATION FOR RETAIL

Veeam Availability Solution for Cisco UCS: Designed for Virtualized Environments. Solution Overview Cisco Public

DATACENTER AS A SERVICE. We unburden you at the level you desire

SAP IQ Software16, Edge Edition. The Affordable High Performance Analytical Database Engine

IBM POWER SYSTEMS: YOUR UNFAIR ADVANTAGE

Virtualizing Networks:

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

TITLE. the IT Landscape

Data Analytics at Logitech Snowflake + Tableau = #Winning

Approaching the Petabyte Analytic Database: What I learned

ARC BRIEF. Application Downtime, Your Productivity Killer. Overview. Trends in Plant Application Adoption. By John Blanchard and Greg Gorbach

Software Defined Storage

Transcription:

When, Where & Why to Use NoSQL? 1

Big data is becoming a big challenge for enterprises. Many organizations have built environments for transactional data with Relational Database Management Systems (RDBMS), but are now inundated with enormous amounts of new types of data. These include, but are not limited to, social media data, server logs, clickstream data, machine data, and geolocation data. These new data sources include unstructured and semi-structured data, and they all share the common big data characteristics of volume, velocity, and variety. 2

Big Data, Big Challenges. Modern enterprises collect more data than ever before, from a wide variety of sources, and in a wide variety of formats. In a recent survey conducted by Researchscape, executives surveyed reported that as data volumes increase, they are struggling to glean meaningful insights from their data (60%), and that getting access to and analyzing the right data at the right time is a problem (56%). Only a quarter of executives believed that their data architecture from five years ago will work well five years from now (27%). Which, if any, of the following statements do you agree with? (See all that apply.) Sample size: 250 (97% of Respondents) 2% None of the above 27% My data architecture from five years ago will work well five years from now 60% As data volumes increase, we are struggling to glean meaningful insights from our data 56% Getting access to and analyzing the right data at the right time is a problem for my company 3

Challenges such as the lack of scale to manage large data sets (48%) and the slow speed of the system for modern analytics (45%) concern the executives surveyed and may drive them to change their traditional data management systems. Which of the following statements concern you and may drive you to change your traditional data management systems? (See all that apply.) Sample size: 250 (97% of Respondents) 48% The lack of scale to manage large data sets, types of data and performance requirements 3% None of the above 15% The length of time required to use traditional systems to get insights from data and create business value 45% The speed of the system being too slow for modern analytic needs 37% The shortage of skills to manage and build on current proprietary solutions 34% The cost of managing traditional systems 4

Data from the Researchscape survey also indicates that most professionals surveyed (67%) were satisfied with their current data warehousing vendor, stating that it is working well for existing operational workloads. The biggest complaint, as expected, is that it was not designed for modern analytics on Big Data, the IoT (Internet of Things), and other new types of data (31%). The second-biggest complaint is that it was expensive to maintain as data needs and volumes grow exponentially (29%). Which statement best represents your current relationship with your data warehousing vendor? (See all that apply.) Sample size: 252 (98% of Respondents) 67% I am satisfied with their product, and it is working well for my operational workloads today 1% None of the above 2% I am stuck with them 3% I don t have the authority to change technologies 31% While it works well for our traditional analytics applications, it is not designed for modern analytics on Big Data, lot, and other new types of data; we are planning to reevaluate soon 29% It is expensive to maintain as my data volume and needs are expanding exponentially 20% 11% I have not found a suitable alternative It is not getting the job done, but I am apprehensive to deploy an unproven new technology 5

RDBMS Relational databases came into wide use in the 1980s. These tablebased (column and row) relational databases became popular because they provided many useful features: Reliability ACID Compliance Aggregation Rigorous Schema Complex Queries Normalization Relational databases are used to process and store structured data, and they offer many tools, but also a significant amount of infrastructure and overhead. A relational database simply cannot meet all of the business requirements and opportunities that have arisen with big data. The biggest limitation is that you are bound to a rigid schema ahead of time. 6

Dealing with Big Data Big data is a direct result of both the explosive growth of the Internet, and the proliferation of social and mobile applications. To deal effectively with big data, a database must possess the following characteristics: Scalability The database must be able to scale up to hundreds of terabytes of data, and scale dynamically up to thousands or millions of users. Flexibility The database must be able to deal with schema-less data sets that include structured and semi-structured data. Performance The database must provide fast transaction response times, at scale. Agility The database must provide developers with the ability to rapidly create and adopt applications. 7

There are two ways to scale a database: Scale Up Scale vertically by adding CPU, storage, and memory to individual servers Scale Out Scale horizontally by adding more servers in parallel Legacy relational database systems cannot scale out effectively. An RDBMS server can be scaled up by adding CPU, storage, and memory, but you eventually reach an upper limit that still cannot adequately address the performance requirements of big data. A NoSQL database is designed to scale horizontally, leveraging the processing and storage power of hundreds or even thousands of parallel servers. And instead of requiring expensive, specialized storage and processing hardware, NoSQL takes advantage of low cost commodity servers. In addition, NoSQL is schema-agnostic which accelerates development cycles by providing the freedom to store information without requiring extensive up front schema design. 8

Why Move to NoSQL? Using NoSQL does not necessarily involve scrapping your existing RDBMS and starting from scratch. NoSQL should be thought of as a tool that can be used to solve the new types of challenges associated with big data. There may be business processes that can continue to be addressed effectively with RDBMS. But with the new challenges presented by big data, you will likely face new problems that can be solved more efficiently and more cost-effectively with a NoSQL database. A NoSQL database can be used to solve new problems that require: Scalability A NoSQL database can scale horizontally to the scale required by big data. Applications can run in parallel on a cloud-based cluster comprising of dozens, hundreds, or even thousands of commodity servers. The NoSQL scale-out architecture enables web applications to scale dynamically up to thousands or even millions of users. As the number of concurrent users grows, you can dynamically add more cluster nodes (commodity servers) to process the additional load. Flexibility A schema-less NoSQL database can process and store structured, unstructured and semi-structured data, and enable flexible and rapid development of applications and use cases such as right-time decisioning, recommendations, profile management, bidding, and risk profiling. With RDBMS, you cannot add data (columns) without updating the entire schema, which can take hours or even days, depending on your institutional infrastructure. With NoSQL, data can be added and updated flexibly and efficiently. Speed The parallel processing nature of a NoSQL database, along with caching and aggregation, can provide fast (sub-millisecond) transaction response times at scale. Developer productivity and agility A NoSQL database shortens time to market for new applications and updates because developers don t have to shoehorn data into a fixed schema. With NoSQL, applications can be rapidly prototyped, tested, and deployed into production in a cloud-based cluster. Operational readiness In contrast to RDBMS, which typically require schema migration scripts, significant manual effort, and scheduled downtime for release upgrades, NoSQL schema migrations are relatively easy and have minimal impact on operational readiness. Maintenance windows are compressed, and the performance impact on users is far less noticeable. 9

Why Use Aerospike? Aerospike is a high-performance NoSQL database that enables high throughput with low latency, horizontal scalability, reliability, and operational efficiency. Organizations across a wide range of industries, including ad tech, financial services, ecommerce, online gaming, and telco rely on Aerospike to power revenue-generating, mission-critical systems of engagement. 10

Aerospike has pioneered a hybrid RAM/SSD storage architecture that lets you achieve RAM-like performance at a lower cost by leveraging flash storage (SSDs). This innovative architecture delivers speed at scale, predictable performance, and high availability while dramatically reducing total cost of ownership. Applications that leverage Aerospike typically require as few as one-tenth the number of servers required by most other databases. The NoSQL space is vast and populated by large and well-known companies (e.g., MongoDB, DataStax, and Couchbase) that are investing aggressively to go to market as Oracle replacement technology for non-mission-critical use cases. In contrast, Aerospike is focused on the next phase of the database market, where speed and scale offer companies a competitive advantage. Speed at Scale Low TCO Built for Flash storage Ability to support applications with a very large data set, which increases the accuracy of the decision engine Open source Cross Datacenter Replication (XDR) support Store hundreds of terabytes and access them at speeds measured in milliseconds or sub-milliseconds Predictable performance 11

Next Steps See how much you can save with the Aerospike TCO (Total Cost of Ownership) calculator! Go The Aerospike Quick Start Program is designed to help you fully leverage our fast, reliable, high-performance NoSQL key-value database for real-time workloads. This program allows you to quickly achieve the business benefits that our customers rave about. The Aerospike Quick Start Program includes: Unlimited data usage of the Aerospike Enterprise Edition during a 90-day trial period Virtual Classroom Training: 1 Developer Seat 1 Administration & Operations Seat Cluster sizing and benchmark tool support for optimizing your implementation Learn more 12

2016 Aerospike, Inc. 13