TALK 1: CONVINCE YOUR BOSS: CHOOSE THE "RIGHT" DATABASE. Prof. Dr. Stefan Edlich Beuth University of Technology Berlin (App.Sc.)

Similar documents
Jargons, Concepts, Scope and Systems. Key Value Stores, Document Stores, Extensible Record Stores. Overview of different scalable relational systems

CIB Session 12th NoSQL Databases Structures

Realtime visitor analysis with Couchbase and Elasticsearch

NOSQL DATABASE SYSTEMS: DECISION GUIDANCE AND TRENDS. Big Data Technologies: NoSQL DBMS (Decision Guidance) - SoSe

Course Content MongoDB

NoSQL systems. Lecture 21 (optional) Instructor: Sudeepa Roy. CompSci 516 Data Intensive Computing Systems

Accelerate MySQL for Demanding OLAP and OLTP Use Case with Apache Ignite December 7, 2016

Accelerate MySQL for Demanding OLAP and OLTP Use Cases with Apache Ignite. Peter Zaitsev, Denis Magda Santa Clara, California April 25th, 2017

<Insert Picture Here> MySQL Cluster What are we working on

Shen PingCAP 2017

DEMYSTIFYING BIG DATA WITH RIAK USE CASES. Martin Schneider Basho Technologies!

Scaling for Humongous amounts of data with MongoDB

CISC 7610 Lecture 5 Distributed multimedia databases. Topics: Scaling up vs out Replication Partitioning CAP Theorem NoSQL NewSQL

How do we build TiDB. a Distributed, Consistent, Scalable, SQL Database

MongoDB and Mysql: Which one is a better fit for me? Room 204-2:20PM-3:10PM

Big Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara

SEARCHING BILLIONS OF PRODUCT LOGS IN REAL TIME. Ryan Tabora - Think Big Analytics NoSQL Search Roadshow - June 6, 2013

GridGain and Apache Ignite In-Memory Performance with Durability of Disk

Percona Live September 21-23, 2015 Mövenpick Hotel Amsterdam

Apache Ignite TM - In- Memory Data Fabric Fast Data Meets Open Source

Using the MySQL Document Store

MySQL & NoSQL: The Best of Both Worlds

MySQL Cluster Web Scalability, % Availability. Andrew

NewSQL Databases. The reference Big Data stack

Advanced Database Technologies NoSQL: Not only SQL

A Global In-memory Data System for MySQL Daniel Austin, PayPal Technical Staff

A Survey Paper on NoSQL Databases: Key-Value Data Stores and Document Stores

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Winter 2015 Lecture 14 NoSQL

DATABASE SCALE WITHOUT LIMITS ON AWS

Introduction to Oracle NoSQL Database

SQL Azure. Abhay Parekh Microsoft Corporation

CIS 601 Graduate Seminar. Dr. Sunnie S. Chung Dhruv Patel ( ) Kalpesh Sharma ( )

Document Sub Title. Yotpo. Technical Overview 07/18/ Yotpo

MySQL for Database Administrators Ed 3.1

Introduction to NoSQL Databases

The course modules of MongoDB developer and administrator online certification training:

Apache Hadoop Goes Realtime at Facebook. Himanshu Sharma

Accelerating NoSQL. Running Voldemort on HailDB. Sunny Gleason March 11, 2011

There And Back Again

The NoSQL Ecosystem. Adam Marcus MIT CSAIL

Introduction to NoSQL by William McKnight

NoSQL Databases An efficient way to store and query heterogeneous astronomical data in DACE. Nicolas Buchschacher - University of Geneva - ADASS 2018

<Insert Picture Here> MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure

Scaling Pinterest. Marty Weiner Level 83 Interwebz Geek

Why we re excited about MySQL 8

Conceptual Modeling on Tencent s Distributed Database Systems. Pan Anqun, Wang Xiaoyu, Li Haixiang Tencent Inc.

Scaling DreamFactory

Couchbase Architecture Couchbase Inc. 1

<Insert Picture Here> Oracle NoSQL Database A Distributed Key-Value Store

NosDB vs DocumentDB. Comparison. For.NET and Java Applications. This document compares NosDB and DocumentDB. Read this comparison to:

VOLTDB + HP VERTICA. page

NoSQL Databases MongoDB vs Cassandra. Kenny Huynh, Andre Chik, Kevin Vu

Scaling. Marty Weiner Grayskull, Eternia. Yashh Nelapati Gotham City

Oracle NoSQL Database Enterprise Edition, Version 18.1

Using the SDACK Architecture to Build a Big Data Product. Yu-hsin Yeh (Evans Ye) Apache Big Data NA 2016 Vancouver

How to Scale MongoDB. Apr

Agenda. AWS Database Services Traditional vs AWS Data services model Amazon RDS Redshift DynamoDB ElastiCache

Big Data Infrastructure CS 489/698 Big Data Infrastructure (Winter 2017)

Module - 17 Lecture - 23 SQL and NoSQL systems. (Refer Slide Time: 00:04)

CMU SCS CMU SCS Who: What: When: Where: Why: CMU SCS

Migrating to Cassandra in the Cloud, the Netflix Way

PROFESSIONAL. NoSQL. Shashank Tiwari WILEY. John Wiley & Sons, Inc.

Oral Questions and Answers (DBMS LAB) Questions & Answers- DBMS

Cassandra, MongoDB, and HBase. Cassandra, MongoDB, and HBase. I have chosen these three due to their recent

A Non-Relational Storage Analysis

Oracle NoSQL Database Enterprise Edition, Version 18.1

Architecture of a Real-Time Operational DBMS

ICALEPS 2013 Exploring No-SQL Alternatives for ALMA Monitoring System ADC

CISC 7610 Lecture 2b The beginnings of NoSQL

Creating Ultra-fast Realtime Apps and Microservices with Java. Markus Kett, CEO Jetstream Technologies

Design Patterns for Large- Scale Data Management. Robert Hodges OSCON 2013

Database Solution in Cloud Computing

NewSQL. Database Landscape From: the 451 group. OLTP Focus. NewSQL: Flying on ACID. Cloud DB, Winter 2014, Lecture 14

NoSQL Databases. Concept, Types & Use-cases.

Building a high-performance, scalable ML & NLP platform with Python. Sheer El Showk CTO, Lore Ai

1 Big Data Hadoop. 1. Introduction About this Course About Big Data Course Logistics Introductions

NewSQL: Flying on ACID

Modern Database Concepts

Database Availability and Integrity in NoSQL. Fahri Firdausillah [M ]

Putting together the platform: Riak, Redis, Solr and Spark. Bryan Hunt

Relational to NoSQL: Getting started from SQL Server. Shane Johnson Sr. Product Marketing Manager Couchbase

MongoDB 2.2 and Big Data

/ Cloud Computing. Recitation 6 October 2 nd, 2018

Scaling. Yashh Nelapati Gotham City. Marty Weiner Krypton. Friday, July 27, 12

1

DATABASE DESIGN II - 1DL400

SCYLLA: NoSQL at Ludicrous Speed. 主讲人 :ScyllaDB 软件工程师贺俊

THE ATLAS DISTRIBUTED DATA MANAGEMENT SYSTEM & DATABASES

10/18/2017. Announcements. NoSQL Motivation. NoSQL. Serverless Architecture. What is the Problem? Database Systems CSE 414

MongoDB An Overview. 21-Oct Socrates

App Engine: Datastore Introduction

Data Lake Based Systems that Work

MySQL Database Administrator Training NIIT, Gurgaon India 31 August-10 September 2015

MySQL in the Cloud: Creating a Market-Leading DBaaS

Topics. Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples

Increase Value from Big Data with Real-Time Data Integration and Streaming Analytics

Introduction to Hadoop. High Availability Scaling Advantages and Challenges. Introduction to Big Data

Rule 14 Use Databases Appropriately

NoSQL Databases Analysis

CompSci 516 Database Systems

Transcription:

TALK 1: CONVINCE YOUR BOSS: CHOOSE THE "RIGHT" DATABASE Prof. Dr. Stefan Edlich Beuth University of Technology Berlin (App.Sc.)

nosqlfrankfurt.de nosql powerdays

2 years of NoSQL Consulting

http://nosql-database.org

1st idea

+ Scaling= new node + Community + API - Replication - SetUp, Optimize, Management + Scaling= new node + Replication + Configuration (r, w) - Documentation - Query - management

2nd idea

HBase Cassand. DynDB Mongo CouchBS Riak Redis ES schema free Chunks TAB TAB JSON JSON JSON K/V JSON realtime performance mass data 100k 100k scaling ring / shardrepl chunks ring invisible ShRe ring ring repl ShRe self tuning In progress SaaS hard prod / tools why? MMS Why? aggregations? DSS queries Hive,Pig CQL? DSS full text s? DSS filemanagem community APIs ++ - ++ REST support why? docs ++ +/- misc Geo, StoredProcs Geo persist config V3 limited sharding persist config

Analyse your business Answer 70 questions in 6 categories Build a spike / cut sth out Best approach

Now it s you! And your company!

1A Data

1B Storage Model http://pragprog.com/magazines/2012-05/beyond-the-bit-bucket model -query trap

1C Data / Type constraints Data Navigation?

1D Persistence Design With input from highscalability.com

Latency & Request behaviour / distribution? Throughput? High Concurrency?

70% KV 20% SingleTable 10% Joins Typical Queries look like? SQL needed? LINQ needed? BI / Analytic-Tools needed? MapReduce needed? Ad-Hoc Queries needed? Background Data Analytics? Secondary Indices? Range queries? Comlpex Aggregations? ColumnDB needed for Analytics? Views needed?

Architecture : local, parallel, distributed / grid, service, cloud, mobile, p2p, Hosted? Cloud? Local? Datacenter? Data Access Patterns read / write distribution? random / sequential access? Access Design Patterns?

Integration / Tools_ Replication needed? = Rubustness Automatic load balancing, partitioning, and repartitioning? Auto-Scaling needed? Text search integration? Lucene / Solr? Real non-functional Requirements: Refactoring Frequency? 24/7 System? Live add and remove? Developer Qualification DB simplicity? (installation, configuration, development, deployment, upgrade) Company restrictions? DB diversity (allowed?) Security? (authentication, authorization, validation?) Licence Model? Vendor trustworthiness? Community support? Documentation? Company and DB dev in the future?

Costs: DB-Support? (responsiveness, SLA) Costs in general, Scaling Costs Sysadmin costs Operational Costs: (noops) Safety / Backup & Restore Crash Resistance, Disaster Management Monitoring

Failed Ops 67 GB Index Data 11 hours + 1 day off

Good Ops db.server.status()!! Chef, Puppet, etc. MaaS: ServerDensity

Better Ops M:[0,5) HASH KNOTEN REPLIKAT 2 M N,O R:[25,30) Q:[20,25) N:[5,10) O:[10,15) 8 N O,P 10 O P,Q 17 P Q,R 22 Q R,M 26 R M,N W = 2*W R= 1*R P:[15,20) fail save easy extensible perfectly distributed / vnodes

No Ops With NoSQL!

Guess the top 3 blockers for NoSQl or Polyglot Persistence?

Type A: The db kongo Mongo Hadoop

Type A: The db monolith cut a stream out

Conclusion #1: Investa few days in db research & build a prototype is mandatory! Conclusion #2: The world is polyglot!

Thanks for your time! Please contact me: edlich@gmail.com edlich.de

TALK 2: NEWSQL! NOSQL UNDER ATTACK Prof. Dr. Stefan Edlich Beuth University of Technology Berlin (App.Sc.)

Here are the six urban myths that Mr. Stonebraker says NoSQL advocates incorrectly perpetuate: Myth #1: SQL is too slow, so use a lower level interface Myth #2: I like a K-V interface, so SQL is a non-starter Myth #3: SQL systems don t a scale Myth #4: There are no open source, scalable SQL engines Myth #5: ACIDis too slow, so avoid using it Myth #6: in CAP, choose AP over CA

strikes back

451 Group Report / 5.4.2011

Overview

Latest News MySQL Cluster ++ PostGres Native JSON + hstore (K/V) support MySQL 5.6 has a Memcache interface / support: direct K/V access Google / Percona / Twitter MySQL patches

Java Stored Procedures!

RAM with 100.000 ops/snode VoltDB claims to be 100 times faster than MySQL, up to 13 times faster than Cassandra, and 45 times faster than Oracle, with near-linear scaling. (highscalability blog) ACID with partitioned tables Nearly SQL 99 and ALTER &DROP schema changes require Shutdown static query parametrization

Quelle: Pecond MySQL Performance Blog

SSD optimized and disk C t: 10-100 TB ok then weaker 10x faster scaling across cores random access read pattern

QPS on SSD 84.426 14.763 5,5 x faster memcached API more soon no structured data horizontal scaling for nodes

"terabytes of data, billions of objects, and 200K plus transactions per second per node, with sub-millisecond latency." e.g. real-time bidding transactions / ACID linear & elastic horizontal scalable flash/ssd support RTA TM data expiration append list

API: C, C#, Java, Ruby, Python & PHP no master node 200k Ops/secNode read 50k Ops/secNode write

Checkhybrid solutions! easier & better then RDBMS + memcache

Problem: privilege checks, cach queries, connection pooling / thread creation, parsing SQL, open, lock, exec plans, concurrency control, unlock, close, fromdual.com

keep tables open & simple protocol QUELLE: YOSHINORI MATSUNOBU

Performance Transactions Concurrent Access No Cache / Crash-Safe nosql but morethen K/V: ranges, LIMIT, multi_get, no Security new API

200 100 percona.com

Conclusion #1

Conclusion #2 There is no one perfect solution Check hybridsolutions and NewSQLDBs too!

main feed activity feed sessions Must have: Quick K/V lookup => hashes Fit in Memory (17-34) Be persistent 12 sharded quadruple extra-large Instances

Thanks! Contact me please! edlich@gmail.com edlich.de