ICALEPS 2013 Exploring No-SQL Alternatives for ALMA Monitoring System ADC

Similar documents
Managing IoT and Time Series Data with Amazon ElastiCache for Redis

Database Solution in Cloud Computing

NoSQL Databases MongoDB vs Cassandra. Kenny Huynh, Andre Chik, Kevin Vu

CIB Session 12th NoSQL Databases Structures

NoSQL Databases Analysis

Jargons, Concepts, Scope and Systems. Key Value Stores, Document Stores, Extensible Record Stores. Overview of different scalable relational systems

Cassandra, MongoDB, and HBase. Cassandra, MongoDB, and HBase. I have chosen these three due to their recent

SCALABLE DATABASES. Sergio Bossa. From Relational Databases To Polyglot Persistence.

Agenda. AWS Database Services Traditional vs AWS Data services model Amazon RDS Redshift DynamoDB ElastiCache

MongoDB and Mysql: Which one is a better fit for me? Room 204-2:20PM-3:10PM

10 Million Smart Meter Data with Apache HBase

Relational databases

Beyond Relational Databases: MongoDB, Redis & ClickHouse. Marcos Albe - Principal Support Percona

PROFESSIONAL. NoSQL. Shashank Tiwari WILEY. John Wiley & Sons, Inc.

Redis to the Rescue? O Reilly MySQL Conference

Chapter 24 NOSQL Databases and Big Data Storage Systems

Course Content MongoDB

GridGain and Apache Ignite In-Memory Performance with Durability of Disk

Hadoop 2.x Core: YARN, Tez, and Spark. Hortonworks Inc All Rights Reserved

Goal of the presentation is to give an introduction of NoSQL databases, why they are there.

Accelerate MySQL for Demanding OLAP and OLTP Use Cases with Apache Ignite. Peter Zaitsev, Denis Magda Santa Clara, California April 25th, 2017

CIS 601 Graduate Seminar. Dr. Sunnie S. Chung Dhruv Patel ( ) Kalpesh Sharma ( )

Evolution of Database Systems

Introduction to NoSQL Databases

Kim Greene - Introduction

Scalability of web applications

Distributed Databases: SQL vs NoSQL

NoSQL Databases An efficient way to store and query heterogeneous astronomical data in DACE. Nicolas Buchschacher - University of Geneva - ADASS 2018

ApsaraDB for Redis. Product Introduction

Scaling. Marty Weiner Grayskull, Eternia. Yashh Nelapati Gotham City

Conceptual Modeling on Tencent s Distributed Database Systems. Pan Anqun, Wang Xiaoyu, Li Haixiang Tencent Inc.

NOSQL EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY

Scaling. Yashh Nelapati Gotham City. Marty Weiner Krypton. Friday, July 27, 12

How do we build TiDB. a Distributed, Consistent, Scalable, SQL Database

CS 655 Advanced Topics in Distributed Systems

High-Level Data Models on RAMCloud

Making MongoDB Accessible to All. Brody Messmer Product Owner DataDirect On-Premise Drivers Progress Software

Migrating Oracle Databases To Cassandra

Architekturen für die Cloud

CISC 7610 Lecture 5 Distributed multimedia databases. Topics: Scaling up vs out Replication Partitioning CAP Theorem NoSQL NewSQL

Final Exam Review 2. Kathleen Durant CS 3200 Northeastern University Lecture 23

8/24/2017 Week 1-B Instructor: Sangmi Lee Pallickara

How to Scale MongoDB. Apr

Introduction to Database Services

MONGODB INTERVIEW QUESTIONS

Integrating Oracle Databases with NoSQL Databases for Linux on IBM LinuxONE and z System Servers

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Winter 2015 Lecture 14 NoSQL

Dr. Chuck Cartledge. 3 Dec. 2015

MySQL CLUSTER. Low latency for a real-time user experience; 24 x 7 availability for continuous service uptime;

Challenges for Data Driven Systems

COSC Redis. Paul Moore, Stephen Smithbower, William Lee. March 11, 2013

Extreme Computing. NoSQL.

Performance Comparison of NOSQL Database Cassandra and SQL Server for Large Databases

SQL, NoSQL, MongoDB. CSE-291 (Cloud Computing) Fall 2016 Gregory Kesden

Next-Generation Cloud Platform

Middle East Technical University. Jeren AKHOUNDI ( ) Ipek Deniz Demirtel ( ) Derya Nur Ulus ( ) CENG553 Database Management Systems

CSC 261/461 Database Systems Lecture 20. Spring 2017 MW 3:25 pm 4:40 pm January 18 May 3 Dewey 1101

Overview. * Some History. * What is NoSQL? * Why NoSQL? * RDBMS vs NoSQL. * NoSQL Taxonomy. *TowardsNewSQL

PNUTS: Yahoo! s Hosted Data Serving Platform. Reading Review by: Alex Degtiar (adegtiar) /30/2013

Document Sub Title. Yotpo. Technical Overview 07/18/ Yotpo

Shen PingCAP 2017

In-Memory Data processing using Redis Database

<Insert Picture Here> Oracle NoSQL Database A Distributed Key-Value Store

Big Data Infrastructure CS 489/698 Big Data Infrastructure (Winter 2017)

A Fast and High Throughput SQL Query System for Big Data

The course modules of MongoDB developer and administrator online certification training:

CompSci 516 Database Systems

Distributed Data Store

<Insert Picture Here> MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure

MongoDB Schema Design

Administration Naive DBMS CMPT 454 Topics. John Edgar 2

VOLTDB + HP VERTICA. page

NoSQL systems. Lecture 21 (optional) Instructor: Sudeepa Roy. CompSci 516 Data Intensive Computing Systems

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

Rule 14 Use Databases Appropriately

NosDB vs DocumentDB. Comparison. For.NET and Java Applications. This document compares NosDB and DocumentDB. Read this comparison to:

MongoDB Architecture

DISTRIBUTED DATABASE OPTIMIZATIONS WITH NoSQL MEMBERS

Realtime visitor analysis with Couchbase and Elasticsearch

International Journal of Informative & Futuristic Research ISSN:

CISC 7610 Lecture 2b The beginnings of NoSQL

Topics. History. Architecture. MongoDB, Mongoose - RDBMS - SQL. - NoSQL

What s new in Mongo 4.0. Vinicius Grippa Percona

MySQL CLUSTER. Low latency for a real-time user experience. 24 x 7 availability for continuous service uptime

Topics. Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples

Accelerate Applications Using EqualLogic Arrays with directcache

New Oracle NoSQL Database APIs that Speed Insertion and Retrieval

Scaling for Humongous amounts of data with MongoDB

Achieving Horizontal Scalability. Alain Houf Sales Engineer

4 Myths about in-memory databases busted

Scaling DreamFactory

Design Patterns for Large- Scale Data Management. Robert Hodges OSCON 2013

CS November 2018

CS November 2017

<Insert Picture Here> Value of TimesTen Oracle TimesTen Product Overview

Accelerate MySQL for Demanding OLAP and OLTP Use Case with Apache Ignite December 7, 2016

MongoDB - a No SQL Database What you need to know as an Oracle DBA

Kaladhar Voruganti Senior Technical Director NetApp, CTO Office. 2014, NetApp, All Rights Reserved

MySQL Cluster Web Scalability, % Availability. Andrew

Distributed PostgreSQL with YugaByte DB

Transcription:

ICALEPS 2013 Exploring No-SQL Alternatives for ALMA Monitoring System

Overview The current paradigm (CCL and Relational DataBase) Propose of a new monitor data system using NoSQL Monitoring Storage Requirements The Workaround Data Flow NoSQL Relational DataBase vs NoSQL Monitoring System - High Software Level Design Monitoring System - Used Software Tools RedisIO MongoDB Monitoring System - Real Time Data in Web Graphics Monitoring System - Archive Files in Web Interface Monitoring System - Historical Data in Web Interface Conclusions

The current paradigm (CCL)

The current paradigm (Relational DataBase)

Propose of the a new monitor data system using NoSQL Objectives: 1. Provide Data in Real Time of monitor points of antenna's devices using Web Graphics 2. Provide Text Files of monitor points of antenna's devices using Web Interface 3. Provide Historical Retrieval Data of monitor points of antenna's devices using Web Interface

Monitoring Storage Requirement 66 antennas + CentralLO's + CORR's devices + WeatherStationController + AOSTiming Total date rate: ~6.000 7.000 clobs/s ~82.9 clobs/s/antenna # Monitor Points per antenna type DV/DA: 2.179 CM: 2.438 PM: 2.474 Total monitor points: ~130.000 150.000 Currents Size (MB) of Monitor Data per antenna type (daily) DV: 241 DA: 246 CM: 301 PM: 296 Current daily monitoring data size: ~25-30 GB (in ~120k files)

The Workaround Data Flow

NoSQL A NoSQL database provides a mechanism for storage and retrieval of data that uses looser consistency models rather than traditional relational databases. Motivations for this approach include simplicity of design, horizontal scaling and finer control over availability. NoSQL databases are often highly optimized key value stores intended for simple retrieval and appending operations, with the goal being significant performance benefits in terms of latency and throughput NoSQL database find significant and growing industry use in big data and real-time web applications

Relational DataBase vs NoSQL Relational DataBase Relational database management systems are transaction-based and have ACID (Atomicity-Consistency-Isolation- Durability) rules Data in the relational database management systems is located on fixed tables and columns SQL query is used in Relational database systems Disintegration of data by primary key is not compulsory in relational database management systems NoSQL NoSQL systems do not fully support the ACID (Atomicity-Consistency-Isolation- Durability) rules and there is no transaction concept in many NoSQL systems NoSQL systems are not dependent on fixed tables and columns SQL query is not used in NoSQL systems NoSQL systems access the data over primary keys

Monitoring System High Software Level Design

Monitoring System Used Software Tools Apache ActiveMQ (http://activemq.apache.org/) Redis IO (http://http://redis.io) MongoDB (http://www.mongodb.org) Log4j (http://logging.apache.org/log4j/1.2) Apache Commons Pool (http://commons.apache.org/pool) SpringSource (http://www.springsource.org) Junit (https://github.com/kentbeck/junit/wiki) Pyramide (http://docs.pylonsproject.org/en/latest/index.html) WebSockets (http://www.websocket.org) HighCharts (http://www.highcharts.com)

Redis IO Redis is an open source, BSD licensed, advanced key-value store. It is often referred to as a data structure server since keys can contain strings, hashes, lists, sets, sorted sets and channels. In order to achieve its outstanding performance, Redis works with an in-memory dataset. Depending on your use case, you can persist it either by dumping the dataset to disk every once in a while, or by appending each command to a log. Redis also supports trivial-to-setup master-slave replication, with very fast non-blocking first synchronization, auto-reconnection on net split and so forth. Other features include Transactions, Pub/Sub, Lua scripting, Keys with a limited time-to-live, and configuration settings to make Redis behave like a cache. You can use Redis from most programming languages out there. Redis is written in ANSI C and works in most POSIX systems like Linux, *BSD, OS X without external dependencies. Linux and OSX are the two operating systems where Redis is developed and more tested, and we recommend using Linux for deploying. Redis may work in Solaris-derived systems like SmartOS, but the support is best effort. There is no official support for Windows builds, but Microsoft develops and maintains a Win32-64 experimental version of Redis.

Redis IO - Scalability Writes Master Redis Slave Redis DataBase Tier Client Application Tier Reads D1 D2 D3... DN Slave Redis Provide data service

Redis IO - Used Lists

Redis IO - Used Channels

Mongo DB MongoDB is a document database that provides high performance, high availability, and easy scalability. Document Database Documents (objects) map nicely to programming language data types. Embedded documents and arrays reduce need for joins. Dynamic schema makes polymorphism easier. High Performance Embedding makes reads and writes fast. Indexes can include keys from embedded documents and arrays. Optional streaming writes (no acknowledgments). High Availability Replicated servers with automatic master failover. Easy Scalability Automatic sharding distributes collection data across machines. Eventually-consistent reads can be distributed over replicated servers.

MongoDB - Scalability

MongoDB One monitor point per document

MongoDB A clob per document

MongoDB A monitor point per day per document

Monitoring System Real Time Data in Web Graphics Video is Pending

Monitoring System Archive Files in Web Interface

Monitoring System Historical Data in Web Interface Video is Pending

Conclusions NoSQL is a perfect paradigm for store big and heterogeneous data. such as ALMA monitoring data RedisIO is an appropriate key-value store for cache storage of ALMA monitoring data Redis Lists are well designed for put/get a lot of values of monitoring data in blocks for future processes Redis Channels are well designed for publishers/subscribers of events in Real Time MongoDB is a suitable document oriented alternative for permanent storage of ALMA monitoring data A monitor point per day per document is the best option for extract all needed data in few milliseconds