MSG: An Overview of a Messaging System for the Grid

Similar documents
Real World Messaging With Apache ActiveMQ. Bruce Snyder 7 Nov 2008 New Orleans, Louisiana

Purplefinder Enterprise Platform Messagng with ActiveMQ. Peter Potts 13 th October 2010

The Stream Processor as a Database. Ufuk

Nirvana A Technical Introduction

The ATLAS EventIndex: Full chain deployment and first operation

WebSphere MQ Low Latency Messaging V2.1. High Throughput and Low Latency to Maximize Business Responsiveness IBM Corporation

Introduction to JMS & Apache ActiveMQ

A Scalable and Reliable Message Transport Service for the ATLAS Trigger and Data Acquisition System

Frequently asked questions from the previous class survey

CS455: Introduction to Distributed Systems [Spring 2018] Dept. Of Computer Science, Colorado State University

A RESTful Java Framework for Asynchronous High-Speed Ingest

IERG 4080 Building Scalable Internet-based Services

Create High Performance, Massively Scalable Messaging Solutions with Apache ActiveBlaze

JBoss AMQ 7 Technical Deep Dive

Solace JMS Broker Delivers Highest Throughput for Persistent and Non-Persistent Delivery

MOM MESSAGE ORIENTED MIDDLEWARE OVERVIEW OF MESSAGE ORIENTED MIDDLEWARE TECHNOLOGIES AND CONCEPTS. MOM Message Oriented Middleware

Broker Clusters. Cluster Models

CouchDB-based system for data management in a Grid environment Implementation and Experience

IEMS 5722 Mobile Network Programming and Distributed Server Architecture

IBM FileNet Content Manager 5.2. Asynchronous Event Processing Performance Tuning

Microservices, Messaging and Science Gateways. Review microservices for science gateways and then discuss messaging systems.

Monitoring of large-scale federated data storage: XRootD and beyond.

Computer Communication & Networks / Data Communication & Computer Networks Week # 03

Datacenter replication solution with quasardb

Oracle-to-Oracle Data Transfer Options. Sudhanshu Kumar 1 st Feb 2010

Performance and Scalability with Griddable.io

ECE 7650 Scalable and Secure Internet Services and Architecture ---- A Systems Perspective

WLCG Transfers Dashboard: a Unified Monitoring Tool for Heterogeneous Data Transfers.

Improving Data Access of J2EE Applications by Exploiting Asynchronous Messaging and Caching Services

WebLogic JMS System Best Practices Daniel Joray Trivadis AG Bern

TSIN02 - Internetworking

Indirect Communication

<Insert Picture Here> QCon: London 2009 Data Grid Design Patterns

Red Hat AMQ 7.2 Introducing Red Hat AMQ 7

3C05 - Advanced Software Engineering Thursday, April 29, 2004

Building loosely coupled and scalable systems using Event-Driven Architecture. Jonas Bonér Patrik Nordwall Andreas Källberg

<Insert Picture Here> WebLogic JMS Messaging Infrastructure WebLogic Server 11gR1 Labs

<Insert Picture Here> Oracle Coherence & Extreme Transaction Processing (XTP)

Introduction to Protocols for Realtime Data Sharing. Deepti Nagarkar

Data Acquisition. The reference Big Data stack

Overview. Communication types and role of Middleware Remote Procedure Call (RPC) Message Oriented Communication Multicasting 2/36

8/24/2017 Week 1-B Instructor: Sangmi Lee Pallickara

Pimp My Data Grid. Brian Oliver Senior Principal Solutions Architect <Insert Picture Here>

java message service marek konieczny

SJTU 2018 Fall Computer Networking. Wireless Communication

HA solution with PXC-5.7 with ProxySQL. Ramesh Sivaraman Krunal Bauskar

ØMQ and PyØMQ. Simple and Fast Messaging. Brian Granger SciPy 2010

Stanislav Harvan Internet of Things

White Paper. Major Performance Tuning Considerations for Weblogic Server

GoldenGate Zbigniew Baranowski

SPEC Enterprise Java Benchmarks State of the Art and Future Directions

The LCG 3D Project. Maria Girone, CERN. The 23rd Open Grid Forum - OGF23 4th June 2008, Barcelona. CERN IT Department CH-1211 Genève 23 Switzerland

Copyright 2013, Oracle and/or its affiliates. All rights reserved. CON-7777, JMS and WebSocket for Lightweight and Efficient Messaging

TSIN02 - Internetworking

Wasser drauf, umrühren, fertig?

Indirect Communication

TSIN02 - Internetworking

Java Message System. Petr Adámek. April 11 th 2016

Performance Evaluation and Comparison of Distributed Messaging Using Message Oriented Middleware

Data Centers. Tom Anderson

ATLAS Distributed Computing Experience and Performance During the LHC Run-2

0-1 Million in 46 Days Scaling a Facebook Application in Rails

Percona XtraDB Cluster ProxySQL. For your high availability and clustering needs

Transport Layer Protocols TCP

Chapter 4 Communication

Using Apache Beam for Batch, Streaming, and Everything in Between. Dan Halperin Apache Beam PMC Senior Software Engineer, Google

Data Analytics with HPC. Data Streaming

6.033 Lecture 12 3/16/09. Last time: network layer -- how to deliver a packet across a network of multiple links

CS 162 Operating Systems and Systems Programming Professor: Anthony D. Joseph Spring Lecture 21: Network Protocols (and 2 Phase Commit)

A Distributed System Case Study: Apache Kafka. High throughput messaging for diverse consumers

Software Paradigms (Lesson 10) Selected Topics in Software Architecture

Scalable Online Analytics for Monitoring

Last Class: RPCs and RMI. Today: Communication Issues

Java Enterprise Edition

Building LinkedIn s Real-time Data Pipeline. Jay Kreps

Bandwidth Allocation & TCP

Worldwide Production Distributed Data Management at the LHC. Brian Bockelman MSST 2010, 4 May 2010

Quick Start. Scalable Deployers in SDL Web 8.5. Feb 2017 SDL Web. Document owner: Richard Hamlyn

Ultra Messaging Queing Edition (Version ) Guide to Queuing

Data Infrastructure at LinkedIn. Shirshanka Das XLDB 2011

System upgrade and future perspective for the operation of Tokyo Tier2 center. T. Nakamura, T. Mashimo, N. Matsui, H. Sakamoto and I.

High availability with MariaDB TX: The definitive guide

Job Messaging Format (JMF) Tutorial

The Gearman Cookbook OSCON Eric Day Senior Software Rackspace

Interprocess Communication Tanenbaum, van Steen: Ch2 (Ch3) CoDoKi: Ch2, Ch3, Ch5

Outline. Definition of a Distributed System Goals of a Distributed System Types of Distributed Systems

Vlad Vinogradsky

Putting it together. Data-Parallel Computation. Ex: Word count using partial aggregation. Big Data Processing. COS 418: Distributed Systems Lecture 21

Implementing Replication. Overview of Replication Managing Publications and Subscriptions Configuring Replication in Some Common Scenarios

LabVIEW Communication Techniques for Distributed Applications

CSE 461 Module 10. Introduction to the Transport Layer

Increase Value from Big Data with Real-Time Data Integration and Streaming Analytics

Introduc)on to Apache Ka1a. Jun Rao Co- founder of Confluent

TSIN02 - Internetworking

Architecture and Implementation of a Content-based Data Dissemination System

ICALEPS 2013 Exploring No-SQL Alternatives for ALMA Monitoring System ADC

THE ATLAS DISTRIBUTED DATA MANAGEMENT SYSTEM & DATABASES

IBM WebSphere Business Integration Event Broker and Message Broker V5.0

Developing Windows Azure and Web Services

DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S. TANENBAUM MAARTEN VAN STEEN. Chapter 1. Introduction

Transcription:

MSG: An Overview of a Messaging System for the Grid Daniel Rodrigues

Presentation Summary Current Issues Messaging System Testing Test Summary Throughput Message Lag Flow Control Next Steps

Current Issues The current paradigm in the Grid is based on Distributed central services Single points of failure exist within Grid Monitoring Systems (ex: Service Availability Monitoring [SAM]). Reliability on information delivery is often not guaranteed as is not Scalability. Will a Messaging System improve both reliability and scalability?

Messaging System Features of an Messaging System: Flexible architecture: Deliver messages, either in point to point (queue) or multicast mode (topics) Support Synchronous or Asynchronous communication. Reliable delivery of messages: Provide reliability to the senders if required Configurable persistency / Master-Slave. Highly Scalable: Network of Brokers ActiveMQ is an Open Source message broker providing these and many other features.

Testing From ActiveMQ documentation: Evaluated Parameters: Performance 1) Number differs of Producers greatly depending on many different factors: - the network 2) Number topology of Consumers - transport 3) Message protocols Size used - quality 4) Message of service Number - hardware, network, JVM and operating system - number Measurement of producers, of timestamps: number of consumers - distribution 1)Message of messages Sent across destinations along with message size 2)Message on Broker 3)Message Received We believe it does change, but how? Results analysis: 1)Logs containing all information for each message 2)From logs, extract messages/second 3) and messagelag

Test Summary Broker statistics: Running for 6 weeks with no crashes 50 Million messages of various sizes (0 to 10 kb) forwarded to consumers 12 Million incoming messages from producers Up to 40 Producers and 80 Consumers connected at the same time Stable under highly irregular test pattern: Number of clients change Frequent client process kills Daily number of tests vary

Results : Throughput > Consumers > Throughput?? Consumer Bottleneck! With a larger number of producers, even more messages per second saturating the consumer.

Results : Flow Control Effective flow control Flow Control balances load on producers and consumers

Results : Message Lag 3 Consumers Memory Overflow ( Slow consumers ) Test Run batch change Max 180 s!

Next Steps Scalability in a distributed environment Network of Brokers Testing optimized wire protocols (OpenWire) Evaluation under real world use cases SAM 1 Consumer ~ 300 Producers per VO 15 (~2k) messages / second Prototype already in place for OSG Atlas 10 Producers ~ 100 Consumers Streaming of messages with 200 B each Persistence required

Thank you for your attention.

Support Slides

Messages/s / Consumer (1290B) (390B) (290 B) (10290B)

Total Messages/s

Kb/s per consumer

SAM-MSG integration overview ( Piotr Nyczyk ) Test results are published both from the framework and directly from test jobs executing in grid sites MSG-consumer is using transport views in Oracle DB (see later) 16

SAM MSG: Publishing side Firewall and network issues - test jobs running on Worker Nodes solution: using HTTP protocol (REST) with http_proxy if available robust publisher: list of broker URLs (STOMP/REST), the first one that responds is used to publish requirements: message servlet installed on the broker machine Tested with a typical SAM load for 1 VO message rate: 1 to 10 messages/second published from many short-lived producers ~300 machines (producers) publishing at the same time ~15 messages for each producer prototype setup with 1 broker (gridmsg001) Currently used for OSG monitoring integration with SAM 17

SAM MSG: Consumer side Generic consumer written in Python: durable subscription (no data loss in case of producer downtime) message classes based on WLCG MW Probe Format: key-value pairs trivial transformation to SQL inserts: message class - table (name mapping) attribute (key) - column On the Oracle DB side: a view for each message class with exactly the same columns as the attributes PL/SQL code in INSTEAD OF INSERT trigger to do the ID look-ups and actual insert(s) into underlying tables 18