An Agent Based, Dynamic Service System to Monitor, Control and Optimize Distributed Systems. June 2007

Similar documents
Internet2 Meeting September 2005

Grid monitoring with MonAlisa

Monitoring and Control of Large Scale Distributed Systems

MONitoring Agents using a Large Integrated Services Architecture. Iosif Legrand California Institute of Technology

Monitoring the ALICE Grid with MonALISA

MONALISA: A MONITORING FRAMEWORK FOR LARGE SCALE COMPUTING SYSTEMS

Iosif Legrand. California Institute of Technology

ANSE: Advanced Network Services for [LHC] Experiments

Table of Contents. LISA User Guide

DYNES: DYnamic NEtwork System

Adaptive Cluster Computing using JavaSpaces

PROOF-Condor integration for ATLAS

Cisco Configuration Engine 2.0

DIPLOMA PROJECT. Distributed Delay Constrained Multicast Algorithms for the MonALISA Framework

Grid Tutorial Networking

Overview. About CERN 2 / 11

Cluster Setup and Distributed File System

Multi-threaded, discrete event simulation of distributed computing systems

Towards Network Awareness in LHC Computing

Reliable Distributed Messaging with HornetQ

High Throughput WAN Data Transfer with Hadoop-based Storage

ELFms industrialisation plans

Data Transfers Between LHC Grid Sites Dorian Kcira

Functional Requirements for Grid Oriented Optical Networks

Configuring the Oracle Network Environment. Copyright 2009, Oracle. All rights reserved.

BlackBerry Enterprise Server for IBM Lotus Domino Version: 5.0. Administration Guide

Internet data transfer record between CERN and California. Sylvain Ravot (Caltech) Paolo Moroni (CERN)

Scaling Internet TV Content Delivery ALEX GUTARIN DIRECTOR OF ENGINEERING, NETFLIX

Network Design Considerations for Grid Computing

System Requirements. Things to Consider Before You Install Foglight NMS. Host Server Hardware and Software System Requirements

Physicists Set New Record for Network Data Transfer 13 December 2006

Data transfer over the wide area network with a large round trip time

SIMULATION FRAMEWORK FOR MODELING LARGE-SCALE DISTRIBUTED SYSTEMS. Dobre Ciprian Mihai *, Cristea Valentin *, Iosif C. Legrand **

IBM InfoSphere Streams v4.0 Performance Best Practices

Device-Functionality Progression

Chapter 12: I/O Systems. I/O Hardware

Overcoming Obstacles to Petabyte Archives

Global Platform for Rich Media Conferencing and Collaboration

Accelerate Applications Using EqualLogic Arrays with directcache

Measuring the available bandwidth in a network (end to end). Integration in MonALISA

System Description. System Architecture. System Architecture, page 1 Deployment Environment, page 4

IMC Network Traffic Analyzer 7.2 (E0401P04) Copyright 2016 Hewlett Packard Enterprise Development LP

Storage Resource Sharing with CASTOR.

davidklee.net gplus.to/kleegeek linked.com/a/davidaklee

Chapter 4:- Introduction to Grid and its Evolution. Prepared By:- NITIN PANDYA Assistant Professor SVBIT.

Scalability Engine Guidelines for SolarWinds Orion Products

Enabling a SuperFacility with Software Defined Networking

LHC and LSST Use Cases

Veeam Cloud Connect. Version 8.0. Administrator Guide

BlackBerry Enterprise Server for Microsoft Office 365. Version: 1.0. Administration Guide

Test Methodology We conducted tests by adding load and measuring the performance of the environment components:

Monitoring ARC services with GangliARC

POLITEHNICA UNIVERSITY OF BUCHAREST THE FACULTY OF COMPUTER SCIENCE AND AUTOMATIC CONTROL DIPLOMA PROJECT DISTRIBUTED ALGORITHMS

Scientific data processing at global scale The LHC Computing Grid. fabio hernandez

Optimized Distributed Data Sharing Substrate in Multi-Core Commodity Clusters: A Comprehensive Study with Applications

CONTENTS. Cisco Internet Streamer CDS 3.0 Software Configuration Guide iii OL CHAPTER 1 Product Overview 1-1

Xytech MediaPulse Equipment Guidelines (Version 8 and Sky)

Server Specifications

BigData and Map Reduce VITMAC03

Carlos Fernando Gamboa, BNL Andrew Wong, TRIUMF. WLCG Collaboration Workshop, CERN Geneva, April 2008.

FLAT DATACENTER STORAGE. Paper-3 Presenter-Pratik Bhatt fx6568

Chapter 3. Design of Grid Scheduler. 3.1 Introduction

30 Nov Dec Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy

Designing Next Generation Data-Centers with Advanced Communication Protocols and Systems Services

IMC Network Traffic Analyzer 7.3 (E0504) Copyright 2015, 2017 Hewlett Packard Enterprise Development LP

Oracle Enterprise Manager. 1 Before You Install. System Monitoring Plug-in for Oracle Unified Directory User's Guide Release 1.0

A Simulation Model for Large Scale Distributed Systems

Benchmark of a Cubieboard cluster

Xytech MediaPulse Equipment Guidelines (Version 8 and Sky)

Veritas Storage Foundation for Windows by Symantec

How Microsoft Built MySQL, PostgreSQL and MariaDB for the Cloud. Santa Clara, California April 23th 25th, 2018

Status of KISTI Tier2 Center for ALICE

Enterprise print management in VMware Horizon

The EU DataGrid Fabric Management

LUSTRE NETWORKING High-Performance Features and Flexible Support for a Wide Array of Networks White Paper November Abstract

SolarWinds Engineer s Toolset Fast Fixes to Network Issues

Programmable Information Highway (with no Traffic Jams)

High-Energy Physics Data-Storage Challenges

Managing Oracle Real Application Clusters. An Oracle White Paper January 2002

Ivane Javakhishvili Tbilisi State University High Energy Physics Institute HEPI TSU

vcloud Automation Center Reference Architecture vcloud Automation Center 5.2

Symbols INDEX > 12-14

A RESTful Java Framework for Asynchronous High-Speed Ingest

Chapter 13: I/O Systems

CIT 668: System Architecture. Amazon Web Services

SuccessMaker. System Requirements SuccessMaker v1.0

Voldemort. Smruti R. Sarangi. Department of Computer Science Indian Institute of Technology New Delhi, India. Overview Design Evaluation

Virtual Security Server

SuccessMaker Learning Management System System Requirements v1.0

Google File System. Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung Google fall DIP Heerak lim, Donghun Koo

Hardware Requirements

Deployment Planning Guide for Cisco Security Manager 4.0

BlackBerry AtHoc Networked Crisis Communication Capacity Planning Guidelines. AtHoc SMS Codes

Globus Online and HPSS. KEK, Tsukuba Japan October 16 20, 2017 Guangwei Che

MapReduce. U of Toronto, 2014

70-414: Implementing an Advanced Server Infrastructure Course 01 - Creating the Virtualization Infrastructure

Developing deterministic networking technology for railway applications using TTEthernet software-based end systems

FuxiSort. Jiamang Wang, Yongjun Wu, Hua Cai, Zhipeng Tang, Zhiqiang Lv, Bin Lu, Yangyu Tao, Chao Li, Jingren Zhou, Hong Tang Alibaba Group Inc

Speeding up Linux TCP/IP with a Fast Packet I/O Framework

Database Services at CERN with Oracle 10g RAC and ASM on Commodity HW

Transcription:

An Agent Based, Dynamic Service System to Monitor, Control and Optimize Distributed Systems June 2007 Iosif Legrand California Institute of Technology 1

The MonALISA Framework An Agent Based, Dynamic Service System to Monitor, Control and Optimize Distributed Systems MonALISA is a Dynamic, Distributed Service System capable to collect any type of information from different systems, to analyze it in near real time and to provide support for automated control decisions and global optimization of workflows in complex grid systems. The MonALISA system is designed as an ensemble of autonomous multi-threaded, self-describing agent-based subsystems which are registered as dynamic services, and are able to collaborate and cooperate in performing a wide range of monitoring tasks. These agents can analyze and process the information, in a distributed way, and to provide optimization decisions in large scale distributed applications. An agent-based architecture provides the ability to invest the system with increasing degrees of intelligence; to reduce complexity and make global systems manageable in real time. For an effective use of distributed resources, these services provide adaptability and self-organization 2

The MonALISA Architecture HL services Regional or Global High Level Services, Repositories & Clients Proxies Agents MonALISA services Network of JINI-Lookup Services Secure & Public Secure and reliable communication Dynamic load balancing Scalability & Replication AAA for Clients Distributed System for gathering and analyzing information based on mobile agents: Customized aggregation, Triggers, Actions Distributed Dynamic Registration and Discovery- based on a lease mechanism and remote events Fully Distributed System with no Single Point of Failure 3

MonALISA service & Data Handling WS Clients and service Web Service WSDL SOAP Data Store Data Cache Service & DB Postgres Registration Lookup Service Data (via ML Proxy) Lookup Service Discovery Applications Collects any type of information Push and Pull Predicates & Agents Configuration Control (SSL) AGENTS FILTERS / TRIGGERS Monitoring Modules Clients or Higher Level Services Dynamic Loading 4

Monitoring Grid sites, Running Jobs, Network Traffic, and Connectivity Running Jobs JOBS TOPOLOGY ACCOUNTING 5

MonALISA CPU Usage (%) 70 60 50 40 30 20 10 0 0 1000 2000 3000 4000 5000 6000 Messages per second ApMon Application Monitoring Lightweight library of APIs (C, C++, Java, Perl, Python) that can be used to send any information to MonALISA Services High comm. performance Flexible Complete Sys. Monitoring No Lost Packages App. Monitoring APPLICATION Time;IP;procID parameter1: value parameter2: value... ApMon App. Monitoring Mbps_out: 0.52 Status: reading MB_inout: 562.4 System Monitoring load1: 0.24 processes: 97 pages_in: 83 MonALISA hosts APPLICATION Config Servlet ApMon ApMon Config UDP/XDR Monitoring Data UDP/XDR Monitoring Data UDP/XDR Monitoring Data dynamic reloading MonALISA Service MonALISA Service ApMon configuration generated automatically by a servlet / CGI script 6

Monitoring the Execution of Jobs and the Time Evolution SPLIT JOBS LIFELINES for JOBS Summit a Job Job1 Job Job Job2 7 DAG Job3 Job 31 Job 32

End User / Client Agent LISA- Localhost Information Service Agent Authorization Service discovery Local detection of the hardware and software configuration Complete end-system monitoring: Per-process load, I/O and network throughputs, End-to-end performance measurements Will act as an active listener for all events related with the requests generated by its local applications. Can execute agents to re-configure the system and rollback 8

LISA Network Monitoring Network monitoring module monitors the network performance of the local workstation (network interfaces, traffic patterns, TCP/IP running stack parameters) and it also able to configure TCP/IP parameters for optimizing the networking performances. It can use IPERF, WEB 100 and other network monitoring tools. 9

10 LISA- Provides an Efficient Integration for Distributed Systems and Applications It is using external services to identify the real IP of the end system, its network ID and AS Discovers MonALISA services and can select, based on service attributes, different applications and their parameters (location, AS, functionality, load ) Based on information such as AS number or location, it determines a list with the best possible services. Registers as a listener for other service attributes (eg. number of connected clients). Continuously monitors the network connection with several selected services and provides the best one to be used from the client s perspective. Measures network quality, detects faults and informs upper layer services to take appropriate decisions MonALISA Application Service MonALISA Application Service Registration MonALISA Application Service Best Service Lookup Service MonALISA Application Service Lookup Service LISA Discovery

Monitoring Internet2 backbone Network Test for a Land Speed Record ~ 7 Gb/s in a single TCP stream from Geneva to Caltech 11

Monitoring USLHCnet Operations & management assisted by agent-based software Used on the new CIENA equipment used for network managment 12

USLHCNet Integrated Traffic 13

The UltraLight Network BNL ESnet IN /OUT 14

Monitoring The GLORIAD Ring 15

Monitoring Network Topology Latency, Routers NETWORKS ROUTERS AS 16

Available Bandwidth Measurements Embedded Pathload module. 17

Monitoring and Controlling Optical Switches Controlling Port power monitoring 18

Monitoring Optical Switches 19

ALICE : Job status & traffic - real-time map 20

Local and Global Decision Framework Based on monitoring information, actions can be taken in ML Service ML Global Services / Repository Actions can be triggered by Values above/below given thresholds Absence/presence of values Correlation between multiple values Decisions types Alerts o e-mail o Instant messaging o RSS Feeds External commands Event logging Global Optimization Services Traffic Connectivity Jobs Hosts Apps Temperature Humidity A/C Power Sensors ML Service Actions based on local information ML Service Local decisions Actions based on global information Global ML Services Global ML Services Global decisions 21

Alerts and actions MySQL daemon is automatically restarted when it runs out of memory Trigger: threshold on VSZ memory usage ALICE Production jobs queue is automatically kept full by the automatic resubmission Trigger: threshold on the number of aliprod waiting jobs Administrators are kept up-to-date on the services status Trigger: presence/absence of monitored information 22

Monitoring Video Conference System: Reflectors and Communication Topology 23

Creating a Dynamic, Global, Minimum Spanning Tree to optimize the connectivity A weighted connected graph G = (V,E) with n vertices and m edges. The quality of connectivity between any two reflectors is measured every 2s. Building in near real time a minimumspanning tree T w ( T ) = ( v, u ) T w (( v, u )) 24

EVO: LISA Detects the Best Reflector for each Client and MonALISA Agents keep the reflectors connected in a MST Dynamic Discovery of Reflectors Creates and maintains, in real-time, the optimal connectivity between reflectors (MST) based on periodic network measurements. Detects and monitor the User configuration, its hardware, the connectivity and its performance. Dynamically connects the client to the best reflector Provides secure administration. It is using alarm triggers to notify unexpected events 25

On-Demand, Dynamic Path Allocation >FDT A/fileX B/path/ OS path available Configuring interfaces Starting Data Transfer CREATES AN END TO END PATH < 1s Internet Real time monitoring APPLICATION Regular IP path MonALISA Distributed Service System DATA A Control Monitor MonALISA Service OS Agent B LISA AGENT LISA sets up - Network Interfaces -TCP stack - Kernel parameters -Routes LISA APPLICATION use eth1.2, 26 TL1 Active light path Optical Switch LISA Agent Detects errors and automatically recreate the path in less than the TCP timeout

FDT Fast Data Transfer FDT is an application for efficient data transfers. Easy to use. Written in java and runs on all major platforms. It is based on an asynchronous, multithreaded system which is using the NIO library and is able to: stream continuously a list of files use independent threads to read and write on each physical device transfer data in parallel on multiple TCP streams, when necessary use appropriate size of buffers for disk IO and networking resume a file transfer session 27

FDT Fast Data Transfer Control connection / authorization Pool of buffers Kernel Space Pool of buffers Kernel Space Data Transfer Sockets / Channels Independent threads per device Restore the files from buffers 28

FDT Memory to Memory Tests in WAN CPUs Dual Core Intel Xenon @ 3.00 GHz, 4 GB RAM, 4 x 320 GB SATA Disks Connected with 10Gb/s Myricom 29

FDT Memory to Memory Tests in WAN C1-NY -> C1-GVA 30

FDT Memory to Memory Tests in WAN C1-NY <-> C1-GVA 31

Memory-to-memory in transfers in WAN in both directions with two pairs of servers: C1-NY -> C1-GVA C2-NY <- C2-GVA 32

Disk -to- Disk transfers in WAN MB/s Page_IN C1-NY -> C1-GVA Read and writes on 4 SATA disks in parallel on each server Mean traffic ~ 210 MB/s ~ 0.75 TB per hour CERN ->CALTECH Read and writes on 2 RAID Controllers in parallel on each server Mean traffic ~ 545 MB/s ~ 2 TB per hour 33

Controlling Optical Planes Automatic Path Recovery USLHCnet CERN Geneva Internet2 Starlight CALTECH Pasadena Manlan FDT Transfer Fiber cut simulations The traffic moves from one transatlantic line to the other one FDT transfer (CERN CALTECH) continues uninterrupted TCP fully recovers in ~ 20s 4 2 3 1 4 Fiber cuts simulations 34

Bandwidth Challenge at SC2005 151 Gbs ~ 500 TB Total in 4h 35

FDT Used at SC 2006 Entire management was done with LISA & MonALISA 36

SC2006 Official BWC Hyper BWC 37

Data Collection and Interfacing with Other Tools MonALISA is interfaced with many monitoring tools and is capable to collect information from different applications: Computing Nodes / Farms (system information, network traffic ) SNMP, Ganglia, dedicated scripts Routers, Switches, Optical Switches SNMP, NetFlow, SFlow, TL1, MRTG, WS End to End Network performance Pathload, IPERF, Pipes, Abing, ABping Batch Queuing Systems LSF, PBS, Condor, NQS, Grid Job Manager Applications Root, Xrootd, CRAB, RRD, VRVS /EVO, 38

Communities using MonALISA Major Communities ALICE OSG CMS STAR VRVS LGC RUSSIA SE Europe GRID APAC Grid UNAM Grid (Mx) ITU ABILENE ULTRALIGHT GLORIAD LHC Net RoEduNET Enlightened 39 MonALISA Today Running 24 X 7 ABILEN at ~340 Sites E ¾Collecting ~ 1 000 000 parameters in near real-time ¾Update rate of 20,000 parameter updates per second ¾Monitoring ¾12,000 computers ¾ > 100 WAN Links ¾Thousands VRVS of Grid jobs running concurrently Iosif Legrand Demonstrated at: VRVS Telecom World ALICE WSIS - 2003 - SC 2004 Internet2 2005 TERENA 2005 OSG IGrid 2005 SC 2005 CHEP 2006 CENIC 2006 Innovation Award for HighJune Performance 2007 Applications

The MonALISA Architecture Provides: Distributed Registration and Discovery for Services and Applications. Monitoring all aspects of complex systems : System information for computer nodes and clusters Network monitoring, topology, end to end performance Monitoring the performance of Applications, Jobs or services The End User Systems, its performance Environment; Video streaming Can interact with any other services to provide in near real-time customized information based on monitoring data http://monalisa.caltech.edu Secure, remote administration for services and applications Agents to supervise applications,, trigger alarms, restart or reconfigure them, and to notify other services when certain conditions are detected. ted. The MonALISA framework is used to develop higher level decision services, implemented as a distributed network of communicating agents, to perform global optimization tasks. Graphical User Interfaces to visualize complex information 40