System for Large Computing Clusters

Size: px
Start display at page:

Download "System for Large Computing Clusters"

Transcription

1 (Towards) A Scalable Monitoring System for Large Computing Clusters Moreno Marzolla marzolla@dsi.unive.it Web: Dip. Informatica, Università di Venezia

2 Talk Outline System Monitoring Definition Tassonomy Distributed Systems Monitoring Case Study: the BaBar Computing Farm ASC: A prototype of an SNMP Based monitoring application Conclusions Moreno Marzolla 2

3 Monitoring A monitor is a tool used to observe the activities on a system Collect performance statistics Analyze the data Display the results. Why? Identify frequently used portions of a program; Measure resource utilization to find performance bottlenecks; Characterize the Workload; Find model parameters, validate models, or develop inputs for a model. Moreno Marzolla 3

4 Monitor Classification Based on the level at which it is implemented Software Hardware Firmware Hybrid Based on the triggering mechanism Event driven Sampling based Based on the display ability On line Batch Moreno Marzolla 4

5 Hw versus Sw Monitors Input Rate Much higher on Hw monitors; Depends on CPU power for Sw ones. Time Resolution Higher on Hw monitors; Overhead None for Hw monitors. Can be an issue for Sw monitors. Portability Architecture and/or OS dependent? Moreno Marzolla 5

6 Event Driven vs Sampling Based Event Driven monitors collect data when triggered by some condition Pros: No polling required. Only "interesting" notifications are sent Cons: What happens when the triggering mechanism crashes? Sampling Based require periodic queries Pros: Can detect crashes of the monitored entity. Sampling rate can be changed "outside" the monitored system. Cons: Can overload the medium used to transport the requests Moreno Marzolla 6

7 Distributed Systems Monitoring To monitor a distributed system, the monitor needs itself to be (at least partly) distributed. Monitor Observer Observer Observer Observer Host 1 Host 2 Host 3 Host 4 Moreno Marzolla 7

8 Layered View of DS monitors Management Console Interpretation Presentation Analysis Collection Observation The entity deciding to change the parameters using the console Provides an interface to change the parameters and states The user who interprets the results User interface: provides reports, displays, alarms... Statistical routines to summarize the characteristics of the data Collects data from various observers Gathers raw data on individual components of the system Moreno Marzolla 8

9 Issues of DS monitoring Scalability Can it monitor 10 hosts? 100 hosts? 1000 hosts?... Latency Long delays make real time, on line monitoring a challenge Consistency Are you getting a consistent view of the system s status, despite notification delays? Moreno Marzolla 9

10 BaBar Case Study BaBar is a High Energy Physics experiment, studying matter antimatter asimmetry. Only a small fraction of events has real information ( ) Large amounts of data needed Large computing facilities are required Moreno Marzolla 10

11 BaBar INFN Padova 150 2xPIII 1.26GHz Machines, 1GB Ram, RH Linux 7.2 Tape Library with a capacity of ~70TB not compressed; Network switch, UPSes, Environmental conditioning system,... Moreno Marzolla 11

12 Monitoring Requirements Hardware Status Machine Crashes, disk crashes, CPU temperatures, disk/partitions overflows... Processes status Environmental conditions Humidity, Temperature, UPS status... System administrators should be notified as soon as a problem occurs Some automatic action should be taken when possibile (eg, shut down the machines if overheated). Moreno Marzolla 12

13 Other requirements The monitoring system should also be: Scalable Efficient (little resources requirement) Flexible and customizable Easy to configure Should be able to operate in batch mode (as a regular UNIX dæmon, no GUI) Should be able to observe different quantities with different granularities eg, CPU utilization sampled every 5 seconds, CPU temperature sampled every minute,... Moreno Marzolla 13

14 Some existing monitoring tools There are many of them: MRTG Ngop isd.fnal.gov/ngop/ Netsaint/Nagios Ganglia RemStats Cricket GxSNMP (add your own here) Moreno Marzolla 14

15 So, what s wrong? We examined some publicly available monitoring tools. Unfortunately, many of them were not suited to us: Not scalable Require their own dæmons running on the monitored hosts Can t install a dæmon on a network switch, or on a tape library Hard to configure In many cases we gave up without trying the program. Poorly implemented "...My God, it s full of scripting languages..." Moreno Marzolla 15

16 Architectural sketch User Interface User Interface User Interface Monitor Collector Collector Observer Observer Observer Observer Observer Hardware Hardware Hardware Hardware Hardware Moreno Marzolla 16

17 The Observer The Observer must be able to collect statistics on any networked equipment We decided to use SNMP (Simple Network Management Protocol) It is a well known protocol It is implemented by virtually every vendor It is reasonably simple yet powerful A very good open source implementation is available on Unix/Linux platforms snmp.sourceforge.net/ Moreno Marzolla 17

18 More on SNMP GetRequest Management Application GetNextRequest SetRequest SNMP UDP IP GetResponse Trap Network Protocol SNMP Messages LAN/WAN Managed Resources SNMP Managed Objects GetRequest GetNextRequest SetRequest GetResponse Trap SNMP UDP IP Network Protocol W. Stallings, SNMP, SNMPv2, SNMPv3 and RMON 1 and 2, 3rd edition, p. 81 Moreno Marzolla 18

19 The Collector/Monitor Is being developed Features: Asynchronous (nonblocking) parallelized SNMP Polling; XML based configuration file; The RRDTool package is used to store data and produce graphs Old data have lower resolution than recent ones. Round Robin Databases have known maximum size. Graphing capabilities are provided by the library. Dynamic generation of HTML pages using XSLT stylesheets. Moreno Marzolla 19

20 Architecture XML Configuration File <?xml version="1.0" Status Stylesheets XSLT Stylesheet 1 HTML Pages HTML Page 1 standalone="no"?> <!DOCTYPE monitor SYSTEM "monitor.dtd"> <monitor>... </monitor> ASC HTTPD XSLT Stylesheet 2 HTML Page 2 Host 1 Host 2 Host n Monitored Hosts Moreno Marzolla 20

21 Example of XML Configuration File <?xml version="1.0" standalone="no"?> <!DOCTYPE monitor SYSTEM "monitor.dtd"> <monitor numconnections="20" asclogfile="/monitor/asc.log" httpdlogfile="/dev/null" rrddir="/monitor" htmldir="/monitor/html" ascverbosity="3" > <host name="localhost"> <description>this machine</description> <miblist> <mib id="cpuuser" name=" " type="counter"> <archives> <rra cf="average" granularity="60" expire="604800"/> </archives> </mib> </miblist> <graphs> <! Graph definitions here > </graphs> </host> </monitor> Moreno Marzolla 21

22 Example of XML Status Dump <?xml version="1.0"?> <hosts> <host name="localhost" status="nr"> <mibs> <mib id="availswap" lastupdated=" "> </mib> <mib id="totalswap" lastupdated=" "> </mib> <mib id="totalmem" lastupdated=" "> </mib> <mib id="cachedmem" lastupdated=" "> </mib> <mib id="buffermem" lastupdated=" "> </mib> <mib id="sharedmem" lastupdated=" "> </mib> <mib id="freemem" lastupdated=" "> </mib> <mib id="cpusystem" lastupdated=" "> </mib> <mib id="cpuuser" lastupdated=" "> </mib> <mib id="tempcpu2" lastupdated=" "> </mib> <mib id="tempcpu1" lastupdated=" "> </mib> <mib id="tempmb" lastupdated=" "> </mib> </mibs> <graphs> <graph id="hourly.png" title="hourly data"/> </graphs> <notifications> <msg ts=" :11: " severity="critical"> Timeout </msg> </notifications> </host> </hosts> Moreno Marzolla 22

23 Sample HTML Output Moreno Marzolla 23

24 What happens if it does not scale? "Hierarchize" ASC? HTTPD Proxy HTTPD ASC HTTPD ASC SNMP Agent SNMP Agent SNMP Agent SNMP Agent Host 1 Host 2 Host 3 Host 4 Moreno Marzolla 24

25 Hierarchical Monitoring A hierarchical configuration of SNMP agents has already been shown to scale Rajesh Subramanyan, José Miguel Alonso and José A. B. Fortes, "A Scalable SNMP Based Distributed Monitoring System for Heterogeneous Network Computing", Proc. SC2000. Dallas, Texas, Nov What about fault tolerance? "Well written applications do not crash" is right, but machines running them crash anyway... Well known leader election techniques could be applied Redundant monitoring? Moreno Marzolla 25

26 Redundancy for fault tolerance Additional collectors could be used to ensure that at least K out of N of them are sufficient to cover the whole system Collector 1 Collector 1 Host 1 Host 2 Host 3 Host 4 Host 5 Host 6 Host 7 Host 8 Host 9 Collector 1 Any 2 collectors monitor all the hosts Moreno Marzolla 26

27 Conclusions Monitoring a large computing cluster is a highly nontrivial task. Many available monitoring tools exist, but many of them are not adequate for large distributed systems. We are trying to build a general purpose SNMP and XML based monitoring tool. A prototype exists and is working well...but see next slide Moreno Marzolla 27

28 Future work Alarms are not currently implemented, but are at the top position of the to do list ASC is running on a small ( 20 machines) test farm, while the bigger farm is being installed. ASC is expected to scale; however, serious scalability experiments will be done as soon as all the machines are up and running. Could it be extended for monitoring Grids? Moreno Marzolla 28

29 Bibliography W. Stallings, SNMP, SNMPv2, SNMPv3 and RMON 1 and 2, third edition, Addison Wesley, 1999 R. Jain, The art of computer systems performance analysis: Techniques for Experimental Design, Measurement, Simulation, and Modeling, John Wiley and Sons, 1991 Rajesh Subramanyan, José Miguel Alonso and José A. B. Fortes, A Scalable SNMP Based Distributed Monitoring System for Heterogeneous Network Computing, Proc. SC2000. Dallas, Texas, Nov Grid Performance Working Group, didc.lbl.gov/gridperf/ M. Bearden and R. Bianchini Jr., Efficient and Fault Tolerant Distributed Host Monitoring using System Level Diagnosis, Proceedings of the IFIP/IEEE International Conference on Distributed Platforms, Dresden, Germany, pp , February Flaviu Cristian, Understanding fault tolerant distributed systems, Comm. ACM 34(2), 1991 Moreno Marzolla 29

A Performance Monitoring System for Large Computing Clusters

A Performance Monitoring System for Large Computing Clusters A Performance Monitoring System for Large Computing Clusters Moreno Marzolla marzolla@dsi.unive.it http://www.dsi.unive.it/~marzolla Dip. Informatica, Università Ca' Foscari di Venezia and Istituto Nazionale

More information

A Monitoring System for the BaBar INFN Computing Cluster

A Monitoring System for the BaBar INFN Computing Cluster A Monitoring System for the BaBar INFN Computing Cluster Moreno Marzolla Università Ca Foscari di Venezia, 30172 Mestre, Italy/INFN Padova, 35100 Padova, Italy Valerio Melloni Universitá di Ferrara, 44100

More information

Introduction to Systems and Network Management

Introduction to Systems and Network Management Introduction to Systems and Network Management Shang Juh Kao Dept. of Computer Science and Engineering National Chung Hsing University Tel: 04-2284-0497 x 708 Email: sjkao@cs.nchu.edu.tw 1 This course

More information

Distributed Simulation of Large Computer Systems

Distributed Simulation of Large Computer Systems Distributed Simulation of Large Computer Systems Moreno Marzolla Univ. di Venezia Ca Foscari Dept. of Computer Science and INFN Padova Email: marzolla@dsi.unive.it Web: www.dsi.unive.it/ marzolla Moreno

More information

SNMP Basics BUPT/QMUL

SNMP Basics BUPT/QMUL SNMP Basics BUPT/QMUL 2014-05-12 Agenda Brief introduction to Network Management Brief introduction to SNMP SNMP Network Management Framework RMON New trends of network management Summary 2 Brief Introduction

More information

NET311 Computer Network Management Tools, Systems and Engineering

NET311 Computer Network Management Tools, Systems and Engineering NET311 Computer Network Management Tools, Systems and Engineering Dr. Mostafa H. Dahshan Department of Computer Engineering College of Computer and Information Sciences King Saud University mdahshan@ksu.edu.sa

More information

SNMP Basics BUPT/QMUL

SNMP Basics BUPT/QMUL SNMP Basics BUPT/QMUL 2017-05-22 Agenda Brief introduction to Network Management Brief introduction to SNMP SNMP Network Management Framework RMON New trends of network management Summary 2 Brief Introduction

More information

Network Management Standards Architectures & Applications. Network Management

Network Management Standards Architectures & Applications. Network Management Network Management Standards Architectures & Applications Network Management 1 Lectures Schedule Week Week 1 Topic Computer Networks - Network Management Architectures & Applications Week 2 Network Management

More information

SNMP: Simplified. White Paper by F5

SNMP: Simplified. White Paper by F5 The Simple Network Management Protocol defines a method for managing devices that connect to IP networks. The "simple" in SNMP refers to the requirements for a managed device, not the protocol. This white

More information

Network Management. Stuart Johnston 13 October 2011

Network Management. Stuart Johnston 13 October 2011 Network Management Stuart Johnston stuart.johnston@inmon.com 13 October 2011 Slides from: Computer Networking: A Top Down Approach, 4th edition. Jim Kurose, Keith Ross Addison-Wesley, July 2007 All material

More information

SNMP SIMULATOR. Description

SNMP SIMULATOR. Description SNMP SIMULATOR Overview The SNMP Agent Simulator enables simulation of standalone SNMP agents to test and demonstrate SNMP-based management applications. Its unique ability to create default values from

More information

Chapter 9. introduction to network management. major components. MIB: management information base. SNMP: protocol for network management

Chapter 9. introduction to network management. major components. MIB: management information base. SNMP: protocol for network management Chapter 9 Network Management A note on the use of these ppt slides: We re making these slides freely available to all (faculty, students, readers). They re in PowerPoint form so you can add, modify, and

More information

Network Management. Stuart Johnston 08 November 2010

Network Management. Stuart Johnston 08 November 2010 Network Management Stuart Johnston stuart.johnston@inmon.com 08 November 2010 Slides from: Computer Networking: A Top Down Approach, 4th edition. Jim Kurose, Keith Ross Addison-Wesley, July 2007 All material

More information

PLANEAMENTO E GESTÃO DE REDES INFORMÁTICAS COMPUTER NETWORKS PLANNING AND MANAGEMENT

PLANEAMENTO E GESTÃO DE REDES INFORMÁTICAS COMPUTER NETWORKS PLANNING AND MANAGEMENT Mestrado em Engenharia Informática e de Computadores PLANEAMENTO E GESTÃO DE REDES INFORMÁTICAS COMPUTER NETWORKS PLANNING AND MANAGEMENT 2010-2011 Arquitecturas de Redes 3 Gestão de Redes e Serviços -

More information

access addresses/addressing advantages agents allocation analysis

access addresses/addressing advantages agents allocation analysis INDEX A access control of multipath port fanout, LUN issues, 122 of SAN devices, 154 virtualization server reliance on, 173 DAS characteristics (table), 19 conversion to SAN fabric storage access, 105

More information

A Resource Look up Strategy for Distributed Computing

A Resource Look up Strategy for Distributed Computing A Resource Look up Strategy for Distributed Computing F. AGOSTARO, A. GENCO, S. SORCE DINFO - Dipartimento di Ingegneria Informatica Università degli Studi di Palermo Viale delle Scienze, edificio 6 90128

More information

SOFT 437. Software Performance Analysis. Ch 7&8:Software Measurement and Instrumentation

SOFT 437. Software Performance Analysis. Ch 7&8:Software Measurement and Instrumentation SOFT 437 Software Performance Analysis Ch 7&8: Why do we need data? Data is required to calculate: Software execution model System execution model We assumed that we have required data to calculate these

More information

CS533 Modeling and Performance Evaluation of Network and Computer Systems

CS533 Modeling and Performance Evaluation of Network and Computer Systems CS533 Modeling and Performance Evaluation of Network and Computer Systems Monitors (Chapter 7) 1 Monitors That which is monitored improves. Source unknown A monitor is a tool used to observe system Observe

More information

SNMP. Simple Network Management Protocol Philippines Network Operators Group, March Jonathan Brewer Telco2 Limited New Zealand

SNMP. Simple Network Management Protocol Philippines Network Operators Group, March Jonathan Brewer Telco2 Limited New Zealand SNMP Simple Network Management Protocol Philippines Network Operators Group, March 2018 Jonathan Brewer Telco2 Limited New Zealand Objectives Participants will understand the basics of: SNMP Architecture

More information

Reliable Distribution of Data Using Replicated Web Servers

Reliable Distribution of Data Using Replicated Web Servers Reliable Distribution of Data Using Replicated Web Servers Moreno Marzolla Dipartimento di Informatica Università Ca' Foscari di Venezia via Torino 155, 30172 Mestre (ITALY) marzolla@dsi.unive.it Talk

More information

Cisco Configuration Engine 2.0

Cisco Configuration Engine 2.0 Cisco Configuration Engine 2.0 The Cisco Configuration Engine provides a unified, secure solution for automating the deployment of Cisco customer premises equipment (CPE). This scalable product distributes

More information

Operating Systems. Lecture 09: Input/Output Management. Elvis C. Foster

Operating Systems. Lecture 09: Input/Output Management. Elvis C. Foster Operating Systems 141 Lecture 09: Input/Output Management Despite all the considerations that have discussed so far, the work of an operating system can be summarized in two main activities input/output

More information

Understanding SNMP. Rab Nawaz Jadoon DCS. Assistant Professor COMSATS University, Abbottabad Pakistan. Department of Computer Science

Understanding SNMP. Rab Nawaz Jadoon DCS. Assistant Professor COMSATS University, Abbottabad Pakistan. Department of Computer Science Understanding SNMP Rab Nawaz Jadoon DCS COMSATS Institute of Information Technology Assistant Professor COMSATS University, Abbottabad Pakistan Motivation In small networks with only a few devices confined

More information

Configuring SNMP. Understanding SNMP CHAPTER

Configuring SNMP. Understanding SNMP CHAPTER 22 CHAPTER This chapter describes how to configure the Simple Network Management Protocol (SNMP) on the Catalyst 3750 switch. Unless otherwise noted, the term switch refers to a standalone switch and a

More information

Chapter 5 Network Layer: The Control Plane

Chapter 5 Network Layer: The Control Plane Chapter 5 Network Layer: The Control Plane A note on the use of these Powerpoint slides: We re making these slides freely available to all (faculty, students, readers). They re in PowerPoint form so you

More information

N E T W O R K M A N A G E M E N T P R I N C I P L E S R E V I E W

N E T W O R K M A N A G E M E N T P R I N C I P L E S R E V I E W CS7012 N E T W O R K M A N A G E M E N T P R I N C I P L E S R E V I E W THE MANAGED OBJECT MANAGER / AGENT RELATIONSHIP Standard Interface Local (proprietary) Interface Manager Management Operations Agent

More information

Redesde Computadores(RCOMP)

Redesde Computadores(RCOMP) Redesde Computadores(RCOMP) Lecture 11 2016/2017 Network management. SMTP application protocol. Instituto Superior de Engenharia do Porto Departamento de Engenharia Informática Redes de Computadores (RCOMP)

More information

A Scalable Event Dispatching Library for Linux Network Servers

A Scalable Event Dispatching Library for Linux Network Servers A Scalable Event Dispatching Library for Linux Network Servers Hao-Ran Liu and Tien-Fu Chen Dept. of CSIE National Chung Cheng University Traditional server: Multiple Process (MP) server A dedicated process

More information

Chapter 9 Network Management

Chapter 9 Network Management Chapter 9 Network Management A note on the use of these ppt slides: We re making these slides freely available to all (faculty, students, readers). They re in PowerPoint form so you can add, modify, and

More information

Monitoring. 18 Nov TM and copyright Imagicle spa

Monitoring. 18 Nov TM and copyright Imagicle spa Monitoring 18 Nov 2018 TM and copyright 2010-2018 Imagicle spa Table of Contents Monitoring...1/3 Monitoring service configuration...1/3 Monitoring Monitoring service configuration The Application Suite

More information

Industrial Challenges in Working with Events

Industrial Challenges in Working with Events Industrial Challenges in Working with Events Prof. Dr., Senior Technical Leader, NMTG Manageability Cisco Systems, Inc. pdini@cisco.com petre@iaria.org 1 The Road Ahead Positioning Issues - Event definition

More information

JacobsSNMP. Siarhei Kuryla. May 10, Networks and Distributed Systems seminar

JacobsSNMP. Siarhei Kuryla. May 10, Networks and Distributed Systems seminar JacobsSNMP Siarhei Kuryla Networks and Distributed Systems seminar May 10, 2010 Simple Network Management Protocol protocol for exchange of management information; exposes management data in the form of

More information

Proxy Providers versus Embedded Providers (SMI-S)

Proxy Providers versus Embedded Providers (SMI-S) Proxy Providers versus Embedded Providers (SMI-S) Srinivasa Reddy Gandlaparthi NetApp Overview Embedded Providers Proxy Providers Differences between Embedded and Proxy providers Design Considerations

More information

Valutazione delle prestazioni di Architetture Software con specifica UML tramite modelli di simulazione Moreno Marzolla

Valutazione delle prestazioni di Architetture Software con specifica UML tramite modelli di simulazione Moreno Marzolla Valutazione delle prestazioni di Architetture Software con specifica UML tramite modelli di simulazione Moreno Marzolla Dipartimento di Informatica Università Ca' Foscari di Venezia marzolla@dsi.unive.it

More information

HPE Operations Agent. Concepts Guide. Software Version: For the Windows, HP-UX, Linux, Solaris, and AIX operating systems

HPE Operations Agent. Concepts Guide. Software Version: For the Windows, HP-UX, Linux, Solaris, and AIX operating systems HPE Operations Agent Software Version: 12.02 For the Windows, HP-UX, Linux, Solaris, and AIX operating systems Concepts Guide Document Release Date: December 2016 Software Release Date: December 2016 Legal

More information

RRDTool: A Round Robin Database for Network Monitoring

RRDTool: A Round Robin Database for Network Monitoring Journal of Computer Science Original Research Paper RRDTool: A Round Robin Database for Network Monitoring Sweta Dargad and Manmohan Singh Department of Computer Science Engineering, ITM Universe, Vadodara

More information

CSC 401 Data and Computer Communications Networks

CSC 401 Data and Computer Communications Networks CSC 401 Data and Computer Communications Networks Network Layer ICMP (5.6), Network Management(5.7) & SDN (5.1, 5.5, 4.4) Prof. Lina Battestilli Fall 2017 Outline 5.6 ICMP: The Internet Control Message

More information

Monitoring tools and techniques for ICT4D systems. Stephen Okay

Monitoring tools and techniques for ICT4D systems. Stephen Okay Monitoring tools and techniques for ICT4D systems Stephen Okay Effective Monitoring Why do monitoring? Monitoring tools and Applications Monitoring:What,Where, Why,How, etc. Alerting Off-the-shelf vs.

More information

MONitoring Agents using a Large Integrated Services Architecture. Iosif Legrand California Institute of Technology

MONitoring Agents using a Large Integrated Services Architecture. Iosif Legrand California Institute of Technology MONitoring Agents using a Large Integrated s Architecture California Institute of Technology Distributed Dynamic s Architecture Hierarchical structure of loosely coupled services which are independent

More information

Network Layer: ICMP and Network Management

Network Layer: ICMP and Network Management Network Layer: ICMP and Network Management EECS3214 18-03-15 4-1 Chapter 5: outline 5.1 introduction 5.2 routing protocols link state distance vector 5.3 intra-as routing in the Internet: OSPF 5.4 routing

More information

SNMP Simple Network Management Protocol

SNMP Simple Network Management Protocol SNMP Simple Network Management Protocol Simple Network Management Protocol SNMP is a framework that provides facilities for managing and monitoring network resources on the Internet. Components of SNMP:

More information

Outline. SNMP Simple Network Management Protocol. Before we start on SNMP. Simple Network Management Protocol

Outline. SNMP Simple Network Management Protocol. Before we start on SNMP. Simple Network Management Protocol Outline SNMP Simple Network Management Protocol Several slides are courtesy of the Addison Wesley companion web site for textbook by Liebeherr and El Zarki and others added by M. Veeraraghavan, Univ. of

More information

Service Oriented Performance Analysis

Service Oriented Performance Analysis Service Oriented Performance Analysis Da Qi Ren and Masood Mortazavi US R&D Center Santa Clara, CA, USA www.huawei.com Performance Model for Service in Data Center and Cloud 1. Service Oriented (end to

More information

Accounting management system enhancement supporting automated monitoring and storing facilities

Accounting management system enhancement supporting automated monitoring and storing facilities Accounting management system enhancement supporting automated monitoring and storing facilities Abstract C. Bouras S. Kastaniotis 1 Computer Engineering and Informatics Department University of Patras,

More information

CSCE Introduction to Computer Systems Spring 2019

CSCE Introduction to Computer Systems Spring 2019 CSCE 313-200 Introduction to Computer Systems Spring 2019 Processes Dmitri Loguinov Texas A&M University January 24, 2019 1 Chapter 3: Roadmap 3.1 What is a process? 3.2 Process states 3.3 Process description

More information

Troubleshooting Tools. Tools for Gathering Information

Troubleshooting Tools. Tools for Gathering Information Internetwork Expert s CCNP Bootcamp Troubleshooting Tools http:// Tools for Gathering Information Before implementing a fix, information must be gathered about a problem to eliminate as many variables

More information

Understanding Feature and Network Services in Cisco Unified Serviceability

Understanding Feature and Network Services in Cisco Unified Serviceability CHAPTER 10 Understanding Feature and Network Services in Cisco Unified Serviceability May 19, 2009 Cisco Unified Serviceability service management includes working with feature and network services and

More information

Network control and management

Network control and management Network control and management Network management What is network management?? Why is it needed? Mani Subramanian, Network Management: An introduction to principles and practice, Addison Wesley Longman,

More information

I/O Systems. Amir H. Payberah. Amirkabir University of Technology (Tehran Polytechnic)

I/O Systems. Amir H. Payberah. Amirkabir University of Technology (Tehran Polytechnic) I/O Systems Amir H. Payberah amir@sics.se Amirkabir University of Technology (Tehran Polytechnic) Amir H. Payberah (Tehran Polytechnic) I/O Systems 1393/9/15 1 / 57 Motivation Amir H. Payberah (Tehran

More information

OUTLINE. NSLS-II control system environment Monitoring goals Splunk and Splunk Apps Unix, Nagios, Snort sflow and Cacti Putting it all together

OUTLINE. NSLS-II control system environment Monitoring goals Splunk and Splunk Apps Unix, Nagios, Snort sflow and Cacti Putting it all together OUTLINE NSLS-II control system environment Monitoring goals Splunk and Splunk Apps Unix, Nagios, Snort sflow and Cacti Putting it all together NSLS-II CONTROL SYSTEM ENVIRONMENT Private network no email,

More information

Relational Network Manager for IP Networks

Relational Network Manager for IP Networks Relational Network Manager for IP Networks Yuri Breitbart, Deepakraj Shanthilal Department of Computer Science Kent State University Kent, OH 44242 Abstract This paper deals with an issue of effective

More information

ELFms industrialisation plans

ELFms industrialisation plans ELFms industrialisation plans CERN openlab workshop 13 June 2005 German Cancio CERN IT/FIO http://cern.ch/elfms ELFms industrialisation plans, 13/6/05 Outline Background What is ELFms Collaboration with

More information

SIMPLE NETWORK MANAGEMENT PROTOCOL SATELLAR MANAGEMENT WITH SNMP GET AND SET

SIMPLE NETWORK MANAGEMENT PROTOCOL SATELLAR MANAGEMENT WITH SNMP GET AND SET SIMPLE NETWORK MANAGEMENT PROTOCOL SATELLAR MANAGEMENT WITH SNMP GET AND SET Technical Bulletin 2/14 THE SNMP PROTOCOL The SIMPLE NETWORK MANAGEMENT PROTOCOL, SNMP is a widely used management protocol

More information

Talk Outline. Moreno Marzolla. Motivations. How can performances be evaluated?

Talk Outline. Moreno Marzolla. Motivations. How can performances be evaluated? Talk Outline Moreno Marzolla Motivations and General Principles Contribution Introduction to The The Conclusions Dipartimento di Informatica Università Ca' Foscari di Venezia marzolla@dsi.unive.it M. Marzolla

More information

TSIN02 - Internetworking

TSIN02 - Internetworking TSIN02 - Internetworking Literature: Lecture 11: SNMP and AAA Forouzan, chapter 21 Diameter next generation's AAA protocol by Håkan Ventura, sections 2-3.3.6 RFC2881 (optional extra material) Outline:

More information

Lecture 5: Foundation of Network Management

Lecture 5: Foundation of Network Management Lecture 5: Foundation of Network Management Prof. Shervin Shirmohammadi SITE, University of Ottawa Prof. Shervin Shirmohammadi CEG 4395 5-1 Network Management Standards OSI: Common Management Information

More information

Scaling Without Sharding. Baron Schwartz Percona Inc Surge 2010

Scaling Without Sharding. Baron Schwartz Percona Inc Surge 2010 Scaling Without Sharding Baron Schwartz Percona Inc Surge 2010 Web Scale!!!! http://www.xtranormal.com/watch/6995033/ A Sharding Thought Experiment 64 shards per proxy [1] 1 TB of data storage per node

More information

SilverCreek Compare Versions

SilverCreek Compare Versions Platform Support: Windows Linux Includes all the platfoms listed above T T T x x x x x x Test Coverage: Tests for SNMPv1, v2c, all private and standard MIBs Tests for SNMPv1, v2c, v3, all private and standard

More information

Lite Management Console. design overview Rev. 1.0 December 4, 2003 nextedge Technology, Inc.

Lite Management Console. design overview Rev. 1.0 December 4, 2003 nextedge Technology, Inc. Lite Management Console design overview Rev. 1.0 December 4, 2003 nextedge Technology, Inc. Concepts Light Weight Low resource usage Appropriate for SOHO and Groupware Easy to use Windows GUI, use common

More information

The Fusion Distributed File System

The Fusion Distributed File System Slide 1 / 44 The Fusion Distributed File System Dongfang Zhao February 2015 Slide 2 / 44 Outline Introduction FusionFS System Architecture Metadata Management Data Movement Implementation Details Unique

More information

Simulating storage system performance: a useful approach for SuperB?

Simulating storage system performance: a useful approach for SuperB? Simulating storage system performance: a useful approach for SuperB? Moreno Marzolla Dipartimento di Scienze dell'informazione Università di Bologna marzolla@cs.unibo.it http://www.moreno.marzolla.name/

More information

Real-Time Embedded User Interfaces

Real-Time Embedded User Interfaces Real-Time Embedded User Interfaces Justin Ireland August 2010 Introduction Modern appliances and electronic devices are becoming increasingly sophisticated. Advanced feature sets require advanced configuration

More information

RMON on the Workgroup Catalyst Series

RMON on the Workgroup Catalyst Series RMON on the Workgroup Catalyst Series Document ID: 10675 Contents Introduction General Questions Known Problems and Solutions Error Messages for the TrafficDirector Software Related Information Introduction

More information

Framework Management Layer User's Guide. SNMP Interface

Framework Management Layer User's Guide. SNMP Interface Framework Management Layer User's Guide SNMP Interface 1/21/2018 SNMP Interface Contents 1 SNMP Interface 1.1 Architecture 1.2 How to Activate SNMP Support 1.3 How to Use Contact-Center Graceful Shutdown

More information

Online Help StruxureWare Data Center Expert

Online Help StruxureWare Data Center Expert Online Help StruxureWare Data Center Expert Version 7.2.7 What's New in StruxureWare Data Center Expert 7.2.x Learn more about the new features available in the StruxureWare Data Center Expert 7.2.x release.

More information

02 - Distributed Systems

02 - Distributed Systems 02 - Distributed Systems Definition Coulouris 1 (Dis)advantages Coulouris 2 Challenges Saltzer_84.pdf Models Physical Architectural Fundamental 2/58 Definition Distributed Systems Distributed System is

More information

02 - Distributed Systems

02 - Distributed Systems 02 - Distributed Systems Definition Coulouris 1 (Dis)advantages Coulouris 2 Challenges Saltzer_84.pdf Models Physical Architectural Fundamental 2/60 Definition Distributed Systems Distributed System is

More information

Monitoring Juniper EX Switch

Monitoring Juniper EX Switch Monitoring Juniper EX Switch eg Enterprise v6 Restricted Rights Legend The information contained in this document is confidential and subject to change without notice. No part of this document may be reproduced

More information

Simple Network Management Protocol. Slide Set 8

Simple Network Management Protocol. Slide Set 8 Simple Network Management Protocol Slide Set 8 Network Management Framework Internet network management framework MIB: management information base SMI: data definition language SNMP: protocol for network

More information

Services: Monitoring and Logging. 9/16/2018 IST346: Info Tech Management & Administration 1

Services: Monitoring and Logging. 9/16/2018 IST346: Info Tech Management & Administration 1 Services: Monitoring and Logging 9/16/2018 IST346: Info Tech Management & Administration 1 Recall: Server vs. Service A server is a computer. A service is an offering provided by server(s). HTTP 9/16/2018

More information

Network+ Guide to Networks, Fourth Edition. Chapter 8 Network Operating Systems and Windows Server 2003-Based Networking

Network+ Guide to Networks, Fourth Edition. Chapter 8 Network Operating Systems and Windows Server 2003-Based Networking Network+ Guide to Networks, Fourth Edition Chapter 8 Network Operating Systems and Windows Server 2003-Based Networking Objectives Discuss the functions and features of a network operating system Define

More information

AT76.09 Digital Image Processing in Remote Sensing using C Language

AT76.09 Digital Image Processing in Remote Sensing using C Language AT76.09 Digital Image Processing in Remote Sensing using C Language Dr. HONDA Kiyoshi Associate Professor Space Technology Applications and Research Asian Institute of Technology honda@ait.ac.th 1 1. Introduction

More information

Cisco Prime Collaboration Deployment Configuration and Administration

Cisco Prime Collaboration Deployment Configuration and Administration Cisco Prime Collaboration Deployment Configuration and Administration Services, page 1 Limitations and Restrictions, page 5 Services After the installation of the Cisco Prime Collaboration Deployment platform,

More information

Software-defined Storage: Fast, Safe and Efficient

Software-defined Storage: Fast, Safe and Efficient Software-defined Storage: Fast, Safe and Efficient TRY NOW Thanks to Blockchain and Intel Intelligent Storage Acceleration Library Every piece of data is required to be stored somewhere. We all know about

More information

Lecture 9: MIMD Architectures

Lecture 9: MIMD Architectures Lecture 9: MIMD Architectures Introduction and classification Symmetric multiprocessors NUMA architecture Clusters Zebo Peng, IDA, LiTH 1 Introduction MIMD: a set of general purpose processors is connected

More information

EXAM - VCP5-DCV. VMware Certified Professional 5 Data Center Virtualization (VCP5-DCV) Exam. Buy Full Product.

EXAM - VCP5-DCV. VMware Certified Professional 5 Data Center Virtualization (VCP5-DCV) Exam. Buy Full Product. VMware EXAM - VCP5-DCV VMware Certified Professional 5 Data Center Virtualization (VCP5-DCV) Exam Buy Full Product http://www.examskey.com/vcp5-dcv.html Examskey VMware VCP5-DCV exam demo product is here

More information

Commercial Real-time Operating Systems An Introduction. Swaminathan Sivasubramanian Dependable Computing & Networking Laboratory

Commercial Real-time Operating Systems An Introduction. Swaminathan Sivasubramanian Dependable Computing & Networking Laboratory Commercial Real-time Operating Systems An Introduction Swaminathan Sivasubramanian Dependable Computing & Networking Laboratory swamis@iastate.edu Outline Introduction RTOS Issues and functionalities LynxOS

More information

Fault Tolerance for Highly Available Internet Services: Concept, Approaches, and Issues

Fault Tolerance for Highly Available Internet Services: Concept, Approaches, and Issues Fault Tolerance for Highly Available Internet Services: Concept, Approaches, and Issues By Narjess Ayari, Denis Barbaron, Laurent Lefevre and Pascale primet Presented by Mingyu Liu Outlines 1.Introduction

More information

The Slide does not contain all the information and cannot be treated as a study material for Operating System. Please refer the text book for exams.

The Slide does not contain all the information and cannot be treated as a study material for Operating System. Please refer the text book for exams. The Slide does not contain all the information and cannot be treated as a study material for Operating System. Please refer the text book for exams. Operating System Services User Operating System Interface

More information

Oracle Database 10G. Lindsey M. Pickle, Jr. Senior Solution Specialist Database Technologies Oracle Corporation

Oracle Database 10G. Lindsey M. Pickle, Jr. Senior Solution Specialist Database Technologies Oracle Corporation Oracle 10G Lindsey M. Pickle, Jr. Senior Solution Specialist Technologies Oracle Corporation Oracle 10g Goals Highest Availability, Reliability, Security Highest Performance, Scalability Problem: Islands

More information

1. Introduction. Traditionally, a high bandwidth file system comprises a supercomputer with disks connected

1. Introduction. Traditionally, a high bandwidth file system comprises a supercomputer with disks connected 1. Introduction Traditionally, a high bandwidth file system comprises a supercomputer with disks connected by a high speed backplane bus such as SCSI [3][4] or Fibre Channel [2][67][71]. These systems

More information

Chapter 3. Design of Grid Scheduler. 3.1 Introduction

Chapter 3. Design of Grid Scheduler. 3.1 Introduction Chapter 3 Design of Grid Scheduler The scheduler component of the grid is responsible to prepare the job ques for grid resources. The research in design of grid schedulers has given various topologies

More information

Applications FTP. FTP offers many facilities :

Applications FTP. FTP offers many facilities : Applications FTP Given a reliable end-to-end trasport protocol like TCP, File Transfer might seem trivial. But, the details authorization, representation among heterogeneous machines make the protocol

More information

Exam4Tests. Latest exam questions & answers help you to pass IT exam test easily

Exam4Tests.   Latest exam questions & answers help you to pass IT exam test easily Exam4Tests http://www.exam4tests.com Latest exam questions & answers help you to pass IT exam test easily Exam : VCP510PSE Title : VMware Certified Professional 5 - Data Center Virtualization PSE Vendor

More information

«SAS IT Service Vision and HP Open View Integration (experience of installation and usage in CHERUS Ltd.)»

«SAS IT Service Vision and HP Open View Integration (experience of installation and usage in CHERUS Ltd.)» «SAS IT Service Vision and HP Open View Integration (experience of installation and usage in CHERUS Ltd.)» Dmitri Tseitline, CHERUS Ltd., Moscow, Russia Notes to the presentation The analysis and resource

More information

StorageTek ACSLS Manager Software

StorageTek ACSLS Manager Software StorageTek ACSLS Manager Software Management of distributed tape libraries is both time-consuming and costly involving multiple libraries, multiple backup applications, multiple administrators, and poor

More information

The Lion of storage systems

The Lion of storage systems The Lion of storage systems Rakuten. Inc, Yosuke Hara Mar 21, 2013 1 The Lion of storage systems http://www.leofs.org LeoFS v0.14.0 was released! 2 Table of Contents 1. Motivation 2. Overview & Inside

More information

SNMP and Network Management

SNMP and Network Management Contents SNMP and Network Management Network Management MIB naming tree, MIB-II SNMP protocol SNMP traps SNMP versions Nixu Ltd 2 Network management When you have 100s of computers in a network or are

More information

ONOS: TOWARDS AN OPEN, DISTRIBUTED SDN OS. Chun Yuan Cheng

ONOS: TOWARDS AN OPEN, DISTRIBUTED SDN OS. Chun Yuan Cheng ONOS: TOWARDS AN OPEN, DISTRIBUTED SDN OS Chun Yuan Cheng OUTLINE - Introduction - Two prototypes - Conclusion INTRODUCTION - An open, vendor neutral, control-data plane interface such as OpenFlow allows

More information

Management Software. SmartView TM EMS (Element Management System) Management Software. Management Software SmartView TM EMS. Polled Network Elements

Management Software. SmartView TM EMS (Element Management System) Management Software. Management Software SmartView TM EMS. Polled Network Elements LAN PWR PWR 2 PoE Fault Fiber 00 LAN ON OFF Force Auto 0 00 Half Full LFP Flow Pass SW TX RX Ethernet Media Converter FIBER LAN PWR PWR 2 Fault Fiber 00 LAN ON OFF Force Auto 0 00 Half Full LFP Flow Pass

More information

Performance and Scalability with Griddable.io

Performance and Scalability with Griddable.io Performance and Scalability with Griddable.io Executive summary Griddable.io is an industry-leading timeline-consistent synchronized data integration grid across a range of source and target data systems.

More information

OPTICAL TRANSPORT SYSTEM SOFTWARE RELEASE NOTES

OPTICAL TRANSPORT SYSTEM SOFTWARE RELEASE NOTES OPTICAL TRANSPORT SYSTEM SOFTWARE RELEASE NOTES 4.10-3.9 4.10 April 20, 2018 New Features: CT44/45 High Output CT transmitters added display of laser power in mw in addition to dbm reading. Also now displays

More information

A memcached implementation in Java. Bela Ban JBoss 2340

A memcached implementation in Java. Bela Ban JBoss 2340 A memcached implementation in Java Bela Ban JBoss 2340 AGENDA 2 > Introduction > memcached > memcached in Java > Improving memcached > Infinispan > Demo Introduction 3 > We want to store all of our data

More information

Adaptive Cluster Computing using JavaSpaces

Adaptive Cluster Computing using JavaSpaces Adaptive Cluster Computing using JavaSpaces Jyoti Batheja and Manish Parashar The Applied Software Systems Lab. ECE Department, Rutgers University Outline Background Introduction Related Work Summary of

More information

Virtual Server Agent for VMware VMware VADP Virtualization Architecture

Virtual Server Agent for VMware VMware VADP Virtualization Architecture Virtual Server Agent for VMware VMware VADP Virtualization Architecture Published On: 11/19/2013 V10 Service Pack 4A Page 1 of 18 VMware VADP Virtualization Architecture - Virtual Server Agent for VMware

More information

Abstract. NSWC/NCEE contract NCEE/A303/41E-96.

Abstract. NSWC/NCEE contract NCEE/A303/41E-96. A Distributed Architecture for QoS Management of Dynamic, Scalable, Dependable, Real-Time Systems 1 Lonnie R. Welch and Behrooz A.Shirazi Computer Science and Engineering Dept. The University of Texas

More information

ATLAS NorduGrid related activities

ATLAS NorduGrid related activities Outline: NorduGrid Introduction ATLAS software preparation and distribution Interface between NorduGrid and Condor NGlogger graphical interface On behalf of: Ugur Erkarslan, Samir Ferrag, Morten Hanshaugen

More information

SNMP MIBs and Traps Supported

SNMP MIBs and Traps Supported This section describes the MIBs available on your system. When you access your MIB data you will expose additional MIBs not listed in this section. The additional MIBs you expose through the process are

More information

Configuring SNMP. Understanding SNMP CHAPTER

Configuring SNMP. Understanding SNMP CHAPTER 24 CHAPTER This chapter describes how to configure the the ML1000-2, ML100T-12, ML100X-8, and ML-MR-10 cards for operating with Simple Network Management Protocol (SNMP). Note For complete syntax and usage

More information

SNMP and Network Management

SNMP and Network Management SNMP and Network Management Nixu Ltd Contents Network Management MIB naming tree, MIB-II SNMP protocol SNMP traps SNMP versions 2 Network management When you have 100s of computers in a network or are

More information