TECHNICAL RESEARCH REPORT

Similar documents
different problems from other networks ITU-T specified restricted initial set Limited number of overhead bits ATM forum Traffic Management

What Is Congestion? Computer Networks. Ideal Network Utilization. Interaction of Queues

Congestion in Data Networks. Congestion in Data Networks

Lecture 4 Wide Area Networks - Congestion in Data Networks

ATM. Asynchronous Transfer Mode. (and some SDH) (Synchronous Digital Hierarchy)

Intermediate Traffic Management

Introduction to ATM Traffic Management on the Cisco 7200 Series Routers

ATM Quality of Service (QoS)

Modelling a Video-on-Demand Service over an Interconnected LAN and ATM Networks

QOS in ATM Networks. Traffic control in ATM networks. Layered model. Call level. Pag. 1

Current Issues in ATM Forum Traffic Management Group

Expert Network. Expert. Network

11. Traffic management in ATM. lect11.ppt S Introduction to Teletraffic Theory Spring 2003

Traffic control in ATM networks

Uncontrollable. High Priority. Users. Multiplexer. Server. Low Priority. Controllable. Users. Queue

TECHNICAL RESEARCH REPORT

What Is Congestion? Effects of Congestion. Interaction of Queues. Chapter 12 Congestion in Data Networks. Effect of Congestion Control

Rohit Goyal 1, Raj Jain 1, Sonia Fahmy 1, Shobana Narayanaswamy 2

BROADBAND AND HIGH SPEED NETWORKS

Rohit Goyal 1, Raj Jain 1, Sonia Fahmy 1, Shobana Narayanaswamy 2

Source 1. Destination 1. Bottleneck Link. Destination 2. Source 2. Destination N. Source N

ATM Traffic Control Based on Cell Loss Priority and Performance Analysis

MDP Routing in ATM Networks. Using the Virtual Path Concept 1. Department of Computer Science Department of Computer Science

ETSI ETR 155 TECHNICAL February 1995 REPORT

Credit-Based Fair Queueing (CBFQ) K. T. Chan, B. Bensaou and D.H.K. Tsang. Department of Electrical & Electronic Engineering

Real-Time ABR, MPEG2 Streams over VBR, and Virtual Source/Virtual Destination rt-abr switch

perform well on paths including satellite links. It is important to verify how the two ATM data services perform on satellite links. TCP is the most p

! Cell streams relating to different media types are multiplexed together on a statistical basis for transmission and switching.

Network management and QoS provisioning - QoS in ATM Networks

11. Traffic management in ATM

Understanding the Available Bit Rate (ABR) Service Category for ATM VCs

Traffic Management. Service Categories CHAPTER

ATM Logical Connections: VCC. ATM Logical Connections: VPC

Standardizing Information and Communication Systems

Traffic Management Tools for ATM Networks With Real-Time and Non-Real-Time Services

Headend Station. Headend Station. ATM Network. Headend Station. Station. Fiber Node. Station. Station Trunk Splitter.

Network Working Group Request for Comments: 3134 Category: Informational June 2001

V 1. volume. time. t 1

ACE-2002, ACE-2002E. Multiservice Access Concentrators/ ATM Network Termination Units FEATURES

Performance Analysis & QoS Guarantee in ATM Networks

Traffic Management. Service Categories CHAPTER

Connection Admission Control for Hard Real-Time Communication in ATM Networks

Traffic Management and. QoS Issues for Large High-Speed Networks

SCHEDULING REAL-TIME MESSAGES IN PACKET-SWITCHED NETWORKS IAN RAMSAY PHILP. B.S., University of North Carolina at Chapel Hill, 1988

Performance and Evaluation of Integrated Video Transmission and Quality of Service for internet and Satellite Communication Traffic of ATM Networks

\Classical" RSVP and IP over ATM. Steven Berson. April 10, Abstract

A Survey of Congestion Control Schemes for ABR Services in ATM Networks Hyun mee Choi Ronald J. Vetter Payoff

Asynchronous Transfer Mode (ATM) ATM concepts

What is the role of teletraffic engineering in broadband networks? *

Numerical Evaluation of Hierarchical QoS Routing. Sungjoon Ahn, Gayathri Chittiappa, A. Udaya Shankar. Computer Science Department and UMIACS

cell rate bandwidth exploited by ABR and UBR CBR and VBR time

header information which limits the maximum possible eciency of data transmission, especially on LANs. Further, the loss of one cell results in the lo

Routing Strategies. Fixed Routing. Fixed Flooding Random Adaptive

Bandwidth-on-Demand up to very high speeds. Variety of physical layers using optical fibre, copper, wireless. 3BA33 D.Lewis

ATM Traffic Management

OPTIMISTIC HARD REAL TIME ANALYSIS OF CONNECTIONLESS SELF SIMILAR TRAFFIC IN ATM NETWORKS P. Rajan 1 and Dr. Arul Lawrence Selvakumar 1

General Description. Overview. Introduction CHAPTER

Dynamic Multi-Path Communication for Video Trac. Hao-hua Chu, Klara Nahrstedt. Department of Computer Science. University of Illinois

INTERNATIONAL TELECOMMUNICATION UNION

1 Introduction Virtual private networks (VPNs) are rapidly gaining popularity. A VPN uses the public Internet to transparently connect private network

Enhancing Security In ATM Networks Through Congestion Control Technique

Chapter 10. Circuits Switching and Packet Switching 10-1

Flow Control. Flow control problem. Other considerations. Where?

ATM. Asynchronous Transfer Mode. these slides are based on USP ATM slides from Tereza Carvalho. ATM Networks Outline

Traffic and Congestion Control in ATM Networks Using Neuro-Fuzzy Approach

Master Course Computer Networks IN2097

Toward a Time-Scale Based Framework for ABR Trac Management. Tamer Dag Ioannis Stavrakakis. 409 Dana Research Building, 360 Huntington Avenue

Network Working Group Request for Comments: 2761 Category: Informational February 2000

End-to-End Mechanisms for QoS Support in Wireless Networks

Implementation of ATM Endpoint Congestion Control Protocols. Prashant R. Chandra, Allan L. Fisher, Corey Kosak and Peter A.

TECHNICAL RESEARCH REPORT

Dynamics of an Explicit Rate Allocation. Algorithm for Available Bit-Rate (ABR) Service in ATM Networks. Lampros Kalampoukas, Anujan Varma.

Protocol Architecture (diag) Computer Networks. ATM Connection Relationships. ATM Logical Connections

Stability in ATM Networks. network.

2 CHAPTER 2 LANs. Until the widespread deployment of ABR compatible products, most ATM LANs will probably rely on the UBR service category. To ll the

Methods and Models for In-Service Monitoring and Estimation of Cell Losses in ATM Networks

EC1009 HIGH SPEED NETWORKS (ELECTIVE) (2 marks Questions and answers)

Extensions to RTP to support Mobile Networking: Brown, Singh 2 within the cell. In our proposed architecture [3], we add a third level to this hierarc

William Stallings Data and Computer Communications 7 th Edition. Chapter 11 Asynchronous Transfer Mode

ATG s Communications & Networking Technology Guide Series This guide has been sponsored by

SIMULATION OF PACKET DATA NETWORKS USING OPNET

Improving the Eectiveness of ATM Trac Control over Hybrid. Fiber-Coax Networks. Polytechnic University

Communication Networks

The ERICA ALGORITHM for ABR TRAFFIC in ATM NETWORKS

Performance Characteristics of a Packet-Based Leaky-Bucket Algorithm for ATM Networks

ATM Virtual Private Networks. for the Internet Data Trac. Abstract. The ecient utilization and management of bandwidth in broadband networks

ETSI ETR 123 TECHNICAL February 1994 REPORT

A Distributed Routing Algorithm for Supporting Connection-Oriented Service in Wireless Networks with Time-Varying Connectivity

Multimedia Traffic Management and Congestion Control in Satellite ATM Networks

Estimating End-to-End Cell Delay Variation. in ATM Networks. Ibrahim Korpeoglu Satish K. Tripathi. University of Maryland. College Park, MD 20742

IP SLAs Overview. Finding Feature Information. Information About IP SLAs. IP SLAs Technology Overview

TCP/IP over ATM using ABR, UBR, and GFR Services

which adjusts its allowed cell rate according to that feedback. Figure 1: Forward and backward RM cells Most resource management cells generated by th

Lecture 03 Chapter 11 Asynchronous Transfer Mode

MPLS AToM Overview. Documentation Specifics. Feature Overview

Unit 2 Packet Switching Networks - II

Performance of UMTS Radio Link Control

William Stallings Data and Computer Communications. Chapter 10 Packet Switching

CPEG 514. Lecture 11 Asynchronous Transfer Mode (ATM) CPEG 514

RADview-EMS/NGN. Element Management System for NGN Applications ACE-201/ RAD Data Communications Publication 06/07

Transcription:

TECHNICAL RESEARCH REPORT Performance Monitoring in ATM Networks by A. Arora, J. Baras CSHCN T.R. 98-12 (ISR T.R. 98-22) The Center for Satellite and Hybrid Communication Networks is a NASA-sponsored Commercial Space Center also supported by the Department of Defense (DOD), industry, the State of Maryland, the University of Maryland and the Institute for Systems Research. This document is a technical report in the CSHCN series originating at the University of Maryland. Web site http://www.isr.umd.edu/cshcn/

Performance Monitoring in ATM Networks Anubhav Arora and John S. Baras Center for Satellite and Hybrid Communication Networks and Institute of Systems Research University of Maryland at College Park April 20, 1998 1

Performance Monitoring in ATM Networks 1 Anubhav Arora and John S. Baras Center for Satellite and Hybrid Communication Networks and Institute of Systems Research University of Maryland at College Park Contents 1 Introduction 1 2 The Operations And Maintenance (OAM) Standard for ATM 2 3 Management Architecture 3 3.1 Intelligent Distributed Performance Management.................... 3 3.2 Information Flow Model.................................. 4 4 Measuring key QoS parameters 5 4.1 Delay measurement using management cells....................... 5 4.2 Delay measurement using clock parameter estimation.................. 6 4.3 Cell loss estimation using a fuzzy system......................... 6 4.4 Cell loss estimation using in-service monitoring..................... 7 4.5 Parameters for error performance............................. 8 5 Conclusions and Future work 9 1 Research supported by NASA under cooperative agreement NCC3-528 and by the Army Research Laboratory under cooperative agreement DAAL01-96-2-0002. 2

Abstract ATM networks provide end-to-end QoS guarantees to connections for their lifetime in the form of bounds on delays, errors and losses. Performance management involves measurement of these parameters accurately and taking control measures, if required, to improve performance. This is very important for real time connections in which losses are irrecoverable and delays cause interruptions in service. In this paper, an intelligent management architecture and an information ow model for performance management proposed in the literature is reviewed. A brief description of the related ITU-T standard for performance monitoring is given and several schemes for measurement or estimation of QoS parameters are documented. Future work required in the area is also presented in the last section. 1 Introduction Asynchronous Transfer Mode (ATM) is a connection-oriented fast packet switching protocol. Information is carried in xed size cells along virtual channels, contained in virtual paths. ATM is designed to support various kinds of connections, voice/video/data etc. using statistical multiplexing of sources to achieve ecient resource utilization. Every incoming call is characterized by its type - Constant/Variable/Available/Unspecied Bit Rate (CBR/VBR/ABR/UBR) - and the Quality of Service (QoS) it requires. The QoS of a connection consists of parameters like the Mean Cell Rate (MCR), Peak Cell Rate (PCR), Cell Transfer Delay (CTD), Cell Delay Variance (CDV), Cell Loss Ratio (CLR), Cell Error Ratio (CER) etc. dened for CBR and VBR services. The parameters CTD, CDV, and CLR are most important for real time connections in which delays are interruptions in service and losses maynot be recoverable using retransmissions. Real time connections are usually CBR or VBR depending on the coding of the source. In the connection setup phase a route is determined, with the network guaranteeing (the guarantee is of a statistical nature) the QoS required by the connection along the complete path. The network is obligated to provide this QoS throughout the duration of the connection but is allowed to drop cells if the source violates the QoS contract. To this end, performance management i.e., measurement and control of the QoS being provided to a connection, is necessary for the network. It is also necessary for congestion control, trac management, planning in the network and possibly for billing. Measurement of QoS parameters forachannel (or path) is useful for the connection also to know if the agreed QoS is being provided. 1

Performance management thus encompasses gathering of network performance statistics (in the form of QoS parameters), analyzing its trends and taking proactive or curitive control measures. A few papers on performance management inatm networks are reviewed here. In the next section, the related ATM standard from ITU-T is reviewed regarding its performance monitoring aspects. In section 3, an intelligent management architecture and an information ow model for performance monitoring are reviewed. Section 4 looks at schemes for measuring or estimating the important QoS parameters. In the last section, future work required in the area and conclusions are presented. 2 The Operations And Maintenance (OAM) Standard for ATM The OAM standard for ATM is recommendation I.610 from ITU-T [1]. There are two vertical OAM levels dened for the ATM layer, the virtual path level (F4) and the virtual channel level (F5). OAM ows are provided by cells dedicated to VPCs (or VCCs) and follow the same path as user cells. These ows can be further classied into two horizontal levels, end-to-end ows (from a VP (or VC) endpoint to another VP (or VC) endpoint) and segment ows (dened as single or multiple inter-connected VP (or VC) links). Intermediate nodes can monitor existing ows and insert new OAM cells but cannot terminate ows which are not their own. The OAM functions dened for the ATM layer are fault management, continuity check, loopback and performance monitoring. Performance monitoring is done by inserting end-to-end or segment monitoring cells at F4 or F5 level. These cells monitor a block of user cells (of size 128, 256, 512 or 1024). The main objective of this monitoring is to detect errored blocks and loss/mis-insertion of cells. There are forward monitoring cells to measure and backward reporting cells to report the measured values to the source. The various elds dened for these cells include { a sequence number to identify the cells { the total number of user cells sent (a modulo 64k counter) and total number of user cells with high Cell Loss Priority (CLP = 1) { a block error detection code (even parity Bit Interleaved Parity - 16) { an optional time stamp for delay measurements { for backward monitoring cells - total number of user cells received, total number of CLP=1 user cells and the block error result on the received cells Using these elds, block errors and dierence in the number of transmitted and received cells 2

can be known. However, this does not give precise information about lost and mis-inserted cells. Only round trip delays are accurately measurable using the optional timestamp eld (which is not implemented by most vendors currently) and that rarely provides information about one-way delay or bottle-neck links on the path. It is observable that this provides limited information regarding the delays and losses which are critical to real time services. The standard does not specify how these values are used for proactive or reactive control actions required if the QoS guarantees are not being met. It also does not specify how frequently these measurements should be made; this will depend on the bandwidth of the path, the network state, and user requirements. 3 Management Architecture 3.1 Intelligent Distributed Performance Management In [2], a generic intelligent mechanism for network management based on a distributed multiagent system is dened. The agents are semi-autonomous, making instantaneous local decisions when they have enough knowledge and communicating with other agents when the network is not loaded. The agent is composed of a knowledge module, a control and decision making module, and a communication module. It is attached to a node and may work in isolation when necessary. The agents communicate using a general blackboard that contains at the beginning, the facts of a given problem. The agents read in the current state of the problem and write into it to co-operatively converge to a solution. The blackboard also contains a control system to supervise the actions of the agents. A network performance monitoring architecture is proposed using the above model. The network is divided into domains and dierent levels of problem solving are dened - an agent working alone, agents in a domain working together and agents in multiple domains working together. OAM cells [1] are the carriers of performance information, in the form of various QoS parameters. A congestion control scheme is then proposed. It uses four thresholds to discard CLP=1 (indicating congestion) cells on specic or all ports and to send out Explicit Forward/Backward Congestion Notication (EFCI/EBCI). Upon receiving EBCN, a previous node drops CLP=1 cells on the congested path as it would be dropped on the congested node anyway. Simulation results given for a very simple network with uniform routing policy indicate that 3

tuning of thresholds using the multi-agent paradigm improves the cell loss by an order of magnitude as compared to xed thresholds. In the paper, the complexity of a real time agent is not investigated. The structure of the agent, its use of the QoS parameters and its threshold tuning policy (using predictions from a learning algorithm) are not clearly specied. The simulation assumptions seem too simple to be able to eectively model a real network. Also, the information ow in the simulation is not through OAM cells in the simulation. OAM information may be stale and inaccurate and is also quite limited in scope, which can adversely aect the knowledge of an agent. 3.2 Information Flow Model The paper [3] explores the use of special cells as an information carrier for a wide range of monitoring and control functions. The management cells dened in the paper can conceivably carry measurement data, network state information, control information and administrative information. This can serve as an eective model for the communication of parameters required in the previous intelligent architecture. Use of performance management OAM cells as dened in [1], requires no processing capabilities at intermediate nodes. They can be used to measure the round trip delay, the dierence in the number of cells transmitted and received block errors. However, the exact number of cells lost or misinserted, the nodes experiencing congestion, and the one way delay cannot be accurately measured using the performance management OAM cell. The authors propose the use of management cells for improved monitoring as described in the next section. However, these methods require specic processing capabilities at end nodes and intermediate switches, like timestamping a cell on ingress and egress etc. which are not present in state-of-the-art switches today. Very few commercial switches implement the OAM standard completely. Thus there is a need for monitoring methods which are algorithmically simple, require minimal processing at switches and gather relevant data. The paper also proposes the use of management cells for trac and congestion management using explicit feedback rate control (for ABR and UBR trac), advocating the use of ATM Block Transfers with bandwidth renogotiation for each block using RM cells (all cells in a block are carried or dropped at each node, ensuring high good-put), and dynamic VP bandwidth management (to accomodate new connections in a VP). Routing information can be communicated using management cells. 4

4 Measuring key QoS parameters This section presents several measurement (or estimation) schemes for the QoS parameters, cell delay, cell loss ratio and cell error parameters. 4.1 Delay measurement using management cells To accurately measure the one-way delay, OAM performance management cells require the source and destination clocks to be precisely synchronized to the same time, which is hardly the case. However, this method gives an accurate estimate of the round trip delay as it is the dierence of two times measured from the same clock. For the method proposed in [3], the switches should be capable of collecting information about their own internal performance. Also, ATM switches should be able to modify the management cells in ight. The one-way cell delay can be accurately measured by breaking it into delays experienced at each switch. A switch can time-stamp a management cell when it is received at the input. At the departure of the cell, the cell delay can be computed and added to the delay value already written in the cell (initialised to zero by the source). The processing delay required for this process and the propagation delays are xed and can be precomputed. Thus, a management cell accumulates the delay along the path. The delay eld at the destination gives a sample of the cell transfer delay which does not suer from the clock synchronization problem. An alternative is to have multiple time-stamp elds in the management cell so that the delay at each switch can be recorded separately. This may be useful for diagnostic purposes, for example to determine the bottleneck link. Management cells are required to occupy a small portion of the bandwidth, specially when the network is congested. This criterion dictates the inter-sample time. The authors have briey alluded to this issue and stated some heuristic arguments. This scheme requires new processing capabilities at the switches to modify cells on ingress and egress. No mention is made about the accuracy of such a scheme either (which can possibly justify the additional complexity in the switches). The next section describes a procedure based on statistical analysis of the remote clock which can be used with OAM performance management cells. 5

4.2 Delay measurement using clock parameter estimation This technique proposed by Roppel in [4] relies on estimation of the one-way cell transfer delay by analyzing the properties of the remote clock. Essentially, the remote clock can be modeled, its parameters estimated, and the time-stamp of the destination can be corrected for the oset. In this method, the switches do not need any new processing capabilities. The remote clock can be modeled using a time oset parameter (T o ) and a clock frequency drift parameter (). C(t) =t +T o + (t, t o )+(t) These two parameters can be estimated using a regression model on the delay samples from the backward reporting performance management cells. This method however requires that the minimum delay along both directions to be the same so is not suitable for asymmetric and hybrid networks. The number of samples required to converge to correct clock parameters may be large, thus the time for convergence for low bit rate links can be very high. These disadvantages can prohibit the use of this scheme for a large class of networks and connections. The above scheme can however be modied so that each node in the network estimates the parameters of its neighbors' clocks. This can be done with better accuracy as this involves only one queueing delay in the path, and may be done more simply and accurately using a CBR connection during light loading. Once a node knows the equations of all neighbor clocks, remote clock oset from source can be known by propagating it hop-by-hop along the connection path at the connection setup time. Thus, the source can know the model for the remote clock at connection setup time and use it to correct timestamp values in OAM cells from the destination. This allows for measurement of one-way delay, although bottle-neck links can not be known. This scheme, which uses the performance management OAM cell, neither involves a large time delaytoachieve accuracy nor new processing capabilities at switches and can be more accurate than Roppel's scheme. 4.3 Cell loss estimation using a fuzzy system Estimation of CLR using a fuzzy system and a related CAC algorithm are described in [5]. The fuzzy system used has a singleton fuzzier, a product inference engine, and a center average defuzzier with gaussian membership functions. Such a fuzzy system has universal approximation capabilities, i.e., any real valued continuous function can be approximated using such a system. 6

The CLR (P (m)) is taken as a function of S m (it can be the buer size or service capacity or the number of users) which evolves as S m = m + S 0. The CLR when S m is small is measured and supplied as the initial input to the system. The input to the system are previous M input-output pairs, f[p (k, (i + 2));P(k, (i + 1))]; P (k, i), P (k, (i + 1))g;i =0; :::; M, 2. The output of the system is the dierence between the next and current CLR values. When the number of steps required to predict CLR at target S m value is very large, the accuracy of the system suers. The authors propose using asymptotic information about CLR ^P (m) (assumed to be known) when the S m,! S1 : P improved (m) =[1, (m)]p (m)+(m) ^P (m) where the function (m) gives more weight to the asymptotic behavior as m,! 1. The authors study the behavior of this scheme with S m being the buer size, the service capacity and the number of users. The results give very good approximations to the actual CLR as obtained from exact source models, with the computation time at least an order of magnitude less than any other scheme. All the three parameters can be used one after the other to obtain CLR at the target values. The accuracy of the algorithm however depends on the variance of the gaussian membership function and the asymptotic behavior assumed. The paper presents some heuristic values for the two parameters. If the algorithm can be made less sensitive to the choice of these parameters while keeping the same low computational complexity, it can be eectively used in a real system for CLR measurement. A CAC algorithm that accurately estimates the aggregate required bandwidth by using the measured CLR and the bandwidth required by the incoming call is also described in the paper. 4.4 Cell loss estimation using in-service monitoring In [6], a technique to measure CLR using two dierent trac models is proposed. With a Markovian arrival process, the log of CLR depends linearly on the buer size (B) log(clr),, B and with a long range dependent trac model, log[-log(clr)] depends linearly on the logarithm of the buer size 7

log[,log(clr)] log()+log(b): The technique uses small pseudo-buers to estimate the CLR from a small number of incoming cells, thus decreasing the monitoring time. The algorithm applies linear regression to incoming samples of CLR over all pseudo buers on both models. It keeps two running counters indicating how many times a model was better than the other based on a goodness-of-t R 2 test. After a xed number of samples, it chooses one of the models as the correct one and then extrapolates the CLR to the physical buer size. The mean and the variance of CLR measurement over all pseudo buers can be calculated now and the distribution is assumed to be normal. The paper then proposes a QoS violation algorithm based on the Neyman-Pearson detection rule. Simulations are based on Bellcore video source traces & Bellcore LAN trace data, and the G/G/1/K & aggregated homogenous on/o sources with single server queue queueing models. Three dierent loads, under, critical and overload are simulated. The results are very close to the actual simulation values. Another notable point is that the algorithm chose long-range dependent model for video trace and markovian model for ethernet trace. The tradeo between the observation period and the detection probability is discussed and the decrease in observation interval through the use of pseudo-buers is highlighted (it is about 4-7 orders of magnitude). 4.5 Parameters for error performance In [7], the authors advocate the use of error performance parameters like Errored Second (a second including one or more bit errors), Severely Errored Second (a second in which the BER exceeds 10,3 ) and Unavailable Time (time that begins with 10 consecutive SESs and ends with 10 consecutive seconds not consisting of SESs) rather than just the Bit Error Rate which is felt as inadequate. The paper recommends estimation of channel error parameters using ISM (In-Service Monitoring) information about anomaly and defect events. Two channel performance estimation methods are suggested: 1) using statistical estimation of expectations of ES and SES using LOF (Loss of Frame Alignment) duration and violations of error checking codes measurements and 2) using linear or multivariate regression models denoting the relationship between ISM information gathered at path and the actual error performance of the actual channel. Field experiment results indicate that ISM information should be collected at short intervals and the monitoring path rate should be close to 8

the channel for better estimation of the channel. The authors have mentioned several uses of ISM but have provided no specic example or application where the BER is inadequate and the parameters mentioned above are required. In other words, the authors have given no justication or need for the parameters they propose to estimate. The mechanism proposed in the standard [1] to detect errors in small sized blocks using an even Bit Interleaved Parity (BIP-16) seems adequate to measure the error performance of the channel unless a clear application for the new parameters is specied. 5 Conclusions and Future work Performance management is an important function of ATM networks because the network guarantees to provide a certain end-to-end quality of service to the connections and is obliged to do so as long as the source does not violate its agreed upon peak and mean cell rates. Performance monitoring is useful for trac management, congestion control, network design, planning and possibly billing. In this paper, some literature pertaining to performance monitoring in ATM was reviewed. Operations and Maintenance (OAM) standards for ATM provide basic performance monitoring mechanism to measure errors, losses and possibly delays using OAM cells inserted after a specied block of cells. These measurements are however, quite limited in the information they gather. The standards do not clearly specify the inter-sample time and the possible uses of such measurements. Also, performance management cells are not implemented in most switches at present. Thus, a need is felt for a well dened monitoring architecture including algorithms to measure (or estimate) QoS parameters and the use of the information gathered to improve network performance. Another point to note is that the control algorithms for CAC, routing, UPC, soft-fault management etc. should dictate the performance measurement required. A generic intelligent performance management architecture based on a distributed semiautonomous multi-agent system interacting via a blackboard is reviewed. This architecture is used to implement a congestion management scheme and is shown to improve performance by an order of magnitude. The architecture requires information ow amongst the agents proposed to be done through OAM cells. An information ow model using special cells (called management cells) is reviewed next. However these mechanisms are complicated, require new processing capabilities at switches and their accuracy may not justify the additional cost. 9

Next, a few mechanisms to measure the QoS parameters - delay, loss and errors are reviewed. Two fundamentally dierent schemes of correctly measuring delay using management cells and estimation of remote clock parameters are presented. CLR can be estimated using a fuzzy system (given extremes information about the behavior) and using regression on measurements on several pseudo-buers. It has been proposed to measure error performance using parameters more sophisticated than BER but a proper justication about their applications has not been given. Anumber of issues remain to be addressed in this topic. The rst important requirement is the ability to accurately measure the parameters using a simple, fast, topology independent scheme that requires minimal changes to existing equipment. The ability to measure the bottle-necks, and other relevant diagnostic information should also be incorporated. A simple algorithm that dictates the next optimal time to sample also needs to be found. A constant sampling policy cannot be used due to the heterogeneity in sources, bandwidth constraints and congestion in the network. The second question is about the value of the information gathered. These statistics will be potentially used by the control algorithms - CAC, routing, congestion management, UPC, VP control etc. How they are used, will dene the value of these statistics, which in turn will dictate which parameters to measure, what accuracy is required and how often to measure. In other words, a strong coupling between trac management functions and performance monitoring aspects is required. Another important issue is the use of performance monitoring for proactive measurements and probing. For example, measurements on alternate routes can be made when there is no congestion to aid in routing and CAC. Such measurements will require very dierent schemes which again will be dictated by the potential use of this information. References [1] \B-ISDN Operation and Maintenance Principles and Functions", ITU-T Recommendation I.610, May 1995. [2] D. Gaiti and G. Pujolle, \Performance Management Issues in ATM Networks: Trac and Congestion Control", IEEE/ACM Trans. on Networking, vol. 4., no. 2, Apr 1996, pp.249-257. [3] T. M. Chen et al., \Monitoring and Control of ATM Networks Using Special Cells", IEEE Network, Sept/Oct 1996, pp. 28-38. 10

[4] C. Roppel, \Estimating Cell Transfer Delay in ATM Networks Using In-Service Monitoring Methods", Proc. Globecom '95, pp. 904-908. [5] B. Bensaou, T. C. Lam, H. W. Chu and H. K. Tsang, \Estimation of the Cell Loss Ratio in ATM Networks with a Fuzzy System and Application to Measurement-Based Call Admission Control", IEEE/ACM Trans. on Networking, vol. 5, no. 4, Aug 1997, pp. 572-584. [6] H. Zhu and V. S. Frost, \In-Service Monitoring for Cell Loss Quality of Service Violations in ATM Networks", it IEEE/ACM Trans. on Networking, vol. 4., no. 2, Apr 1996, pp.240-248. [7] N. Sato, K. Asatani, H. Murakami, R. Mallon, S. Hughes and T. L. Gra, \In-Service Monitoring Methods-Better Ways to Assure Quality of Digital Transmission", IEEE JSAC, vol. 12, no. 2, Feb 1994, pp. 355-360. 11