Pricing Network Resources for Adaptive Applications in a Differentiated Services Network

IEEE INFOCOM Prcng Network Resources for Adaptve Applcatons n a Dfferentated Servces Network Xn Wang and Hennng Schulzrnne Columba Unversty Emal: {xnwang, schulzrnne}@cs.columba.edu Abstract The Dfferentated Servces framework (DffServ) [] has been proposed to provde multple Qualty of Servce (QoS) classes over IP networks. A network supportng multple classes of servce also requres a dfferentated prcng structure. In ths work, we propose a prcng scheme n a DffServ envronment based on the cost of provdng dfferent levels of qualty of servce to dfferent classes, and on long-term demand. Prcng of network servces dynamcally based on the level of servce, usage, and congeston allows a more compettve prce to be offered, allows the network to be used more effcently, and provdes a natural and equtable ncentve for applcatons to adapt ther servce contract accordng to network condtons. We develop a DffServ smulaton framework to compare the performance of a network supportng congeston-senstve prcng and adaptve servce negotaton to that of a network wth a statc prcng polcy. Adaptve users adapt to prce changes by adjustng ther sendng rate or selectng a dfferent servce class. We also develop the demand behavor of adaptve users based on a physcally reasonable user utlty functon. Smulaton results show that a congeston-senstve prcng polcy coupled wth user rate adaptaton s able to control congeston and allow a servce class to meet ts performance assurances under large or bursty offered loads, even wthout explct admsson control. Users are able to mantan a stable expendture. Allowng users to mgrate between servce classes n response to prce ncreases further stablzes the ndvdual servce prces. When admsson control s enforced, congeston-senstve prcng stll provdes an advantage n terms of a much lower connecton blockng rate at hgh loads. I. INTRODUCTION The Dfferentated Servces framework (DffServ) [] has been proposed to provde multple Qualty of Servce (QoS) classes over IP networks. Two types of Per-Hop-Behavor (PHB) are proposed: Expedted Forwardng (EF) [] and Assured Forwardng (AF) [3]. The EF PHB s defned as a forwardng treatment where the departure rate of an aggregate s packets from any DffServ node must equal or exceed a confgurable rate. For AF servce, four classes wth three levels of drop precedence n each class are defned for general use. A network supportng multple classes of servce also requres a dfferentated prcng structure, rather than the flatfee prcng model adopted by vrtually all current Internet servces. Whle network tarff structures are often domnated by busness and marketng arguments rather than costs, we beleve t s worthwhle to understand and develop a cost-based prcng structure as a gude for actual prcng. In economcally Ths work was sponsored by NSF CAREER grant. vable models, the dfference n the charge between dfferent servce classes would presumably depend on the dfference n performance between the classes, and should take nto account the average (long-term) demand for each class. In general, the level of forwardng assurance of an IP packet n DffServ depends on the amount of resources allocated to a class the packet belongs to, the current load of the class, and n case of congeston wthn the class, the drop precedence of the packet. Also, when multple servces are avalable at dfferent prces, users should be able to demand partcular servces, sgnal the network to provson accordng to the requested qualty, and generate accountng and bllng records. One of the two man goals of our work s to develop a prcng scheme n a DffServ envronment based on the cost of provdng dfferent levels of qualty of servce to dfferent classes, and on long-term demand. DffServ supports servces whch nvolve a traffc contract or servce level agreement (SLA) between the user and the network. If the agreement, ncludng prce negotaton and resource allocaton are set statcally (before transmsson), prcng, resource allocaton and admsson control polces (f any) have to be conservatve to be able to meet QoS assurances n the presence of network traffc dynamcs. Prcng of network servces dynamcally based on the level of servce, usage, and congeston allows a more compettve prce to be offered, and allows the network to be used more effcently. Dfferentated and congeston-senstve prcng also provdes a natural and equtable ncentve for applcatons to adapt ther servce contract accordng to network condtons. A number of adaptaton schemes have been proposed for multmeda applcatons to dynamcally regulate the source bandwdth accordng to the exstng network condtons (a survey of ths work s gven n [4]). The second man goal of our work s to ntegrate our prcng scheme wth a dynamc prcng and servce negotaton envronment. In ths envronment, servce prces have a congestonsenstve component n addton to the long-term, relatvely statc prce. Some or all users are adaptaton-capable, and adapt to prce changes by adjustng ther sendng rate or selectng a dfferent servce class. Users wth strngent bandwdth and QoS requrements mantan a hgh qualty by payng more, whle adaptaton-ncapable applcatons use servces offerng a statc prce. We develop the demand behavor of adaptve users based on a physcally reasonable user utlty functon. In our smulatons, prces and servces are negotated through a Resource Negotaton and Prcng (RNAP) protocol

IEEE INFOCOM and archtecture, presented n earler work [5]. RNAP enables the user to select from avalable network servces wth dfferent QoS propertes and re-negotate contracted servces, and enables the network to dynamcally formulate servce prces and communcate current prces to the user. In RNAP, resource commtments are typcally made for short negotaton ntervals, nstead of ndefntely, and prces may vary for each nterval. Usng RNAP and an extended verson of an exstng Dff- Serv mplementaton, we develop a smulaton framework to compare the performance of a network supportng congestonsenstve prcng and adaptve servce negotaton to that of a network wth a statc prcng polcy. We also study the stablty of the dynamc prcng and servce negotaton mechansms. We evaluate the system performance and perceved beneft (or value-for-money) under the dynamc and statc systems. We also study the relatve effects on system performance of rate adaptaton, dynamc load balancng between servce classes and admsson. Although the smulaton framework s based on the RNAP model, we try to derve results and conclusons applcable to statc and congeston-drven, dynamc prcng schemes n general. Ths paper s organzed as follows. Secton II develops a physcally realstc user utlty functon to represent user demand behavor n response to prce changes. Secton III dscusses our proposed prcng model n detal. Secton IV summarzes our earler work on RNAP and partcularly how t supports network prcng. In secton V we descrbe our smulaton model, and n secton VI we dscuss smulaton results. We descrbe some related work n secton VII, and summarze our work n secton VIII. II. USER ADAPTATION In a network wth congeston dependent prcng and dynamc resource negotaton (through RNAP or some other sgnalng protocol), adaptve applcatons wth a budget constrant wll adjust ther servce requests n response to prce varatons. In ths secton, we dscuss how a set of user applcatons performng a gven task (for example, a vdeo conference) adapt ther sendng rate and qualty of servce requests to the network n response to changes n servce prces, so as to maxmze the beneft or utlty to the user, subject to the constrant of the user s budget. Although we focus on adaptve applcatons as the ones best suted to a dynamc prcng envronment, the RNAP framework does not requre adaptaton capablty. Applcatons may choose servces that provde a fxed prce and fxed servce parameters durng the duraton of servce. Generally, the longterm average cost for a fxed-prce servce wll be hgher, snce t uses network resources less effcently. Alternatvely, applcatons may use a servce wth usage-senstve prcng, and mantan a hgh QoS level, payng a hgher charge durng congeston. We consder a set of user applcatons, requred to perform ataskormsson. The user would lke to determne a set of transmsson parameters (sendng rate and QoS parameters) from whch t can derve the maxmum beneft, subject to hs budget. We assume that the user defnes quanttatvely, through a utlty functon, the perceved monetary value (say, 5 cents/mnute) provded by the set of transmsson parameters towards completng the msson. Consumers n the real world generally try to obtan the best possble value for the money they pay, subject to ther budget and mnmum qualty requrements; n other words, consumers may prefer lower qualty at a lower prce f they perceve ths as meetng ther requrements and offerng better value. Intutvely, ths seems to be a reasonable model n a network wth QoS support, where the user pays for the level of QoS he receves. In our case, the value for money obtaned by the user corresponds to the surplus between the utlty U( ) wth a partcular set of transmsson parameters (snce ths s the perceved value), and the cost of obtanng that servce. The goal of the adaptaton s to maxmze ths surplus, subject to the budget and the mnmum and maxmum QoS requrements. We now consder the smultaneous adaptaton of transmsson parameters of a set of n applcatons performng a sngle task. The transmsson bandwdth and QoS parameters for each applcaton are selected and adapted so as to maxmze the msson-wde value perceved by the user, as represented by the surplus of the total utlty, Û, over the total cost C. We can thnk of the adaptaton process as the allocaton and dynamc re-allocaton of a fnte amount of resources between the applcatons. In ths paper, we make the smplfyng assumpton that for each applcaton, a utlty functon can be defned as a functon only of the transmsson parameters of that applcaton, ndependent of the transmsson parameters of other applcatons. Snce we consder utlty to be equvalent to a certan monetary value, we can wrte the total utlty as the sum of ndvdual applcaton utltes : Û = [U (x (T spec,r spec)] () where x s the transmsson (T sepc ) and qualty of servce parameter (R spec ) tuple for the th applcaton. The optmzaton of surplus can be wrtten as max [U (x ) C (x )] s. t. C (x ) b, x mn x x max () where x mn and x max represent the mnmum and maxmum transmsson requrements for stream, C s the cost of the type of servce selected for stream at requested transmsson parameter x,andbs the budget of the user. In practce, the applcaton utlty s lkely to be measured by user experments and known at dscrete bandwdths, at one or a few levels of loss and delay, possbly correspondng to a subset of the avalable servces; at the current stage of research, some possble servces are guaranteed [6] and controlled-load servce [7] under the nt-serv model, Expedted Forwardng (EF) [] and Assured Forwardng (AF) [3] under dff-serv. In ths case, t s convenent to represent the utlty as a pecewse lnear functon of bandwdth (or a set of such functons). A smplfed algorthm s proposed n [8] to search for the optmal servce requests n such a framework.

IEEE INFOCOM 3 We can make some general assumptons about the utlty functon as a functon of the bandwdth (can be equvalent bandwdth [9]), at a fxed value of loss and delay. A user applcaton generally has a mnmum bandwdth requrement. It also assocates a certan mnmum value wth a task, whch may be regarded as an opportunty value, and ths s the perceved utlty when the applcaton receves just the mnmum requred bandwdth. The user termnates the applcaton f ts mnmum bandwdth requrement can not be fulflled, or when the prce charged s hgher than the opportunty value derved from keepng the connecton alve. Also, user experments reported n the lterature [][] suggest that utlty functons typcally follow a model of dmnshng returns to scale, that s, the margnal utlty as a functon of bandwdth dmnshes wth ncreasng bandwdth. Hence, a utlty functon can be represented n a general form as a functon of bandwdth as: U(x) =U + w log x (3) x m where x m represents the mnmum bandwdth the applcaton requres, w represents the senstvty of the utlty to bandwdth, and U s the monetary opportunty that the user perceves at the lowest QoS level. The utlty functon s also senstve to network transmsson parameters such as loss and delay. In our work, we rely on the expermental results n [] whch show that users perceved qualty for nteractve audo decreases almost lnearly wth ether delay or loss, wth a mnmum acceptable qualty requrement. More subjectve tests are needed for other applcaton types. Currently, we assume a smlar lnear dependence for all applcatons. Accordngly, we represent the utlty functon as: U(x) =U + w log x k d d k l l, for x x m, (4) x m where k d and k l represent respectvely the user s senstvty to delay and loss. In some cases, the user s perceved senstvty may depend on the bandwdth used. For example, tolerance to delay and loss wll be dfferent for dfferent speech codecs. Snce we are not assumng any partcular applcaton model, we assume users delay and loss senstvty are bandwdth ndependent n our smulatons. A user wth a hgher senstvty to delay or loss wll tend to select a hgher servce class rather than request more bandwdth. If the utltes of all the applcatons are represented n the format of equaton 4, the optmzaton process for a system wth multple applcatons can be represented as: max [U + w log x kdd kll p x ] x m s. t. p x b, x x m,, d D, l L (5) where p s the prce of the servce class selected by the applcaton, D and L are respectvely the loss and delay bound of an applcaton, above whch the applcaton no longer functons usefully. It s possble to represent the above optmzaton problem as a Lagrangan and solve t. However, we assume the avalablty of only a few dfferent loss and delay levels correspondng to dfferent servce classes, and accordngly use a more heurstc method. The optmzaton nvolves assgnng a servce class and a bandwdth to each applcaton. For a partcular assgnment of servce classes to applcatons, f the user can obtan the optmal bandwdth dstrbuton accordng to equaton 5 at a cost below hs budget, then the bandwdth allocaton that maxmzes the perceved surplus for an applcaton can be showntobe: x = w p (6) Hence, w represents the money a user would spend based on ts perceved value for an applcaton.the above bandwdth dstrbuton s consdered for all possble servce class assgnments (constraned by applcaton requrements and budget), and the one gvng the hghest total surplus s used. If there s no set of servce class assgnments for whch the optmal dstrbuton of equaton 6 can be obtaned at a cost below the budget, the total budget s frst dstrbuted to the component applcatons accordng to ther relatve bandwdth senstvty w. That s, each applcaton receves a budget share b such that b = b w (7) k wk Each applcaton s then allocated a servce and bandwdth x = b p whch maxmzes ts ndvdual surplus accordng to equaton 4. The dscusson so far assumes that each prce p s per unt average bandwdth. A prce based on unt equvalent bandwdth [3] may be farer snce t takes nto account the burstness of user traffc. In ths case, the user adaptaton of the source rate s more complcated. If effectve bandwdth s used, a user could calculate a new average bandwdth when the prce ncreases. Alternatvely, t could ntroduce addtonal bufferng at the source to reduce ts burstness, at the cost of a hgher delay, thus reducng the effectve bandwdth. III. PRICING STRATEGIES A few prcng schemes are wdely used n the Internet today [4]: access-rate-dependent charge (AC), volume-dependent charge (V), or the combnaton of the both (AC-V). An AC chargng scheme s usually one of two types: allowng unlmted use, or allowng lmted duraton of connecton, and chargng a per-hour fee for addtonal connecton tme. Smlarly, AC-V chargng schemes normally allow some amount of volume to be transmtted for a fxed access fee, and then mpose a per-volume charge. Although tme-of-day dependent chargng s commonly used n telephone networks, t s not generally used n the current Internet. User experments [5] ndcate that usage-based prcng s a far way to charge people and allocate network resources. Both connecton tme and the transmtted volume reflect the usage of the network. Chargng based on connect-tme only works when resource demands per tme unt are roughly unform. Snce ths s not the case for

IEEE INFOCOM 4 Internet applcatons and across the range of access speeds, we only consder volume-based chargng. In ths paper, we study two knds of volume-based prcng: a fxed-prce (FP) polcy wth a fxed unt volume prce, and a congeston-prce-based adaptve servce (CPA) n whch the unt volume prce has a congeston-senstve component. In the fxed prce model, the network charges the user per volume of data transmtted, ndependent of the congeston state of the network. The per-byte charge can be the same for all servce classes ( flat, FP-FL), depend on the servce class (FP-PR), depend on the tme of day (FP-T) or a combnaton of tmeof-day and servce class (FP-PR-T). If the prce does not depend on the congeston condtons n the network, customers wth less bandwdth-senstve applcatons have no motvaton to reduce ther traffc as network congeston ncreases. As a result, ether the servce request blockng rate wll ncrease at the call admsson control level, or the packet delay and droppng rate wll ncrease at the queue management level. Havng a congeston-dependent component n the servce prce provdes a monetary ncentve for adaptve applcatons to adapt ther servce class and/or sendng rates accordng to network condtons. In perods of resource scarcty, qualty senstve applcatons can mantan ther resource levels by payng more, and relatvely qualtynsenstve applcatons wll reduce ther sendng rates or change to a lower class of servce. The total prce conssts of a congeston-dependent component and a fxed volumebased charge. The fxed volume-based charge has the same 4 chargng modes as n FP, gvng the prcng models CP-FL, CP-PR, CP-T, CP-PR-T. A. Proposed Prcng Scheme We assume that routers support multple servce classes and that each router s parttoned to provde a separate lnk bandwdth and buffer space for each servce, at each port. We use the framework of the compettve market model [6]. The compettve market model defnes two knds of agents: consumers and producers. Consumers seek resources from producers, and producers create or own the resources. The exchange rate of a resource s called ts prce. The routers are consdered the producers and own the lnk bandwdth and buffer space for each output port. The flows (ndvdual flows or aggregate of flows) are consdered consumers who consume resources. The congeston-dependent component of the servce prce s computed perodcally, wth a prce computaton nterval τ. The total demand for lnk bandwdth s based on the aggregate bandwdth reserved on the lnk for a prce computaton nterval, and the total demand for the buffer space at an output port s the average buffer occupancy durng the nterval. The supply bandwdth and buffer space need not be equal to the nstalled capacty; nstead, they are the targeted bandwdth and buffer space utlzaton. The congeston prce wll be leved once demands exceeds a provder-set fracton of the avalable bandwdth or buffer space. We now dscuss the formulaton of the fxed charge, whch we decompose nto holdng charge and usage charge, and the formulaton of the congeston charge. ) Holdng Charge: A servce may enforce admsson control to ensure some level of performance. In ths case, the applcatons admtted nto the network wll mpose some potental cost by deprvng other applcatons the opportunty to be admtted. Hence, t s far to charge the admtted applcatons a holdng prce. The holdng charge can be calculated based on the followng consderaton. If a partcular flow or flow-aggregate does not utlze the resources (buffer space or bandwdth) set asde for t, we assume that the scheduler allows the resources to be used by excess traffc from a lower level of servce. The holdng charge reflects revenue lost by the provder because nstead of sellng the allotted resources at the usage charge of the gven servce level (f all of the reserved resources were consumed) t sells the reserved resources at the usage charge of a lower servce level. The holdng prce (p j h ) of a servce class j s therefore set to be proportonal to the dfference between the usage prce for that class and the usage prce for the next lower servce class. The holdng prce can be represented as: p j h = αj (p j u p j u ), (8) where α j s a scalng factor related to servce class j. The holdng charge c j h (n) when the customer reserves a bandwdth r j (n) from class j s gven by: c j h (n) =pj h rj (n)τ j (9) where τ j s the negotaton perod for class j. r j (n) can be a bandwdth requrement specfed explctly by the customer, or estmated from the traffc specfcaton and servce request of the customer. ) Usage Charge: The usage charge s determned by the actual resources consumed, the average user demand, the level of servce guaranteed to the user, and the elastcty of the traffc. The usage prce (p u ) wll be set such that t allows a retal network to recover the cost of the purchase from the wholesale market, and varous fxed costs assocated wth the servce. In a network supportng multple classes of servce, the dfference n the charge between dfferent servce classes would presumably depend on the dfference n performance between the classes. The model we consder s a network supportng J classes of servces, the servce prce for class j s p j u, the long tme user bandwdth demand s known (e.g., through statstcs) and can be represented as x j (p u,p u,..., p J u), and the cost of havng capacty C durng one unt of tme s f(c). The provder s decson problem s to choose the optmal prces for each class that optmze ts proft: J max[ x j (p u,p u,..., p J u)p j u f(c)], p j u j subject to: r(x j (p u,p u,..., p J u)) R, j J () where r represents the bandwdth requrement for all classes, and R s the total bandwdth avalablty of the network. Assumng users choose servce classes ndependently, the total demand for a class over a long enough tme perod depends only on the prce for that class. If we assume the users have the utlty functons of Secton II, the total demand of servce class j can be represented as a constant elastcty model:

IEEE INFOCOM 5 x j (p j u ) = Aj /p j u, whch vares nversely wth the prce of the servce class. A j reflects the total wllngness to pay of users belongng to servce class j. Servce prcng for dfferentated servce DffServ supports SLA negotaton between the user and the network. An SLA generally ncludes traffc parameters, whch descrbe the user s traffc profle, and performance parameters, whch characterze the level of performance that the network promses to provde to the conformng part of the user s traffc. A wdely used descrptor for a user s traffc profle conssts of a peak rate, a sustanable rate, and a maxmum burst tolerance. The generally consdered QoS parameters are delay and loss. Mechansms, such as weghted far queung (WFQ) and class based queung (CBQ) can be used to provson resources for dfferent servce classes. In general, a class wth lower load leads to lower delay expectaton. A hgher level of servce class s expected to have a lower average load, and hence lower average delay. If we do not consder the dfference n element costs for dfferent classes, chargng servces proportonal to ther ndvdual expected load seems to reasonably reflect the cost of provdng the servces and the dfferences between ther performance. Assumng that unt bandwdth of a servce class would be charged a basc rate p basc f all ts bandwdth were used, and the expected load rato of servce class j s ρ j,the unt bandwdth prce for servce class j can then be estmated as p j u = p basc/ρ j. The effectve bandwdth consumpton of an applcaton wth rate x j can be represented as x j /ρ j. For constant elastcty demand, x j (p j u ) = Aj /p j u, and the effectve bandwdth consumpton s A j /(p j u ρj ). Then the prce optmzaton problem of equaton can be wrtten as: max[ p j u J j A j p j u p j u f(c)], subject to: p j u = p basc ρ j, J j A j p j uρ j C () The Lagrangan for the problem can be represented as: J J max [ A j j + λ(c Aj ) f(c)] () p basc p basc j The optmal soluton s: J j p basc = Aj C, pj u = p J basc j = Aj (3) ρ j Cρ j The bandwdth provsoned for each servce class wll be gven by A j /p basc, and s hence proportonal to total user wllngness to pay for that class. The usage charge c j u (n) for class j over a perod n n whch v j (n) bytes were transmtted s gven by: c j u (n) =p j uv j (n) (4) 3) Congeston Charge: A smple usage-based chargng scheme montors the data volume transmtted and n prncple charges users based on ther average rate. Chargng accordng to the mean rate, though encouragng the user to use network bandwdth more effcently, does not dscourage users from selectng large traffc contracts and sendng the worst-case traffc allowed by ther contract, whch create problems for network traffc management. An approprate prcng scheme should provde users the ncentves to select traffc contracts that reflect ther actual needs. Effectve bandwdth [9][7] and prcng based on effectve bandwdth [3] have been proposed n a multple-servce-class envronment. However, effectve bandwdth normally accounts for the worst case traffc subject to the traffc profle of the SLA. The contract for typcal users has an effectve bandwdth much larger than the mean rate. Provsonng based on equvalent bandwdth s not economcally effcent n a DffServ envronment. Performance guarantees n DffServ are qualtatve and can be very loose. Ths may make t dffcult to evaluate the equvalent bandwdth. Also, DffServ does not allocate resources to applcatons based on ther effectve bandwdth. Therefore, t appears unfar to charge users based on ther profle declaraton only, though the charge should take the profle nto account. To encourage users to reduce ther resource requrements under network resource contenton, we propose an addtonal congestonsenstve prce component under these condtons. The general network resources consdered are bandwdth and buffer space. Two knds of congeston prcng can be consdered: prcng when the expected load bound s exceeded, or prcng when buffer occupancy reaches certan level. In the frst case, when the average demand for a certan class exceeds a threshold, an addtonal congeston prce s charged all users of that class. In the case of prorty droppng for AF class, the droppng precedence s only consdered when the buffer occupancy reaches dfferent thresholds. The same thresholds can be assocated wth dfferent congeston or buffer prces. When each threshold s reached, user packets wth the correspondng precedence level begn to be dropped wth a certan probablty, and users wth hgher precedence levels are charged the addtonal buffer prce. Therefore, the hgher precedence users pay the sum of buffer prces correspondng to all the exceeded thresholds. Durng congeston, lower precedence users wll suffer lost packets, or reduce ther rate, or smoothen ther traffc at the source (at the cost of hgher delay due to bufferng), or change to a hgher precedence and pay a hgher prce. Both knds of congeston prce for a servce class can be calculated as an teratve tâtonnement process [6]: p j c(n) =mn[{p j c(n ) + σ j (D j,s j )(D j S j )/S j, } +,p j max] (5) where D j and S j represent the current total demand and supply respectvely, and σ j s a factor used to adjust the convergence rate. σ j maybeafunctonofd j and S j ;nthat case, t would be hgher when congeston s severe. D j and S j wll be dfferent for bandwdth and buffer space congeston. The router begns to apply the congeston charge only when the total demand exceeds the supply. Even after the congeston s removed, a non-zero, but gradually decreasng congeston charge s appled untl t falls to zero to protect aganst further congeston. In our smulatons, we also used a prce adjustment threshold parameter θ j to lmt the frequency wth whch the prce s updated. The congeston prce s updated f the calculated prce ncrement exceeds θ j p j c (n ). The maxmum

IEEE INFOCOM 6 congeston prce s bounded by the p j max. When a servce class needs admsson control, all new arrvals are rejected when the prce reaches p max j.ifp j c reaches p j max frequently, t ndcates that more resources are needed for the correspondng servce, or usage prce for a class needs to be adjusted to reflect the new demand statstcs. For a perod n, the total congeston charge s gven by c j c (n) =p j c(n)v j (n). (6) Based on the prce formulaton strategy descrbed above, a router arrves at a cost structure for a partcular RNAP flow or flow-aggregate at the end of each prce update nterval. The total charge for a sesson s gven by c j s = N [p j h rj (n)τ j +(p j u + p j c(n))v j (n)] (7) n= where N s the total number of ntervals spanned by a sesson. In some cases, the network may set the usage charge to zero, mposng a holdng charge for reservng resources only, and/or a congeston charge durng resource contenton. Also, the holdng charge would be set to zero for servces wthout explct resource reservaton or admsson control, for example, best effort servce. Snce the re-negotaton of network servces wll generally be drven by prce changes, the stablty of the negotaton process s dscussed n related work wth a greater focus on prcng [8]. IV. RESOURCE NEGOTIATION THROUGH RNAP The prcng algorthms and adaptaton framework presented n ths paper do not depend on any partcular network archtecture or protocol. However n ths paper, we smulated our results n an envronment supportng dynamc servce negotaton through the Resource Negotaton and Prcng protocol (RNAP) [5][8], usng a centralzed (RNAP-C) network management archtecture. We frst brefly revew the RNAP framework, and then descrbe the prcng and charge formulaton process used. In the RNAP framework, we assume that the network makes servces wth certan QoS characterstcs avalable to user applcatons, and charges prces for these servces that, n general, vary wth the avalablty of network resources. Network resources are obtaned by user applcatons through negotaton between the Host Resource Negotator (HRN) on the user sde, and a Network Resource Negotator (NRN) actng on behalf of the network. The HRN negotates on behalf of one or multple applcatons belongng to a multmeda system. In an RNAP sesson, the NRN perodcally provdes the HRN updated prces for a set of servces. Based on ths nformaton and current applcaton requrements, the HRN determnes the current optmal transmsson bandwdth and servce parameters for each applcaton. It re-negotates the contracted servces by sendng a Reserve message to the NRN, and recevng a Commt message as confrmaton or denal. The HRN only nteracts wth the local NRN. If ts applcaton flows traverse multple domans, resource negotatons are extended from end to end by passng RNAP messages hop-by-hop from the frst-hop NRN untl the destnaton COPS messages Table Table Doman Routng Table Resource Table B R R R R B Dest Next Hop Next Hop Next Hop (C, BW, Q, P) (C, BW, Q, P) (C, BW, Q, P) B R R B B, 3, 3, B3 R,, 3, B4 R, 3, 3, Step: determne a path (Table ) Step: accumulate prce along the path (Table ) Step 3: send total prce ($4/Mb) Fg.. B B3 R NRN B4 Prce formulaton n RNAP-C R B C: Servce class BW: average bandwdth (Mb) Q:average queue length P: prce ($/Mb) network NRN, and vce versa. End-to-end prces and charges are computed by accumulatng local prces and charges as Quotaton and Commt messages travel hop-by-hop upstream towards the HRN. The NRN mantans local state nformaton for a doman for chargng and other purposes. It makes the admsson decson and decdes the prce for a servce, based on the servce specfcatons alone, or by also takng nto account routng and confguraton polces, and network load. In the latter case, the NRN sts at a router that belongs to a lnk-state routng doman (for example an OSPF area) and has an dentcal lnk state database as other routers n the doman. Ths allows t to calculate all the routng tables of all other routers n the doman usng Djkstra s algorthm. The NRN mantans a doman routng table whch fnds any flow route that ether ends n ts own doman, or uses ts doman as a transt doman (Fg. ). The doman routng table wll be updated whenever the lnk state database s changed. A NRN also mantans a resource table, whch allows t to keep track of the avalablty and dynamc usage of the resources (bandwdth, buffer space). In general, the resource table stores resource nformaton for each servce provded at a router. The resource table allows the NRN to compute a local prce at each router (for nstance, usng the usage-based prcng strategy descrbed n Secton III). For a partcular servce request, the NRN frst looks up the path on whch resources are requested usng the doman routng table, and then uses the per-router prces to compute the accumulated prce along ths path. The resource table also facltates montorng and provsonng of resources at the routers. To enable the NRN to collect resource nformaton, routers n the doman perodcally report local state nformaton (for nstance, average buffer occupancy and bandwdth utlzaton) to the NRN. In ths paper, we extend COPS [9] for ths purpose. To compute the charge for a flow, ngress routers mantan per-flow (or aggregated flow from neghborng doman) state nformaton about the data volume transmtted durng a negotaton perod. Ths nformaton s perodcally transmtted to the NRN, allowng the NRN to compute the charge for the perod. The NRN uses the computed prce and charge to mantan chargng state nformaton for each RNAP sesson. A network doman manages ts own prcng scheme (whch

IEEE INFOCOM 7 may be congeston senstve or statc) ndependent of other domans, and wll have ts own per unt resource costs for each class. When an user flow traverses multple domans, RNAP messagng collates prcng and bllng nformaton from each doman and determne the total prce/charge for the user. For reducng the overhead due to per-flow RNAP message processng and storage, we consder a snk-tree based aggregaton scheme n [8]. The RNAP messages and state nformaton are aggregated n the core networks, allowng data measurement and chargng to be at much larger granularty. V. SIMULATION MODEL In ths secton, we descrbe our smulaton model for the CPA and FP polces. We smulate a sngle DffServ servce doman, under whch resources are not explctly reserved for each flow. We smulate the servce performance wth or wthout admsson control from the doman. User resource requrements are declared explctly through RNAP, allowng admsson control to be enforced f requred n an experment. The ndvdual and total user resource demands are also obtaned through measurement. Prce and network statstcs are sgnaled to users through RNAP. We used the network smulator [] envronment to smulate two network topologes, shown n Fg. and Fg. 3. Topology contans two backbone nodes, sx access nodes, and twentyfour end nodes. Topology two contans fve backbone nodes, ffteen access nodes, and sxty end nodes. Topology two was also used n []. All lnks are full duplex and pont-to-pont. The lnks connectng the backbone nodes are 3 Mb/s, the lnks connectng the access nodes to the backbone nodes are Mb/s, and the lnks connectng the end nodes to the access nodes are Mb/s. At each end node, there s a fxed number N s of sendng users. We use topology n most of our smulatons to allow congeston to be smulated at a sngle bottleneck node, and use topology to llustrate the CPA performance under a more general network topology [8]. We modfed the DffServ module developed by Sean Murphy to support dynamc SLA negotaton, and montor the user traffc at ngress pont. A Weghted-Round-Robn scheduler s modeled at each node, wth weghts dstrbuted equally among EF, AF, and Best Effort (BE) classes. Although the DffServ proposals menton 4 AF classes wth three levels of drop precedence n each, we only smulated one AF class to make the smulatons less resource-ntensve, snce ths does not affect the general results n any way. Three dfferent buffer management algorthms are used for dfferent DffServ classes - tal-droppng for EF, RED-wth-In-Out [] for AF, and Random Early Detecton [3] for the BE traffc. The default queue length for EF, AF and BE are set respectvely to 5,, packets. Other parameters are set to the default values n the network smulator mplementaton. A combnaton of exponental on-off and Pareto on-off traffc sources are used n the smulaton. Unless otherwse specfed, the traffc conssts of 5% of each for all the servce classes, and the on tme and off tme are both set to.5 seconds. The shape parameter for Pareto sources s set to.5. The mean packet sze s set to bytes. The traffc Senders Mb/s A A A3 B Mb/s 3 Mb/s Fg.. Smulaton network topology Mb/s A A A3 Mb/s B B B A5 A4 A3 3 Mb/s B5 A4 A5 A6 A7 A8 A9 Fg. 3. Smulaton network topology B4 B3 A6 A5 A4 Recevers condtoners are confgured wth one profle for each traffc source, wth peak rate and bucket sze set to the n-off source peak rate and maxmum amount of traffc sent durng an on perod respectvely for both EF and AF classes. We also characterze the system load by burst ndex and offered load. The burst ndex s defned as OffTme/(OnTme +OffTme)for both types of On-Off sources. The offered load for a servce class s defned as the rato between the total user resource requrement for a servce type, and the confgured class capacty at the bottleneck. Under the FP polcy, the total user resource requrement s also the actual resource demand from all the users. Under the CPA polcy, the total user resource requrement s what the total resource demand would be f there were no resource contenton at the bottleneck and the network dd not mpose an addtonal congestondependent prce. User requests are generated accordng to a Posson arrval process and the lfetme of each flow s exponentally dstrbuted wth an average length of mnutes. In topology, users from the sender sde ndependently ntalze undrectonal flows towards randomly selected recever sde end nodes. N s flows wll be ntalzed at one node. At most N s flows (6 sessons wth N s set to 5) can run smultaneously n the whole network. In topology, all the users ntalze undrectonal flows towards randomly selected end nodes. At most 6N s users (36 sessons wth N s set to 6) are allowed to run smultaneously n the whole network. For ease of understandng, all prces n ths secton are gven n terms of prce per mnute of a 64 kb/s transmsson, currently equvalent to a telephone call. The basc prce charged by the FP polcy, and the basc usage prce charged by CPA (p basc ), are both set to $.8/mn. We set the target average load of the EF class at 4%, the AF class at 6%, and the BE class at 9%. Therefore, based on the prcng strategy proposed n Secton III, the usage prce for EF, AF and BE classes are set respectvely as $./mn, $.3/mn, and $.89/mn. When admsson control s enforced, the holdng A A A

IEEE INFOCOM 8 prce for the CPA polcy s correspondngly set to $.67/mn for EF class, and $.44/mn for AF class. Congeston prcng s appled when nstantaneous usage exceeds the target load threshold of each class or when the loss or delay exceeds /3 of the bounds at a node assocated wth the class (delay bound of ms, 5 ms, and ms respectvely for EF, AF, and BE, and loss bounds of 6, 4 and respectvely). The prce adjustment procedure s also controlled by a par of parameters, the prce adjustment step σ from equaton 5 and the prce adjustment threshold parameter θ, defned n Secton III. Unless otherwse specfed, values of σ =.6 and θ =.5 are used. The users are assumed to have the general form of the utlty functon shown n Secton II. At the begnnng of each experment, the user populaton s dvded nto users of the EF, AF and BE classes, although n some experments they are allowed to adapt to prce changes by swtchng to a dfferent class. For EF users, the elastcty factor factor w (whch s also the user s wllngness to pay), s unformly dstrbuted between $.3/mn and $.4/mn for a 64 kb/s bandwdth. For AF and BE users, t s unformly dstrbuted between $.9/mn and $.6/mn, and $.6/mn and $.8/mn respectvely. The mnmum delay and loss requrements for each type of users are set to be the same as the expected performance bound of the correspondng servce class. The opportunty cost parameter U s set to the amount a user s wllng to pay for ts mnmum bandwdth requrement, and s hence gven by U = p hgh x mn,wherep hgh s the maxmum prce the user wll pay before termnatng hs connecton altogether. Users re-negotate ther resource requrements wth a perod of 3 seconds n all the experments. The total smulaton tme for each experment s, seconds. We use a number of engneerng and economc metrcs to evaluate our experments. The engneerng metrcs nclude the average traffc arrval rate at the bottleneck, the average packet delay, the average packet loss rate, and the user request blockng probablty. The averages are computed as exponentally weghted movng averages. The economc performance metrcs nclude the average user beneft (the perceved value obtaned by users based on ther utlty functons), the end-toend prce for each servce class. VI. RESULTS AND DISCUSSION In ths secton, we smulate the FP polcy and CPA polcy under dentcal traffc condtons, and compare the relatve performance. For ease of presentaton, a sngle traffc parameter for the AF class was vared n each experment, and ts effect on CPA and FP polcy performance was studed. We conducted four groups of experments. In the frst and second groups, we vary the load burstness and average load respectvely of the AF class, and evaluate the mprovements gven by CPA over FP. In the thrd experment, ncentve drven traffc mgraton between classes s shown to mprove the overall system performance. In the last experment, we show that access control to a servce class s crtcal n mantanng expected performance (a) (c) (e) Bottleneck traffc arrval rate.9.8.7.6.5.4.3.. Average packet delay (second) average prce prce devaton...3.4.5.6.7.8.9 Burst ndex.9.8.7.6.5.4.3 4.5 4 3.5 3.5.5.5 5 x 3...3.4.5.6.7.8.9 AF burst ndex....3.4.5.6.7.8.9 AF burst ndex (b) (d) (f) Average packet loss rate Average user beneft ($/mn).9.8.7.6.5.4.3.. burst ndex:.4 burst ndex:.6 burst ndex:.8..4.6.8..4.6.8 tme (s) x 4 x 3 8 6 4...3.4.5.6.7.8.9 AF burst ndex 4 6 8 4 6 8...3.4.5.6.7.8.9 AF burst ndex Fg. 4. System dynamcs under CPA wth ncrease n AF traffc burst ndex: (a) prce average and standard devaton of AF class; (b) varaton over tme of AF. Performance metrcs of CPA and FP polces as a functon of burst ndex of AF class: (c) average packet delay; (d) average packet loss; (e) average traffc arrval rate; (f) average user beneft. levels. Combnng access control wth user servce adaptaton effectvely reduces the request blockng rate. A. Effect of Traffc Burstness We frst compare the performance of FP and CPA polces as the burst ndex of AF class ncreases, at a constant average offered load of 6%. Fg. 4 (a) shows that the average AF prce ncreases under CPA due to the ncreasng congeston prce as the burst ndex exceeds.4. In response, the AF traffc backs off. Fg. 4 (a) also shows that the standard devaton n the AF prce ncreases wth the burst ndex, ndcatng greater fluctuatons n the prce. Fg. 4 (b) shows the dynamc varaton of the AF class prce at three dfferent levels of burstness, confrmng ths trend. Fg. 4 (c) and (d) show that under FP polcy the average packet delay and loss of the AF class ncrease sharply as the burst ndex exceeds.4. As a result of the user traffc back-off under CPA the delay and loss of AF class are well controlled below the respectve performance bounds of 5 ms and 4 up to a burst ndex of.8. The average user beneft for CPA (Fg. 4 f) decreases due to the reducton of bandwdth, but remans hgher than that of the FP polcy. There s also a smaller degradaton n the performance of the BE class at hgh burst ndces. Ths appears to be because the BE class operates under a relatvely hgh load, and therefore borrows bandwdth from the AF class when the AF class s lghtly loaded. It can no longer do so when the AF traffc burstness ncreases.

IEEE INFOCOM 9 (a).9.8.7.6.5.4.3.. average prce prce devaton.5.6.7.8.9...3.4.5 (b).9.8.7.6.5.4.3.. offered load:.8 offered load:. offered load:...4.6.8..4.6.8 tme (s) x 4 (a).9.8.7.6.5.4.3.. offered load:.8 offered load:. offered load:...4.6.8..4.6.8 tme (s) x 4 (b) Rato of AF traffc mgratng between classes.9.8.7.6.5.4.3.. EF AF BE.5.6.7.8.9...3.4.5 Average packet delay (second).5..5 Average packet loss.4.35.3.5..5..5 Average packet delay (second) 9 x 3 8 7 6 5 4 3 Average packet loss rate x 3 5 5 (c) (e) Bottleneck traffc arrval rate.5.6.7.8.9...3.4.5.6.4..8.6.4..5.6.7.8.9...3.4.5 (d) (f) Average user beneft ($/mn).5.5.6.7.8.9...3.4.5 3 4 5.5.6.7.8.9...3.4.5 Fg. 5. System dynamcs under CPA wth ncrease n : (a) average and standard devaton of AF class prce; (b) varaton over tme of AF class prce. Performance metrcs of CPA and FP polces as a functon of : (c) average packet delay; (d) average packet loss; (e) average bottleneck traffc arrval rate; (f) average user beneft. The results n ths secton ndcate that the CPA polcy takes advantage of applcaton adaptvty for sgnfcant gans n network performance, and perceved user beneft, relatve to the fxed-prce polcy. The congeston-based prcng s stable and effectve. B. Effect of Traffc Load In ths smulaton, we keep the load and burstness of EF class and BE class and the burst ndex of the AF class at ther default values, and vary the offered load of AF class. The average AF prce under CPA s seen to ncrease wth offered load (Fg. 5 (a)). The standard devaton of the prce shows an ncrease to a certan level and then a decrease. Intally, the prce devaton ncreases due to the more aggressve congeston control. At heavy loads, the ncreased multplexng of user demand smooths the total demand, and therefore reduces fluctuatons n the prce. Fg. 5 (e) shows that the actual arrval rate of AF under CPA backs off as users adapt to the hgher prce. Fgs. 5 (c) and (d) show that the delay and loss of AF class under FP quckly ncreases after the offered load ncreases above.6 and approaches the provsoned capacty. As a result, the performance bounds for AF class can no longer be met. The hgh AF load also degrades BE performance. Ths s apparently because BE operates at a hgh load (.9) and tends to borrow bandwdth from AF and EF when the latter classes are lghtly loaded. Fgs. 5 (c), (d), and (e) show that CPA coupled wth user (c).5.6.7.8.9...3.4.5 (d) 5.5.6.7.8.9...3.4.5 Fg. 6. Performance metrcs of CPA and FP polces wth traffc mgraton between classes: (a) varaton over tme of AF class prce; (b) rato of AF class traffc mgratng through class re-selecton; (c) average packet delay of all classes; (d) average packet loss of all classes; adaptaton s able to control congeston and mantan the total traffc load of a servce class at the targeted level, and hence allows the servce class to meet the expected performance bounds. Smlar to our observaton n Secton VI-A, f the nomnal prce of the system correctly reflect long-term user demand, dynamc prcng drven servce re-negotaton can effectvely lmts short-term fluctuatons n load. Usage prce of a class should be adjusted f persstent hgh user demand exst for a servce. C. Load Balance between Classes As seen from the prevous secton, the performance of a class wll suffer f the load nto that class s too hgh. In general, a user under CPA polcy wll select a servce class whch provdes t the hghest beneft based on the prce and performance parameters of a class as announced by the provders. The performance parameters are generally based on long-term statstcs. In ths secton, we assume that a user can learn from network performance data receved over a short perod, and select the class that would provde the hghest beneft based on the user utlty functon, network performance statstcs and servce prce, as dscussed n Secton II. In ths smulaton, the EF and BE classes are loaded at 3% and 8% respectvely. When the load of AF class ncreases, the performance of AF class degrades and congeston prce s nvoked. In response, some applcatons swtch from the AF class to the EF class, whch provdes better performance guarantee, or BE class, whch allows t more bandwdth at a cheaper prce. As the result of ths re-selecton, the load s better balanced across classes, and overall performance of the system mproves (Fg. 6 (c) and (d)). Fg. 6 (a) shows that wth load balancng n combnaton wth adaptaton wthn a sngle class, the congeston prce needs to be nvoked much less often than wth adaptaton wthn a class only, as n Fg. 5 (b). The proporton of mgratng traffc s shown n Fg. 6 (b). We see even when a small porton of users select other

IEEE INFOCOM (a) (c) Average packet delay (second).9.8.7.6.5.4.3.. AF average prce prce devaton.5.6.7.8.9...3.4.5 4 x 4 3.5 3.5.5.5.6.7.8.9...3.4 (b) (d) Request blockng rate Average packet loss.9.8.7.6.5.4.3...5.6.7.8.9...3.4 4 x 5 3.5 3.5.5.5.5.5.6.7.8.9...3.4 Fg. 7. System dynamcs under CPA wth access control CPA as AF offered load ncreases: (a) average and standard devaton of AF class prce. Performance metrcs of CPA and FP polces wth access control as a functon of : (b) user requests blockng rate; (c) average packet delay; (d) average packet loss. servce classes, the performance of the over-loaded class s greatly mproved. D. Effect of Admsson Control We have seen that the performance of a class can not be expected wthout any access control. In ths secton, we compare the performance of FP and CPA for a network wth admsson control for EF and AF class. The admsson threshold for each class s set to.5 tmes the target load to ncrease the effcency of the network. Wth admsson control, the performance of EF and AF classes are well controlled (Fg. 7 c and d). However, due to the burstness of the traffc, the blockng rate under FP s hgh even at a very small offered load (Fg. 7 b), and ncreases almost lnearly as the offered load ncreases beyond.6. Wth congeston control and servce contract re-negotaton, the blockng rate of CPA s seen to be up to 3 tmes smaller than that under the FP polcy, and actually starts to decrease after reachng a maxmum at offered load.8. Ths s because the prce adjustment step s proportonal to the excess bandwdth above the targeted utlzaton and ncreases progressvely faster wth offered load at hgher loads, and the user bandwdth request decreases proportonally wth the prce accordng to the general utlty functon of Secton II. Compared to Secton VI-B, the average prce under CPA (Fg. 7 a) s bounded to a smaller value at hgh offered loads, and has a smaller fluctuaton. The results ndcate that access control s mportant n mantanng the expected performance of a class. However, admsson control by tself may lead to a hgh blockng rate due to the network dynamcs. By combnng admsson control wth user traffc adaptaton, the network s more effcently used. Wth admsson control, the dynamcs of the network prce can also be better controlled, so that users have a more relable expectaton of the prce. VII. RELATED WORK Mcroeconomc prncples have been appled to varous network traffc management problems. The studes n [4][5][6] are based on a maxmzaton process to determne the optmal resource allocaton such that the utlty (a functon that maps a resource amount to a satsfacton level) of a group of users s maxmzed. In [7][8][6][9], the resources are prced to reflect demand and supply. Some of these methods are lmted by ther relance on a well-defned statstcal model of source traffc, and are generally not ntended to adapt to changng traffc demands. The study n [7] shows that compare to tradtonal flat prcng, servce-class senstve prcng results n hgher network performance. Prcng for DffServ has also been studed n [3] through equvalent bandwdth. As has been ponted out earler, equvalent bandwdth may be too conservatve for resource provsonng n a DffServ envronment, and hence prcng based on equvalent bandwdth may not be far to the users. Also, t s not trval for users to adapt ther requrements dynamcally to meet ther equvalent bandwdth constrants. Although there s some overlap between the cted work and ours, our work s drected to studyng and solvng somewhat dfferent problems - developng a prcng model for DffServ, and studyng DffServ performance n a dynamc servce and prce negotaton envronment. VIII. SUMMARY In ths work, we have developed a reasonably complete DffServ prcng model. We have proposed a prce structure for dfferent servce classes n DffServ based on ther relatve performance, long-term demand, and short-term fluctuatons n demand. We have ntegrated ths prcng model nto a dynamc servce negotaton envronment n whch servce prces ncrease n response to congeston, and users adapt to prce ncreases by adaptng ther sendng rate and/or choce of servce. We have also modeled the demand behavor of adaptve users based on a physcally reasonable user utlty functon. Our smulaton results show that the dfferent DffServ classes provde dfferent levels of servce only when they operate at dfferent target utlzaton. In the absence of explct admsson control, a servce class loaded beyond ts target utlzaton (under ether sustaned or bursty loads) no longer meets ts expected performance levels. Under these condtons, a congeston-senstve prcng polcy (CPA) coupled wth user rate adaptaton s able to control congeston and allow a servce class to meet ts performance assurances under large or bursty offered loads. Users see a reasonably stable servce prce and are able to mantan a very stable expendture. Allowng users to mgrate between servce classes n response to prce ncrease and network performance further stablzes the ndvdual servce prces whle mantanng the system performance. When admsson control s enforced beyond a threshold load for each class, performance bounds can be met wth a fxed servce prce. However, n ths case, the CPA polcy