Audio Engineering Society. Convention Paper. Presented at the 119th Convention 2005 October 7 10 New York, New York USA
|
|
- Laureen Carr
- 5 years ago
- Views:
Transcription
1 Audio Engineering Sociey Convenion Paper Presened a he 119h Convenion 2005 Ocober 7 10 New Yor, New Yor USA This convenion paper has been reproduced from he auhor's advance manuscrip, wihou ediing, correcions, or consideraion by he Review Board. The AES aes no responsibiliy for he conens. Addiional papers may be obained by sending reques and remiance o Audio Engineering Sociey, 60 Eas 42 nd Sree, New Yor, New Yor , USA; also see All righs reserved. Reproducion of his paper, or any porion hereof, is no permied wihou direc permission from he Journal of he Audio Engineering Sociey. Qualiy Enhancemen of Low Bi Rae MPEG1-Layer 3 Audio Based on Audio Resynhesis Demerios Canzos 1 and Chris Kyriaais 1 1 Inegraed Media Sysems Cener (IMSC), Universiy of Souhern California, Los Angeles, CA, , USA ABSTRACT One of he mos popular audio compression formas is indispuably he MPEG1-Layer 3 forma which is based on he idea of low-bi ransparen encoding. As hese ypes of audio signals are saring o migrae from porable players wih inexpensive headphones o higher qualiy home audio sysems, i is becoming eviden ha higher bi raes may be required o mainain ransparency. We propose a novel mehod ha enhances low bi rae MP3 encoded audio segmens by applying mulichannel audio resynhesis mehods in a pos-processing sage or during decoding. Our algorihm employs he highly efficien Generalized Gaussian mixure model which, combined wih cepsral smoohing, leads o very low cepsral reconsrucion errors. In addiion, residual conversion is applied which proves o significanly improve he enhancemen performance. The mehod presened can be easily generalized o include oher audio formas for which sound qualiy is an issue. 1. INTRODUCTION The majoriy of MPEG1-Layer 3 (Mp3) audio encoded a low bi rae does no deliver high qualiy sound. On he oher hand, high bi rae Mp3 segmens, even hough hey deliver sufficien sound qualiy, are oo large o ransmi or sore. Wih he emergence of high qualiy consumer audio sysems and he prevalence of Mp3 as he sandard audio coding scheme, he need for enhancing low bi rae Mp3 audio daa wihou imposing excessive sorage or ransmission requiremens, seems naural. In his wor, we aemp o improve he qualiy of Mp3 encoded audio daa based on a recenly inroduced concep ermed audio resynhesis ([1]). In audio resynhesis, a reference (source) channel is ransmied and hen used o recreae he remaining (arge) channels a he receiving end by deriving a small se of consan parameers. In order o apply his concep o Mp3 audio enhancemen, we replace he source channel wih a low bi rae Mp3 audio music segmen and he arge channel wih he original uncompressed audio segmen of he same music piece. Our main goal is o recreae he high qualiy arge segmen a he receiver end by ransmiing a small se of consan parameers and by using he low qualiy source segmen ha is already sored a he receiver. This scheme is implemened in a pos-processing sage or during decoding and hus boh source and arge are preconvered o he same lossless daa forma (e.g. WAV).
2 Recen wor on audio resynhesis ([1]) has been based on previous specral ransformaion algorihms ([2,3,4]). The basic assumpion made in hese algorihms is ha he specral parameers are of Gaussian naure and hence are modeled by a Gaussian mixure. This grealy faciliaes he Maximum Lielihood (ML) parameers esimaion since he popular Expecaion-Maximizaion (EM) algorihm can be applied. As we show laer, he acual naure of he cepsral coefficiens of an audio signal is no sricly Gaussian and hus he Gaussian mixure model, alhough convenien, is no he bes soluion. We presen a new approach on modeling he cepsral coefficiens by employing he Generalized Gaussian mixure model. This model is very flexible and incorporaes a large number of disribuions including he Gaussian. A new echnique is also inroduced which aes effec during he cepsral conversion sep. Due o he lineariy of he conversion funcion and he abrup changes of he cepsral vecors during shor ime periods, he reconsrucion errors are considerably high. We propose a mehod in which he cepsral vecors are smoohed and he number of mixure componens increases o faciliae he as of he conversion funcion. Finally, a novel echnique relaed o residual processing is implemened. In many cases of low bi rae Mp3 sources, reconsrucion in he cepsral domain is no adequae for disorion-free enhanced audio. For his reason we also apply residual conversion and even hough i is no as accurae as cepsral conversion, i proves o significanly enrich he specral deails of he enhanced Mp3 music piece. 2. STATISTICAL CONVERSION The approach followed is based on previous saisical conversion algorihms relaed o speech synhesis ([2,3,4]). In our applicaion, he shor erm specral parameers are seleced o be he LPC cepsral vecors ([5]). The LPC analysis is carried ou in overlapping frames hrough a sliding window and hence each frame is modeled as an AR filer excied by a residual. We exrac he LPC cepsral vecors of he arge (which is unnown a he receiving end) and source signals. Our goal is o modify he cepsral vecors of he source signal so ha hey would be close in he leas squares sense o he arge cepsral vecors of he same music piece. This is accomplished by deriving a mapping funcion ha will conver each of he source cepsral vecors o he arge cepral vecor of he same ime frame (he wo signals are ime-aligned). The funcion is assumed linear and will be fully deermined by a small se of consan parameers. As shown laer, a similar conversion echnique can be applied o he residual vecors in which he source residual is modified so ha i beer maches he arge residual. In order o implemen he conversion funcion, we assume ha he source cepsral (and residual) vecors are generaed by a probabiliy densiy funcion (pdf). The as of deermining his pdf is effecively he sysem raining. The audio segmen used during raining is chosen so ha i is capable of modeling a large and diverse number of music pieces and is called he raining se. The esing source and esing arge signals are he paricular signal segmens on which we apply he conversion scheme and derive he specific conversion funcion. In he following subsecion we presen he probabilisic model associaed wih he raining as The Generalized Gaussian Mixure Model In he previous saisical conversion algorihms a common assumpion is ha he specral vecors are of Gaussian naure and hence he Gaussian mixure model is employed. The Gaussian mixure model has been reaed in numerous oher applicaions and an algorihm o esimae is parameers (EM) is readily available. However, as we show laer, he cepsral vecors of audio daa are no sricly Gaussian and hus his model is no he bes selecion. A more flexible model is adoped here, which includes he Gaussian mixure as a subcase, and is called he Generalized Gaussian mixure. Is componen pdf, he Generalized Gaussian pdf, is more flexible and adaps o virually any unimodal disribuion. Is analyical form for a random variable z is: aβ ( z µ ) α g( z; µ, σ,a ) = exp[ β ] (1) 2σΓ(1/ a) σ where µ is he mean, σ is he variance, α is he shape parameer, Γ( ) is he Gamma funcion and β is a dependen parameer: 1 / 2 Γ(3 / a) β = (2) Γ(1 / a) AES 119h Convenion, New Yor, New Yor, 2005 Ocober 7 10 Page 2 of 10
3 If α =2.0 we have he Gaussian pdf and if α =1.0 we have he Laplace pdf. When α >>1 he disribuion ends o he uniform pdf and when α < 1 he disribuion becomes impulsive. We consider he raining cepsral vecors (and he esing source vecors) o be generaed by a mixure wih componen pdf as described in equaion (1). The mixure formulaion of he Generalized Gaussian case is shown below: K q G( x ) = p( C ) g( x ; µ, σ, a ) (3) = 1 j= 1 where C denoes he cluser (componen), K is he number of clusers and p(c ) denoes he prior probabiliy ha he cepsral vecor x belongs o cluser. The cepsral vecor is q dimensional where q is he cepsral order and he jh coefficien is denoed by x (j). The vecor coefficiens are considered o be independen and hus he join pdf is he produc of he q coefficien pdf s. This diagonal formulaion is favorable since i decreases he compuaional complexiy during implemenaion Mixure Parameers Esimaion and Clusering The inclusion of a hird independen parameer (he shape parameer α) incurs addiional complexiy when i comes o ML (Maximum Lielihood) esimaion of he pdf parameers. This becomes more apparen in a mixure pdf where i is obvious ha he model is considerably more difficul o manipulae han he Gaussian mixure and he EM algorihm canno be applied easily because he Expecaion sep is very hard o compue. Also, even hough he EM algorihm is guaraneed o approach a local maximum, i is uncerain how fas his can be reached. We decide o follow a differen pah han he one used in he convenional mixure esimaion mehods by clusering he vecors and focusing on each cluser separaely. This will divide he parameers esimaion as ino K simpler ass. In order o perform his decomposiion we employ fuzzy clusering echniques hrough he c-means algorihm ([6]) and cluser he raining vecors ino K groups. The c-means is nown o avoid local minima beer han he -means and i also provides a fuzziness opion ha regulaes he occurrence of ouliers. The nex sep is o perform ML esimaion on each cluser. The esimaion is now sraighforward because he mean for each componen is nown (i is he cluser cener). We also compue p(c ) as he number of vecors ha belong o cluser divided by he oal number of vecors. The ML esimaor for he shape parameer a (j) of cluser and coordinae j is given by ([7]): ψ(1/ a ( + 1) + log( a j) 2 a a x µ : x C x : x C a ) 1 + a log( x µ 2 a 1 log( n µ ) x : x C = 0 µ a ) (4) where n is he number of vecors ha belong o class and ψ( ) is a funcion given by: 1 τ 1 1 ψ ( τ ) = (1 )(1 ) d (5) 0 The expression in (4) is solved by ieraive mehods. (j) The variance parameer σ of he h cluser and jh coordinae is hen esimaed as follows ([7]): 1 / a a a a β x µ : x C σ = (6) n Noe ha he zero h cepsral coefficiens (energy coefficiens) are discarded because hey inroduce srong bias during parameers esimaion. Besides, he frame energy informaion (relaive o he oher frames) is already conained in he residual Conversion Funcion The conversion funcion F( ) acs on he vecor sequence [x 1,...,x n ] and produces a vecor sequence close in he leas squares sense o he sequence [y 1,...,y n ]. Since we have seleced a diagonal implemenaion, his funcion will ac on he individual vecor componens and minimize he error: n q E = y = 1 j = 1 ( j ) ( j ) 2 F ( x ) (7) AES 119h Convenion, New Yor, New Yor, 2005 Ocober 7 10 Page 3 of 10
4 as in [2]. This problem becomes possible o solve under he consrain ha F is piecewise linear, i.e. K u F ( x ) = P( C x )[ v + ( x µ )] (8) = 1 σ for =1,..,n and j =1,..,q. The condiional probabiliy ha a given vecor belongs o cluser, P(C x ), is given by: p( C ) g( x ; µ, σ j = 1 P(C x ) = G( x ) q,a ) (9) The unnown parameers se [v,u] can be found by minimizing (7) which reduces o solving a ypical se of q independen leas-squares equaions ([2]) and hence he linear conversion funcion F is fully deermined Conversion Opimizaion hrough Cepsral Smoohing and Daa Overfiing The cepsral conversion funcion will generally no provide he accuracy in resuls ha is needed for audio reproducion. The cepsral vecors vary rapidly from frame o frame and many spies occur. The conversion funcion, due o is linear form, canno follow hese abrup changes and fails o produce he desired vecors. A new echnique is inroduced here ha improves he cepsral conversion performance. In essence, we smooh ou he cepsral vecors o reduce he spies by increasing he LPC analysis frame slide and lengh and a he same ime increase he mixure groups number so ha he conversion funcion has more componens available. The frame slide and lengh increase is applied only on he esing source and arge signals and no on he raining signal. If we apply he frame slide and lengh increase on he raining vecors oo hen heir number will decrease considerably and he ML esimaion will fail for a mixure of many componens. The number of groups is around hree imes larger han he number deermined by he MDL informaion-heoreic crierion ([8]) and hus he raining daa is overfied. This overfiing does no affec he conversion sage since any unnecessary clusers are filered ou by he conversion funcion. This echnique is proved o be exremely favorable since accurae reconsrucion of he cepsral vecors is achieved Residual Modeling and Conversion In many cases, an accurae cepsral reconsrucion is no sufficien for acousically undisored enhanced Mp3 segmens. Especially in he case of a very low bi rae source (e.g. 64Kbps), many audible arifacs are presen because he source and arge esing signals are simply oo differen. Insrumens ha are inaudible in he source signal will usually appear in he enhanced signal as disorions since he LPC coefficiens alone fail o reproduce hem. In such cases, he signal differences lie mainly in he residuals and herefore some residual processing is essenial for beer enhancemen resuls. We adop he assumpion ha he residual vecors are correlaed wih heir corresponding cepsral vecors ([9]) and hus share similar saisical properies. Therefore, we can apply he saisical conversion described in he previous secions o he residual vecors also. The probabilisic model used here is he same used for cepsral conversion (i.e. i is derived from he raining cepsral vecors). However, he dimensionaliy of he residual vecors is much higher han ha of he raining cepsral vecors and herefore we have o divide hem in subvecors of dimensionaliy equal o ha of he raining cepsral vecors. For insance, in he case of 30 raining cepsral coordinaes and 840 residual coordinaes, we would divide he residual vecors in subvecors of 30 coordinaes each and apply saisical conversion in each of he 28 subvecors ses separaely. Clearly, we do no expec a residual reconsrucion wih accuracy similar o ha of he cepsral reconsrucion because he residuals are oo spiy. Furhermore, we have no derived a raining se or a probabilisic model specifically for he residual vecors since he exremely high residual vecor dimensionaliy would mae his impracical. Besides, we would have o design a global mixure pdf ha could efficienly model any se of esing residual vecors even hough hese are highly diverse and conain he fine deails of he signal. Using he mixure pdf derived from he raining cepsral vecors shows ha he convered residuals are much closer o he arge residuals (han he source residuals are) and a large amoun of informaion is conveyed o he enhanced Mp3 segmen hrough his process. I was also observed ha a high raining cepsral order led o smaller residual reconsrucion errors. Therefore we selec a cepsral order for he raining vecors ha is higher han he cepsral order of he esing vecors. AES 119h Convenion, New Yor, New Yor, 2005 Ocober 7 10 Page 4 of 10
5 3. IMPLEMENTATION The algorihm described previously was applied and esed on a randomly seleced music piece. The general scenario involves enhancing a 32sec long, 64Kbps Mp3 segmen. This is he esing source signal. The esing arge signal is he uncompressed WAV file of he same music piece. These wo segmens are ime-aligned and since he algorihm is applied in a pos-processing sage, he Mp3 source is also convered o a WAV forma. Careful consideraion has been aen o reduce he residual conversion parameers size as much as possible. As shown laer in his secion, he acual size of he conversion funcion is less han he size of he Mp3 source and much less han he size of he uncompressed, arge file. Some objecive enhancemen resuls are also provided which prove he validiy of his scheme Wavele-Based Subband Coding Due o he higher sampling frequency and richer conen of an audio signal (compared o a speech signal) we follow a subband analysis. The subband separaion is performed wih waveles ([10]) and in his case he Daubechies filer of order 40 was a good choice since no audible aliasing effecs were observed. Several differrren wavele ree srucures were esed (e.g. equidisan subbands) bu he mos efficien srucure proved o be one ha emulaes he criical bands of he human hearing sysem as in [11]. This choice is furher jusified by he fac ha he Mp3 encoded source segmen has passed hrough a criical filerban also ([12]). The high number of subbands seleced allows us, as we show laer, o ae advanage of he iner-band redundancy and also o process heavier he subbands ha are he mos significan (i.e. he ones ha are more degraded or carry he audible pars of he signal). The acual wavele filerban is shown in Fig. 1 and is applied o boh esing source and esing arge signals leading o 17 esing subbands Training Model Derivaion A crucial par of he algorihm is o derive a Generalized Gaussian mixure pdf ha does no have o adjus o he paricular esing music piece. This probabilisic model should be global in he sense ha i will include he saisical properies of all possible music segmens and boh ransmiing and receiving ends will have access o i (e.g. pre-sored in boh sides). [0, 22] 13 [11, 13.8] [13.8, 16.5] [16.5, 19.3] [6.9, 8.3] [8.3, 9.6] [9.6, 11] [0, 0.7] [0.7, 1.4] [1.4, 2.1] [2.1, 2.8] [2.8, 3.4] [3.4, 4.1] [4.1, 4.8] [4.8, 5.5] [5.5, 6.2] [6.2, 6.9] Figure 1: Wavele ree srucure used for subband analysis of he esing source and esing arge signals (Numbers in braces indicae he frequency region in Hz in each subband. Numbers on leafs indicae he subband index from 1 o 17) Several candidae raining ses were processed o produce a mixure pdf among which were he mulichannel raining se of [1],a whie noise raining se, a Brownian noise raining se and a pin noise raining se. Pin noise proved o be he mos suiable raining se and produced smaller cepsral reconsrucion errors (up o 5% less in all subbands compared o he oher ses). In order o reduce he raining model size and allow for he daa diversiy needed in he case of many mixure componens ML esimaion, we divide he raining daa se ino 4 large equidisan subbands (insead of he 17 subbands shown in Fig. 1) covering he frequency range 0-22Hz (0-5.5Hz,5.5-11Hz, Hz, Hz) and each subband consiss of 12,000 cepsral vecors of cepsral order 30. Each of he 17 analysis subbands of he esing source and esing arge signals acquires he raining model parameers from one of he 4 larger subbands ha i is par of. During cepsral conversion, he cepsral order of he raining model is runcaed appropriaely for each esing subband o adjus o he lower cepsral order of he paricular esing source and esing arge cepsral vecors. During residual conversion, he large raining cepsral dimensionaliy allows for more efficien division of he esing residual vecors ino subvecors, as explained in secion [19.3, 22] AES 119h Convenion, New Yor, New Yor, 2005 Ocober 7 10 Page 5 of 10
6 3.3. Cepsral Conversion Resuls The cepsral conversion algorihm described in secion 2 is implemened according o he experimenal condiions of Table 1. Analysis Frame Slide/Lengh Cepsral Order Subbands Train(ms) Tes(ms) Train Tes /15 50/ /15 50/ Figure 2: Hisogram of shape parameers for he frequency band 0-5.5Hz of he pin noise raining se Fig. 2 shows he disribuion of he mixure pdf shape parameers for all groups and vecor coordinaes of he firs (0-5.5Hz) of he 4 raining subbands. I is clear ha he shape parameers, alhough srongly peaed a a =2.0, have he majoriy of heir values in he inervals (subgaussian) and (supergaussian) which jusifies he use of he Generalized Gaussian mixure as a more accurae model. Pin noise is random daa raher han acual audio daa bu a similar hisogram is obained from he audio daa se used in [1]. Figure 3: Fiing of mixure pdf (120 groups) o he normalized hisogram of he firs cepsral coefficiens of he band 0-5.5Hz of he pin noise raining se In Fig. 3 he validiy of he esimaion algorihm, as described in secion 2.2, is shown. Even hough a mixure model of 40 groups would be sufficien (as deermined by he MDL crierion), we increase his number o 120 and overfi he model for all 4 raining subbands as explained in secion 2.4. The fiing of he mixure pdf o he hisogram is sill very accurae which is aribued o he high modeling flexibiliy of he Generalized Gaussian pdf. Table 1: Experimenal parameers. The frequency regions of each of he analysis subbands in he lef-mos able column can be found in Fig.1. The frame slide and lengh are differen for he raining and esing segmens as explained in secion 2.4. We now show he necessiy of he conversion opimizaion scheme of secion 2.4 by esing wo scenarios where cepsral smoohing and daa overfiing are no applied a he same ime and which lead o increased cepsral reconsrucion errors. In case A, resynhesis is applied wih cepsral smoohing bu no daa overfiing (i.e. we derive a mixure pdf of 40 groups insead of 120), while in case B resynhesis is applied wih overfiing (120 groups) bu no smoohing (i.e. he raining and esing recordings have boh frame slide 10ms and frame lengh 15ms). The resuls are shown in Table 2. Average Quadraic Cepsral Disance Beween Targe-Source (frame slide/lengh 50ms/75ms) Targe-Resynhesis case A (no overfiing) Targe/Resynhesis case B (no smoohing) Targe-Resynhesis (smoohing+overfiing) Band 15 Band 16 Band Table 2: Two poor cepsral reconsrucion scenarios A,B for subbands and he case where cepsral smoohing and daa overfiing are applied ogeher. The conversion resuls for he remaining subbands (1-14) are shown in Table 3. I is clear ha he error reducion due o resynhesis varies across he subbands. However, he average cepsral disance beween he AES 119h Convenion, New Yor, New Yor, 2005 Ocober 7 10 Page 6 of 10
7 esing arge and resynhesized segmens is of he same order of magniude for mos of he subbands which means ha he cepsral conversion echnique has finie accuracy. By decreasing he duraion of he esing segmens and hus he number of cepsral vecors, he Analysis Subband Cepsral Disance Targe-Source E Cepsral Disance Targe-Resynhesis E Table 3: Average quadraic cepsral conversion resuls for subbands accuracy would increase bu so would he conversion parameers overhead since more conversion parameers would have o be ransmied per uni lengh of esing segmen. Figure 4: Cepsral reconsrucion of he firs coordinae for subband 17 ( Hz) In Fig. 4, an example of cepsral conversion for subband 17 is shown. I is clear ha he resynhesized firs cepsral coefficiens follow he corresponding arge coefficiens closely. Subbands 1-8 and 12 do no show observable errors since he iniial disance beween he source and arge cepsral coefficiens are small. Finally, from Tables 2 and 3 we observe ha he cepsral disance beween he source and arge signals grealy increases for subbands 9-17 (excep subband 12). This is direcly relaed o he fac ha he 64Kbps Mp3 coding scheme severely degrades he signal conen around he frequency region Hz while i reains he lower subbands. This will be aen ino accoun during he residual conversion implemenaion presened in he nex secion Residual Conversion Resuls and Redundancy The residual conversion scheme described in secion 2.6 is implemened. We exrac he residual vecors according o he 17 subbands analysis and apply he same 4 subbands raining model used for cepsral conversion. The high cepsral order of he model (30) allows for he inclusion of low-valued vecor coefficiens which are necessary for modeling he residual valleys. Low cepsral orders were also esed and led o larger residual reconsrucion errors. Therefore, he selecion of a high raining cepsral order is favorable. As menioned, he esing source and arge residual vecors acquire he model parameers according o one of he 4 raining subbands he paricular esing subband belongs o Residual Inra-Band Redundancy The residual conversion scheme as described previously requires a large amoun of conversion parameers o be creaed. For a full reconsrucion of all he residual vecors of a paricular subband, he size of he conversion parameers would be as large as 60% of he size of he arge (uncompressed) signal and several imes larger han he source Mp3 signal. For his reason, we decide o downsample he esing source and esing arge residual vecors before conversion. We esed downsampling facors of 2, 4 and 8 and he bes combinaion in erms of conversion parameers size and reconsrucion accuracy proved o be a downsampling facor of 4. Afer conversion, he reconsruced residual is resampled o he original rae by using he previous wo samples a each ime insance. Under his scheme, he audio qualiy does no decrease noiceably compared o a full reconsrucion and he size of he residual conversion funcion becomes four imes smaller. AES 119h Convenion, New Yor, New Yor, 2005 Ocober 7 10 Page 7 of 10
8 Residual Iner-Band Redundancy In Fig. 4, he average quadraic residual disances beween source and arge residuals for all subbands are ploed. Analysis Subband Residual Average Quadraic Disance Targe-Source Residual Average Quadraic Disance Targe-Reconsruc Table 4: Average quadraic residual conversion resuls for subbands when using he reconsruced residual of subband 16 for subbands Figure 4: Average quadraic residual errors beween source and arge for all subbands. I is clear ha no all source subbands are heavily disored. Subbands 1-8, 12 and 13 show small residual differences beween he esing source and esing arge segmens. This means ha we can apply residual processing o seleced subbands only. Applying residual conversion o subbands 9-11 and produced audible enhancemen wihou deriving many conversion parameers or performing excessive compuaions. Processing he remaining subbands did no provide significanly beer resuls ha could jusify he large amoun of he resuling conversion parameers. A furher reducion in parameers is achieved by observing ha he 4 highes esing subbands (14-17) show many residual similariies. By reconsrucing only one of hese residual signals and replacing all 4 residual signals wih he paricular reconsruced residual signal, a grea reducion in he average quadraic residual disances for all 4 subbands is achieved. This is also aribued o he fac ha he paricular subbands conen is no highly audible and he residual disances beween source and arge signals in hese subbands are large. Thus, even a less accuraely reconsruced residual is beer han he original source residuals. This is shown below in Table 4 where he reconsruced residual is derived for subband 16 only and i is used for all 4 subbands. The residual conversion resuls for he remaining subbands are shown in Table 5. Each of hese subbands has is own reconsruced residual since he lower subbands are very differen o each oher. Analysis Subband Residual Average Quadraic Disance Targe-Source Table 5: Average quadraic residual conversion resuls for subbands 9-11 when using he corresponding reconsruced residuals. The resuls of Tables 4 and 5 prove he validiy of he residual conversion scheme. Subbands 9-11 and 16 have reduced heir original residual errors more han 50%. Subbands 14, 15 and 17 have reduced heir original residual errors around 45% bu his reducion could be even more if each subband had is own reconsruced residual insead of sharing he residual derived from subband 16. Achieving an error reducion of 50% or more for hese subbands does no acually provide any acousical improvemen of he enhanced waveform since, as menioned, hey do no conain he highly audible pars of he signal Overall Performance Residual Average Quadraic Disance Targe-Reconsruc Several objecive similariy measures were esed among which he Muual Informaion in he ime domain proved o be he mos suiable. Fig. 5 illusraes he effeciveness of he seleced wavele srucure agains wavele rees of 2, 4 and 8 equidisan subbands. These cases are furher subdivided in cases of cepsral reconsrucion only and cepsral reconsrucion wih residual reconsrucion. In he case of 2 subbands, residual conversion is applied in boh subbands. In he case of 4 subbands, residual conversion is applied in he upper 3 subbands and in he case of 8 subbands residual conversion is applied in he upper 6 subbands. AES 119h Convenion, New Yor, New Yor, 2005 Ocober 7 10 Page 8 of 10
9 Figure 5: Muual Informaion beween esing arge and resynhesized signals for various wavele srucures wih and wihou residual conversion. From Fig. 5 i is clear ha audio enhancemen is more efficien -in erms of conversion parameers size and qualiy improvemen- when applying 17 bands wavele separaion wih residual reconsrucion. Even hough residual processing does no increase dramaically he Muual Informaion meric, he differences acousically are very sharp and he resynhesized segmen wihou residual conversion conains many periodic and random disorions. In conras, audio enhancemen wih residual conversion does no cause any audible disorions as preliminary subjecive ess show. The audio qualiy increase in he enhanced segmen compared o he source segmen is also easily percepible. To furher illusrae his we provide some ime domain waveform resuls of seleced subbands when applying residual conversion and cepsral conversion under he 17 subband analysis. I is obvious from Fig. 6 ha some subbands are severely degraded because he source waveform is almos non exisen. The resynhesized signal follows much closer he arge signal bu as menioned before here sill exis residual differences beween he arge and resynhesized segmens (see Tables 4 and 5) and herefore he wo signals canno be idenical for subbands Subbands 1-8 are no degraded enough (see Table 2 and Fig. 4) o show noiceable differences beween he source and arge waveforms and hence are no illusraed. Table 6 shows he ransmission requiremens of our scheme when ransmiing he cepsral conversion and residual conversion parameers under he 17 subbands separaion. No arihmeic coding is applied o compress he conversion parameers se and herefore i is possible ha he ransmission size can be furher reduced. Some Figure 6: Time domain resynhesis resuls for subbands 11 (upper plo) and 17 (lower plo). of he lower subbands can also no be processed a all (no cepsral conversion) since for hese he source and arge cepsral differences are very small. Mp3 Source 64Kbps size (byes) Conversion Parameers size (byes) Targe WAV size (byes) Table 6: Amoun of ransmied conversion parameers compared o he source and arge segmen sizes. As shown in Table 6, he conversion funcion size is smaller han he Mp3 source signal (77% of he source size) and much smaller han he arge segmen size. If we do no apply cepsral conversion for subbands 1-8 hen he parameers size would be 155Byes (61% of he source size). AES 119h Convenion, New Yor, New Yor, 2005 Ocober 7 10 Page 9 of 10
10 4. CONCLUSIONS AND FUTURE RESEARCH We presened a novel echnique on audio qualiy enhancemen of low bi rae Mp3 signals. Subjecive ess are currenly underway bu he qualiy improvemen is paricularly audible since he source segmen is Mp3 encoded in very low bi rae and herefore i is severely degraded. We have shown hrough objecive means ha he resynhesized signal is closer o he arge (han he source is) in erms of cepsral and residual disances and also in he ime domain by illusraing some subband waveforms. The selecion of subbands ha need residual or cepsral conversion can be deermined robusly by processing only he subbands ha conain he highes residual or cepsral errors, respecively. Furher invesigaion is needed on deermining he opimal number of subbands since i is clear ha a high number of subbands improves he enhancemen performance and can also allow for deecing more redundancies (e.g. source subbands ha are no degraded). The residual conversion scheme could be possibly furher improved by selecing a higher cepsral order for he raining model. Finally, if we apply he resynhesis scheme o a 128bps Mp3 source (which has double he size of he currenly used source) he relaive reducion in conversion parameers would be double he curren one (38% of he source size) or more since i is possible ha fewer subbands would need residual (or cepsral) conversion. Higher bi rae Mp3 source segmens are currenly being esed and naurally he algorihm performance is beer since he overall differences beween he source and arge audio segmens are smaller. 5. ACKNOWLEDGEMENTS Research presened in his paper was funded in par by he Inegraed Media Sysems Cener, a Naional Science Foundaion Engineering Research Cener, Cooperaive Agreemen No. EEC and in par by he US Army Research, Developmen, and Engineering Command (RDECOM). Saemens and opinions expressed do no necessarily reflec he posiion or policy of he Naional Science Foundaion or he US Governmen and no official endorsemen should be inferred. 6. REFERENCES [1] A. Moucharis, S. S. Narayanan and C. Kyriaais, Muli-resoluion specral conversion for mulichannel audio resynhesis, IEEE Proc. In. Conf. Mulimedia and Expo (ICME), vol.2, (Lausanne, Swizerland), pp , Augus [2] Y. Sylianou, O. Cappe and E. Moulines, Coninuous probabilisic ransform for voice conversion, IEEE Trans. Speech and Audio Processing, vol.6, no.2, pp , March [3] A. Kain and M. W. Macon, Specral voice conversion for ex-o-speech synhesis, IEEE Proc. In. Conf. Acousics, Speech and Signal Processing (ICASSP), Seale, WA, May 1998, pp [4] D. Reynolds and R. Rose, Robus ex-independen speaer idenificaion using Gaussian mixure speaer models, IEEE Trans. Speech and Audio Processing, vol.3, no.1, pp.72-83, January [5] L. Rabiner and B. H. Juang, Fundamenals of Speech Recogniion, Prenice Hall, Englewood Cliffs, NJ, [6] J. C. Bezde, Paern Recogniion wih Fuzzy Objecive Funcion Algorihms, Plenum Press, New Yor, NY, [7] F. Muller, Disribuion shape of wo-dimensional DCT coefficiens of naural images, Elecronics Leers, vol.29, no.22, pp , 1993 [8] J. Rissanen, Modeling by shores daa descripion, Auomaica, vol.14, pp , [9] B. Gille and S. King, Transforming Voice Qualiy, Eurospeech, pp , [10] G, Srang and T. Nguyen, Waveles and Filer Bans, Wellesley-Cambridge, [11] D. Sinha and A.H. Tewfi, Low Bi Rae Transparen Audio Compression using Adaped Waveles, IEEE Trans. Signal Processing, vol.41, pp , December [12] P. Noll, MPEG Digial Audio Coding Sandards, CRC Press LLC, AES 119h Convenion, New Yor, New Yor, 2005 Ocober 7 10 Page 10 of 10
Implementing Ray Casting in Tetrahedral Meshes with Programmable Graphics Hardware (Technical Report)
Implemening Ray Casing in Terahedral Meshes wih Programmable Graphics Hardware (Technical Repor) Marin Kraus, Thomas Erl March 28, 2002 1 Inroducion Alhough cell-projecion, e.g., [3, 2], and resampling,
More informationImage segmentation. Motivation. Objective. Definitions. A classification of segmentation techniques. Assumptions for thresholding
Moivaion Image segmenaion Which pixels belong o he same objec in an image/video sequence? (spaial segmenaion) Which frames belong o he same video sho? (emporal segmenaion) Which frames belong o he same
More informationA Matching Algorithm for Content-Based Image Retrieval
A Maching Algorihm for Conen-Based Image Rerieval Sue J. Cho Deparmen of Compuer Science Seoul Naional Universiy Seoul, Korea Absrac Conen-based image rerieval sysem rerieves an image from a daabase using
More informationMORPHOLOGICAL SEGMENTATION OF IMAGE SEQUENCES
MORPHOLOGICAL SEGMENTATION OF IMAGE SEQUENCES B. MARCOTEGUI and F. MEYER Ecole des Mines de Paris, Cenre de Morphologie Mahémaique, 35, rue Sain-Honoré, F 77305 Fonainebleau Cedex, France Absrac. In image
More informationSTEREO PLANE MATCHING TECHNIQUE
STEREO PLANE MATCHING TECHNIQUE Commission III KEY WORDS: Sereo Maching, Surface Modeling, Projecive Transformaion, Homography ABSTRACT: This paper presens a new ype of sereo maching algorihm called Sereo
More informationAn Adaptive Spatial Depth Filter for 3D Rendering IP
JOURNAL OF SEMICONDUCTOR TECHNOLOGY AND SCIENCE, VOL.3, NO. 4, DECEMBER, 23 175 An Adapive Spaial Deph Filer for 3D Rendering IP Chang-Hyo Yu and Lee-Sup Kim Absrac In his paper, we presen a new mehod
More informationCAMERA CALIBRATION BY REGISTRATION STEREO RECONSTRUCTION TO 3D MODEL
CAMERA CALIBRATION BY REGISTRATION STEREO RECONSTRUCTION TO 3D MODEL Klečka Jan Docoral Degree Programme (1), FEEC BUT E-mail: xkleck01@sud.feec.vubr.cz Supervised by: Horák Karel E-mail: horak@feec.vubr.cz
More informationFIELD PROGRAMMABLE GATE ARRAY (FPGA) AS A NEW APPROACH TO IMPLEMENT THE CHAOTIC GENERATORS
FIELD PROGRAMMABLE GATE ARRAY (FPGA) AS A NEW APPROACH TO IMPLEMENT THE CHAOTIC GENERATORS Mohammed A. Aseeri and M. I. Sobhy Deparmen of Elecronics, The Universiy of Ken a Canerbury Canerbury, Ken, CT2
More informationReal Time Integral-Based Structural Health Monitoring
Real Time Inegral-Based Srucural Healh Monioring The nd Inernaional Conference on Sensing Technology ICST 7 J. G. Chase, I. Singh-Leve, C. E. Hann, X. Chen Deparmen of Mechanical Engineering, Universiy
More informationLandmarks: A New Model for Similarity-Based Pattern Querying in Time Series Databases
Lmarks: A New Model for Similariy-Based Paern Querying in Time Series Daabases Chang-Shing Perng Haixun Wang Sylvia R. Zhang D. So Parker perng@cs.ucla.edu hxwang@cs.ucla.edu Sylvia Zhang@cle.com so@cs.ucla.edu
More informationCoded Caching with Multiple File Requests
Coded Caching wih Muliple File Requess Yi-Peng Wei Sennur Ulukus Deparmen of Elecrical and Compuer Engineering Universiy of Maryland College Park, MD 20742 ypwei@umd.edu ulukus@umd.edu Absrac We sudy a
More informationMichiel Helder and Marielle C.T.A Geurts. Hoofdkantoor PTT Post / Dutch Postal Services Headquarters
SHORT TERM PREDICTIONS A MONITORING SYSTEM by Michiel Helder and Marielle C.T.A Geurs Hoofdkanoor PTT Pos / Duch Posal Services Headquarers Keywords macro ime series shor erm predicions ARIMA-models faciliy
More informationCOSC 3213: Computer Networks I Chapter 6 Handout # 7
COSC 3213: Compuer Neworks I Chaper 6 Handou # 7 Insrucor: Dr. Marvin Mandelbaum Deparmen of Compuer Science York Universiy F05 Secion A Medium Access Conrol (MAC) Topics: 1. Muliple Access Communicaions:
More informationA Hierarchical Object Recognition System Based on Multi-scale Principal Curvature Regions
A Hierarchical Objec Recogniion Sysem Based on Muli-scale Principal Curvaure Regions Wei Zhang, Hongli Deng, Thomas G Dieerich and Eric N Morensen School of Elecrical Engineering and Compuer Science Oregon
More informationEvaluation and Improvement of Region-based Motion Segmentation
Evaluaion and Improvemen of Region-based Moion Segmenaion Mark Ross Universiy Koblenz-Landau, Insiue of Compuaional Visualisics, Universiässraße 1, 56070 Koblenz, Germany Email: ross@uni-koblenz.de Absrac
More informationThe Impact of Product Development on the Lifecycle of Defects
The Impac of Produc Developmen on he Lifecycle of Rudolf Ramler Sofware Compeence Cener Hagenberg Sofware Park 21 A-4232 Hagenberg, Ausria +43 7236 3343 872 rudolf.ramler@scch.a ABSTRACT This paper invesigaes
More informationA Fast Stereo-Based Multi-Person Tracking using an Approximated Likelihood Map for Overlapping Silhouette Templates
A Fas Sereo-Based Muli-Person Tracking using an Approximaed Likelihood Map for Overlapping Silhouee Templaes Junji Saake Jun Miura Deparmen of Compuer Science and Engineering Toyohashi Universiy of Technology
More information4.1 3D GEOMETRIC TRANSFORMATIONS
MODULE IV MCA - 3 COMPUTER GRAPHICS ADMN 29- Dep. of Compuer Science And Applicaions, SJCET, Palai 94 4. 3D GEOMETRIC TRANSFORMATIONS Mehods for geomeric ransformaions and objec modeling in hree dimensions
More informationMATH Differential Equations September 15, 2008 Project 1, Fall 2008 Due: September 24, 2008
MATH 5 - Differenial Equaions Sepember 15, 8 Projec 1, Fall 8 Due: Sepember 4, 8 Lab 1.3 - Logisics Populaion Models wih Harvesing For his projec we consider lab 1.3 of Differenial Equaions pages 146 o
More informationOpen Access Research on an Improved Medical Image Enhancement Algorithm Based on P-M Model. Luo Aijing 1 and Yin Jin 2,* u = div( c u ) u
Send Orders for Reprins o reprins@benhamscience.ae The Open Biomedical Engineering Journal, 5, 9, 9-3 9 Open Access Research on an Improved Medical Image Enhancemen Algorihm Based on P-M Model Luo Aijing
More informationAn Improved Square-Root Nyquist Shaping Filter
An Improved Square-Roo Nyquis Shaping Filer fred harris San Diego Sae Universiy fred.harris@sdsu.edu Sridhar Seshagiri San Diego Sae Universiy Seshigar.@engineering.sdsu.edu Chris Dick Xilinx Corp. chris.dick@xilinx.com
More informationVideo Content Description Using Fuzzy Spatio-Temporal Relations
Proceedings of he 4s Hawaii Inernaional Conference on Sysem Sciences - 008 Video Conen Descripion Using Fuzzy Spaio-Temporal Relaions rchana M. Rajurkar *, R.C. Joshi and Sananu Chaudhary 3 Dep of Compuer
More informationEECS 487: Interactive Computer Graphics
EECS 487: Ineracive Compuer Graphics Lecure 7: B-splines curves Raional Bézier and NURBS Cubic Splines A represenaion of cubic spline consiss of: four conrol poins (why four?) hese are compleely user specified
More informationACQUIRING high-quality and well-defined depth data. Online Temporally Consistent Indoor Depth Video Enhancement via Static Structure
SUBMITTED TO TRANSACTION ON IMAGE PROCESSING 1 Online Temporally Consisen Indoor Deph Video Enhancemen via Saic Srucure Lu Sheng, Suden Member, IEEE, King Ngi Ngan, Fellow, IEEE, Chern-Loon Lim and Songnan
More informationQuantitative macro models feature an infinite number of periods A more realistic (?) view of time
INFINIE-HORIZON CONSUMPION-SAVINGS MODEL SEPEMBER, Inroducion BASICS Quaniaive macro models feaure an infinie number of periods A more realisic (?) view of ime Infinie number of periods A meaphor for many
More informationSTRING DESCRIPTIONS OF DATA FOR DISPLAY*
SLAC-PUB-383 January 1968 STRING DESCRIPTIONS OF DATA FOR DISPLAY* J. E. George and W. F. Miller Compuer Science Deparmen and Sanford Linear Acceleraor Cener Sanford Universiy Sanford, California Absrac
More informationImage Content Representation
Image Conen Represenaion Represenaion for curves and shapes regions relaionships beween regions E.G.M. Perakis Image Represenaion & Recogniion 1 Reliable Represenaion Uniqueness: mus uniquely specify an
More informationRao-Blackwellized Particle Filtering for Probing-Based 6-DOF Localization in Robotic Assembly
MITSUBISHI ELECTRIC RESEARCH LABORATORIES hp://www.merl.com Rao-Blackwellized Paricle Filering for Probing-Based 6-DOF Localizaion in Roboic Assembly Yuichi Taguchi, Tim Marks, Haruhisa Okuda TR1-8 June
More informationAUTOMATIC 3D FACE REGISTRATION WITHOUT INITIALIZATION
Chaper 3 AUTOMATIC 3D FACE REGISTRATION WITHOUT INITIALIZATION A. Koschan, V. R. Ayyagari, F. Boughorbel, and M. A. Abidi Imaging, Roboics, and Inelligen Sysems Laboraory, The Universiy of Tennessee, 334
More informationProbabilistic Detection and Tracking of Motion Discontinuities
Probabilisic Deecion and Tracking of Moion Disconinuiies Michael J. Black David J. Flee Xerox Palo Alo Research Cener 3333 Coyoe Hill Road Palo Alo, CA 94304 fblack,fleeg@parc.xerox.com hp://www.parc.xerox.com/fblack,fleeg/
More informationVisual Perception as Bayesian Inference. David J Fleet. University of Toronto
Visual Percepion as Bayesian Inference David J Flee Universiy of Torono Basic rules of probabiliy sum rule (for muually exclusive a ): produc rule (condiioning): independence (def n ): Bayes rule: marginalizaion:
More informationLearning in Games via Opponent Strategy Estimation and Policy Search
Learning in Games via Opponen Sraegy Esimaion and Policy Search Yavar Naddaf Deparmen of Compuer Science Universiy of Briish Columbia Vancouver, BC yavar@naddaf.name Nando de Freias (Supervisor) Deparmen
More informationDesign Alternatives for a Thin Lens Spatial Integrator Array
Egyp. J. Solids, Vol. (7), No. (), (004) 75 Design Alernaives for a Thin Lens Spaial Inegraor Array Hala Kamal *, Daniel V azquez and Javier Alda and E. Bernabeu Opics Deparmen. Universiy Compluense of
More information4 Error Control. 4.1 Issues with Reliable Protocols
4 Error Conrol Jus abou all communicaion sysems aemp o ensure ha he daa ges o he oher end of he link wihou errors. Since i s impossible o build an error-free physical layer (alhough some shor links can
More informationAnalysis of Various Types of Bugs in the Object Oriented Java Script Language Coding
Indian Journal of Science and Technology, Vol 8(21), DOI: 10.17485/ijs/2015/v8i21/69958, Sepember 2015 ISSN (Prin) : 0974-6846 ISSN (Online) : 0974-5645 Analysis of Various Types of Bugs in he Objec Oriened
More information4. Minimax and planning problems
CS/ECE/ISyE 524 Inroducion o Opimizaion Spring 2017 18 4. Minima and planning problems ˆ Opimizing piecewise linear funcions ˆ Minima problems ˆ Eample: Chebyshev cener ˆ Muli-period planning problems
More informationDefinition and examples of time series
Definiion and examples of ime series A ime series is a sequence of daa poins being recorded a specific imes. Formally, le,,p be a probabiliy space, and T an index se. A real valued sochasic process is
More informationSam knows that his MP3 player has 40% of its battery life left and that the battery charges by an additional 12 percentage points every 15 minutes.
8.F Baery Charging Task Sam wans o ake his MP3 player and his video game player on a car rip. An hour before hey plan o leave, he realized ha he forgo o charge he baeries las nigh. A ha poin, he plugged
More informationScheduling. Scheduling. EDA421/DIT171 - Parallel and Distributed Real-Time Systems, Chalmers/GU, 2011/2012 Lecture #4 Updated March 16, 2012
EDA421/DIT171 - Parallel and Disribued Real-Time Sysems, Chalmers/GU, 2011/2012 Lecure #4 Updaed March 16, 2012 Aemps o mee applicaion consrains should be done in a proacive way hrough scheduling. Schedule
More informationNEWTON S SECOND LAW OF MOTION
Course and Secion Dae Names NEWTON S SECOND LAW OF MOTION The acceleraion of an objec is defined as he rae of change of elociy. If he elociy changes by an amoun in a ime, hen he aerage acceleraion during
More informationImproving the Efficiency of Dynamic Service Provisioning in Transport Networks with Scheduled Services
Improving he Efficiency of Dynamic Service Provisioning in Transpor Neworks wih Scheduled Services Ralf Hülsermann, Monika Jäger and Andreas Gladisch Technologiezenrum, T-Sysems, Goslarer Ufer 35, D-1585
More informationWeighted Voting in 3D Random Forest Segmentation
Weighed Voing in 3D Random Fores Segmenaion M. Yaqub,, P. Mahon 3, M. K. Javaid, C. Cooper, J. A. Noble NDORMS, Universiy of Oxford, IBME, Deparmen of Engineering Science, Universiy of Oxford, 3 MRC Epidemiology
More informationVideo-Based Face Recognition Using Probabilistic Appearance Manifolds
Video-Based Face Recogniion Using Probabilisic Appearance Manifolds Kuang-Chih Lee Jeffrey Ho Ming-Hsuan Yang David Kriegman klee10@uiuc.edu jho@cs.ucsd.edu myang@honda-ri.com kriegman@cs.ucsd.edu Compuer
More informationNetwork management and QoS provisioning - QoS in Frame Relay. . packet switching with virtual circuit service (virtual circuits are bidirectional);
QoS in Frame Relay Frame relay characerisics are:. packe swiching wih virual circui service (virual circuis are bidirecional);. labels are called DLCI (Daa Link Connecion Idenifier);. for connecion is
More informationsource managemen, naming, proecion, and service provisions. This paper concenraes on he basic processor scheduling aspecs of resource managemen. 2 The
Virual Compuers A New Paradigm for Disribued Operaing Sysems Banu Ozden y Aaron J. Goldberg Avi Silberschaz z 600 Mounain Ave. AT&T Bell Laboraories Murray Hill, NJ 07974 Absrac The virual compuers (VC)
More informationA Bayesian Approach to Video Object Segmentation via Merging 3D Watershed Volumes
A Bayesian Approach o Video Objec Segmenaion via Merging 3D Waershed Volumes Yu-Pao Tsai 1,3, Chih-Chuan Lai 1,2, Yi-Ping Hung 1,2, and Zen-Chung Shih 3 1 Insiue of Informaion Science, Academia Sinica,
More informationResearch Article Auto Coloring with Enhanced Character Registration
Compuer Games Technology Volume 2008, Aricle ID 35398, 7 pages doi:0.55/2008/35398 Research Aricle Auo Coloring wih Enhanced Characer Regisraion Jie Qiu, Hock Soon Seah, Feng Tian, Quan Chen, Zhongke Wu,
More informationIntentSearch:Capturing User Intention for One-Click Internet Image Search
JOURNAL OF L A T E X CLASS FILES, VOL. 6, NO. 1, JANUARY 2010 1 InenSearch:Capuring User Inenion for One-Click Inerne Image Search Xiaoou Tang, Fellow, IEEE, Ke Liu, Jingyu Cui, Suden Member, IEEE, Fang
More informationNonparametric CUSUM Charts for Process Variability
Journal of Academia and Indusrial Research (JAIR) Volume 3, Issue June 4 53 REEARCH ARTICLE IN: 78-53 Nonparameric CUUM Chars for Process Variabiliy D.M. Zombade and V.B. Ghue * Dep. of aisics, Walchand
More informationImproved TLD Algorithm for Face Tracking
Absrac Improved TLD Algorihm for Face Tracking Huimin Li a, Chaojing Yu b and Jing Chen c Chongqing Universiy of Poss and Telecommunicaions, Chongqing 400065, China a li.huimin666@163.com, b 15023299065@163.com,
More informationRecovering Joint and Individual Components in Facial Data
JOURNAL OF L A E X CLASS FILES, VOL. 14, NO. 8, AUGUS 2015 1 Recovering Join and Individual Componens in Facial Daa Chrisos Sagonas, Evangelos Ververas, Yannis Panagakis, and Sefanos Zafeiriou, Member,
More informationMoving Object Detection Using MRF Model and Entropy based Adaptive Thresholding
Moving Objec Deecion Using MRF Model and Enropy based Adapive Thresholding Badri Narayan Subudhi, Pradipa Kumar Nanda and Ashish Ghosh Machine Inelligence Uni, Indian Saisical Insiue, Kolkaa, 700108, India,
More informationRelevance Ranking using Kernels
Relevance Ranking using Kernels Jun Xu 1, Hang Li 1, and Chaoliang Zhong 2 1 Microsof Research Asia, 4F Sigma Cener, No. 49 Zhichun Road, Beijing, China 100190 2 Beijing Universiy of Poss and Telecommunicaions,
More informationReal-Time Non-Rigid Multi-Frame Depth Video Super-Resolution
Real-Time Non-Rigid Muli-Frame Deph Video Super-Resoluion Kassem Al Ismaeil 1, Djamila Aouada 1, Thomas Solignac 2, Bruno Mirbach 2, Björn Oersen 1 1 Inerdisciplinary Cenre for Securiy, Reliabiliy, and
More informationLow-Cost WLAN based. Dr. Christian Hoene. Computer Science Department, University of Tübingen, Germany
Low-Cos WLAN based Time-of-fligh fligh Trilaeraion Precision Indoor Personnel Locaion and Tracking for Emergency Responders Third Annual Technology Workshop, Augus 5, 2008 Worceser Polyechnic Insiue, Worceser,
More informationAML710 CAD LECTURE 11 SPACE CURVES. Space Curves Intrinsic properties Synthetic curves
AML7 CAD LECTURE Space Curves Inrinsic properies Synheic curves A curve which may pass hrough any region of hreedimensional space, as conrased o a plane curve which mus lie on a single plane. Space curves
More informationPART 1 REFERENCE INFORMATION CONTROL DATA 6400 SYSTEMS CENTRAL PROCESSOR MONITOR
. ~ PART 1 c 0 \,).,,.,, REFERENCE NFORMATON CONTROL DATA 6400 SYSTEMS CENTRAL PROCESSOR MONTOR n CONTROL DATA 6400 Compuer Sysems, sysem funcions are normally handled by he Monior locaed in a Peripheral
More informationRobot localization under perceptual aliasing conditions based on laser reflectivity using particle filter
Robo localizaion under percepual aliasing condiions based on laser refleciviy using paricle filer DongXiang Zhang, Ryo Kurazume, Yumi Iwashia, Tsuomu Hasegawa Absrac Global localizaion, which deermines
More informationReinforcement Learning by Policy Improvement. Making Use of Experiences of The Other Tasks. Hajime Kimura and Shigenobu Kobayashi
Reinforcemen Learning by Policy Improvemen Making Use of Experiences of The Oher Tasks Hajime Kimura and Shigenobu Kobayashi Tokyo Insiue of Technology, JAPAN genfe.dis.iech.ac.jp, kobayasidis.iech.ac.jp
More informationFACIAL ACTION TRACKING USING PARTICLE FILTERS AND ACTIVE APPEARANCE MODELS. Soumya Hamlaoui & Franck Davoine
FACIAL ACTION TRACKING USING PARTICLE FILTERS AND ACTIVE APPEARANCE MODELS Soumya Hamlaoui & Franck Davoine HEUDIASYC Mixed Research Uni, CNRS / Compiègne Universiy of Technology BP 20529, 60205 Compiègne
More informationM(t)/M/1 Queueing System with Sinusoidal Arrival Rate
20 TUTA/IOE/PCU Journal of he Insiue of Engineering, 205, (): 20-27 TUTA/IOE/PCU Prined in Nepal M()/M/ Queueing Sysem wih Sinusoidal Arrival Rae A.P. Pan, R.P. Ghimire 2 Deparmen of Mahemaics, Tri-Chandra
More informationOptimal Crane Scheduling
Opimal Crane Scheduling Samid Hoda, John Hooker Laife Genc Kaya, Ben Peerson Carnegie Mellon Universiy Iiro Harjunkoski ABB Corporae Research EWO - 13 November 2007 1/16 Problem Track-mouned cranes move
More informationIn fmri a Dual Echo Time EPI Pulse Sequence Can Induce Sources of Error in Dynamic Magnetic Field Maps
In fmri a Dual Echo Time EPI Pulse Sequence Can Induce Sources of Error in Dynamic Magneic Field Maps A. D. Hahn 1, A. S. Nencka 1 and D. B. Rowe 2,1 1 Medical College of Wisconsin, Milwaukee, WI, Unied
More informationHyelim Oh. School of Computing, National University of Singapore, 13 Computing Drive, Singapore SINGAPORE
RESEARCH ARTICLE FREE VERSUS FOR-A-FEE: THE IMPACT OF A PAYWALL ON THE PATTERN AND EFFECTIVENESS OF WORD-OF-MOUTH VIA SOCIAL MEDIA Hyelim Oh School of Compuing, Naional Universiy of Singapore, 13 Compuing
More informationIAJIT First Online Publication
An Improved Feaure Exracion and Combinaion of Muliple Classifiers for Query-by- ming Naha Phiwma and Parinya Sanguansa 2 Deparmen of Compuer Science, Suan Dusi Rajabha Universiy, Thailand 2 Faculy of Engineering
More informationImage Based Computer-Aided Manufacturing Technology
Sensors & Transducers 03 by IFSA hp://www.sensorsporal.com Image Based Compuer-Aided Manufacuring Technology Zhanqi HU Xiaoqin ZHANG Jinze LI Wei LI College of Mechanical Engineering Yanshan Universiy
More informationCENG 477 Introduction to Computer Graphics. Modeling Transformations
CENG 477 Inroducion o Compuer Graphics Modeling Transformaions Modeling Transformaions Model coordinaes o World coordinaes: Model coordinaes: All shapes wih heir local coordinaes and sies. world World
More informationRule-Based Multi-Query Optimization
Rule-Based Muli-Query Opimizaion Mingsheng Hong Dep. of Compuer cience Cornell Universiy mshong@cs.cornell.edu Johannes Gehrke Dep. of Compuer cience Cornell Universiy johannes@cs.cornell.edu Mirek Riedewald
More informationA Review on Block Matching Motion Estimation and Automata Theory based Approaches for Fractal Coding
Regular Issue A Review on Block Maching Moion Esimaion and Auomaa Theory based Approaches for Fracal Coding Shailesh D Kamble 1, Nileshsingh V Thakur 2, and Preei R Bajaj 3 1 Compuer Science & Engineering,
More informationPerformance Evaluation of Implementing Calls Prioritization with Different Queuing Disciplines in Mobile Wireless Networks
Journal of Compuer Science 2 (5): 466-472, 2006 ISSN 1549-3636 2006 Science Publicaions Performance Evaluaion of Implemening Calls Prioriizaion wih Differen Queuing Disciplines in Mobile Wireless Neworks
More informationLAMP: 3D Layered, Adaptive-resolution and Multiperspective Panorama - a New Scene Representation
Submission o Special Issue of CVIU on Model-based and Image-based 3D Scene Represenaion for Ineracive Visualizaion LAMP: 3D Layered, Adapive-resoluion and Muliperspecive Panorama - a New Scene Represenaion
More informationRestorable Dynamic Quality of Service Routing
QOS ROUTING Resorable Dynamic Qualiy of Service Rouing Murali Kodialam and T. V. Lakshman, Lucen Technologies ABSTRACT The focus of qualiy-of-service rouing has been on he rouing of a single pah saisfying
More informationChapter 3 MEDIA ACCESS CONTROL
Chaper 3 MEDIA ACCESS CONTROL Overview Moivaion SDMA, FDMA, TDMA Aloha Adapive Aloha Backoff proocols Reservaion schemes Polling Disribued Compuing Group Mobile Compuing Summer 2003 Disribued Compuing
More informationComputer representations of piecewise
Edior: Gabriel Taubin Inroducion o Geomeric Processing hrough Opimizaion Gabriel Taubin Brown Universiy Compuer represenaions o piecewise smooh suraces have become vial echnologies in areas ranging rom
More informationDetection and segmentation of moving objects in highly dynamic scenes
Deecion and segmenaion of moving objecs in highly dynamic scenes Aurélie Bugeau Parick Pérez INRIA, Cenre Rennes - Breagne Alanique Universié de Rennes, Campus de Beaulieu, 35 042 Rennes Cedex, France
More informationAssignment 2. Due Monday Feb. 12, 10:00pm.
Faculy of rs and Science Universiy of Torono CSC 358 - Inroducion o Compuer Neworks, Winer 218, LEC11 ssignmen 2 Due Monday Feb. 12, 1:pm. 1 Quesion 1 (2 Poins): Go-ack n RQ In his quesion, we review how
More informationWe are IntechOpen, the world s leading publisher of Open Access books Built by scientists, for scientists. International authors and editors
We are InechOpen, he world s leading publisher of Open Access books Buil by scieniss, for scieniss 4,000 116,000 120M Open access books available Inernaional auhors and ediors Downloads Our auhors are
More informationDetection of salient objects with focused attention based on spatial and temporal coherence
ricle Informaion Processing Technology pril 2011 Vol.56 No.10: 1055 1062 doi: 10.1007/s11434-010-4387-1 SPECIL TOPICS: Deecion of salien objecs wih focused aenion based on spaial and emporal coherence
More informationA Fast Non-Uniform Knots Placement Method for B-Spline Fitting
2015 IEEE Inernaional Conference on Advanced Inelligen Mecharonics (AIM) July 7-11, 2015. Busan, Korea A Fas Non-Uniform Knos Placemen Mehod for B-Spline Fiing T. Tjahjowidodo, VT. Dung, and ML. Han Absrac
More informationCONTEXT MODELS FOR CRF-BASED CLASSIFICATION OF MULTITEMPORAL REMOTE SENSING DATA
ISPRS Annals of he Phoogrammery, Remoe Sensing and Spaial Informaion Sciences, Volume I-7, 2012 XXII ISPRS Congress, 25 Augus 01 Sepember 2012, Melbourne, Ausralia CONTEXT MODELS FOR CRF-BASED CLASSIFICATION
More informationIt is easier to visualize plotting the curves of cos x and e x separately: > plot({cos(x),exp(x)},x = -5*Pi..Pi,y = );
Mah 467 Homework Se : some soluions > wih(deools): wih(plos): Warning, he name changecoords has been redefined Problem :..7 Find he fixed poins, deermine heir sabiliy, for x( ) = cos x e x > plo(cos(x)
More informationOcclusion-Free Hand Motion Tracking by Multiple Cameras and Particle Filtering with Prediction
58 IJCSNS Inernaional Journal of Compuer Science and Nework Securiy, VOL.6 No.10, Ocober 006 Occlusion-Free Hand Moion Tracking by Muliple Cameras and Paricle Filering wih Predicion Makoo Kao, and Gang
More informationMotion Level-of-Detail: A Simplification Method on Crowd Scene
Moion Level-of-Deail: A Simplificaion Mehod on Crowd Scene Absrac Junghyun Ahn VR lab, EECS, KAIST ChocChoggi@vr.kais.ac.kr hp://vr.kais.ac.kr/~zhaoyue Recen echnological improvemen in characer animaion
More informationVisual Indoor Localization with a Floor-Plan Map
Visual Indoor Localizaion wih a Floor-Plan Map Hang Chu Dep. of ECE Cornell Universiy Ihaca, NY 14850 hc772@cornell.edu Absrac In his repor, a indoor localizaion mehod is presened. The mehod akes firsperson
More informationAn efficient approach to improve throughput for TCP vegas in ad hoc network
Inernaional Research Journal of Engineering and Technology (IRJET) e-issn: 395-0056 Volume: 0 Issue: 03 June-05 www.irje.ne p-issn: 395-007 An efficien approach o improve hroughpu for TCP vegas in ad hoc
More informationSENSING using 3D technologies, structured light cameras
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 39, NO. 10, OCTOBER 2017 2045 Real-Time Enhancemen of Dynamic Deph Videos wih Non-Rigid Deformaions Kassem Al Ismaeil, Suden Member,
More information1.4 Application Separable Equations and the Logistic Equation
1.4 Applicaion Separable Equaions and he Logisic Equaion If a separable differenial equaion is wrien in he form f ( y) dy= g( x) dx, hen is general soluion can be wrien in he form f ( y ) dy = g ( x )
More informationDAGM 2011 Tutorial on Convex Optimization for Computer Vision
DAGM 2011 Tuorial on Convex Opimizaion for Compuer Vision Par 3: Convex Soluions for Sereo and Opical Flow Daniel Cremers Compuer Vision Group Technical Universiy of Munich Graz Universiy of Technology
More informationPrecise Voronoi Cell Extraction of Free-form Rational Planar Closed Curves
Precise Voronoi Cell Exracion of Free-form Raional Planar Closed Curves Iddo Hanniel, Ramanahan Muhuganapahy, Gershon Elber Deparmen of Compuer Science Technion, Israel Insiue of Technology Haifa 32000,
More informationEfficient Multi-view Video Coding using 3D Motion Estimation and Virtual Frame
Neurocompuing journal homepage: www. elsevi er.c om Efficien Muli-view Video Coding using 3D Moion Esimaion and Virual Frame Manoranjan Paul* Cenre for Research in Complex Sysems, School of Compuing &
More informationDetection Tracking and Recognition of Human Poses for a Real Time Spatial Game
Deecion Tracking and Recogniion of Human Poses for a Real Time Spaial Game Feifei Huo, Emile A. Hendriks, A.H.J. Oomes Delf Universiy of Technology The Neherlands f.huo@udelf.nl Pascal van Beek, Remco
More informationA new algorithm for small object tracking based on super-resolution technique
A new algorihm for small objec racking based on super-resoluion echnique Yabunayya Habibi, Dwi Rana Sulisyaningrum, and Budi Seiyono Ciaion: AIP Conference Proceedings 1867, 020024 (2017); doi: 10.1063/1.4994427
More informationLess Pessimistic Worst-Case Delay Analysis for Packet-Switched Networks
Less Pessimisic Wors-Case Delay Analysis for Packe-Swiched Neworks Maias Wecksén Cenre for Research on Embedded Sysems P O Box 823 SE-31 18 Halmsad maias.wecksen@hh.se Magnus Jonsson Cenre for Research
More informationIn Proceedings of CVPR '96. Structure and Motion of Curved 3D Objects from. using these methods [12].
In Proceedings of CVPR '96 Srucure and Moion of Curved 3D Objecs from Monocular Silhouees B Vijayakumar David J Kriegman Dep of Elecrical Engineering Yale Universiy New Haven, CT 652-8267 Jean Ponce Compuer
More informationLearning nonlinear appearance manifolds for robot localization
Learning nonlinear appearance manifolds for robo localizaion Jihun Hamm, Yuanqing Lin, and Daniel. D. Lee GRAS Lab, Deparmen of Elecrical and Sysems Engineering Universiy of ennsylvania, hiladelphia, A
More informationAlgorithm for image reconstruction in multi-slice helical CT
Algorihm for image reconsrucion in muli-slice helical CT Kasuyuki Taguchi a) and Hiroshi Aradae Medical Engineering Laboraory, Toshiba Corporaion, 1385 Shimoishigami, Oawara, Tochigi 324-855, Japan Received
More informationLecture 18: Mix net Voting Systems
6.897: Advanced Topics in Crypography Apr 9, 2004 Lecure 18: Mix ne Voing Sysems Scribed by: Yael Tauman Kalai 1 Inroducion In he previous lecure, we defined he noion of an elecronic voing sysem, and specified
More informationViewpoint Invariant 3D Landmark Model Inference from Monocular 2D Images Using Higher-Order Priors
Viewpoin Invarian 3D Landmark Model Inference from Monocular 2D Images Using Higher-Order Priors Chaohui Wang 1,2, Yun Zeng 3, Loic Simon 1, Ioannis Kakadiaris 4, Dimiris Samaras 3, Nikos Paragios 1,2
More informationPartition-based document identifier assignment (PBDIA) algorithm. (long queries)
( ) Pariion-based documen idenifier assignmen (PBDIA) algorihm PBDIA (long queries) (parallel IR) :,,,, d-gap Compressing an invered file can grealy improve query performance of an informaion rerieval
More informationReal-time 2D Video/3D LiDAR Registration
Real-ime 2D Video/3D LiDAR Regisraion C. Bodenseiner Fraunhofer IOSB chrisoph.bodenseiner@iosb.fraunhofer.de M. Arens Fraunhofer IOSB michael.arens@iosb.fraunhofer.de Absrac Progress in LiDAR scanning
More information