Feature Enhancement with a Reservoir-based Denoising Auto Encoder

Size: px
Start display at page:

Download "Feature Enhancement with a Reservoir-based Denoising Auto Encoder"

Transcription

1 Featue Enhancement with a Resevoi-based Denoising Auto Encode Azaakhsh Jalalvand, Kis Demuynck, Jean-Piee Matens Multimedia Lab, ELIS, Ghent Univesity/iMinds, Sint-Pietesnieuwstaat 41, B-9000, Ghent, Belgium Azaakhsh.Jalalvand@ugent.be Abstact Recently, automatic speech ecognition has advanced significantly by the intoduction of deep neual netwoks fo acoustic modeling. Howeve, thee is no clea evidence yet that this does not come at the pice of less genealization to conditions that wee not pesent duing taining. On the othe hand, acoustic modeling with Resevoi Computing (RC) did not offe impoved clean speech ecognition but it leads to good obustness against noise and channel distotions. In this pape, the aim is to establish whethe adding featue denoising in the font-end can futhe impove the obustness of an RC-based ecognize, and if so, whethe one can devise an RC-based Denoising Auto Encode that outpefoms a taditional denoise like the ETSI Advanced Font-End. In ode to answe these questions, expeiments ae conducted on the Auoa-2 benchmak. Keywods ecuent neual netwoks, esevoi computing, denoising auto encode, obust speech ecognition I. INTRODUCTION If one wants to apply automatic speech ecognition (ASR) on mobile devices, an ASR system is needed that is accuate and obust against the pesence of noise and channel distotions. Despite many yeas of wok, achieving obust ecognition emains a big challenge. Since an ASR is composed of a font-end (fo the extacting acoustic featue vectos) and a back-end (fo decoding these featue vectos) one can envision two types of techniques to tackle the poblem. One is to enhance (denoise) the featue vectos in the font-end [1], [2], the othe is to take account of the noise duing featue decoding in the back-end [3], [4]. On top of that, one can develop acoustic model types that ae intinsicaally moe esistant to noise [5] [8]. Typical featue enhancement adopts signal pocessing techniques such as Wiene filteing (single-channel) and beamfoming (multi-channel). They gave ise to an Advanced Fontend (AFE) [1] and the SPLICE [9] algoithms, espectively. Less typical is to adopt a neual netwok to convet the noisy featue vectos to clean vectos [10]. Such a netwok is called a Denoising Auto Encode (DAE). Back-end techniques ae usually esticted to GMM-based acoustic modeling (GMM = Gaussian Mixtue Model) because fo this type of model it is easy to undestand how to include the vaiations due to the noise into the decode. In pevious wok we investigated Resevoi Computing (RC) [11], [12] as a neual-based appoach to acoustic modeling. The agument was that RC-netwoks ae dynamical systems which ae able to focus on meaningful speech dynamics which ae bound to diffe fom the dynamics induced by the noise. We achieved quite competitive esults fo noiseobust continuous digit ecognition (CDR) on Auoa-2 [7], [13]. A deep RC-HMM hybid (HMM = Hidden Makov Model) can appaently compete with a taditional GMM- HMM system in clean conditions and clealy outpefom it in noisy envionments. Moe ecently, we also discoveed that supplying AFE featues to the RC-HMM system can futhe inceases its noise obustness [8]. In this wok we futhe investigate the impact of the fontend in an RC-HMM system. In paticula, we popose to apply an RC-netwok fo featue denoising befoe ecognition. We ague that such complex nonlinea dynamical system is bound to captue bette the complex elationships between noisy and clean utteances than the taditional signal pocessing methods that have only a vey shot memoy. The est of the pape is oganized as follows: Section II povides a concise outline of the basic pinciples of RC, Section III descibes ways of integating esevoi netwoks in an RC-HMM hybid speech ecognize, Section IV eviews the RC-based denoising of the acoustic featues we popose and Sections V and VI summaize the expeimental famewok (Auoa-2) and the esults obtained within this famewok. The pape ends with conclusions and futue wok. II. RESERVOIR COMPUTING (RC) The basic pinciple of RC is that one can etieve infomation fom sequential inputs by means of a two-laye RNN with the following chaacteistics (see Fig. 1). The inputs ae U t Input laye W in R t W ec Resevoi W out Y t Readout laye U t W in R t Y t andom input weights andom ecuent weights tained output weights input featue vecto input weight matix esevoi state vecto W ec ecuent weight matix W out output weight matix eadout vecto Fig. 1. Speech A basic RC Font-end system consists of a esevoi and RC a eadout laye. The esevoi is composed of inteconnected Mean & non-linea neuons with andomly fixed weights. The Featue eadout laye Resevoi Vaiance consists of linea neuons Resevoi with tained weights. extaction Nomalization spasely Intemediate-pocessing connected to a pool of ecuently Classical GMM-HMM inteconnected nonlinea neuons foming so-called esevoi and the outputs Decoelation GMMs Input laye Decode Readouts Digits Speech Font-end

2 ae obtained as a linea combinations of the esevoi outputs. The esevoi neuons constitute a hidden laye that, at time t, is diven by inputs U t and delayed hidden laye outputs R t 1. Impotant is that (1) the weights of the hidden neuons ae fixed, (2) the output neuons ae linea, and consequently, (3) the output weights can be optimized by means of a least squaed linea egession. The entie system is called a esevoi netwok. Its outputs Y t ae usually called eadouts [11] so as to diffeentiate them unambiguously fom the esevoi outputs R t. In ode to become moe esistant to andom intefame changes in the inputs (e.g. changes due to the spectal analysis o the ambient noise), one can ceate a esevoi of Leaky Integato Neuons [14]). A netwok containing such a esevoi is govened by the following equations: R t = (1 λ)r t 1 + λ f es (W in U t + W ec R t 1 ) (1) Y t = W out R t (2) with a leak ate λ between 0 and 1, with W in, W ec containing the input and ecuent weights to the esevoi neuons, and with W out containing the output weights. The weights of the hidden neuons ae fixed by means of a andom pocess chaacteized by fou contol paametes (see [15] fo moe details): (1) α U, the maximal absolute eigenvalue of the input weight matix W in, (2) ρ, the maximal absolute eigenvalue of the ecuent weight matix W ec, (3) K in, the numbe of inputs diving each esevoi neuon and (4) K ec, the numbe of delayed esevoi outputs diving each esevoi neuon. The fist two paametes contol the stengths of the input and the ecuent stimulations of a esevoi neuon, wheeas the latte two contol the spasity of the input and ecuent weight matices. Togethe with λ they constitute the esevoi contol paametes which have to be set popely in ode to assue the esevoi is well behaved. Any effective esevoi should at least have the so-called echo state popety, stating that with time, the esevoi should foget about the initial state it was in. That is also why a esevoi netwok was oiginally called an Echo State Netwok [11]. It was shown in [11] that the echo state popety holds if ρ called the spectal adius is smalle than 1. The esevoi can be envisioned as a pedefined but complex non-linea dynamical system that pefoms a tempoal analysis of the input steam. Ou claim is that such a system can extact featues which ae esistant to the pesence of noise whose dynamics diffe fom the speech dynamics. The output weights ae detemined so that they minimize the mean squaed eo between the eadouts Y t and the desied eadouts D t ove the taining examples [7]. As a consequence, they follow fom a set of linea equations. If a esevoi netwok is tained fo ecognition, the desied output D t is a unit vecto with one non-zeo enty encoding the desied HMM-state at time t. If it is tained fo denoising, D t is the desied clean speech featue vecto at time t. III. A HYBRID RC-HMM FOR SPEECH RECOGNITION An RC-HMM hybid woks with an HMM that epesents the task and a neual netwok that is supposed to convet the inputs U t into HMM state likelihoods. The seach fo the best path though the HMM is found using a Vitebi seach. In the case of an RC-HMM hybid, the eadouts y t,i (with i indexing the eadouts) ae assumed to esemble the posteio pobabilities P (q t = i U1). t This means that z t,i = y t,i /P (q t = i) is a scaled likelihood and consequently, that the best state sequence follows fom T ˆq = ag max P (q, y) = ag max z t,qt P (q t q t 1 ), q q t=1 Fig. 2 shows the achitectue fo the case of continuous digit ecognition (CDR) and a multi-stage RC netwok in which each netwok output is supplied to the next stage [8]. The tansition pobability P 0 which is added to the digit loop contols the balance between deletions and insetions. Since y t,i is not confined to [0,1] it is fist mapped to that inteval befoe computing z t,i. The mapping is achieved by a simple clip-and-scale appoach, as descibed in [8]. The diffeent stages of the RC-netwok ae tained independently, one afte the othe. As in [8], [16] we use a bi-diectional RC-netwok with one esevoi pocessing the fames fom left-to-ight and anothe one pocessing them fom ight-to-left. The eadouts at time t ae then computed as a linea function of the two esevoi outputs at time t. IV. AN RC-BASED DENOISING AUTOENCODER A system that aims at econstucting clean featue vectos fom noisy featue vectos is called a denoising autoencode (DAE) [17]. A taditional example of a DAE is spectal subtaction: it is able to emove pat of the noise by woking fame-by-fame (= memoy-less). One can ague that a complex dynamical system with memoy should be able to do a bette job. Theefoe, we popose to ceate a DEA by means of an RC-netwok incopoating one unidiectional esevoi because such a system has memoy and it pemits to pefom a non-linea tansfomation of the noisy featues to clean featues. The back-side of this appoach is of couse that the DAE has to be tained and that duing this taining one needs the clean vesion of the noisy speech featue vectos. Since we use MFCCs (Mels-scale Fequency Cepstal Coefiificients) as the featue vectos, and as the DAE has to denoise these MFCCs, one can adopt diffeent stategies fo achieving this. In fact, the MFCC-computation includes a chain of pocessing stages (See Fig. 3), and the denoising RCnetwok could be inseted at diffeent positions in this chain. Taditional speech enhancement fo instance often woks in the Discete-time Fouie Tansfom (DFT) domain [18], [19] (at position P1), wheeas most wok on obust ASR applies denoising in the log mel-fequency domain [20] (at position P2) and in the MFCC-domain [21], [22] (at position P3). Note that although the high dimensionality of the input nomally leads to a highe computational cost [23], this may not necessaily be the case hee due to the spase inteconnection between inputs and esevoi neuons, and also due to employing linea nodes as the eadouts. Theefoe, we conside a RC-netwok at all positions. Futhemoe, each RC-netwok is supposed to have access to static as well as the dynamic featues, but the netwok is only expected to poduce denoised static featues. The dynamic paametes of the input to the ASR ae then deived fom these denoised static featues.

3 Featue extaction Mean & Vaiance Nomalization ut laye Resevoi Resevoi eadouts Intemediate-pocessing Inputs Laye 1 Decoelation GMMs U Y 1 Classical GMM-HMM Digits Intemediate layes Decode Y k Laye L Y L P(U q) I1 D 1 D Speech Featue extaction Font-end Mean & Vaiance Nomalization GMMs Posteio pobabilities I2 # Hybid RC-HMM F Readouts to Digits Resevoi Fig. 2. Achitectue Resevoi of an RC-HMM Log-Likelihood hybid compising Decode a multi-laye esevoi netwok fo CDR. The HMM has two initial states (I1 and I2), one final state (F) and it compises 11 multi-state digit convete models (D1... D11) and a single state silence model (#) Input laye Readouts P o Speech U t± n / Windowing Deivatives Calculato DFT P1 Log Melfilte P2 P3 DCT bank Input laye 3n n Resevoi / Resevoi / Û t Readouts TABLE I. NOISE TYPES AND FILTERS USED IN AURORA-2 DATASET Tain & Test A Test B Test C N1: subway N1: estauant N1: subway Noise N2: babble N2: steet N2: steet types N3: ca noise N3: aipot N4: exhibition hall N4: tain station Filte G712 G712 MIRS Fig. 3. Possible points to apply the denoising system (top) and thei stuctue in moe details (down) V. EXPERIMENTAL SETUP In this section we pesent the expeimental famewok that was adopted to investigate the potential of the diffeent system configuations pesented in the pevious sections. A. Speech copus: Auoa-2 The Auoa-2 copus consists of clean and noise coupted digit sequences counting 1 to 7 digits pe utteance. Each utteance is passed though a G712 o a MIRS filte [24], and then sampled at 8 khz. Since thee ae two vaiants of 0 in Ameican English, namely zeo and oh, the vocabulay is composed of 11 digits. The data is divided into a taining pat and an evaluation pat. The famewok suppots two types of expeiments: clean taining expeiments in which systems ae developed on 8440 clean taining utteances fom 110 adults and multi-style taining expeiments in which systems ae developed on 8440 noise coupted vesions of the same utteances. The couption is andomly chosen out of fou noise types and five SNRs ( (clean), 20, 15, 10 and 5 db). The evaluation utteances come fom speakes that ae not pesent in the taining data. They ae divided into thee tests. Tests A and B each contain 28,028 utteances coveing 4004 diffeent digit sequences, 4 noise types and 7 SNRs ( (clean), 20, 15, 10, 5, 0, and -5 db). The noise types occuing in Test B do not occu in the multistyle taining data, while those of Test A do. Test C contains 14,014 utteances coveing 2002 diffeent digit sequences, 2 noise types (one matched and one mismatched) and 7 SNRs. Unlike all othe utteances they passed though a MIRS instead of a G712 filte (see Table I). B. Evaluation esults We epot aveage Wod Eo Rates (WERs) on tests A-C fo all SNRs, and we conside both clean speech taining and multi-style taining. In multi-style taining, clean utteances and utteances with SNRs between 20 and 5 db ae used fo taining. In the final evaluation phase both the acoustic models and the DAE ae tained on the complete taining set, but using the contol paametes that wee found optimal in a development phase duing which two thids of the taining set ae used fo taining and the emaining thid fo contol paamete optimization (e.g. the Vitebi decode penalty, P 0 of the ecognize). In this pape, we only epot the esults of the final evaluation phase fo each expeiment. C. Refeence systems In ode to set a efeence, we fist epot some state-ofthe-at system pefomances (see Table II). In paticula, we conside the ML-based GMM systems using AFE-featues poposed in [25], the ML-based and MCE-based GMM systems poposed in [2] and [26], two GMM systems embedding moe sophisticated back-ends based on joint uncetainty decoding (JUD) and Vecto Tylo Seies (VTS) espectively [4] and the tandem system embedding deep belief netwoks and GMMs (T-DBN-GMM), epoted in [5]. The figues show that advanced back-end techniques (JUD and VTS) lead to a lage gain in noise obustness than advanced font-end techniques, but it is not clea fom the papes how much degadation they induce fo clean speech ecognition. D. Font-end setups We will investigate thee diffeent acoustic featue sets: MFCCs (log enegy and c 1... c 12 ), Mel filtebank featues (MelFB) (24 channel log enegies), and the AFE featues (denoised c 0... c 12 without dopping non-speech fames) [1]. In all cases, the analysis is pefomed on 30 ms Hammingwindowed fames and the hop size between fames is τ f =

4 TABLE II. COMPARING AVERAGE WERS (IN %) PER CONDITION FOR TEST SETS A-C OF AURORA-2 USING A 3-LAYER HYBRID RC-HMM FOR BOTH CLEAN AND MULTI-STYLE TRAINING. Clean Multi System Clean dB Clean dB GMM (AFE) [25] GMM (MFCC) [2] GMM (SPLICE) [27] GMM (MFCC-MCE) [26] GMM (VTS) [4] GMM (JUD) [4] T-DBN-GMM [5] RC (MelFB) RC (MFCC) RC (AFE) Seen Unseen Ref P1 P2 P3 P3 P Coss coelation Fig. 4. The coelation between the output of the DAE and the clean nomalized featues, by denoising the data steam in diffeent points. 10 ms. Each featue set is supplemented with and featues. Befoe supplying the featue vectos to the ASR, an utteance-wise nomalization that ceates zeo-mean and unitvaiance inputs pe featue is pefomed. E. RC-HMM hybid setup Following ou pevious wok, the esevoi contol paametes of each esevoi ae detemined in the same way.. Defining τ λ = τf / ln(1 λ) as the leaky integation time constant and T as the expected state duation, we select (ρ, τ λ, K in, K ec ) = (0.8, T, 10, 10) and we chose α U so that the aveage vaiance of the eseevoi outputs eaches a cetain level [15]. In this wok we utilize a 3-laye RC-HMM system and since layes 2 and 3 see basically the same inputs, α U is taken the same fo both layes. The size of the esevoi defined as the numbe of neuons it contains (N es ) is consideed to be an independent vaiable. Note that since K in and K ec ae kept fixed to 10, the CPU-time needed fo calculating the eadouts scales linealy with the size of the esevoi. The esevoi netwoks ae tained by means of a Tikhonov egession [28] and each digit is modeled by a 7-state left-toight HMM whilst the silence is modeled by a single state. The taget outputs encode the visited HMM state at time t. F. RC-based DAE setup In line with [15], we contemplate that ρ and λ ae the only task-dependent contol paametes of a esevoi. Consequently, even though denoising is a completely diffeent task than digit state ecognition, we keep Kin = K ec = 10 and we maintain the same method as befoe fo computing α U. Futhemoe, since leaky integation is a fom of smoothing coesponding to a some low-pass filte, we ague that no leaky integation should be applied hee as the spectum of the denoised inputs is not expected to be naowe than that of the noisy inputs. Consequently, thee is only one paamete left to optimiza, namely ρ. In all expeiments, iespective of the chosen featue set, we have used a 2-laye RC-netwok with one unidiectional esevoi of 1K neuons pe laye. VI. EXPERIMENTAL RESULTS In this section we eview the expeiments we conducted to assess the capacity of an RC-netwok to denoise the featues and the capacity it has to futhe aise the obustness of the CDR system. A. Denoising capacity of an RC-netwok Duing assessment of the denoising capacity of the RCnetwoks we adopt the aveage coelations between the 39 denoised and the clean MFCCs featues as the quality citeion. Howeve, since the ASR will always wok with utteance based mean and vaiance nomalized featues, the coelations ae computed afte this nomalization step. We conducted two expeiments: one in which the test samples come fom Test A and epesent conditions that ae pesent duing taining (= Seen ) and one in which the test samples come fom Test C and epesent anothe channel and unseen SNRs (= Unseen ). In each expeiment, the DAE is imputed at all thee positions P1, P2 and P3 (see Section IV). The esults ae depicted in Fig. 4. Ref denotes to the coelation existing between the aw (noisy) and the clean featues. The data show that denoising in the DFT-domain is not woking well but denoising in the two othe domains does. In fact, iespective of the expeiment, denoising the MelFB and the MFCC featues seem to be equally effective. We will theefoe evaluate them both in combination with CDR. In an additional expeiment we constucted RC-netwoks that wee tained to denoise the dynamic as well as the static MFCCs instead of just the static featues. This appoach lead to the aveage coelation maked by P3. Thee seems to be no benefit in woking this way. Finally, we wanted to assess the limits of the RC-based DAE by aising the esevoi size fom 1K to 4K neuons. The esults obtained with the lage esevois (maked as P3 ) ae only slightly bette than the ones obtained with the smalle esevois. In ode to illustate the effect of the DAE, we have depicted on Fig. 5 the MelFB spectogams fo a noisy speech sample (SNR = 5dB) befoe and afte denoising togethe with the clean speech spectogam. It is especially notewothy that the DAE does an excellent job in the silence pats. This is patly due to the lage numbe of non-speech (silent) fames in the taining data.

5 TABLE III. COMPARING AVERAGE WERS (IN %) ON TEST SETS A-C PER CONFIGURATION: (1) THE BASELINE RC-HMM RECOGNIZER TRAINED ON MFCC AND AFE, (2) THE BASELINE RECOGNIZER TRAINED ON MFCC AND AFE BUT TESTED ON DENOISED FEATURES, AND (3) THE RETRAINED RECOGNIZER TESTED ON DENOISED FEATURES. (a) Noisy MelFB featues Baseline Clean Multi Clean dB Clean dB MFCC AFE Test on MFCC Baseline AFE Reain & MFCC Test AFE (b) Clean copy (c) Denoised MelFB Fig. 5. Denoising MelFB featues of a sample with steet noise of SNR 5 db fom Test C. B. Need fo a denoising font-end in an RC-based CDR In a fist ecognition expeiment we test the thee acoustic featue sets descibed in the pevious section in combination with a thee-laye bi-diectional RC-HMM hybid with 8K neuons pe laye. The esults of this expeiment ae listed in the bottom section of Table II. Appaently, in the clean speech taining expeiment thee is only a little diffeence between the featue sets in the matched condition (clean). In the mismatched conditions howeve, the AFE does lead to moe obustness, be it that the gain is vey much less impessive than it is fo GMM-based systems. This finding seems to confim the hypothesis that a esevoi can filte out a lage pat of the noise without having been confonted with noise duing taining. In the multi-style taining expeiment, whee the mismatch between the test and the taining emains much smalle, the positive effect of the AFE is also much smalle, but nevetheless emaining to some extent. Consequently, if an RC-based DAE could outpefom the AFE it could lead to inceased the obustness. C. Can RC-based DAE outpefom the AFE? In ode to answe the question, we conducted expeiments with a font-end incopoating an RC-based DAE at position P3. We consideed thee configuations: (1) the baseline featues and acoustic models ae used (= Baseline), (2) the denoised featues ae used but the acoustic models ae left unchanged (= Test on Baseline) and (3) the denoised featues ae used in combination with acoustic models that ae etained on these featues (= Retain & Test). In the case of etaining, we maintained the distinction between clean speech taining and multi-style taining, but of couse, the clean speech taining esults have to be intepeted with cae because afte all, the DAE has aleady seen the noisy taining examples that belong to the multi-style taining copus. The esults listed in Table III show that the RC-based denoise does not outpefom the AFE as the pefomance of the baseline system with AFE featues is slightly bette than that of a system woking with plain MFCCs and an RC-based DAE on all conditions. In spite of this, compaing the AFE ows of the baseline and the new system, demonstates that the RC-based DAE does lead to a small exta gain in stongly mismatched conditions (clean speech taining and noisy test samples) but that this comes at the expense of a significant loss of the pefomance fo matched conditions (clean test samples). In the multi-style taining expeiment whee the mismatch emains modeate, the RC-based denoise even has a (small) detimental effect in all conditions. This can only mean that the DAE emoves infomation fom the featue steam that is othewise exploitable by the RC back-end. As could be expected, the Test on Baseline configuation is chaacteized by a stong detimental effect in matched conditions because the featues used duing taining and test have become diffeent even in these conditions. In a contol expeiment, we also tained and tested the RCbased denoising of MFCCs by inseting a DAE at position P2. In line with the coelations measued befoe, the emeging systems achieved vey simila esult showing no pefeence fo positions P2 and P3. VII. CONCLUSION AND FUTURE WORK Recently, we wee able to show that esevoi computing (RC) is a viable acoustic modeling technique that can lead to moe obust automatic speech ecognition (ASR). Expeiments on the Auoa-2 benchmak demonstated that ou RC-based systems aleady outpefom the most complex GMM-HMM based systems on the task of continuous digit ecognition. The main objectives of the pesent pape wee: (1) to establish whethe esevoi netwoks can be tained to denoise the featue vectos, (2) to investigate how much benefit can be attained by applying denoising in the font-end of the RC-ecognize, (3) to establish whethe RC-based denoising can outpefom taditional signal pocessing based techniques (e.g. like in the ETSI advanced font-end (AFE)), and (4)

6 to find out whethe the two denoising techniques ae maybe complementay. Fist of all, we could show that an RC-based denoising auto encode (DAE) imputed at a suitable position in the MFCC font-end can lead to inceasing mean coelations between the featues of the noisy and the clean speech utteances. Next, we could demonstate that adopting featue denoising in the font-end of an RC-based ASR is beneficial, but not as much as it is in the case of a taditional GMM-based ASR. A somewhat disappointing esult is that an RC-based DAE fails to induce as much impovement as the AFE. Howeve, by combining the two tecniques, a small gain is possible to achieve in seveely mismatched conditions, be it at the expense of a small degadation in the matched condition. In the nea futue we will compae the RC-based DAE with the AFE in situations whee the noise is moe non-stationay. Futhemoe, we hope to apply some of the ideas which lead to uncetainty decoding in an RC-backend. Finally, we intend to extend ou eseach to noise obust lage vocabulay speech ecognition (e.g. Auoa-4). ACKNOWLEDGEMENT The eseach leading to the esults pesented hee has eceived funding fom Flemish Science Foundation (FWO) unde gant ageement G N (RECAP). REFERENCES [1] ETSI, Speech pocessing, tansmission and quality aspects STQ; distibuted speech ecognition; advanced font-end featue extaction algoithm; compession algoithms, ES , Tech. Rep., [2] C.-P. Chen and J. A. Bilmes, MVA pocessing of speech featues, IEEE Tans. Audio, Speech and Language Pocessing, vol. 15, no. 1, pp , jan [3] M. Van Segboeck and H. Van Hamme, Advances in missing featue techniques fo obust lage vocabulay continuous speech ecognition, IEEE Tans. Audio, Speech and Language Pocessing, vol. 19, no. 1, pp , [4] H. Xu, M. Gales, and K. Chin, Joint uncetainty decoding with pedictive methods fo noise obust speech ecognition, IEEE Tans. Audio, Speech and Language Pocessing, vol. 19, no. 6, pp , [5] O. Vinyals and S. Ravui, Compaing multilaye pecepton to deep belief netwok tandem featues fo obust ASR, in Poc. ICASSP, 2011, pp [6] S.-X. Zhang and M. Gales, Stuctued suppot vecto machines fo noise obust continuous speech ecognition, in Poc. Intespeech, 2011, pp [7] A. Jalalvand, F. Tiefenbach, and J.-P. Matens, Continuous digit ecognition in noise: Resevois can do an excellent job! in Poc. Intespeech, 2012, p. ID:644. [8] A. Jalalvand, K. Demuynck, and J.-P. Matens, Noise obust continuous digit ecognition with esevoi-based acoustic models, in Poc. ISPACS, 2013, p. ID:99. [9] L. Deng, A. Aceo, L. Jiang, J. Doppo, and X. Huang, Highpefomance obust speech ecognition using steeo taining data, in Poc. ICASSP, vol. 1, 2001, pp [10] S. Tamua and A. Waibel, Noise eduction using connectionist models, in Poc. ICASSP, ap 1988, pp [11] H. Jaege, The echo state appoach to analysing and taining ecuent neual netwoks - with an eatum note, GMD Repot 148, Geman National Reseach Cente fo Infomation Technology, Tech. Rep., [12] D. Vestaeten, B. Schauwen, M. D Haene, and D. Stoobandt, An expeimental unification of esevoi computing methods, IEEE Tans. Neual Netwoks, vol. 20, pp , [13] A. Jalalvand, F. Tiefenbach, D. Vestaeten, and J.-P. Matens, Connected digit ecognition by means of esevoi computing, in Poc. Intespeech, 2011, pp [14] H. Jaege, M. Lukoševičius, D. Popovici, and U. Siewet, Optimization and applications of echo state netwoks with leaky-integato neuons, Neual Netwoks, vol. 20, no. 3, pp , Ap [15] M. Lukoševičius, A pactical guide to applying echo state netwoks, in Neual Netwoks: Ticks of the Tade, se. Lectue Notes in Compute Science. Spinge Belin Heidelbeg, 2012, vol. 7700, pp [16] F. Tiefenbach, A. Jalalvand, K. Demuynck, and J.-P. Matens, Acoustic modeling with hieachical esevois, IEEE Tans. Audio, Speech and Language Pocessing, vol. PP, no. 99, [17] P. Vincent, H. Laochelle, Y. Bengio, and P.-A. Manzagol, Extacting and composing obust featues with denoising autoencodes, in Poc. ICML, 2008, pp [18] Y. Ephaim and D. Malah, Speech enhancement using a minimummean squae eo shot-time spectal amplitude estimato, IEEE Tans. Acoustics,Speech and Signal Pocessing, vol. 32, no. 6, pp , [19] P. J. Wolfe and S. J. Godsill, Efficient altenatives to the ephaim and malah suppession ule fo audio signal enhancement, in special issue) EURASIP JASP on Digital Audio fo Multimedia Communications, 2003, pp [20] D. Yu, L. Deng, J. Doppo, J. Wu, Y. Gong, and A. Aceo, Robust speech ecognition using a cepstal minimum-mean-squae-eomotivated noise suppesso, IEEE Tans. Audio, Speech and Language Pocessing, vol. 16, no. 5, pp , [21] K. Indebo, R. Povinelli, and M. Johnson, Minimum mean-squaed eo estimation of mel-fequency cepstal coefficients using a novel distotion model, IEEE Tans. Audio, Speech and Language Pocessing, vol. 16, no. 8, pp , [22] L. Deng, J. Doppo, and A. Aceo, Estimating cepstum of speech unde the pesence of noise using a joint pio of static and dynamic featues. IEEE Tans. Speech Audio Pocessing, vol. 12, no. 3, pp , [23] R. Rotili, E. Pincipi, S. Cifani, F. Piazza, and S. Squatini, Multichannel featue enhancement fo obust speech ecognition, Speech technologies. InTech, ISBN, pp , [24] H.-G. Hisch and D. Peace, The AURORA expeimental famewok fo the pefomance evaluation of speech ecognition systems unde noise conditions, in Automatic Speech Recognition: Challenges fo the Next Millennium. ISCA ITRW, 2000, pp [25] H. G. Hisch and D. Peace, Applying the advanced ETSI fontend to the Auoa-2 task, vesion 1.1, Tech. Rep., [26] X. Xiao, J. Li, E.-S. Chng, H. Li, and C.-H. Lee, A study on the genealization capability of acoustic models fo obust speech ecognition, IEEE Tans. Audio, Speech and Language Pocessing, vol. 18, no. 6, pp , [27] T. Kai, M. Suzuki, and K. Chijiiwa, Combination of SPLICE and featue nomalization fo noise obust speech ecognition, Intl. Wokshop on Nonlinea Cicuits, Communications and Signal Pocessing, vol. 16, no. 4, pp , [28] C. M. Bishop, Taining with noise is equivalent to Tikhonov egulaization, Neual Computation, vol. 7, pp , 1994.

Controlled Information Maximization for SOM Knowledge Induced Learning

Controlled Information Maximization for SOM Knowledge Induced Learning 3 Int'l Conf. Atificial Intelligence ICAI'5 Contolled Infomation Maximization fo SOM Knowledge Induced Leaning Ryotao Kamimua IT Education Cente and Gaduate School of Science and Technology, Tokai Univeisity

More information

Journal of World s Electrical Engineering and Technology J. World. Elect. Eng. Tech. 1(1): 12-16, 2012

Journal of World s Electrical Engineering and Technology J. World. Elect. Eng. Tech. 1(1): 12-16, 2012 2011, Scienceline Publication www.science-line.com Jounal of Wold s Electical Engineeing and Technology J. Wold. Elect. Eng. Tech. 1(1): 12-16, 2012 JWEET An Efficient Algoithm fo Lip Segmentation in Colo

More information

A modal estimation based multitype sensor placement method

A modal estimation based multitype sensor placement method A modal estimation based multitype senso placement method *Xue-Yang Pei 1), Ting-Hua Yi 2) and Hong-Nan Li 3) 1),)2),3) School of Civil Engineeing, Dalian Univesity of Technology, Dalian 116023, China;

More information

Detection and Recognition of Alert Traffic Signs

Detection and Recognition of Alert Traffic Signs Detection and Recognition of Alet Taffic Signs Chia-Hsiung Chen, Macus Chen, and Tianshi Gao 1 Stanfod Univesity Stanfod, CA 9305 {echchen, macuscc, tianshig}@stanfod.edu Abstact Taffic signs povide dives

More information

Effects of Model Complexity on Generalization Performance of Convolutional Neural Networks

Effects of Model Complexity on Generalization Performance of Convolutional Neural Networks Effects of Model Complexity on Genealization Pefomance of Convolutional Neual Netwoks Tae-Jun Kim 1, Dongsu Zhang 2, and Joon Shik Kim 3 1 Seoul National Univesity, Seoul 151-742, Koea, E-mail: tjkim@bi.snu.ac.k

More information

An Unsupervised Segmentation Framework For Texture Image Queries

An Unsupervised Segmentation Framework For Texture Image Queries An Unsupevised Segmentation Famewok Fo Textue Image Queies Shu-Ching Chen Distibuted Multimedia Infomation System Laboatoy School of Compute Science Floida Intenational Univesity Miami, FL 33199, USA chens@cs.fiu.edu

More information

Image Enhancement in the Spatial Domain. Spatial Domain

Image Enhancement in the Spatial Domain. Spatial Domain 8-- Spatial Domain Image Enhancement in the Spatial Domain What is spatial domain The space whee all pixels fom an image In spatial domain we can epesent an image by f( whee x and y ae coodinates along

More information

Slotted Random Access Protocol with Dynamic Transmission Probability Control in CDMA System

Slotted Random Access Protocol with Dynamic Transmission Probability Control in CDMA System Slotted Random Access Potocol with Dynamic Tansmission Pobability Contol in CDMA System Intaek Lim 1 1 Depatment of Embedded Softwae, Busan Univesity of Foeign Studies, itlim@bufs.ac.k Abstact In packet

More information

Spiral Recognition Methodology and Its Application for Recognition of Chinese Bank Checks

Spiral Recognition Methodology and Its Application for Recognition of Chinese Bank Checks Spial Recognition Methodology and Its Application fo Recognition of Chinese Bank Checks Hanshen Tang 1, Emmanuel Augustin 2, Ching Y. Suen 1, Olivie Baet 2, Mohamed Cheiet 3 1 Cente fo Patten Recognition

More information

Topic -3 Image Enhancement

Topic -3 Image Enhancement Topic -3 Image Enhancement (Pat 1) DIP: Details Digital Image Pocessing Digital Image Chaacteistics Spatial Spectal Gay-level Histogam DFT DCT Pe-Pocessing Enhancement Restoation Point Pocessing Masking

More information

Segmentation of Casting Defects in X-Ray Images Based on Fractal Dimension

Segmentation of Casting Defects in X-Ray Images Based on Fractal Dimension 17th Wold Confeence on Nondestuctive Testing, 25-28 Oct 2008, Shanghai, China Segmentation of Casting Defects in X-Ray Images Based on Factal Dimension Jue WANG 1, Xiaoqin HOU 2, Yufang CAI 3 ICT Reseach

More information

IP Network Design by Modified Branch Exchange Method

IP Network Design by Modified Branch Exchange Method Received: June 7, 207 98 IP Netwok Design by Modified Banch Method Kaiat Jaoenat Natchamol Sichumoenattana 2* Faculty of Engineeing at Kamphaeng Saen, Kasetsat Univesity, Thailand 2 Faculty of Management

More information

Point-Biserial Correlation Analysis of Fuzzy Attributes

Point-Biserial Correlation Analysis of Fuzzy Attributes Appl Math Inf Sci 6 No S pp 439S-444S (0 Applied Mathematics & Infomation Sciences An Intenational Jounal @ 0 NSP Natual Sciences Publishing o Point-iseial oelation Analysis of Fuzzy Attibutes Hao-En hueh

More information

Illumination methods for optical wear detection

Illumination methods for optical wear detection Illumination methods fo optical wea detection 1 J. Zhang, 2 P.P.L.Regtien 1 VIMEC Applied Vision Technology, Coy 43, 5653 LC Eindhoven, The Nethelands Email: jianbo.zhang@gmail.com 2 Faculty Electical

More information

Optical Flow for Large Motion Using Gradient Technique

Optical Flow for Large Motion Using Gradient Technique SERBIAN JOURNAL OF ELECTRICAL ENGINEERING Vol. 3, No. 1, June 2006, 103-113 Optical Flow fo Lage Motion Using Gadient Technique Md. Moshaof Hossain Sake 1, Kamal Bechkoum 2, K.K. Islam 1 Abstact: In this

More information

3D Hand Trajectory Segmentation by Curvatures and Hand Orientation for Classification through a Probabilistic Approach

3D Hand Trajectory Segmentation by Curvatures and Hand Orientation for Classification through a Probabilistic Approach 3D Hand Tajectoy Segmentation by Cuvatues and Hand Oientation fo Classification though a Pobabilistic Appoach Diego R. Faia and Joge Dias Abstact In this wok we pesent the segmentation and classification

More information

RANDOM IRREGULAR BLOCK-HIERARCHICAL NETWORKS: ALGORITHMS FOR COMPUTATION OF MAIN PROPERTIES

RANDOM IRREGULAR BLOCK-HIERARCHICAL NETWORKS: ALGORITHMS FOR COMPUTATION OF MAIN PROPERTIES RANDOM IRREGULAR BLOCK-HIERARCHICAL NETWORKS: ALGORITHMS FOR COMPUTATION OF MAIN PROPERTIES Svetlana Avetisyan Mikayel Samvelyan* Matun Kaapetyan Yeevan State Univesity Abstact In this pape, the class

More information

A Recommender System for Online Personalization in the WUM Applications

A Recommender System for Online Personalization in the WUM Applications A Recommende System fo Online Pesonalization in the WUM Applications Mehdad Jalali 1, Nowati Mustapha 2, Ali Mamat 2, Md. Nasi B Sulaiman 2 Abstact foeseeing of use futue movements and intentions based

More information

Input Layer f = 2 f = 0 f = f = 3 1,16 1,1 1,2 1,3 2, ,2 3,3 3,16. f = 1. f = Output Layer

Input Layer f = 2 f = 0 f = f = 3 1,16 1,1 1,2 1,3 2, ,2 3,3 3,16. f = 1. f = Output Layer Using the Gow-And-Pune Netwok to Solve Poblems of Lage Dimensionality B.J. Biedis and T.D. Gedeon School of Compute Science & Engineeing The Univesity of New South Wales Sydney NSW 2052 AUSTRALIA bbiedis@cse.unsw.edu.au

More information

Module 6 STILL IMAGE COMPRESSION STANDARDS

Module 6 STILL IMAGE COMPRESSION STANDARDS Module 6 STILL IMAE COMPRESSION STANDARDS Lesson 17 JPE-2000 Achitectue and Featues Instuctional Objectives At the end of this lesson, the students should be able to: 1. State the shotcomings of JPE standad.

More information

Frequency Domain Approach for Face Recognition Using Optical Vanderlugt Filters

Frequency Domain Approach for Face Recognition Using Optical Vanderlugt Filters Optics and Photonics Jounal, 016, 6, 94-100 Published Online August 016 in SciRes. http://www.scip.og/jounal/opj http://dx.doi.og/10.436/opj.016.68b016 Fequency Domain Appoach fo Face Recognition Using

More information

Color Correction Using 3D Multiview Geometry

Color Correction Using 3D Multiview Geometry Colo Coection Using 3D Multiview Geomety Dong-Won Shin and Yo-Sung Ho Gwangju Institute of Science and Technology (GIST) 13 Cheomdan-gwagio, Buk-ku, Gwangju 500-71, Republic of Koea ABSTRACT Recently,

More information

ANALYTIC PERFORMANCE MODELS FOR SINGLE CLASS AND MULTIPLE CLASS MULTITHREADED SOFTWARE SERVERS

ANALYTIC PERFORMANCE MODELS FOR SINGLE CLASS AND MULTIPLE CLASS MULTITHREADED SOFTWARE SERVERS ANALYTIC PERFORMANCE MODELS FOR SINGLE CLASS AND MULTIPLE CLASS MULTITHREADED SOFTWARE SERVERS Daniel A Menascé Mohamed N Bennani Dept of Compute Science Oacle, Inc Geoge Mason Univesity 1211 SW Fifth

More information

Methods for history matching under geological constraints Jef Caers Stanford University, Petroleum Engineering, Stanford CA , USA

Methods for history matching under geological constraints Jef Caers Stanford University, Petroleum Engineering, Stanford CA , USA Methods fo histoy matching unde geological constaints Jef Caes Stanfod Univesity, Petoleum Engineeing, Stanfod CA 9435-222, USA Abstact Two geostatistical methods fo histoy matching ae pesented. Both ely

More information

MULTI-TEMPORAL AND MULTI-SENSOR IMAGE MATCHING BASED ON LOCAL FREQUENCY INFORMATION

MULTI-TEMPORAL AND MULTI-SENSOR IMAGE MATCHING BASED ON LOCAL FREQUENCY INFORMATION Intenational Achives of the Photogammety Remote Sensing and Spatial Infomation Sciences Volume XXXIX-B3 2012 XXII ISPRS Congess 25 August 01 Septembe 2012 Melboune Austalia MULTI-TEMPORAL AND MULTI-SENSOR

More information

Towards Adaptive Information Merging Using Selected XML Fragments

Towards Adaptive Information Merging Using Selected XML Fragments Towads Adaptive Infomation Meging Using Selected XML Fagments Ho-Lam Lau and Wilfed Ng Depatment of Compute Science and Engineeing, The Hong Kong Univesity of Science and Technology, Hong Kong {lauhl,

More information

A ROI Focusing Mechanism for Digital Cameras

A ROI Focusing Mechanism for Digital Cameras A ROI Focusing Mechanism fo Digital Cameas Chu-Hui Lee, Meng-Feng Lin, Chun-Ming Huang, and Chun-Wei Hsu Abstact With the development and application of digital technologies, the digital camea is moe popula

More information

Real-Time Speech-Driven Face Animation. Pengyu Hong, Zhen Wen, Tom Huang. Beckman Institute for Advanced Science and Technology

Real-Time Speech-Driven Face Animation. Pengyu Hong, Zhen Wen, Tom Huang. Beckman Institute for Advanced Science and Technology Real-Time Speech-Diven Face Animation Pengyu Hong, Zhen Wen, Tom Huang Beckman Institute fo Advanced Science and Technology Univesity of Illinois at Ubana-Champaign, Ubana, IL 61801, USA Abstact This chapte

More information

AUTOMATED LOCATION OF ICE REGIONS IN RADARSAT SAR IMAGERY

AUTOMATED LOCATION OF ICE REGIONS IN RADARSAT SAR IMAGERY AUTOMATED LOCATION OF ICE REGIONS IN RADARSAT SAR IMAGERY Chistophe Waceman (1), William G. Pichel (2), Pablo Clement-Colón (2) (1) Geneal Dynamics Advanced Infomation Systems, P.O. Box 134008 Ann Abo

More information

Accurate Diffraction Efficiency Control for Multiplexed Volume Holographic Gratings. Xuliang Han, Gicherl Kim, and Ray T. Chen

Accurate Diffraction Efficiency Control for Multiplexed Volume Holographic Gratings. Xuliang Han, Gicherl Kim, and Ray T. Chen Accuate Diffaction Efficiency Contol fo Multiplexed Volume Hologaphic Gatings Xuliang Han, Gichel Kim, and Ray T. Chen Micoelectonic Reseach Cente Depatment of Electical and Compute Engineeing Univesity

More information

A Neural Network Model for Storing and Retrieving 2D Images of Rotated 3D Object Using Principal Components

A Neural Network Model for Storing and Retrieving 2D Images of Rotated 3D Object Using Principal Components A Neual Netwok Model fo Stong and Reteving 2D Images of Rotated 3D Object Using Pncipal Components Tsukasa AMANO, Shuichi KUROGI, Ayako EGUCHI, Takeshi NISHIDA, Yasuhio FUCHIKAWA Depatment of Contol Engineeng,

More information

Effective Missing Data Prediction for Collaborative Filtering

Effective Missing Data Prediction for Collaborative Filtering Effective Missing Data Pediction fo Collaboative Filteing Hao Ma, Iwin King and Michael R. Lyu Dept. of Compute Science and Engineeing The Chinese Univesity of Hong Kong Shatin, N.T., Hong Kong { hma,

More information

A Two-stage and Parameter-free Binarization Method for Degraded Document Images

A Two-stage and Parameter-free Binarization Method for Degraded Document Images A Two-stage and Paamete-fee Binaization Method fo Degaded Document Images Yung-Hsiang Chiu 1, Kuo-Liang Chung 1, Yong-Huai Huang 2, Wei-Ning Yang 3, Chi-Huang Liao 4 1 Depatment of Compute Science and

More information

= dv 3V (r + a 1) 3 r 3 f(r) = 1. = ( (r + r 2

= dv 3V (r + a 1) 3 r 3 f(r) = 1. = ( (r + r 2 Random Waypoint Model in n-dimensional Space Esa Hyytiä and Joma Vitamo Netwoking Laboatoy, Helsinki Univesity of Technology, Finland Abstact The andom waypoint model (RWP) is one of the most widely used

More information

Conversion Functions for Symmetric Key Ciphers

Conversion Functions for Symmetric Key Ciphers Jounal of Infomation Assuance and Secuity 2 (2006) 41 50 Convesion Functions fo Symmetic Key Ciphes Deba L. Cook and Angelos D. Keomytis Depatment of Compute Science Columbia Univesity, mail code 0401

More information

A Minutiae-based Fingerprint Matching Algorithm Using Phase Correlation

A Minutiae-based Fingerprint Matching Algorithm Using Phase Correlation A Minutiae-based Fingepint Matching Algoithm Using Phase Coelation Autho Chen, Weiping, Gao, Yongsheng Published 2007 Confeence Title Digital Image Computing: Techniques and Applications DOI https://doi.og/10.1109/dicta.2007.4426801

More information

An Extension to the Local Binary Patterns for Image Retrieval

An Extension to the Local Binary Patterns for Image Retrieval , pp.81-85 http://x.oi.og/10.14257/astl.2014.45.16 An Extension to the Local Binay Pattens fo Image Retieval Zhize Wu, Yu Xia, Shouhong Wan School of Compute Science an Technology, Univesity of Science

More information

Communication vs Distributed Computation: an alternative trade-off curve

Communication vs Distributed Computation: an alternative trade-off curve Communication vs Distibuted Computation: an altenative tade-off cuve Yahya H. Ezzeldin, Mohammed amoose, Chistina Fagouli Univesity of Califonia, Los Angeles, CA 90095, USA, Email: {yahya.ezzeldin, mkamoose,

More information

AN ANALYSIS OF COORDINATED AND NON-COORDINATED MEDIUM ACCESS CONTROL PROTOCOLS UNDER CHANNEL NOISE

AN ANALYSIS OF COORDINATED AND NON-COORDINATED MEDIUM ACCESS CONTROL PROTOCOLS UNDER CHANNEL NOISE AN ANALYSIS OF COORDINATED AND NON-COORDINATED MEDIUM ACCESS CONTROL PROTOCOLS UNDER CHANNEL NOISE Tolga Numanoglu, Bulent Tavli, and Wendi Heinzelman Depatment of Electical and Compute Engineeing Univesity

More information

Extract Object Boundaries in Noisy Images using Level Set. Final Report

Extract Object Boundaries in Noisy Images using Level Set. Final Report Extact Object Boundaies in Noisy Images using Level Set by: Quming Zhou Final Repot Submitted to Pofesso Bian Evans EE381K Multidimensional Digital Signal Pocessing May 10, 003 Abstact Finding object contous

More information

A Mathematical Implementation of a Global Human Walking Model with Real-Time Kinematic Personification by Boulic, Thalmann and Thalmann.

A Mathematical Implementation of a Global Human Walking Model with Real-Time Kinematic Personification by Boulic, Thalmann and Thalmann. A Mathematical Implementation of a Global Human Walking Model with Real-Time Kinematic Pesonification by Boulic, Thalmann and Thalmann. Mashall Badley National Cente fo Physical Acoustics Univesity of

More information

ANN Models for Coplanar Strip Line Analysis and Synthesis

ANN Models for Coplanar Strip Line Analysis and Synthesis 200 IJCSNS Intenational Jounal of Compute Science and Netwok Secuity, VOL.8 No.10, Octobe 2008 Models fo Coplana Stip Line Analysis and J.Lakshmi Naayana D.K.Si Rama Kishna D.L.Patap Reddy Chalapathi Institute

More information

A Memory Efficient Array Architecture for Real-Time Motion Estimation

A Memory Efficient Array Architecture for Real-Time Motion Estimation A Memoy Efficient Aay Achitectue fo Real-Time Motion Estimation Vasily G. Moshnyaga and Keikichi Tamau Depatment of Electonics & Communication, Kyoto Univesity Sakyo-ku, Yoshida-Honmachi, Kyoto 66-1, JAPAN

More information

POMDP: Introduction to Partially Observable Markov Decision Processes Hossein Kamalzadeh, Michael Hahsler

POMDP: Introduction to Partially Observable Markov Decision Processes Hossein Kamalzadeh, Michael Hahsler POMDP: Intoduction to Patially Obsevable Makov Decision Pocesses Hossein Kamalzadeh, Michael Hahsle 2019-01-02 The R package pomdp povides an inteface to pomdp-solve, a solve (witten in C) fo Patially

More information

COLOR EDGE DETECTION IN RGB USING JOINTLY EUCLIDEAN DISTANCE AND VECTOR ANGLE

COLOR EDGE DETECTION IN RGB USING JOINTLY EUCLIDEAN DISTANCE AND VECTOR ANGLE COLOR EDGE DETECTION IN RGB USING JOINTLY EUCLIDEAN DISTANCE AND VECTOR ANGLE Slawo Wesolkowski Systems Design Engineeing Univesity of Wateloo Wateloo (Ont.), Canada, NL 3G s.wesolkowski@ieee.og Ed Jenigan

More information

Assessment of Track Sequence Optimization based on Recorded Field Operations

Assessment of Track Sequence Optimization based on Recorded Field Operations Assessment of Tack Sequence Optimization based on Recoded Field Opeations Matin A. F. Jensen 1,2,*, Claus G. Søensen 1, Dionysis Bochtis 1 1 Aahus Univesity, Faculty of Science and Technology, Depatment

More information

Topological Characteristic of Wireless Network

Topological Characteristic of Wireless Network Topological Chaacteistic of Wieless Netwok Its Application to Node Placement Algoithm Husnu Sane Naman 1 Outline Backgound Motivation Papes and Contibutions Fist Pape Second Pape Thid Pape Futue Woks Refeences

More information

HISTOGRAMS are an important statistic reflecting the

HISTOGRAMS are an important statistic reflecting the JOURNAL OF L A T E X CLASS FILES, VOL. 14, NO. 8, AUGUST 2015 1 D 2 HistoSketch: Disciminative and Dynamic Similaity-Peseving Sketching of Steaming Histogams Dingqi Yang, Bin Li, Laua Rettig, and Philippe

More information

An Improved Resource Reservation Protocol

An Improved Resource Reservation Protocol Jounal of Compute Science 3 (8: 658-665, 2007 SSN 549-3636 2007 Science Publications An mpoved Resouce Resevation Potocol Desie Oulai, Steven Chambeland and Samuel Piee Depatment of Compute Engineeing

More information

Lecture # 04. Image Enhancement in Spatial Domain

Lecture # 04. Image Enhancement in Spatial Domain Digital Image Pocessing CP-7008 Lectue # 04 Image Enhancement in Spatial Domain Fall 2011 2 domains Spatial Domain : (image plane) Techniques ae based on diect manipulation of pixels in an image Fequency

More information

ADDING REALISM TO SOURCE CHARACTERIZATION USING A GENETIC ALGORITHM

ADDING REALISM TO SOURCE CHARACTERIZATION USING A GENETIC ALGORITHM ADDING REALISM TO SOURCE CHARACTERIZATION USING A GENETIC ALGORITHM Luna M. Rodiguez*, Sue Ellen Haupt, and Geoge S. Young Depatment of Meteoology and Applied Reseach Laboatoy The Pennsylvania State Univesity,

More information

INFORMATION DISSEMINATION DELAY IN VEHICLE-TO-VEHICLE COMMUNICATION NETWORKS IN A TRAFFIC STREAM

INFORMATION DISSEMINATION DELAY IN VEHICLE-TO-VEHICLE COMMUNICATION NETWORKS IN A TRAFFIC STREAM INFORMATION DISSEMINATION DELAY IN VEHICLE-TO-VEHICLE COMMUNICATION NETWORKS IN A TRAFFIC STREAM LiLi Du Depatment of Civil, Achitectual, and Envionmental Engineeing Illinois Institute of Technology 3300

More information

arxiv: v2 [physics.soc-ph] 30 Nov 2016

arxiv: v2 [physics.soc-ph] 30 Nov 2016 Tanspotation dynamics on coupled netwoks with limited bandwidth Ming Li 1,*, Mao-Bin Hu 1, and Bing-Hong Wang 2, axiv:1607.05382v2 [physics.soc-ph] 30 Nov 2016 1 School of Engineeing Science, Univesity

More information

A Shape-preserving Affine Takagi-Sugeno Model Based on a Piecewise Constant Nonuniform Fuzzification Transform

A Shape-preserving Affine Takagi-Sugeno Model Based on a Piecewise Constant Nonuniform Fuzzification Transform A Shape-peseving Affine Takagi-Sugeno Model Based on a Piecewise Constant Nonunifom Fuzzification Tansfom Felipe Fenández, Julio Gutiéez, Juan Calos Cespo and Gacián Tiviño Dep. Tecnología Fotónica, Facultad

More information

A Novel Automatic White Balance Method For Digital Still Cameras

A Novel Automatic White Balance Method For Digital Still Cameras A Novel Automatic White Balance Method Fo Digital Still Cameas Ching-Chih Weng 1, Home Chen 1,2, and Chiou-Shann Fuh 3 Depatment of Electical Engineeing, 2 3 Gaduate Institute of Communication Engineeing

More information

On Error Estimation in Runge-Kutta Methods

On Error Estimation in Runge-Kutta Methods Leonado Jounal of Sciences ISSN 1583-0233 Issue 18, Januay-June 2011 p. 1-10 On Eo Estimation in Runge-Kutta Methods Ochoche ABRAHAM 1,*, Gbolahan BOLARIN 2 1 Depatment of Infomation Technology, 2 Depatment

More information

Modelling, simulation, and performance analysis of a CAN FD system with SAE benchmark based message set

Modelling, simulation, and performance analysis of a CAN FD system with SAE benchmark based message set Modelling, simulation, and pefomance analysis of a CAN FD system with SAE benchmak based message set Mahmut Tenuh, Panagiotis Oikonomidis, Peiklis Chachalakis, Elias Stipidis Mugla S. K. Univesity, TR;

More information

Multi-azimuth Prestack Time Migration for General Anisotropic, Weakly Heterogeneous Media - Field Data Examples

Multi-azimuth Prestack Time Migration for General Anisotropic, Weakly Heterogeneous Media - Field Data Examples Multi-azimuth Pestack Time Migation fo Geneal Anisotopic, Weakly Heteogeneous Media - Field Data Examples S. Beaumont* (EOST/PGS) & W. Söllne (PGS) SUMMARY Multi-azimuth data acquisition has shown benefits

More information

Improvement of First-order Takagi-Sugeno Models Using Local Uniform B-splines 1

Improvement of First-order Takagi-Sugeno Models Using Local Uniform B-splines 1 Impovement of Fist-ode Takagi-Sugeno Models Using Local Unifom B-splines Felipe Fenández, Julio Gutiéez, Gacián Tiviño and Juan Calos Cespo Dep. Tecnología Fotónica, Facultad de Infomática Univesidad Politécnica

More information

Multiview plus depth video coding with temporal prediction view synthesis

Multiview plus depth video coding with temporal prediction view synthesis 1 Multiview plus depth video coding with tempoal pediction view synthesis Andei I. Puica, Elie G. Moa, Beatice Pesquet-Popescu, Fellow, IEEE, Maco Cagnazzo, Senio Membe, IEEE and Bogdan Ionescu, Senio

More information

A Hybrid DWT-SVD Image-Coding System (HDWTSVD) for Color Images

A Hybrid DWT-SVD Image-Coding System (HDWTSVD) for Color Images A Hybid DWT-SVD Image-Coding System (HDWTSVD) fo Colo Images Humbeto Ochoa a,b, K.R. Rao a a Univesity of Texas at Alington, Box 906 46 ates Steet, Neddeman Hall, Rm 530 Alington, TX 7609 hdo594@exchange.uta.edu

More information

On using circuit-switched networks for file transfers

On using circuit-switched networks for file transfers On using cicuit-switched netwoks fo file tansfes Xiuduan Fang, Malathi Veeaaghavan Univesity of Viginia Email: {xf4c, mv5g}@viginia.edu Abstact High-speed optical cicuit-switched netwoks ae being deployed

More information

CLUSTERED BASED TAKAGI-SUGENO NEURO-FUZZY MODELING OF A MULTIVARIABLE NONLINEAR DYNAMIC SYSTEM

CLUSTERED BASED TAKAGI-SUGENO NEURO-FUZZY MODELING OF A MULTIVARIABLE NONLINEAR DYNAMIC SYSTEM Asian Jounal of Contol, Vol. 7, No., pp. 63-76, June 5 63 -Bief Pape- CLUSTERED BASED TAKAGI-SUGENO NEURO-FUZZY MODELING OF A MULTIVARIABLE NONLINEAR DYNAMIC SYSTEM E. A. Al-Gallaf ABSTRACT This eseach

More information

Tissue Classification Based on 3D Local Intensity Structures for Volume Rendering

Tissue Classification Based on 3D Local Intensity Structures for Volume Rendering 160 IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 6, NO., APRIL-JUNE 000 Tissue Classification Based on 3D Local Intensity Stuctues fo Volume Rendeing Yoshinobu Sato, Membe, IEEE, Cal-Fedik

More information

Cryptanalysis of Hwang-Chang s a Time-Stamp Protocol for Digital Watermarking

Cryptanalysis of Hwang-Chang s a Time-Stamp Protocol for Digital Watermarking Cyptanalysis of Hwang-Chang s a Time-Stamp Potocol fo Digital Watemaking *Jue-Sam Chou, Yalin Chen 2, Chung-Ju Chan 3 Depatment of Infomation Management, Nanhua Univesity Chiayi 622 Taiwan, R.O.C *: coesponding

More information

Generalized Grey Target Decision Method Based on Decision Makers Indifference Attribute Value Preferences

Generalized Grey Target Decision Method Based on Decision Makers Indifference Attribute Value Preferences Ameican Jounal of ata ining and Knowledge iscovey 27; 2(4): 2-8 http://www.sciencepublishinggoup.com//admkd doi:.648/.admkd.2724.2 Genealized Gey Taget ecision ethod Based on ecision akes Indiffeence Attibute

More information

Fifth Wheel Modelling and Testing

Fifth Wheel Modelling and Testing Fifth heel Modelling and Testing en Masoy Mechanical Engineeing Depatment Floida Atlantic Univesity Boca aton, FL 4 Lois Malaptias IFMA Institut Fancais De Mechanique Advancee ampus De lemont Feand Les

More information

Obstacle Avoidance of Autonomous Mobile Robot using Stereo Vision Sensor

Obstacle Avoidance of Autonomous Mobile Robot using Stereo Vision Sensor Obstacle Avoidance of Autonomous Mobile Robot using Steeo Vision Senso Masako Kumano Akihisa Ohya Shin ichi Yuta Intelligent Robot Laboatoy Univesity of Tsukuba, Ibaaki, 35-8573 Japan E-mail: {masako,

More information

Voting-Based Grouping and Interpretation of Visual Motion

Voting-Based Grouping and Interpretation of Visual Motion Voting-Based Gouping and Intepetation of Visual Motion Micea Nicolescu Depatment of Compute Science Univesity of Nevada, Reno Reno, NV 89557 micea@cs.un.edu Géad Medioni Integated Media Systems Cente Univesity

More information

Computational and Theoretical Analysis of Null Space and Orthogonal Linear Discriminant Analysis

Computational and Theoretical Analysis of Null Space and Orthogonal Linear Discriminant Analysis Jounal of Machine Leaning Reseach 7 2006) 1183 1204 Submitted 12/05; Revised 3/06; Published 7/06 Computational and Theoetical Analysis of Null Space and Othogonal Linea Disciminant Analysis Jieping Ye

More information

a Not yet implemented in current version SPARK: Research Kit Pointer Analysis Parameters Soot Pointer analysis. Objectives

a Not yet implemented in current version SPARK: Research Kit Pointer Analysis Parameters Soot Pointer analysis. Objectives SPARK: Soot Reseach Kit Ondřej Lhoták Objectives Spak is a modula toolkit fo flow-insensitive may points-to analyses fo Java, which enables expeimentation with: vaious paametes of pointe analyses which

More information

Any modern computer system will incorporate (at least) two levels of storage:

Any modern computer system will incorporate (at least) two levels of storage: 1 Any moden compute system will incopoate (at least) two levels of stoage: pimay stoage: andom access memoy (RAM) typical capacity 32MB to 1GB cost pe MB $3. typical access time 5ns to 6ns bust tansfe

More information

Mobility Pattern Recognition in Mobile Ad-Hoc Networks

Mobility Pattern Recognition in Mobile Ad-Hoc Networks Mobility Patten Recognition in Mobile Ad-Hoc Netwoks S. M. Mousavi Depatment of Compute Engineeing, Shaif Univesity of Technology sm_mousavi@ce.shaif.edu H. R. Rabiee Depatment of Compute Engineeing, Shaif

More information

Gravitational Shift for Beginners

Gravitational Shift for Beginners Gavitational Shift fo Beginnes This pape, which I wote in 26, fomulates the equations fo gavitational shifts fom the elativistic famewok of special elativity. Fist I deive the fomulas fo the gavitational

More information

Optimal Adaptive Learning for Image Retrieval

Optimal Adaptive Learning for Image Retrieval Optimal Adaptive Leaning fo Image Retieval ao Wang Dept of Compute Sci and ech singhua Univesity Beijing 00084, P. R. China Wangtao7@63.net Yong Rui Micosoft Reseach One Micosoft Way Redmond, WA 9805,

More information

A VECTOR PERTURBATION APPROACH TO THE GENERALIZED AIRCRAFT SPARE PARTS GROUPING PROBLEM

A VECTOR PERTURBATION APPROACH TO THE GENERALIZED AIRCRAFT SPARE PARTS GROUPING PROBLEM Accepted fo publication Intenational Jounal of Flexible Automation and Integated Manufactuing. A VECTOR PERTURBATION APPROACH TO THE GENERALIZED AIRCRAFT SPARE PARTS GROUPING PROBLEM Nagiza F. Samatova,

More information

Scaling Location-based Services with Dynamically Composed Location Index

Scaling Location-based Services with Dynamically Composed Location Index Scaling Location-based Sevices with Dynamically Composed Location Index Bhuvan Bamba, Sangeetha Seshadi and Ling Liu Distibuted Data Intensive Systems Laboatoy (DiSL) College of Computing, Geogia Institute

More information

5 4 THE BERNOULLI EQUATION

5 4 THE BERNOULLI EQUATION 185 CHATER 5 the suounding ai). The fictional wok tem w fiction is often expessed as e loss to epesent the loss (convesion) of mechanical into themal. Fo the idealied case of fictionless motion, the last

More information

vaiation than the fome. Howeve, these methods also beak down as shadowing becomes vey signicant. As we will see, the pesented algoithm based on the il

vaiation than the fome. Howeve, these methods also beak down as shadowing becomes vey signicant. As we will see, the pesented algoithm based on the il IEEE Conf. on Compute Vision and Patten Recognition, 1998. To appea. Illumination Cones fo Recognition Unde Vaiable Lighting: Faces Athinodoos S. Geoghiades David J. Kiegman Pete N. Belhumeu Cente fo Computational

More information

Modeling Spatially Correlated Data in Sensor Networks

Modeling Spatially Correlated Data in Sensor Networks Modeling Spatially Coelated Data in Senso Netwoks Apoova Jindal and Konstantinos Psounis Univesity of Southen Califonia The physical phenomena monitoed by senso netwoks, e.g. foest tempeatue, wate contamination,

More information

FINITE ELEMENT MODEL UPDATING OF AN EXPERIMENTAL VEHICLE MODEL USING MEASURED MODAL CHARACTERISTICS

FINITE ELEMENT MODEL UPDATING OF AN EXPERIMENTAL VEHICLE MODEL USING MEASURED MODAL CHARACTERISTICS COMPDYN 009 ECCOMAS Thematic Confeence on Computational Methods in Stuctual Dynamics and Eathquake Engineeing M. Papadakakis, N.D. Lagaos, M. Fagiadakis (eds.) Rhodes, Geece, 4 June 009 FINITE ELEMENT

More information

Visual Servoing from Deep Neural Networks

Visual Servoing from Deep Neural Networks Visual Sevoing fom Deep Neual Netwoks Quentin Bateux 1, Eic Machand 1, Jügen Leitne 2, Fançois Chaumette 3, Pete Coke 2 Abstact We pesent a deep neual netwok-based method to pefom high-pecision, obust

More information

Hierarchical Region Mean-Based Image Segmentation

Hierarchical Region Mean-Based Image Segmentation Hieachical Region Mean-Based Image Segmentation Slawo Wesolkowski and Paul Fieguth Systems Design Engineeing Univesity of Wateloo Wateloo, Ontaio, Canada, N2L-3G1 s.wesolkowski@ieee.og, pfieguth@uwateloo.ca

More information

Positioning of a robot based on binocular vision for hand / foot fusion Long Han

Positioning of a robot based on binocular vision for hand / foot fusion Long Han 2nd Intenational Confeence on Advances in Mechanical Engineeing and Industial Infomatics (AMEII 26) Positioning of a obot based on binocula vision fo hand / foot fusion Long Han Compute Science and Technology,

More information

Cardiac C-Arm CT. SNR Enhancement by Combining Multiple Retrospectively Motion Corrected FDK-Like Reconstructions

Cardiac C-Arm CT. SNR Enhancement by Combining Multiple Retrospectively Motion Corrected FDK-Like Reconstructions Cadiac C-Am CT SNR Enhancement by Combining Multiple Retospectively Motion Coected FDK-Like Reconstuctions M. Pümme 1, L. Wigstöm 2,3, R. Fahig 2, G. Lauitsch 4, J. Honegge 1 1 Institute of Patten Recognition,

More information

A New Finite Word-length Optimization Method Design for LDPC Decoder

A New Finite Word-length Optimization Method Design for LDPC Decoder A New Finite Wod-length Optimization Method Design fo LDPC Decode Jinlei Chen, Yan Zhang and Xu Wang Key Laboatoy of Netwok Oiented Intelligent Computation Shenzhen Gaduate School, Habin Institute of Technology

More information

High performance CUDA based CNN image processor

High performance CUDA based CNN image processor High pefomance UDA based NN image pocesso GEORGE VALENTIN STOIA, RADU DOGARU, ELENA RISTINA STOIA Depatment of Applied Electonics and Infomation Engineeing Univesity Politehnica of Buchaest -3, Iuliu Maniu

More information

UCLA Papers. Title. Permalink. Authors. Publication Date. Localized Edge Detection in Sensor Fields. https://escholarship.org/uc/item/3fj6g58j

UCLA Papers. Title. Permalink. Authors. Publication Date. Localized Edge Detection in Sensor Fields. https://escholarship.org/uc/item/3fj6g58j UCLA Papes Title Localized Edge Detection in Senso Fields Pemalink https://escholashipog/uc/item/3fj6g58j Authos K Chintalapudi Govindan Publication Date 3-- Pee eviewed escholashipog Poweed by the Califonia

More information

Improved Fourier-transform profilometry

Improved Fourier-transform profilometry Impoved Fouie-tansfom pofilomety Xianfu Mao, Wenjing Chen, and Xianyu Su An impoved optical geomety of the pojected-finge pofilomety technique, in which the exit pupil of the pojecting lens and the entance

More information

Transmission Lines Modeling Based on Vector Fitting Algorithm and RLC Active/Passive Filter Design

Transmission Lines Modeling Based on Vector Fitting Algorithm and RLC Active/Passive Filter Design Tansmission Lines Modeling Based on Vecto Fitting Algoithm and RLC Active/Passive Filte Design Ahmed Qasim Tuki a,*, Nashien Fazilah Mailah b, Mohammad Lutfi Othman c, Ahmad H. Saby d Cente fo Advanced

More information

STUDIES ON IMPROVING TEXTURE SEGMENTATION PERFORMANCE USING GENERALIZED GAUSSIAN MIXTURE MODEL INTEGRATING DCT AND LBP

STUDIES ON IMPROVING TEXTURE SEGMENTATION PERFORMANCE USING GENERALIZED GAUSSIAN MIXTURE MODEL INTEGRATING DCT AND LBP Jounal of Theoetical and Applied Infomation Technology 3 st Octobe 206. Vol.92. No.2 2005-206 JATIT & LLS. All ights eseved. ISSN: 992-8645 www.jatit.og E-ISSN: 87-395 STUDIES ON IMPROVING TEXTURE SEGMENTATION

More information

dc - Linux Command Dc may be invoked with the following command-line options: -V --version Print out the version of dc

dc - Linux Command Dc may be invoked with the following command-line options: -V --version Print out the version of dc - CentOS 5.2 - Linux Uses Guide - Linux Command SYNOPSIS [-V] [--vesion] [-h] [--help] [-e sciptexpession] [--expession=sciptexpession] [-f sciptfile] [--file=sciptfile] [file...] DESCRIPTION is a evese-polish

More information

A Full-mode FME VLSI Architecture Based on 8x8/4x4 Adaptive Hadamard Transform For QFHD H.264/AVC Encoder

A Full-mode FME VLSI Architecture Based on 8x8/4x4 Adaptive Hadamard Transform For QFHD H.264/AVC Encoder 20 IEEE/IFIP 9th Intenational Confeence on VLSI and System-on-Chip A Full-mode FME VLSI Achitectue Based on 8x8/ Adaptive Hadamad Tansfom Fo QFHD H264/AVC Encode Jialiang Liu, Xinhua Chen College of Infomation

More information

SCALABLE ENERGY EFFICIENT AD-HOC ON DEMAND DISTANCE VECTOR (SEE-AODV) ROUTING PROTOCOL IN WIRELESS MESH NETWORKS

SCALABLE ENERGY EFFICIENT AD-HOC ON DEMAND DISTANCE VECTOR (SEE-AODV) ROUTING PROTOCOL IN WIRELESS MESH NETWORKS SCALABL NRGY FFICINT AD-HOC ON DMAND DISTANC VCTOR (S-AODV) ROUTING PROTOCOL IN WIRLSS MSH NTWORKS Sikande Singh Reseach Schola, Depatment of Compute Science & ngineeing, Punjab ngineeing College (PC),

More information

Modeling spatially-correlated data of sensor networks with irregular topologies

Modeling spatially-correlated data of sensor networks with irregular topologies This full text pape was pee eviewed at the diection of IEEE Communications Society subject matte expets fo publication in the IEEE SECON 25 poceedings Modeling spatially-coelated data of senso netwoks

More information

Erasure-Coding Based Routing for Opportunistic Networks

Erasure-Coding Based Routing for Opportunistic Networks Easue-Coding Based Routing fo Oppotunistic Netwoks Yong Wang, Sushant Jain, Magaet Matonosi, Kevin Fall Pinceton Univesity, Univesity of Washington, Intel Reseach Bekeley ABSTRACT Routing in Delay Toleant

More information

A Generalized Profile Function Model Based on Artificial Intelligence

A Generalized Profile Function Model Based on Artificial Intelligence Manuscipt eceived Jul., 28; evised Sep. 7, 28 A Genealized Pofile Function Model Based on Atificial Intelligence PERO RADONJA, SRDJAN STANKOVIC 2, and DRAGANA DRAZIC Institute of Foesty, 3 Kneza Viseslava

More information

DISTRIBUTION MIXTURES

DISTRIBUTION MIXTURES Application Example 7 DISTRIBUTION MIXTURES One fequently deals with andom vaiables the distibution of which depends on vaious factos. One example is the distibution of atmospheic paametes such as wind

More information

Combinatorial Mobile IP: A New Efficient Mobility Management Using Minimized Paging and Local Registration in Mobile IP Environments

Combinatorial Mobile IP: A New Efficient Mobility Management Using Minimized Paging and Local Registration in Mobile IP Environments Wieless Netwoks 0, 3 32, 200 200 Kluwe Academic Publishes. Manufactued in The Nethelands. Combinatoial Mobile IP: A New Efficient Mobility Management Using Minimized Paging and Local Registation in Mobile

More information

Image Registration among UAV Image Sequence and Google Satellite Image Under Quality Mismatch

Image Registration among UAV Image Sequence and Google Satellite Image Under Quality Mismatch 0 th Intenational Confeence on ITS Telecommunications Image Registation among UAV Image Sequence and Google Satellite Image Unde Quality Mismatch Shih-Ming Huang and Ching-Chun Huang Depatment of Electical

More information