DETECTING ERRORS AND IMPUTING MISSING DATA FOR SINGLE LOOP SURVEILLANCE SYSTEMS

Size: px
Start display at page:

Download "DETECTING ERRORS AND IMPUTING MISSING DATA FOR SINGLE LOOP SURVEILLANCE SYSTEMS"

Transcription

1 DETECTING ERRORS AND IMPUTING MISSING DATA FOR SINGLE LOOP SURVEILLANCE SYSTEMS Chao Chen Department of Electrcal Engneerng and Computer Scence Unversty of Calforna, Berkeley CA Tel: (510) Jamyoung Kwon Department of Statstcs Unversty of Calforna, Berkeley CA Tel: (510) , Fax: (510) John Rce Department of Statstcs Unversty of Calforna, Berkeley CA Tel: (510) , Fax: (510) Alexander Skabardons Insttute of Transportaton Studes Unversty of Calforna, Berkeley CA Tel: (510) , Fax: (510) Pravn Varaya Department of Electrcal Engneerng and Computer Scence Unversty of Calforna, Berkeley CA Tel: (510) For Presentaton and Publcaton 82 nd Annual Meetng Transportaton Research Board January 2003 Washngton, D.C. November 15, 2002

2 Chen/Kwon/Skabardons/Varaya 1 No WORDS: 4627 Plus 7 Fgures (1750) Plus 4 Tables (1000) TOTAL: 7377 Correspondng Author

3 Chen/Kwon/Skabardons/Varaya 2 ABSTRACT Sngle loop detectors provde the most abundant source of traffc data n Calforna, but loop data samples are often mssng or nvald. We descrbe a method that detects bad data samples and mputes mssng or bad samples to form a complete grd of clean data, n real tme. The dagnostcs algorthm and the mputaton algorthm that mplement ths method are operatonal on 14,871 loops n sx Dstrcts of the Calforna Department of Transportaton. The dagnostcs algorthm detects bad (malfunctonng) sngle loop detectors from ther volume and occupancy measurements. Its novelty s ts use of tme seres of many samples, rather than basng decsons on sngle samples, as n prevous approaches. The mputaton algorthm models the relatonshp between neghborng loops as lnear, and uses lnear regresson to estmate the value of mssng or bad samples. Ths gves a better estmate than prevous methods because t uses hstorcal data to learn how pars of neghborng loops behave. Detecton of bad loops and mputaton of loop data are mportant because they allow algorthms that use loop data to perform analyss wthout requrng them to compensate for mssng or ncorrect data samples.

4 Chen/Kwon/Skabardons/Varaya 3 INTRODUCTION Loop detectors are the best source of real tme freeway traffc data today. In Calforna, these detectors cover most urban freeways. Loop data provde a powerful means to study and montor traffc (2). But the data contan many holes (mssng values) or bad (ncorrec values and requre careful cleanng to produce relable results. Bad or mssng samples present problems for any algorthm that uses the data for analyss. Therefore, we need both to detect when data are bad and throw them out, and to fll holes n the data wth mputed values. The goal s to produce a complete grd of relable data. We can trust analyses that use such a complete data set. We need to detect bad data from the measurements themselves. The problem was studed by the FHWA, Washngton DOT, and others. Exstng algorthms usually work on the raw 20-second or 30- second data, and produce a dagnoss for each sample. But t s very hard to tell f a sngle 20-second sample s good or bad unless t s very abnormal. Fortunately, loop detectors don t just gve random errors some loops produce reasonable data all the tme, whle others produce suspect data all the tme. By examnng a tme seres of measurements one can readly dstngush bad behavor from good. Our dagnostcs algorthm examnes a day s worth of samples together, producng convncng results. Once bad samples are thrown out, the resultng holes n the data must be flled wth mputed values. Imputaton usng tme seres analyss has been suggested before, but these mputatons are only effectve for short perods of mssng data; lnear nterpolaton and neghborhood averages are natural mputaton methods, but they don t use all the relevant data that are avalable. Our mputaton algorthm estmates values at a detector usng data from ts neghbors. The algorthm models each par of neghbors lnearly, and fts ts parameters on hstorcal data. It s robust, and performs better than other methods. We frst descrbe the data and types of errors that are observed. We then survey current methods of error detecton, whch operate on sngle 20-second samples. Then we present our dagnostc algorthm, and show that t performs better. We then present our mputaton algorthm, and show that ths method s better than other mputaton methods such as lnear nterpolaton. DESCRIPTION OF DATA The freeway Performance Measurement System (PeMS) (1,2) collects, stores, and analyzes data from thousands of loop detectors n sx dstrcts of the Calforna Department of Transportaton (Caltrans). The PeMS database currently has 1 terabyte of data onlne, and collects more than 1GB per day. PeMS uses the data to compute freeway usage and congeston delays, measure and predct travel tme, evaluate ramp-meterng methods, and valdate traffc theores. There are 14,871 man lne (ML) loops n the PeMS database from sx Caltrans dstrcts. The results presented here are for man lne loops. Each loop reports the volume q( the number of vehcles crossng the loop detector durng a 30-second tme nterval t, and occupancy k( the fracton of ths nterval durng whch there s a vehcle above the loop. We call each par of volume and occupancy observatons a sample. The number of total possble samples n one day from ML loops n PeMS s therefore (14871 loops) x (2880 sample per loop per day) = 42 mllon samples. In realty, however, PeMS never receves all the samples. For example, Los Angeles has a mssng sample rate of about 15%. Whle t s clear when we mss samples, t s harder to tell when a receved sample s bad or ncorrect. A dagnostcs test needs to accept or reject samples based on our assumpton of what good and bad samples look lke.

5 Chen/Kwon/Skabardons/Varaya 4 EXISTING DATA RELIABILITY TESTS Loop data error has plagued ther effectve use for a long tme. In 1976, Payne (3) dentfed fve types of detector errors and presented several methods to detect them from 20-second and 5-mnute volume and occupancy measurements. These methods place thresholds on mnmum and maxmum flow, densty, and speed, and declare a sample to be nvald f they fal any of the tests. Later, Jacobsen and Nhan at the Unversty of Washngton defned an acceptable regon n the k-q plane, and declared samples to be good only f they fell nsde the regon (4). We call ths the Washngton Algorthm. The boundares of the acceptable regon are defned by a set of parameters, whch are calbrated from hstorcal data, or derved from traffc theory. Exstng detecton algorthms (3,4,5) try to catch the errors descrbed n (3). For example, chatterng and pulse break up cause q to be hgh, so a threshold on q can catch these errors. But some errors cannot be caught ths way, such as a detector stuck n the off (q=0, k=0) poston. Payne s algorthm would dentfy ths as a bad pont, but good detectors wll also report (0,0) when there are no vehcles n the detecton perod. Elmnatng all (0,0) ponts ntroduces a postve bas n the data. On the other hand, the Washngton Algorthm accepts the (0,0) pont, but dong so makes t unable to detect the stuck type of error. A threshold on occupancy s smlarly hard to set. An occupancy value of 0.5 for one 30-second perod should not ndcate an error, but a large number of 30-second samples wth occupances of 0.5, especally durng non-rush hours, ponts to a malfuncton. We mplemented the Washngton Algorthm n Matlab and tested t on 30-second data from 2 loops n Los Angeles, for one day. The acceptable regon s taken from (4). The data and ther dagnoses are shown n Fgure 1. Vsually, loop 1 looks good (Fgure 1b), and loop 2 looks bad (Fgure 1d). Loop 2 looks bad because there are many samples wth k=70% and q=0, as well as many samples wth occupances that appear too hgh, even durng non-rush hours, and when loop 1 shows low occupancy. The Washngton Algorthm, however, does not make the correct dagnoss. Out of 2875 samples, t declared 1138 samples to be bad for loop 1 and 883 bad for loop 2. In both loops, there were many false alarms. Ths s because the maxmum acceptable slope of q/k was exceeded by many samples n free flow. Ths suggests that the algorthm s very senstve to thresholds and needs to be calbrated for Calforna. Calbraton s mpractcal because each loop wll need a separate acceptable regon, and ground truth would be dffcult to get. There are also false negatves many samples from loop 2 appear to be bad because they have hgh occupances durng off peak tmes, but they were not detected by the Washngton Algorthm. Ths llustrates a dffculty wth the threshold method the acceptable regon has to be very large, because there are many possble traffc states wthn a 30-second perod. On the other hand, a lot more nformaton can be ganed by lookng at how a detector behaves over many sample tmes. Ths s why we easly recognze loop 1 to be good and loop 2 to be bad by lookng at ther k( plots, and ths s a key nsght that led to our dagnostcs algorthm. PROPOSED DETECTOR DIAGNOSTICS ALGORITHM Desgn The algorthm for loop error detecton uses the tme seres of flow and occupancy measurements, nstead of makng a decson based on an ndvdual sample. It s based on the emprcal observaton that good and bad detectors behave very dfferently over tme. For example, at any gven nstant, the flow and occupancy at a detector locaton can have a wde range of values, and one cannot rule most

6 Chen/Kwon/Skabardons/Varaya 5 of them out; but over a day, most detectors show a smlar pattern flow and occupancy are hgh n the rush hours and low late at nght. Fgure 2a and 2b show typcal 30-second flow and occupancy measurements. Most loops have outputs that look lke ths, but some loops behave very dfferently. Fgure 2c and 2d show an example of a bad loop. Ths loop has zero flow and an occupancy value of 0.7 for several hours durng the evenng rush hour clearly, these values must be ncorrect. We found 4 types of abnormal tme seres behavor, and lst them n Table 1. Types 1 and 4 are selfexplanatory; types 2 and 3 are llustrated n Fgure 2c, 2d, and Fgure 1b. The errors n Table 1 are not mutually exclusve. For example, a loop wth all zero occupancy values exhbts both type 1 and type 4 errors. A loop s declared bad f t s n any of these categores. We dd not fnd a sgnfcant number of loops that have chatter or pulse break up, whch would produce abnormally hgh volumes. Therefore the current form of the detecton algorthm does not check for ths condton. However, a ffth error type and error check can easly be added to the algorthm to flag loops wth consstently hgh counts. We developed the Daly Statstcs Algorthm (DSA) to recognze error types 1-4 above. The nput to the algorthm s the tme seres of 30-second measurements q(d, and k(d,, where d s the ndex of the day, and t=0,1,2,...,2879 s the 30-second sample number; the output s the dagnoss (d) for the dth day: (d)=0 f the loop s good, and (d)=1 f the loop s bad. In contrast to exstng algorthms that operate on each sample, the DSA produces one dagnoss for all the samples of a loop on each day. We use only samples between 5am and 10pm to do the dagnostcs, because outsde of ths perod, t s more dffcult to tell the dfference between good and bad loops. There are second samples n ths perod, therefore the algorthm s a functon of 2041x2=4082 varables. Thus the dagnostc (d) on day d s a functon, (d) = f(q(d,a), q(d,a+1),..., q(d,b), k(d,a), k(d,a+1),..., k(d,b)), where a=5120=600 s the sample number at 5am, and b=22120=2640 s the last sample number, at 10pm. To deal wth the large number of varables, we frst reduce them to four statstcs, S 1,...,S 4, whch are approprate summares of the tme seres. Ther defntons are gven n Table 2, where S j (,d) s the jth statstc computed for the th loop on the dth day. The decson becomes a functon of these four varables. For the th loop and dth day, the decson whether the loop s bad or good s determned accordng to the rule where s j ( d ) = 1 0 f S1 (, d ) > s1 or S 2 (, d ) > s 2 or S 3 (, d ) > s3 or S 4 (, d ) < s 4 otherwse are thresholds on each statstc. These four statstcs summarze the daly measurements well because they are good ndcators of the four types of loop falure lsted n Table 1. Ths s seen n the hstogram of each of statstc dsplayed n Fgure 3. The data are collected from Los Angeles on 4/24/2002. The dstrbuton of each statstc shows two dstnct populatons. In S 1, for example, there are two peaks at 0 and Ths shows that there are two groups of loops one group of about 4700 loops have very few samples that report zero occupancy, and another group of about 300 that report almost all zeros. The second group s bad, because they have type 1 error. Snce all four 1

7 Chen/Kwon/Skabardons/Varaya 6 dstrbutons are strongly b-modal, Equaton 1 s not very senstve to the thresholds s j whch just have to be able to separate the two peaks n the four hstograms n Fgure 3. The default thresholds are gven n Table 2. The only other parameters of ths model are the tme ranges, and the defnton of S 3, where an occupancy threshold of 0.35 s specfed. The DSA uses a total of 7 parameters, lsted n Table 3. They work well n all 6 Caltrans dstrcts. Performance The DSA algorthm s mplemented and run on PeMS data. The last column n Table 1 shows the dstrbuton of the 4 types of errors n Dstrct 12 (Orange County) for 31 days n October, Because we don t have the ground truth of whch detectors are actually bad, we must verfy the performance of ths algorthm vsually. Fortunately, ths s easy for n most cases, because the tme seres show dstnctly dfferent patterns for good and bad detectors. A vsual test was performed on loops n Los Angeles, on data from 8/7/2001. There are 662 loops on Interstate 5 and Interstate 210, out of whch 142 (21%) were declared to be bad by the algorthm. We then manually checked the plots of occupancy to verfy these results. We found 14 loops that were declared good, but ther plots suggested they could be bad. Ths suggests a false negatve rate of 14/( ) = 2.7%. There were no false postves. Ths suggests that the algorthm performs very well. Real-Tme Operaton The detecton algorthm descrbed above gves a dagnoss on samples from an entre day. But we are also nterested n real-tme detecton the valdty of each sample as t s receved. Therefore what we want s a decson ˆ ( d,, where d s the current day, and t s the current sample tme. We use the smple approxmaton: ˆ ( d, = ( d 1) 2 where s defned n Equaton 1. Equaton 2 has two consequences. Frst, a loop s declared good or bad for an entre day. As a result, we lose some flexblty because we may be throwng away good data from a partally bad loop ths pont s dscussed n the concluson secton. Second, there s a one-day lag n the dagnoss, whch ntroduces a small error. We estmated the probablty of loop falure gven the loop status on the prevous day, and found Equaton 2 to be true for 98% of the tme. Therefore, t s a good approxmaton. IMPUTATION OF MISSING AND BAD SAMPLES The Need for Imputaton We model the measurement of each detector as ether the actual value or an error value, dependng on the status : qmeas, ( d, = qreal, ( d, (1 ( d)) + ε ( d, ( d), 3 k ( d, = k ( d, (1 ( d)) + φ ( d, ( d), 0 t 2879 meas, real, where q meas, and k meas, are the measured values, q real, and k real, are the true values, and ε and φ are error values that are ndependent of q real, and k real,. We obtaned an estmate of the loop status n Equaton 2. It tells us to dscard the samples from detectors that are declared bad. Ths leaves holes n the data, n addton to the orgnally mssng samples. Ths s a common problem at each sample

8 Chen/Kwon/Skabardons/Varaya 7 tme, the user must determne whether t s a good sample or not. An applcaton that analyzes the data must deal wth both possbltes. One approach to mssng data s to predct them usng tme seres analyss. Nhan modeled occupancy and flow tme seres as ARMA processes and predcted values n the near future (6); Daley presented a method of predcton from neghbor loops usng a Kalman flter (9). In our case, the errors do not occur randomly, but persst for many hours and days. Tme seres predctons become nvald very quckly and are napproprate n such stuatons. We developed an mputaton scheme that uses nformaton from good neghbor loops at only the current sample tme. Ths s a natural way of dealng wth mssng data and s used by tradtonal mputaton methods. For example, to fnd the total volume of a freeway locaton wth 4 lanes and only 3 workng loops, one may reasonably use the average of the 3 lanes and multply t by 4. Ths mputes the mssng value usng the average of ts neghbors. Lnear nterpolaton s another example. Suppose detector s bad, and s located between detectors j and k whch are good. Let x, x j, x k be ther locatons, and x j < x < x k, then ( x x ) q ( + ( x x ) q ( j k k j qˆ ( = 8 xk x j s the lnear nterpolaton mputaton. Whle these tradtonal mputaton methods are ntutve, they make nave assumptons about the data. Our proposed algorthm, on the other hand, models the behavor of neghbor loops better because t uses hstorcal data. Lnear Model Of Neghbor Detectors We propose a lnear regresson algorthm for mputaton that models the behavor of neghbor loops usng hstorcal data. We fnd that occupances and volumes of detectors n nearby locatons are hghly correlated. Therefore, measurements from one locaton can be used to estmate quanttes at other locatons, and a more accurate estmate can be formed f all the neghborng loops are used n the estmaton. We defne two loops to be neghbors f they are n the same locaton n dfferent lanes, or f they are n adjacent locatons. Fgure 4 shows a typcal neghborhood. We fnd that both volume and occupancy from neghborng locatons are strongly correlated. Fgure 5 shows two pars of neghbors wth lnearly related flow and occupances. Fgure 6 plots the dstrbuton of the correlaton coeffcents between all neghbors n Los Angeles. It shows that most neghbor pars have hgh correlatons n both flow and occupancy. The hgh correlaton among neghbor loop measurements means that lnear regresson s a good way to predct one from the other. It s also easy to mplement and fast to run. We use the followng parwse lnear model to relate the measurements from neghbor loops: q ( = α (, j) + α (, j) q ( + nose k ( = β (, j) + β (, j) k ( + nose For each par of neghbors (,j), the parameters α 0 (,j), α 1 (,j), β 0 (,j), β 1 (,j) are estmated usng fve days of hstorcal data. Let q (,q j (, t=1,2,...,n be the hstorcal measurements of volume, then j j 1 n 2 α 0 (, j), α1 (, j) = arg max [ q ( α0 α1 q j ( ] 10 α ', α ' n t =

9 Chen/Kwon/Skabardons/Varaya 8 The parameters for densty are ftted the same way. We can fnd parameters for all pars of loops that report data n our hstorcal database, but some loops never report any data. For them, we use a set of global parameters α 0 (δ,l 1,l 2 ), α 1 (δ,l 1,l 2 ), β 0 (δ,l 1,l 2 ), β 1 (δ,l 1,l 2 ) that generalze the relatonshp between pars of loop n dfferent confguratons. For each combnaton of (relatve locaton, lane of loop 1, lane of loop 2), we have a lnear model as follows. q ( = α 0 ( δ, l, l j ) + α 0 ( δ, l, l j ) q j ( + nose 11 k ( = β ( δ, l, l ) + β ( δ, l, l ) k ( + nose where 0 δ = 0 f and j are n the same locaton on the freeway, 1 otherwse l = lane number of loop l j = lane number of loop j l,l j = 1,2,3,...,8 j 1 The global parameters are ftted to data smlar to the local parameters. In Los Angeles, there are 60,760 pars of neghbors (,j) for 5377 loops; n San Bernardno, there are 3,896 pars for 466 loops. The four parameters for each par are computed for these two dstrcts and stored n database tables. When mputng values for loop usng ts neghbors, each neghbor provdes an estmate, and the fnal estmate s taken as the medan of the par wse estmates. Both volume and occupancy mputaton are performed the same way. The mputaton for volume s j j qˆ j 0 1 ( d, = α (, j) + α (, j) q qˆ ( d, = medan{ qˆ j j ( d, ( d,, j : neghbor of, ˆ j ( d, = 0} 12 Here ˆ j ( d, obtaned from Equaton 2 s the dagnoss of the jth loop only estmates from good neghbors are used n the mputaton. Equaton 12 s a way to combne nformaton from multple neghbors. Whle ths method s suboptmal compared to those wth jont probablty models, such as multple regresson, t s more robust. Multple regresson models all neghbors jontly, as opposed to the par-wse model adopted here. Daley also presented an estmaton method based on all neghbors jontly usng a Kalman flter (9). But we chose the par-wse model for ts robustness--t generates an estmate as long as there s one good neghbor. In contrast, multple regresson needs values at each sample tme from all the neghbors. Robustness s also ncreased by use of the medan of q s nstead of the mean, whch s affected by outlers and errors n j. After one teraton, the mputaton algorthm generates estmates for all the bad loops that have at least one good neghbor. We stll need to do somethng for the bad loops that don t have good neghbors. We have not decded on a scheme for how to do ths, but there are several alternatves. The current mplementaton smply terates the mputaton process. After the frst teraton, a subset of the bad loops s flled wth mputed values these are the loops wth good neghbors. In the second teraton, the set of good loops grows to nclude those that have been mputed n the prevous teraton, so some of the remanng bad loops now have good neghbors. Ths process contnues untl all loops are flled, or untl all the remanng bad loops don t have any good neghbors. The problem wth ths method s that the mputaton becomes less accurate wth each succeedng teraton. Fortunately, most of the bad loops are flled n the frst teraton. In Dstrct 7 on 4/24/2002, for example, the percentages of flled loops n the frst 4 teratons are 90%, 5%, 1%, 1%; the entre grd s flled after 8 teratons. Another alternatve s to use the current mputaton only for the frst n mputatons. After that, f there are stll loops wthout values, we can use another method such as ˆj

10 Chen/Kwon/Skabardons/Varaya 9 hstorcal mean. In any case, an alternatve mputaton scheme s requred for sample tmes when there are no good data for any loop. Performance We evaluate the performance of ths algorthm on data from 4/24/2002. To run ths test, we found 189 loops that are themselves good and also had good neghbors. From each loop, we collected the measured flows and occupances q ( and k (; we then ran the algorthm to compute the estmated values qˆ ( and kˆ ( based on neghbors. From these, we found the root mean squared errors for each loop, see Table 4. Ths table shows that the estmates are unbased as they should be. The standard devaton of mputaton error s small compared to the mean and standard devaton of the measurements. Fgure 7 compares the estmated and orgnal values for one loop. They show good agreement. We also compared the performance of our algorthm aganst that of lnear nterpolaton. Ffteen trplets of good loops were chosen for ths test. Ten of the trplets are loops n the same lane, dfferent locatons, whle 5 other trplets have ther loops n the same locaton, across 3 lanes. In each trplet, we used two loops to predct the volume and occupancy of the thrd loop usng lnear nterpolaton. In every case, the neghborhood method produced a lower error n occupancy estmates; t produced smaller errors n flow estmates n 10 of 15 locatons. Overall, the neghborhood method performed better n the mean and medan, as expected. CONCLUSION We presented algorthms to detect bad loop detectors from ther outputs, and to mpute mssng data from neghborng good loops. Exstng methods of detecton evaluate each 20-second sample to determne f t represents a plausble traffc state, but we found that there s much more nformaton n how detectors behave over tme. Our algorthm makes dagnoses based on the sequence of measurements from each detector over a whole day. Vsually, bad data s much easer to detect when vewed as a tme seres. We found that our algorthm found almost all of the bad detectors that could be found vsually. Our mputaton algorthm estmates the true values at locatons wth bad or mssng data. Ths s an mportant functonalty, because almost any algorthm that uses the data needs a complete grd of data. Tradtonally, the way to deal wth mssng data s to nterpolate from near-by loops. Our algorthm performs better than nterpolaton because t uses hstorcal nformaton on how the measurements from neghbor detectors are related. We model the volume and occupancy between neghbor loops lnearly, and fnd the lnear regresson coeffcents of each neghbor par from hstorcal data. Ths algorthm s smple and robust. There reman many possbltes for mprovements to the algorthms descrbed here. The detecton algorthm descrbed here has a tme lag. To address ths, we are developng a truly real tme detecton algorthm that ncorporates neghbor loop measurements as well as the past day s statstcs. Whle the lnear model descrbes most neghbor pars, some pars have non-lnear relatonshps, so a more general model may be better. Another area of mprovement s the handlng of entre blocks of mssng data. The current mputaton algorthm needs a large number of good loops to mpute the rest, but t doesn t work f most or all the loops are bad for a sample tme. We need a method to handle ths stuaton.

11 Chen/Kwon/Skabardons/Varaya 10 Sngle loop data dagnostcs s an mportant area of research. Whle loop detectors are the most abundant source of traffc nformaton, the data are sometmes bad or mssng. The algorthms we presented construct a complete grd of clean data n real tme. They smplfy the desgn of upper level algorthms and mprove the accuracy of analyss based on loop data.

12 Chen/Kwon/Skabardons/Varaya 11 ACKNOWLEDGEMENTS Ths study s part of the PeMS project, whch s supported by grants from Caltrans to the Calforna PATH Program. We are very grateful to engneers from Caltrans Dstrcts 3, 4, 7, 8 and 12 and Headquarters for ther encouragement, understandng, and patence. They contnue to shape the evoluton of the PeMS vson, to montor ts progress, to champon PeMS wthn Caltrans. Wthout ther generous support ths project would not have reached such a mature stage. The contents of ths paper reflect the vews of the authors who are responsble for the facts and the accuracy of the data presented heren. The contents do not necessarly reflect the offcal vews of or polcy of the Calforna Department of Transportaton. Ths paper does not consttute a standard, specfcaton or regulaton.

13 Chen/Kwon/Skabardons/Varaya 12 REFERENCES Chen, C., K. Petty, A. Skabardons, and P. Varaya. Freeway Performance Measurement System: Mnng Loop Detector Data. Transportaton Research Record No.1748, Transportaton Research Board, Washngton, D.C., 2001, pp Payne, H.J., E.D. Helfenben and H.C. Knobel. Development and testng of ncdent detecton algorthms, FHWA-RD-76-20, Federal Hghway Admnstraton, Washngton DC Jacobson, L., Nhan, N., and J. Bender, Detectng Erroneous Loop Detector Data n a Freeway Traffc Management System. Transportaton Research Record 1287, Transportaton Research Board, Washngton, D.C., 1990, pp Cleghorn, D., F. Hall, and D. Garbuo. Improved Data Screenng Technques for Freeway Traffc Management Systems. Transportaton Research Record 1320, Transportaton Research Board, Washngton, D.C., 1991, pp Nhan, N. (1997) Ad to Determnng Freeway Meterng Rates and Detectng Loop Errors. Journal of Transportaton Engneerng, Vol 123, No 6, November/December 1997, pp Turochy, R.E. and B.L. Smth. A New Procedure for Detector Data Screenng n Traffc Management Systems. Transportaton Research Record 1727, Transportaton Research Board, Washngton, D.C., 2000, pp Daley, D.J. Improved error detecton for nductve loop sensors, WA-RD 3001 Washngton State Dept. of Transportaton, May Davs, G. and N. Nhan. Usng Tme-Seres Desgns to Estmate Changes n Freeway Level of Servce, Despte Mssng Data. Transportaton Research. Part A, Vol 18A, no. 5/6, Oct/Dec 1984, pp

14 Chen/Kwon/Skabardons/Varaya 13 LIST OF TABLES Table 1 Error Types...14 Table 2 Statstcs for dagnostcs...15 Table 3 Parameters of the Daly Statstcs Algorthm, and ther default settngs...16 Table 4 Performance of mputaton...17 LIST OF FIGURES Fgure 1 The Washngton Algorthm on two loops. Loop 1 and 2 are n Los Angeles, I-5 North, postmle 7.8, lanes 1 and 2; data collected on 8/7/ Fgure 2 Typcal and abnormal 30-sec flow (lef and occupancy measurements Fgure 3 Hstograms of S 1 - S Fgure 4 Example of neghborng loops Fgure 5 Scatter plot of occupances and flows from two pars of neghbors Fgure 6 Cumulatve dstrbuton of the correlaton coeffcents between neghbors...23 Fgure 7 Orgnal and estmated occupances and flows for a good loop...24

15 Chen/Kwon/Skabardons/Varaya 14 Table 1 Error Types. Error Type Descrpton Lkely Cause Fracton of loops n Dstrct 12 1 Occupancy and flow are mostly zero Stuck off 5.6% 2 Non-zero occupancy and zero flow, see Hangng on 5.5% Fgure 2c and 2d. 3 Very hgh occupancy, see Fgure 1d Hangng on 9.6% 4 Constant occupancy and flow Stuck on or 11.2% off All Errors 16%

16 Chen/Kwon/Skabardons/Varaya 15 Table 2 Statstcs for dagnostcs. Name Defnton Descrpton S 1 (,d) 1 ( k ( d, = 0) number of samples that have occupancy = 0. S 2 (,d) S 3 (,d: S 4 (,d) a t b a t b a t b ( 1) 1 ( k ( d, > 0)1( q ( d, = 0) number of samples that have occupancy>0 and flow=0 1( k ( d, > k ), k = number of samples that have occupancy > k (=0.35) pˆ( x) = x: p( x) > 0 pˆ( x) log( pˆ( x)), a t b 1( k ( d, = x) 1 a t b entropy of occupancy samples a well-known measure of the randomness of a random varable. If k (d, s constant n t, for example, ts entropy s zero.

17 Chen/Kwon/Skabardons/Varaya 16 Table 3 Parameters of the Daly Statstcs Algorthm, and ther default settngs Parameter Value k 0.35 s s 2 50 s s 4 4 a 5am b 10pm

18 Chen/Kwon/Skabardons/Varaya 17 Quantty Mean Standard Devaton Table 4 Performance of mputaton. Mean Absolute Error Standard Devaton of Error Mean Error Occupancy Volume (vph)

19 Chen/Kwon/Skabardons/Varaya 18 a b c d Fgure 1 The Washngton Algorthm on two loops. Loop 1 and 2 are n Los Angeles, I-5 North, postmle 7.8, lanes 1 and 2; data collected on 8/7/2001. Occupancy s n percent.

20 Chen/Kwon/Skabardons/Varaya 19 a b c d Fgure 2 Typcal and abnormal 30-sec flow (lef and occupancy measurements.

21 Chen/Kwon/Skabardons/Varaya 20 Fgure 3 Hstograms of S 1 - S 4.

22 Chen/Kwon/Skabardons/Varaya 21 Fgure 4 Example of neghborng loops.

23 Chen/Kwon/Skabardons/Varaya 22 Fgure 5 Scatter plot of occupances and flows from two pars of neghbors.

24 Chen/Kwon/Skabardons/Varaya 23 Fgure 6 Cumulatve dstrbuton of the correlaton coeffcents between neghbors.

25 Chen/Kwon/Skabardons/Varaya 24 Fgure 7 Orgnal and estmated occupances and flows for a good loop.

Detecting Errors and Imputing Missing Data for Single-Loop Surveillance Systems

Detecting Errors and Imputing Missing Data for Single-Loop Surveillance Systems 16 Transportaton Research Record 1855 Paper No. 3-357 Detectng Errors and Imputng Mssng Data for Sngle-Loop Survellance Systems Chao Chen, Jamyoung Kwon, John Rce, Alexander Skabardons, and Pravn Varaya

More information

Wishing you all a Total Quality New Year!

Wishing you all a Total Quality New Year! Total Qualty Management and Sx Sgma Post Graduate Program 214-15 Sesson 4 Vnay Kumar Kalakband Assstant Professor Operatons & Systems Area 1 Wshng you all a Total Qualty New Year! Hope you acheve Sx sgma

More information

CS 534: Computer Vision Model Fitting

CS 534: Computer Vision Model Fitting CS 534: Computer Vson Model Fttng Sprng 004 Ahmed Elgammal Dept of Computer Scence CS 534 Model Fttng - 1 Outlnes Model fttng s mportant Least-squares fttng Maxmum lkelhood estmaton MAP estmaton Robust

More information

y and the total sum of

y and the total sum of Lnear regresson Testng for non-lnearty In analytcal chemstry, lnear regresson s commonly used n the constructon of calbraton functons requred for analytcal technques such as gas chromatography, atomc absorpton

More information

An Entropy-Based Approach to Integrated Information Needs Assessment

An Entropy-Based Approach to Integrated Information Needs Assessment Dstrbuton Statement A: Approved for publc release; dstrbuton s unlmted. An Entropy-Based Approach to ntegrated nformaton Needs Assessment June 8, 2004 Wllam J. Farrell Lockheed Martn Advanced Technology

More information

X- Chart Using ANOM Approach

X- Chart Using ANOM Approach ISSN 1684-8403 Journal of Statstcs Volume 17, 010, pp. 3-3 Abstract X- Chart Usng ANOM Approach Gullapall Chakravarth 1 and Chaluvad Venkateswara Rao Control lmts for ndvdual measurements (X) chart are

More information

Some Advanced SPC Tools 1. Cumulative Sum Control (Cusum) Chart For the data shown in Table 9-1, the x chart can be generated.

Some Advanced SPC Tools 1. Cumulative Sum Control (Cusum) Chart For the data shown in Table 9-1, the x chart can be generated. Some Advanced SP Tools 1. umulatve Sum ontrol (usum) hart For the data shown n Table 9-1, the x chart can be generated. However, the shft taken place at sample #21 s not apparent. 92 For ths set samples,

More information

Complex Numbers. Now we also saw that if a and b were both positive then ab = a b. For a second let s forget that restriction and do the following.

Complex Numbers. Now we also saw that if a and b were both positive then ab = a b. For a second let s forget that restriction and do the following. Complex Numbers The last topc n ths secton s not really related to most of what we ve done n ths chapter, although t s somewhat related to the radcals secton as we wll see. We also won t need the materal

More information

Why visualisation? IRDS: Visualization. Univariate data. Visualisations that we won t be interested in. Graphics provide little additional information

Why visualisation? IRDS: Visualization. Univariate data. Visualisations that we won t be interested in. Graphics provide little additional information Why vsualsaton? IRDS: Vsualzaton Charles Sutton Unversty of Ednburgh Goal : Have a data set that I want to understand. Ths s called exploratory data analyss. Today s lecture. Goal II: Want to dsplay data

More information

Solutions to Programming Assignment Five Interpolation and Numerical Differentiation

Solutions to Programming Assignment Five Interpolation and Numerical Differentiation College of Engneerng and Coputer Scence Mechancal Engneerng Departent Mechancal Engneerng 309 Nuercal Analyss of Engneerng Systes Sprng 04 Nuber: 537 Instructor: Larry Caretto Solutons to Prograng Assgnent

More information

S1 Note. Basis functions.

S1 Note. Basis functions. S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

TN348: Openlab Module - Colocalization

TN348: Openlab Module - Colocalization TN348: Openlab Module - Colocalzaton Topc The Colocalzaton module provdes the faclty to vsualze and quantfy colocalzaton between pars of mages. The Colocalzaton wndow contans a prevew of the two mages

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

USING GRAPHING SKILLS

USING GRAPHING SKILLS Name: BOLOGY: Date: _ Class: USNG GRAPHNG SKLLS NTRODUCTON: Recorded data can be plotted on a graph. A graph s a pctoral representaton of nformaton recorded n a data table. t s used to show a relatonshp

More information

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

NAG Fortran Library Chapter Introduction. G10 Smoothing in Statistics

NAG Fortran Library Chapter Introduction. G10 Smoothing in Statistics Introducton G10 NAG Fortran Lbrary Chapter Introducton G10 Smoothng n Statstcs Contents 1 Scope of the Chapter... 2 2 Background to the Problems... 2 2.1 Smoothng Methods... 2 2.2 Smoothng Splnes and Regresson

More information

A Semi-parametric Regression Model to Estimate Variability of NO 2

A Semi-parametric Regression Model to Estimate Variability of NO 2 Envronment and Polluton; Vol. 2, No. 1; 2013 ISSN 1927-0909 E-ISSN 1927-0917 Publshed by Canadan Center of Scence and Educaton A Sem-parametrc Regresson Model to Estmate Varablty of NO 2 Meczysław Szyszkowcz

More information

An Optimal Algorithm for Prufer Codes *

An Optimal Algorithm for Prufer Codes * J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,

More information

Data Mining: Model Evaluation

Data Mining: Model Evaluation Data Mnng: Model Evaluaton Aprl 16, 2013 1 Issues: Evaluatng Classfcaton Methods Accurac classfer accurac: predctng class label predctor accurac: guessng value of predcted attrbutes Speed tme to construct

More information

Classifier Selection Based on Data Complexity Measures *

Classifier Selection Based on Data Complexity Measures * Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.

More information

Module Management Tool in Software Development Organizations

Module Management Tool in Software Development Organizations Journal of Computer Scence (5): 8-, 7 ISSN 59-66 7 Scence Publcatons Management Tool n Software Development Organzatons Ahmad A. Al-Rababah and Mohammad A. Al-Rababah Faculty of IT, Al-Ahlyyah Amman Unversty,

More information

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

User Authentication Based On Behavioral Mouse Dynamics Biometrics

User Authentication Based On Behavioral Mouse Dynamics Biometrics User Authentcaton Based On Behavoral Mouse Dynamcs Bometrcs Chee-Hyung Yoon Danel Donghyun Km Department of Computer Scence Department of Computer Scence Stanford Unversty Stanford Unversty Stanford, CA

More information

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster

More information

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr)

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr) Helsnk Unversty Of Technology, Systems Analyss Laboratory Mat-2.08 Independent research projects n appled mathematcs (3 cr) "! #$&% Antt Laukkanen 506 R ajlaukka@cc.hut.f 2 Introducton...3 2 Multattrbute

More information

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur FEATURE EXTRACTION Dr. K.Vjayarekha Assocate Dean School of Electrcal and Electroncs Engneerng SASTRA Unversty, Thanjavur613 41 Jont Intatve of IITs and IISc Funded by MHRD Page 1 of 8 Table of Contents

More information

Reducing Frame Rate for Object Tracking

Reducing Frame Rate for Object Tracking Reducng Frame Rate for Object Trackng Pavel Korshunov 1 and We Tsang Oo 2 1 Natonal Unversty of Sngapore, Sngapore 11977, pavelkor@comp.nus.edu.sg 2 Natonal Unversty of Sngapore, Sngapore 11977, oowt@comp.nus.edu.sg

More information

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour 6.854 Advanced Algorthms Petar Maymounkov Problem Set 11 (November 23, 2005) Wth: Benjamn Rossman, Oren Wemann, and Pouya Kheradpour Problem 1. We reduce vertex cover to MAX-SAT wth weghts, such that the

More information

The Codesign Challenge

The Codesign Challenge ECE 4530 Codesgn Challenge Fall 2007 Hardware/Software Codesgn The Codesgn Challenge Objectves In the codesgn challenge, your task s to accelerate a gven software reference mplementaton as fast as possble.

More information

Performance Evaluation of Information Retrieval Systems

Performance Evaluation of Information Retrieval Systems Why System Evaluaton? Performance Evaluaton of Informaton Retreval Systems Many sldes n ths secton are adapted from Prof. Joydeep Ghosh (UT ECE) who n turn adapted them from Prof. Dk Lee (Unv. of Scence

More information

A New Approach For the Ranking of Fuzzy Sets With Different Heights

A New Approach For the Ranking of Fuzzy Sets With Different Heights New pproach For the ankng of Fuzzy Sets Wth Dfferent Heghts Pushpnder Sngh School of Mathematcs Computer pplcatons Thapar Unversty, Patala-7 00 Inda pushpndersnl@gmalcom STCT ankng of fuzzy sets plays

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

SVM-based Learning for Multiple Model Estimation

SVM-based Learning for Multiple Model Estimation SVM-based Learnng for Multple Model Estmaton Vladmr Cherkassky and Yunqan Ma Department of Electrcal and Computer Engneerng Unversty of Mnnesota Mnneapols, MN 55455 {cherkass,myq}@ece.umn.edu Abstract:

More information

Empirical Distributions of Parameter Estimates. in Binary Logistic Regression Using Bootstrap

Empirical Distributions of Parameter Estimates. in Binary Logistic Regression Using Bootstrap Int. Journal of Math. Analyss, Vol. 8, 4, no. 5, 7-7 HIKARI Ltd, www.m-hkar.com http://dx.do.org/.988/jma.4.494 Emprcal Dstrbutons of Parameter Estmates n Bnary Logstc Regresson Usng Bootstrap Anwar Ftranto*

More information

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z.

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z. TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS Muradalyev AZ Azerbajan Scentfc-Research and Desgn-Prospectng Insttute of Energetc AZ1012, Ave HZardab-94 E-mal:aydn_murad@yahoocom Importance of

More information

Simulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010

Simulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010 Smulaton: Solvng Dynamc Models ABE 5646 Week Chapter 2, Sprng 200 Week Descrpton Readng Materal Mar 5- Mar 9 Evaluatng [Crop] Models Comparng a model wth data - Graphcal, errors - Measures of agreement

More information

Synthesizer 1.0. User s Guide. A Varying Coefficient Meta. nalytic Tool. Z. Krizan Employing Microsoft Excel 2007

Synthesizer 1.0. User s Guide. A Varying Coefficient Meta. nalytic Tool. Z. Krizan Employing Microsoft Excel 2007 Syntheszer 1.0 A Varyng Coeffcent Meta Meta-Analytc nalytc Tool Employng Mcrosoft Excel 007.38.17.5 User s Gude Z. Krzan 009 Table of Contents 1. Introducton and Acknowledgments 3. Operatonal Functons

More information

UB at GeoCLEF Department of Geography Abstract

UB at GeoCLEF Department of Geography   Abstract UB at GeoCLEF 2006 Mguel E. Ruz (1), Stuart Shapro (2), June Abbas (1), Slva B. Southwck (1) and Davd Mark (3) State Unversty of New York at Buffalo (1) Department of Lbrary and Informaton Studes (2) Department

More information

Concurrent Apriori Data Mining Algorithms

Concurrent Apriori Data Mining Algorithms Concurrent Apror Data Mnng Algorthms Vassl Halatchev Department of Electrcal Engneerng and Computer Scence York Unversty, Toronto October 8, 2015 Outlne Why t s mportant Introducton to Assocaton Rule Mnng

More information

Econometrics 2. Panel Data Methods. Advanced Panel Data Methods I

Econometrics 2. Panel Data Methods. Advanced Panel Data Methods I Panel Data Methods Econometrcs 2 Advanced Panel Data Methods I Last tme: Panel data concepts and the two-perod case (13.3-4) Unobserved effects model: Tme-nvarant and dosyncratc effects Omted varables

More information

Life Tables (Times) Summary. Sample StatFolio: lifetable times.sgp

Life Tables (Times) Summary. Sample StatFolio: lifetable times.sgp Lfe Tables (Tmes) Summary... 1 Data Input... 2 Analyss Summary... 3 Survval Functon... 5 Log Survval Functon... 6 Cumulatve Hazard Functon... 7 Percentles... 7 Group Comparsons... 8 Summary The Lfe Tables

More information

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Proceedngs of the Wnter Smulaton Conference M E Kuhl, N M Steger, F B Armstrong, and J A Jones, eds A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Mark W Brantley Chun-Hung

More information

Parameter estimation for incomplete bivariate longitudinal data in clinical trials

Parameter estimation for incomplete bivariate longitudinal data in clinical trials Parameter estmaton for ncomplete bvarate longtudnal data n clncal trals Naum M. Khutoryansky Novo Nordsk Pharmaceutcals, Inc., Prnceton, NJ ABSTRACT Bvarate models are useful when analyzng longtudnal data

More information

Programming in Fortran 90 : 2017/2018

Programming in Fortran 90 : 2017/2018 Programmng n Fortran 90 : 2017/2018 Programmng n Fortran 90 : 2017/2018 Exercse 1 : Evaluaton of functon dependng on nput Wrte a program who evaluate the functon f (x,y) for any two user specfed values

More information

Virtual Machine Migration based on Trust Measurement of Computer Node

Virtual Machine Migration based on Trust Measurement of Computer Node Appled Mechancs and Materals Onlne: 2014-04-04 ISSN: 1662-7482, Vols. 536-537, pp 678-682 do:10.4028/www.scentfc.net/amm.536-537.678 2014 Trans Tech Publcatons, Swtzerland Vrtual Machne Mgraton based on

More information

Mathematics 256 a course in differential equations for engineering students

Mathematics 256 a course in differential equations for engineering students Mathematcs 56 a course n dfferental equatons for engneerng students Chapter 5. More effcent methods of numercal soluton Euler s method s qute neffcent. Because the error s essentally proportonal to the

More information

Chapter 9. Model Calibration. John Hourdakis Center for Transportation Studies, U of Mn

Chapter 9. Model Calibration. John Hourdakis Center for Transportation Studies, U of Mn Chapter 9 Model Calbraton John Hourdaks Center for Transportaton Studes, U of Mn Hourd00@tc.umn.edu Why Calbrate? Computers Cannot Magcally Replcate Realty! Smulaton Models Are Desgned to be General Drver

More information

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

Proper Choice of Data Used for the Estimation of Datum Transformation Parameters

Proper Choice of Data Used for the Estimation of Datum Transformation Parameters Proper Choce of Data Used for the Estmaton of Datum Transformaton Parameters Hakan S. KUTOGLU, Turkey Key words: Coordnate systems; transformaton; estmaton, relablty. SUMMARY Advances n technologes and

More information

Problem Set 3 Solutions

Problem Set 3 Solutions Introducton to Algorthms October 4, 2002 Massachusetts Insttute of Technology 6046J/18410J Professors Erk Demane and Shaf Goldwasser Handout 14 Problem Set 3 Solutons (Exercses were not to be turned n,

More information

Hierarchical clustering for gene expression data analysis

Hierarchical clustering for gene expression data analysis Herarchcal clusterng for gene expresson data analyss Gorgo Valentn e-mal: valentn@ds.unm.t Clusterng of Mcroarray Data. Clusterng of gene expresson profles (rows) => dscovery of co-regulated and functonally

More information

3D vector computer graphics

3D vector computer graphics 3D vector computer graphcs Paolo Varagnolo: freelance engneer Padova Aprl 2016 Prvate Practce ----------------------------------- 1. Introducton Vector 3D model representaton n computer graphcs requres

More information

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes SPH3UW Unt 7.3 Sphercal Concave Mrrors Page 1 of 1 Notes Physcs Tool box Concave Mrror If the reflectng surface takes place on the nner surface of the sphercal shape so that the centre of the mrror bulges

More information

A Simple and Efficient Goal Programming Model for Computing of Fuzzy Linear Regression Parameters with Considering Outliers

A Simple and Efficient Goal Programming Model for Computing of Fuzzy Linear Regression Parameters with Considering Outliers 62626262621 Journal of Uncertan Systems Vol.5, No.1, pp.62-71, 211 Onlne at: www.us.org.u A Smple and Effcent Goal Programmng Model for Computng of Fuzzy Lnear Regresson Parameters wth Consderng Outlers

More information

ECONOMICS 452* -- Stata 11 Tutorial 6. Stata 11 Tutorial 6. TOPIC: Representing Multi-Category Categorical Variables with Dummy Variable Regressors

ECONOMICS 452* -- Stata 11 Tutorial 6. Stata 11 Tutorial 6. TOPIC: Representing Multi-Category Categorical Variables with Dummy Variable Regressors ECONOMICS * -- Stata 11 Tutoral Stata 11 Tutoral TOPIC: Representng Mult-Category Categorcal Varables wth Dummy Varable Regressors DATA: wage1_econ.dta (a Stata-format dataset) TASKS: Stata 11 Tutoral

More information

Brave New World Pseudocode Reference

Brave New World Pseudocode Reference Brave New World Pseudocode Reference Pseudocode s a way to descrbe how to accomplsh tasks usng basc steps lke those a computer mght perform. In ths week s lab, you'll see how a form of pseudocode can be

More information

ECONOMICS 452* -- Stata 12 Tutorial 6. Stata 12 Tutorial 6. TOPIC: Representing Multi-Category Categorical Variables with Dummy Variable Regressors

ECONOMICS 452* -- Stata 12 Tutorial 6. Stata 12 Tutorial 6. TOPIC: Representing Multi-Category Categorical Variables with Dummy Variable Regressors ECONOMICS 45* -- Stata 1 Tutoral 6 Stata 1 Tutoral 6 TOPIC: Representng Mult-Category Categorcal Varables wth Dummy Varable Regressors DATA: wage1_econ45.dta (a Stata-format dataset) TASKS: Stata 1 Tutoral

More information

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009.

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009. Farrukh Jabeen Algorthms 51 Assgnment #2 Due Date: June 15, 29. Assgnment # 2 Chapter 3 Dscrete Fourer Transforms Implement the FFT for the DFT. Descrbed n sectons 3.1 and 3.2. Delverables: 1. Concse descrpton

More information

Outlier Detection based on Robust Parameter Estimates

Outlier Detection based on Robust Parameter Estimates Outler Detecton based on Robust Parameter Estmates Nor Azlda Aleng 1, Ny Ny Nang, Norzan Mohamed 3 and Kasyp Mokhtar 4 1,3 School of Informatcs and Appled Mathematcs, Unverst Malaysa Terengganu, 1030 Kuala

More information

Virtual Memory. Background. No. 10. Virtual Memory: concept. Logical Memory Space (review) Demand Paging(1) Virtual Memory

Virtual Memory. Background. No. 10. Virtual Memory: concept. Logical Memory Space (review) Demand Paging(1) Virtual Memory Background EECS. Operatng System Fundamentals No. Vrtual Memory Prof. Hu Jang Department of Electrcal Engneerng and Computer Scence, York Unversty Memory-management methods normally requres the entre process

More information

Angle-Independent 3D Reconstruction. Ji Zhang Mireille Boutin Daniel Aliaga

Angle-Independent 3D Reconstruction. Ji Zhang Mireille Boutin Daniel Aliaga Angle-Independent 3D Reconstructon J Zhang Mrelle Boutn Danel Alaga Goal: Structure from Moton To reconstruct the 3D geometry of a scene from a set of pctures (e.g. a move of the scene pont reconstructon

More information

Determining the Optimal Bandwidth Based on Multi-criterion Fusion

Determining the Optimal Bandwidth Based on Multi-criterion Fusion Proceedngs of 01 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 5 (01) (01) IACSIT Press, Sngapore Determnng the Optmal Bandwdth Based on Mult-crteron Fuson Ha-L Lang 1+, Xan-Mn

More information

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 A mathematcal programmng approach to the analyss, desgn and

More information

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique //00 :0 AM Outlne and Readng The Greedy Method The Greedy Method Technque (secton.) Fractonal Knapsack Problem (secton..) Task Schedulng (secton..) Mnmum Spannng Trees (secton.) Change Money Problem Greedy

More information

A Similarity-Based Prognostics Approach for Remaining Useful Life Estimation of Engineered Systems

A Similarity-Based Prognostics Approach for Remaining Useful Life Estimation of Engineered Systems 2008 INTERNATIONAL CONFERENCE ON PROGNOSTICS AND HEALTH MANAGEMENT A Smlarty-Based Prognostcs Approach for Remanng Useful Lfe Estmaton of Engneered Systems Tany Wang, Janbo Yu, Davd Segel, and Jay Lee

More information

REFRACTIVE INDEX SELECTION FOR POWDER MIXTURES

REFRACTIVE INDEX SELECTION FOR POWDER MIXTURES REFRACTIVE INDEX SELECTION FOR POWDER MIXTURES Laser dffracton s one of the most wdely used methods for partcle sze analyss of mcron and submcron sze powders and dspersons. It s quck and easy and provdes

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

Lecture 5: Multilayer Perceptrons

Lecture 5: Multilayer Perceptrons Lecture 5: Multlayer Perceptrons Roger Grosse 1 Introducton So far, we ve only talked about lnear models: lnear regresson and lnear bnary classfers. We noted that there are functons that can t be represented

More information

This module is part of the. Memobust Handbook. on Methodology of Modern Business Statistics

This module is part of the. Memobust Handbook. on Methodology of Modern Business Statistics Ths module s part of the Memobust Handbook on Methodology of Modern Busness Statstcs 26 March 2014 Theme: Donor Imputaton Contents General secton... 3 1. Summary... 3 2. General descrpton... 3 2.1 Introducton

More information

An Image Fusion Approach Based on Segmentation Region

An Image Fusion Approach Based on Segmentation Region Rong Wang, L-Qun Gao, Shu Yang, Yu-Hua Cha, and Yan-Chun Lu An Image Fuson Approach Based On Segmentaton Regon An Image Fuson Approach Based on Segmentaton Regon Rong Wang, L-Qun Gao, Shu Yang 3, Yu-Hua

More information

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng

More information

Private Information Retrieval (PIR)

Private Information Retrieval (PIR) 2 Levente Buttyán Problem formulaton Alce wants to obtan nformaton from a database, but she does not want the database to learn whch nformaton she wanted e.g., Alce s an nvestor queryng a stock-market

More information

MATHEMATICS FORM ONE SCHEME OF WORK 2004

MATHEMATICS FORM ONE SCHEME OF WORK 2004 MATHEMATICS FORM ONE SCHEME OF WORK 2004 WEEK TOPICS/SUBTOPICS LEARNING OBJECTIVES LEARNING OUTCOMES VALUES CREATIVE & CRITICAL THINKING 1 WHOLE NUMBER Students wll be able to: GENERICS 1 1.1 Concept of

More information

Active Contours/Snakes

Active Contours/Snakes Actve Contours/Snakes Erkut Erdem Acknowledgement: The sldes are adapted from the sldes prepared by K. Grauman of Unversty of Texas at Austn Fttng: Edges vs. boundares Edges useful sgnal to ndcate occludng

More information

ELEC 377 Operating Systems. Week 6 Class 3

ELEC 377 Operating Systems. Week 6 Class 3 ELEC 377 Operatng Systems Week 6 Class 3 Last Class Memory Management Memory Pagng Pagng Structure ELEC 377 Operatng Systems Today Pagng Szes Vrtual Memory Concept Demand Pagng ELEC 377 Operatng Systems

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

Intro. Iterators. 1. Access

Intro. Iterators. 1. Access Intro Ths mornng I d lke to talk a lttle bt about s and s. We wll start out wth smlartes and dfferences, then we wll see how to draw them n envronment dagrams, and we wll fnsh wth some examples. Happy

More information

Pictures at an Exhibition

Pictures at an Exhibition 1 Pctures at an Exhbton Stephane Kwan and Karen Zhu Department of Electrcal Engneerng Stanford Unversty, Stanford, CA 9405 Emal: {skwan1, kyzhu}@stanford.edu Abstract An mage processng algorthm s desgned

More information

Sequential search. Building Java Programs Chapter 13. Sequential search. Sequential search

Sequential search. Building Java Programs Chapter 13. Sequential search. Sequential search Sequental search Buldng Java Programs Chapter 13 Searchng and Sortng sequental search: Locates a target value n an array/lst by examnng each element from start to fnsh. How many elements wll t need to

More information

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto

More information

Parallel Numerics. 1 Preconditioning & Iterative Solvers (From 2016)

Parallel Numerics. 1 Preconditioning & Iterative Solvers (From 2016) Technsche Unverstät München WSe 6/7 Insttut für Informatk Prof. Dr. Thomas Huckle Dpl.-Math. Benjamn Uekermann Parallel Numercs Exercse : Prevous Exam Questons Precondtonng & Iteratve Solvers (From 6)

More information

Quality Improvement Algorithm for Tetrahedral Mesh Based on Optimal Delaunay Triangulation

Quality Improvement Algorithm for Tetrahedral Mesh Based on Optimal Delaunay Triangulation Intellgent Informaton Management, 013, 5, 191-195 Publshed Onlne November 013 (http://www.scrp.org/journal/m) http://dx.do.org/10.36/m.013.5601 Qualty Improvement Algorthm for Tetrahedral Mesh Based on

More information

Analysis of Collaborative Distributed Admission Control in x Networks

Analysis of Collaborative Distributed Admission Control in x Networks 1 Analyss of Collaboratve Dstrbuted Admsson Control n 82.11x Networks Thnh Nguyen, Member, IEEE, Ken Nguyen, Member, IEEE, Lnha He, Member, IEEE, Abstract Wth the recent surge of wreless home networks,

More information

A Background Subtraction for a Vision-based User Interface *

A Background Subtraction for a Vision-based User Interface * A Background Subtracton for a Vson-based User Interface * Dongpyo Hong and Woontack Woo KJIST U-VR Lab. {dhon wwoo}@kjst.ac.kr Abstract In ths paper, we propose a robust and effcent background subtracton

More information

Anonymisation of Public Use Data Sets

Anonymisation of Public Use Data Sets Anonymsaton of Publc Use Data Sets Methods for Reducng Dsclosure Rsk and the Analyss of Perturbed Data Harvey Goldsten Unversty of Brstol and Unversty College London and Natale Shlomo Unversty of Manchester

More information

A Robust Method for Estimating the Fundamental Matrix

A Robust Method for Estimating the Fundamental Matrix Proc. VIIth Dgtal Image Computng: Technques and Applcatons, Sun C., Talbot H., Ourseln S. and Adraansen T. (Eds.), 0- Dec. 003, Sydney A Robust Method for Estmatng the Fundamental Matrx C.L. Feng and Y.S.

More information

The Man-hour Estimation Models & Its Comparison of Interim Products Assembly for Shipbuilding

The Man-hour Estimation Models & Its Comparison of Interim Products Assembly for Shipbuilding Internatonal Journal of Operatons Research Internatonal Journal of Operatons Research Vol., No., 9 4 (005) The Man-hour Estmaton Models & Its Comparson of Interm Products Assembly for Shpbuldng Bn Lu and

More information

Simulation Based Analysis of FAST TCP using OMNET++

Simulation Based Analysis of FAST TCP using OMNET++ Smulaton Based Analyss of FAST TCP usng OMNET++ Umar ul Hassan 04030038@lums.edu.pk Md Term Report CS678 Topcs n Internet Research Sprng, 2006 Introducton Internet traffc s doublng roughly every 3 months

More information

Biostatistics 615/815

Biostatistics 615/815 The E-M Algorthm Bostatstcs 615/815 Lecture 17 Last Lecture: The Smplex Method General method for optmzaton Makes few assumptons about functon Crawls towards mnmum Some recommendatons Multple startng ponts

More information

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices Steps for Computng the Dssmlarty, Entropy, Herfndahl-Hrschman and Accessblty (Gravty wth Competton) Indces I. Dssmlarty Index Measurement: The followng formula can be used to measure the evenness between

More information

THE MAP MATCHING ALGORITHM OF GPS DATA WITH RELATIVELY LONG POLLING TIME INTERVALS

THE MAP MATCHING ALGORITHM OF GPS DATA WITH RELATIVELY LONG POLLING TIME INTERVALS THE MA MATCHING ALGORITHM OF GS DATA WITH RELATIVELY LONG OLLING TIME INTERVALS Jae-seok YANG Graduate Student Graduate School of Engneerng Seoul Natonal Unversty San56-, Shllm-dong, Gwanak-gu, Seoul,

More information

Air Transport Demand. Ta-Hui Yang Associate Professor Department of Logistics Management National Kaohsiung First Univ. of Sci. & Tech.

Air Transport Demand. Ta-Hui Yang Associate Professor Department of Logistics Management National Kaohsiung First Univ. of Sci. & Tech. Ar Transport Demand Ta-Hu Yang Assocate Professor Department of Logstcs Management Natonal Kaohsung Frst Unv. of Sc. & Tech. 1 Ar Transport Demand Demand for ar transport between two ctes or two regons

More information

Circuit Analysis I (ENGR 2405) Chapter 3 Method of Analysis Nodal(KCL) and Mesh(KVL)

Circuit Analysis I (ENGR 2405) Chapter 3 Method of Analysis Nodal(KCL) and Mesh(KVL) Crcut Analyss I (ENG 405) Chapter Method of Analyss Nodal(KCL) and Mesh(KVL) Nodal Analyss If nstead of focusng on the oltages of the crcut elements, one looks at the oltages at the nodes of the crcut,

More information

5.0 Quality Assurance

5.0 Quality Assurance 5.0 Dr. Fred Omega Garces Analytcal Chemstry 25 Natural Scence, Mramar College Bascs of s what we do to get the rght answer for our purpose QA s planned and refers to planned and systematc producton processes

More information

Three supervised learning methods on pen digits character recognition dataset

Three supervised learning methods on pen digits character recognition dataset Three supervsed learnng methods on pen dgts character recognton dataset Chrs Flezach Department of Computer Scence and Engneerng Unversty of Calforna, San Dego San Dego, CA 92093 cflezac@cs.ucsd.edu Satoru

More information

CS 268: Lecture 8 Router Support for Congestion Control

CS 268: Lecture 8 Router Support for Congestion Control CS 268: Lecture 8 Router Support for Congeston Control Ion Stoca Computer Scence Dvson Department of Electrcal Engneerng and Computer Scences Unversty of Calforna, Berkeley Berkeley, CA 9472-1776 Router

More information

Design of a Real Time FPGA-based Three Dimensional Positioning Algorithm

Design of a Real Time FPGA-based Three Dimensional Positioning Algorithm Desgn of a Real Tme FPGA-based Three Dmensonal Postonng Algorthm Nathan G. Johnson-Wllams, Student Member IEEE, Robert S. Myaoka, Member IEEE, Xaol L, Student Member IEEE, Tom K. Lewellen, Fellow IEEE,

More information