Reinforcement Learning Based on Active Learning Method
|
|
- Clinton Ford
- 5 years ago
- Views:
Transcription
1 Second Internatonal Symposum on Intellgent Informaton Technology Applcaton Renforcement Learnng Based on Actve Learnng Method Hesam Sagha 1, Saeed Bagher Shourak 2, Hosen Khasteh 1, Al Akbar Kae 1 1 ACECR, Nasr Branch, Tehran, Iran 2 Department of Electrcal Engneerng Sharf Unversty of Technology,Tehran, Iran sagha@ce.sharf.r, bagher-s@sharf.edu, h_khasteh@ce.sharf.r, kae@ce.sharf.edu Abstract In ths paper, a new renforcement learnng approach s proposed whch s based on a powerful concept named Actve Learnng Method (ALM) n modelng. ALM expresses any mult-nput-sngle-output system as a fuzzy combnaton of some sngle-nput-sngleoutput systems. The proposed method s an actor-crtc system smlar to Generalzed Approxmate Reasonng based Intellgent Control (GARIC) structure to adapt the ALM by delayed renforcement sgnals. Our system uses Temporal Dfference (TD) learnng to model the behavor of useful actons of a control system. The goodness of an acton s modeled on Reward-Penalty-Plane. IDS planes wll be updated accordng to ths plane. It s shown that the system can learn wth a predefned fuzzy system or wthout t (through random actons). 1. Introducton ALM [1,2,3,4] s a recursve fuzzy algorthm, whch expresses a mult-nput-sngle-output system as a fuzzy combnaton of several sngle-nput-sngle-output systems. It models the nput-output relatons for each nput and combnes these models to fnd out the overall system model. ALM starts wth gatherng data and projectng them on dfferent data planes. The horzontal axs of each data plane s one of the nputs and the vertcal axs s the output. IDS processng engne wll look for a behavor curve, hereafter narrow lne, on each data plane. If the spread of data over the narrow lne s more than a threshold, data domans wll be dvded and the algorthm runs agan. The heart of ths learnng algorthm s a fuzzy nterpolaton method whch s used to derve a smooth curve among data ponts. It s done by applyng a three-dmensonal membershp functon to each data pont, whch expresses the belef for the data pont and ts neghbors. Each data pont s consdered as a source of lght, whch has a pyramd-shape llumnaton pattern. As the vertcal dstance from ths source of lght ncreases, ts llumnatng pattern wll nterfere wth ts neghbors formng new brght areas. The projecton of the process on the plane s called IDS. As t s shown n Fg 2, we can use a pyramd as a three dmensonal fuzzy membershp functon of a data pont and ts neghborng ponts. By applyng IDS method to each data plane, two dfferent types of nformaton wll be extracted. One s the narrow path and the other s the devaton of the data ponts around each narrow path. Each narrow path shows the behavor of output relatve to an nput; and spread of the data ponts around ths path shows the mportance degree of that nput n overall system behavor. Less devaton of data ponts around the path represents a hgher degree of mportance and vce versa. Sagha et al [5] proposed a method whch combnes genetc algorthm and IDS to obtan better parttons over nput varables. Ther method s called GIDS (Genetc IDS). Shahd et al [6] proposed RIDS method that replaces each two consequent ponts wth ther mdpont nstead of applyng a 2-d fuzzy membershp functon on each data. RIDS converges to the center of gravty of data and ncreases the number of ponts n order to keep data expanson n plane. In addton, they proposed another method called (Modfed RIDS) MRIDS that support negatve data ponts. In MRIDS, f two consequent ponts are postve, the result s smlar to that of RIDS and the replacng pont s ther mdpont. Nevertheless, f one of the ponts s negatve, then the replacng pont s a pont located near the postve pont on the lne whch connects two ponts; so negatve pont has an effect of devatng center of gravty from postve ponts. MRIDS consders that the rewards and punshments are accessble after each acton, but when they are delayed and ths delay s not determned, t wll not converge correctly. Here we used another method called Renforcement ALM (RALM), to add renforcement capablty to the algorthm. We used the concepts of Acton Selecton /08 $ IEEE DOI /IITA
2 Weght update r AEN r' F SAM ' ASN F State x Falure Renforcement Sgnal Network (ASN), Acton Evaluaton Network (AEN), and Stochastc Acton Modfer (SAM) that are proposed n GARIC [7] as an actor-crtc algorthm. GARIC: The archtecture of GARIC s schematcally shown n Fg 3. ASN maps a state vector nto a recommended acton, F, usng fuzzy nference. AEN maps a state vector and a falure sgnal nto a scalar score that ndcates state goodness. Ths s also used to produce nternal renforcement, r'. AEN can be a neural network structure or a fuzzy system [8] or alke. SAM uses both F and r' to produce an acton F', whch wll be appled to the plant. Learnng occurs by fne-tunng the parameters n the two networks: n the AEN, the weghts or fuzzy parameters are adjusted; n the ASN, the parameters descrbng the fuzzy membershp functons are changed. These are done by gradent descent approach. AEN parameters are updated va Temporal Dfference Learnng method. Temporal Dfference Learnng: s a predcton method. It approxmates ts current estmate based on prevously learned estmates by assumng subsequent predctons are often correlated n some sense. A predcton s made, and when the observaron s avalabe the predcton s adjusted to better match the observaton. If each state s t has the predcton value v(s t ) that denotes the goodness of done actons n that state, then the updatng formula s: v( st) = v( st) + αt( Rt + 1+ δv( st+ 1) v( st)) (1) where α t s learnng rate and δ s a constant n the range of [0,1] and R t+1 s the receved reward at tme t+1.[9] 2. Proposed Method Physcal System Fgure 3. The artchecture of GARIC Fgure 1. Flowchart of ALM Fgure 2. IDS method and Fuzzy membershp functon In our method we used a smlar structure to GARIC. ASN s an IDS fuzzy system. AEN s made up of a plane called Reward-Penalty-Plane (RPP). On ths plane s the nformaton of how much the done acton n a specfc state s good. From control vewpont, ths plane can be called Error-Change n Error-Plane because one axs Fgure 4. Intal Reward-Penalty Plane for an nverse pendulum system. Mddle ponts denote the desred states (-0.012R< θ <0.012 R and -0.05R/s < Δ θ < 0.05R/s) and have the maxmum value (1), margn ponts denote penalty areas and have the negatve mnmum value (less than zero, more than -1), and other ponts are n the play area. 599
3 denotes error and the other denotes changng error, and the value of each pont n ths plane shows that how much we can trust the selected actonn of ASN n that specfc state. SAM changes the value of fuzzy system output by consderng the output of AEN. More relevant an acton s SAM changes t less. RPP s a surface made up of the desred varable to control. At frst we have three regons on RPP plane, ) reward area: s the de- state nto that. sred area we lke the controller takes the Ths area has the fxed value of one. ) Penalty area: conssts of states that are not desrable n system and makes t unstable. The value of ths area s stuck to the lowest negatve value. ) Play area: ths area s the rest of the surface that has the value of zero ntally. An ntal plane s shown n fgure 4, for a system we lke to stable the varable angle. Durng the run when data s avalable, the value of RPP n the prevous tme step and tss neghbors wll approach to the value of RPP n the current tme step and ts neghbors as same as TD(0). delta = λwn( RPP ( e( t ), ce ( t )) (2) RPP ( e ( t 1), ce ( t 1) ))) RPP ( e( t 1), e( t 1)) = (3) RPP ( ce ( t 1), ce ( t 1)) + delta where, delta s the value of changng RPP, e(t) s the error of control varable at tme t, ce s changng n error and wn s the IDS wndow shows how much the neghbors must be effected and t cann be a pyramd, Gaussan wndow wth the center valuee of one or alke. When an acton s rewarded the rate of update, λ, s hgh, but when an acton s penalzed, λ s very low. It s beactons s much cause we assumed the number of false more than true actons for a system. After updatng the RPP plane, we must update IDS planes for the prevous acton and ts neghbors. In ths case, we reward the acton f t goes too better state and punsh t when t goes to worse state. The total goodness of an acton wll be obtaned by averagng over delta values: IDS ( n ( t 1), out ( t 1)) = (4) IDS ( n ( t 1), out ( t 1)) + delta where n s the th nput varable. Fuzzy system can be adapted onlne. In ths case, after spreadng each datum and neglectng data n neg- atve areas of IDS planes, narrow lnes of a predefned fuzzy nference system are updated. To select the next acton (step tme t) after fuzzy nference procedure n ASN, for exploraton and explotaton n the space, we change the obtaned value by followng formula: Ft () = ASN() t + N(0, Var) (5) (exp( α RPP( e( t), ce( t))) exp( α )) where N(0,Var) s a normally dstrbuted random varconstant. When RPP able wth varance Var and α s a gves the best score for an acton (.e. 1), the selected acton of ASN wll be appled wthout manpulaton. For offlne learnng, after some data capturng, when the RPP plane converges and no changes occurred n t, or a specfc number of teraton s passed, we get IDS planes that has both postve and negatve values. Negwhch are chosen n atve areas show that the actons ths part of space transform the system nto worse state. Therefore, by neglectng the data n these areas, we can flter bad actons. Fnally, by estmatng narrow lnes, and usng ALM, we can construct the fuzzy system. Another problem exsts when the state s n the range of reward area. If we use the orgnal generated fuzzy system, we have vbratons n ths range. It s because the learnng system s not learned how to act when the state s n reward area. To handle ths problem we used fuzzy scalng. In ths knd of scalng, the range of nput varable of fuzzy system wll be scaled proporton to the reward range/nput range: ' In = In Range( Reward( In )) / Range( In ) (6) where In s th nput whch s a part of RPP. Output range wll be scaled by Max ( Range ( Reward ( In ))/ Range ( In )) (7) Fgure 5. Fnal IDS planes for the nverted pendulum system. Whte areas have postve value and darker ones have negatve values. Fgure 6. Selected data 600
4 Therefore, we do not need another fuzzy system; just varables must be scaled and use the generated fuzzy system. Ths approach has some advantages n comparson wth MRIDS, especally when the problem has delays to reach nto a desred state. Also we explctly defne the reward and penalty areas and there s no need to defne how and on what trajectory the system can reach the goal. 3. Results We modeled the well-known nverse pendulum problem wth two nput, theta, θ, and angular velocty (Dtheta), θ. Reward area was chosen between θ = [-0.23, 0.23] radan and θ = [-0.98, 0. 98] radan/s and penalty areas are when each varable s more than 0.9 of nput range. λ s chosen to be 0.9 for rewards and 0.05 for penaltes. Penalty areas are set to be Tme step was chosen to be We used two methods of acton selecton n the be- actons n- gnnng of run. The frst one used random stead of a predefned fuzzy system, so no manpulaton by formula (5) was needed. After sequences and only 32 successes durng t, we got the IDS planes whch are shown n Fg. 5. Success s when the system state s n the reward area. Whte areas are postve and dark areas are negatve. Fgure 6 showss the data that are extracted from all data after removng bad ones that are located on negatve part of IDS planes. The fnal Reward-Penalty Plane s shown n Fg. 7. After applyng ALM, we got a fuzzy system wth only four rules. The surface of nput-output-force s shown n Fg. 8. Fg. 9 shows some random ntal states and ther convergence to the desred pont. Rse tme s 2.71 and overshoot s %0.0. The second method uses an ncorrect fuzzy sys tme tem n ASN wth four rules. After about steps of onlne learnng, the system learned to be stable. Fg 10 shows frst 1000 tme steps and last 1000 ones. Learnng by GARIC takes about tme steps but n our system t takes less than tme steps. In addton, we modeled ball and beam system. It s assumed the system has three nputs, θ, x0, v0 and two outputs x and v. θ s the angle of beam wth horzontal lne passng through the orgn, r0 s the ntal value of the dstance of ball from the orgn. v0 s the ntal value of ball's speed and r and v are the fnal values of dstance and speed. Our goal s to move the ball nto the poston zero, so we defne the RPP wth respect to x and v. To control t, we have two nputs v0 and x0 and one output θ. The results of RALM for some random nputs are shown n Fg. 11. Generated fuzzy system has four rules. The rse tme s 1.6 s and overshoot s %0.0. Table 4. Concluson 1 shows the result of other proposed algorthm based on ALM and FALCON. It can be seen that rse tme s reduced about 13% of supervsed ALM and no over- shoot s detected. ALM s a powerful dea for modelng. We changed t to support renforcement learnng. Our approach uses another plane to get the nformaton of renforcement sgnals. The approach s useful when there s no explct dea about the goodness of an acton, and delayed re- RALM con- wards and penaltes must be consdered. sders these very well. Results show that RALM learns better than other proposed ALM based algorthms. Table 1. Comparng control parameters of 4 control- lng method Fuzzy rules Overshoot Rse tme FALCON-ART % 2.11 Unsupervsed % 1.87 ALM Supervsed ALM RALM Fgure 7. Reward--Plane Fgure 8. Fuzzy system surface 601
5 5. References [1] S.Bagher, G.Yuasa, N.Honda, Fuzzy Controller Desgn by an actve Learnng Method,31 st Symposum of Intellgent Control, SIC 98 [2] S.Bagher, N.Honda, Hardware Smulaton of Bran Learnng Process, 15 th Fuzzy Symposum, Osaka, June 99. [3] S.Bagher, N.Honda, A New Method for Establshng and Savng Fuzzy Membershp Functons, 13 th Fuzzy sym- Computer For posum, Toyama, [4] S.Bagher, N.Honda, Outlnes of a Soft Bran Smulaton, Methodologes for the Concepton, Desgn And Applcaton of Soft Computng, IIZUK, 1998 [5] H.Sagha, S. B. Shourak, M.Dehghan, Genetc Ink Drop Spread, Internatonal Symposum n Intellgent Informaton Technology Applcaton, IITA, Chna,2008 [6] A.Shahd, S.Bagher, Supervsed Actve Learnng Meand Its Hardware thod as an ntellgent lngustc Controller Implementaton, IASTED, Span, 2002 [7] H.Berenj, P.Khedkar, Learnng and Tunng Fuzzy Logc Controllers Through Renforcement, IEEE Transacton on Neural Network, Vol 3, No 5, [8] H.Berenj, P.Khedkar, Usng Fuzzy Logc For Perfor- Learnng, mance Evaluaton n Renforcement NASA-TM [9] R. Sutton,A. Barto. Renforcement Learnng. MIT Press, 1998 Fgure 9. Fnal system output for nverse pendulum. Fgure10. Onlne learnng; Left: Frst 1000 tme steps, Rght: Tme steps between and Fgure 11. Fnal system output for ball and beam problem 602
Parallelism for Nested Loops with Non-uniform and Flow Dependences
Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr
More informationType-2 Fuzzy Non-uniform Rational B-spline Model with Type-2 Fuzzy Data
Malaysan Journal of Mathematcal Scences 11(S) Aprl : 35 46 (2017) Specal Issue: The 2nd Internatonal Conference and Workshop on Mathematcal Analyss (ICWOMA 2016) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES
More informationImprovement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration
Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,
More informationCluster Analysis of Electrical Behavior
Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School
More informationSLAM Summer School 2006 Practical 2: SLAM using Monocular Vision
SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,
More informationOPTIMIZATION OF FUZZY RULE BASES USING CONTINUOUS ANT COLONY SYSTEM
Proceedng of the Frst Internatonal Conference on Modelng, Smulaton and Appled Optmzaton, Sharah, U.A.E. February -3, 005 OPTIMIZATION OF FUZZY RULE BASES USING CONTINUOUS ANT COLONY SYSTEM Had Nobahar
More informationA Binarization Algorithm specialized on Document Images and Photos
A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a
More informationSubspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;
Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features
More information(1) The control processes are too complex to analyze by conventional quantitative techniques.
Chapter 0 Fuzzy Control and Fuzzy Expert Systems The fuzzy logc controller (FLC) s ntroduced n ths chapter. After ntroducng the archtecture of the FLC, we study ts components step by step and suggest a
More informationSupport Vector Machines
/9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.
More informationLearning the Kernel Parameters in Kernel Minimum Distance Classifier
Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department
More informationOutline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1
4/14/011 Outlne Dscrmnatve classfers for mage recognton Wednesday, Aprl 13 Krsten Grauman UT-Austn Last tme: wndow-based generc obect detecton basc ppelne face detecton wth boostng as case study Today:
More informationCompiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz
Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster
More informationPositive Semi-definite Programming Localization in Wireless Sensor Networks
Postve Sem-defnte Programmng Localzaton n Wreless Sensor etworks Shengdong Xe 1,, Jn Wang, Aqun Hu 1, Yunl Gu, Jang Xu, 1 School of Informaton Scence and Engneerng, Southeast Unversty, 10096, anjng Computer
More informationSolving two-person zero-sum game by Matlab
Appled Mechancs and Materals Onlne: 2011-02-02 ISSN: 1662-7482, Vols. 50-51, pp 262-265 do:10.4028/www.scentfc.net/amm.50-51.262 2011 Trans Tech Publcatons, Swtzerland Solvng two-person zero-sum game by
More informationFuzzy Filtering Algorithms for Image Processing: Performance Evaluation of Various Approaches
Proceedngs of the Internatonal Conference on Cognton and Recognton Fuzzy Flterng Algorthms for Image Processng: Performance Evaluaton of Varous Approaches Rajoo Pandey and Umesh Ghanekar Department of
More informationGenetic Tuning of Fuzzy Logic Controller for a Flexible-Link Manipulator
Genetc Tunng of Fuzzy Logc Controller for a Flexble-Lnk Manpulator Lnda Zhxa Sh Mohamed B. Traba Department of Mechancal Unversty of Nevada, Las Vegas Department of Mechancal Engneerng Las Vegas, NV 89154-407
More informationAn Image Fusion Approach Based on Segmentation Region
Rong Wang, L-Qun Gao, Shu Yang, Yu-Hua Cha, and Yan-Chun Lu An Image Fuson Approach Based On Segmentaton Regon An Image Fuson Approach Based on Segmentaton Regon Rong Wang, L-Qun Gao, Shu Yang 3, Yu-Hua
More informationHigh-Boost Mesh Filtering for 3-D Shape Enhancement
Hgh-Boost Mesh Flterng for 3-D Shape Enhancement Hrokazu Yagou Λ Alexander Belyaev y Damng We z Λ y z ; ; Shape Modelng Laboratory, Unversty of Azu, Azu-Wakamatsu 965-8580 Japan y Computer Graphcs Group,
More informationEdge Detection in Noisy Images Using the Support Vector Machines
Edge Detecton n Nosy Images Usng the Support Vector Machnes Hlaro Gómez-Moreno, Saturnno Maldonado-Bascón, Francsco López-Ferreras Sgnal Theory and Communcatons Department. Unversty of Alcalá Crta. Madrd-Barcelona
More informationNUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS
ARPN Journal of Engneerng and Appled Scences 006-017 Asan Research Publshng Network (ARPN). All rghts reserved. NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS Igor Grgoryev, Svetlana
More informationOutline. Type of Machine Learning. Examples of Application. Unsupervised Learning
Outlne Artfcal Intellgence and ts applcatons Lecture 8 Unsupervsed Learnng Professor Danel Yeung danyeung@eee.org Dr. Patrck Chan patrckchan@eee.org South Chna Unversty of Technology, Chna Introducton
More informationSmoothing Spline ANOVA for variable screening
Smoothng Splne ANOVA for varable screenng a useful tool for metamodels tranng and mult-objectve optmzaton L. Rcco, E. Rgon, A. Turco Outlne RSM Introducton Possble couplng Test case MOO MOO wth Game Theory
More informationBiostatistics 615/815
The E-M Algorthm Bostatstcs 615/815 Lecture 17 Last Lecture: The Smplex Method General method for optmzaton Makes few assumptons about functon Crawls towards mnmum Some recommendatons Multple startng ponts
More informationS1 Note. Basis functions.
S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type
More informationREFRACTION. a. To study the refraction of light from plane surfaces. b. To determine the index of refraction for Acrylic and Water.
Purpose Theory REFRACTION a. To study the refracton of lght from plane surfaces. b. To determne the ndex of refracton for Acrylc and Water. When a ray of lght passes from one medum nto another one of dfferent
More informationActive Contours/Snakes
Actve Contours/Snakes Erkut Erdem Acknowledgement: The sldes are adapted from the sldes prepared by K. Grauman of Unversty of Texas at Austn Fttng: Edges vs. boundares Edges useful sgnal to ndcate occludng
More informationOutline. Self-Organizing Maps (SOM) US Hebbian Learning, Cntd. The learning rule is Hebbian like:
Self-Organzng Maps (SOM) Turgay İBRİKÇİ, PhD. Outlne Introducton Structures of SOM SOM Archtecture Neghborhoods SOM Algorthm Examples Summary 1 2 Unsupervsed Hebban Learnng US Hebban Learnng, Cntd 3 A
More informationMachine Learning 9. week
Machne Learnng 9. week Mappng Concept Radal Bass Functons (RBF) RBF Networks 1 Mappng It s probably the best scenaro for the classfcaton of two dataset s to separate them lnearly. As you see n the below
More informationA Fast Visual Tracking Algorithm Based on Circle Pixels Matching
A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng
More information3D vector computer graphics
3D vector computer graphcs Paolo Varagnolo: freelance engneer Padova Aprl 2016 Prvate Practce ----------------------------------- 1. Introducton Vector 3D model representaton n computer graphcs requres
More informationUser Authentication Based On Behavioral Mouse Dynamics Biometrics
User Authentcaton Based On Behavoral Mouse Dynamcs Bometrcs Chee-Hyung Yoon Danel Donghyun Km Department of Computer Scence Department of Computer Scence Stanford Unversty Stanford Unversty Stanford, CA
More informationThe Research of Ellipse Parameter Fitting Algorithm of Ultrasonic Imaging Logging in the Casing Hole
Appled Mathematcs, 04, 5, 37-3 Publshed Onlne May 04 n ScRes. http://www.scrp.org/journal/am http://dx.do.org/0.436/am.04.584 The Research of Ellpse Parameter Fttng Algorthm of Ultrasonc Imagng Loggng
More informationA MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS
Proceedngs of the Wnter Smulaton Conference M E Kuhl, N M Steger, F B Armstrong, and J A Jones, eds A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Mark W Brantley Chun-Hung
More informationLoad Balancing for Hex-Cell Interconnection Network
Int. J. Communcatons, Network and System Scences,,, - Publshed Onlne Aprl n ScRes. http://www.scrp.org/journal/jcns http://dx.do.org/./jcns.. Load Balancng for Hex-Cell Interconnecton Network Saher Manaseer,
More informationLecture 5: Multilayer Perceptrons
Lecture 5: Multlayer Perceptrons Roger Grosse 1 Introducton So far, we ve only talked about lnear models: lnear regresson and lnear bnary classfers. We noted that there are functons that can t be represented
More informationSupport Vector Machines
Support Vector Machnes Decson surface s a hyperplane (lne n 2D) n feature space (smlar to the Perceptron) Arguably, the most mportant recent dscovery n machne learnng In a nutshell: map the data to a predetermned
More informationMULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION
MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION Paulo Quntlano 1 & Antono Santa-Rosa 1 Federal Polce Department, Brasla, Brazl. E-mals: quntlano.pqs@dpf.gov.br and
More informationDetermining the Optimal Bandwidth Based on Multi-criterion Fusion
Proceedngs of 01 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 5 (01) (01) IACSIT Press, Sngapore Determnng the Optmal Bandwdth Based on Mult-crteron Fuson Ha-L Lang 1+, Xan-Mn
More informationTraining ANFIS Structure with Modified PSO Algorithm
Proceedngs of the 5th Medterranean Conference on Control & Automaton, July 7-9, 007, Athens - Greece T4-003 Tranng ANFIS Structure wth Modfed PSO Algorthm V.Seyd Ghomsheh *, M. Alyar Shoorehdel **, M.
More informationAn Improved Image Segmentation Algorithm Based on the Otsu Method
3th ACIS Internatonal Conference on Software Engneerng, Artfcal Intellgence, Networkng arallel/dstrbuted Computng An Improved Image Segmentaton Algorthm Based on the Otsu Method Mengxng Huang, enjao Yu,
More informationProblem Definitions and Evaluation Criteria for Computational Expensive Optimization
Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty
More informationA New Approach For the Ranking of Fuzzy Sets With Different Heights
New pproach For the ankng of Fuzzy Sets Wth Dfferent Heghts Pushpnder Sngh School of Mathematcs Computer pplcatons Thapar Unversty, Patala-7 00 Inda pushpndersnl@gmalcom STCT ankng of fuzzy sets plays
More informationClassifier Selection Based on Data Complexity Measures *
Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.
More informationUnsupervised Learning
Pattern Recognton Lecture 8 Outlne Introducton Unsupervsed Learnng Parametrc VS Non-Parametrc Approach Mxture of Denstes Maxmum-Lkelhood Estmates Clusterng Prof. Danel Yeung School of Computer Scence and
More informationContent Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers
IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth
More informationLearning physical Models of Robots
Learnng physcal Models of Robots Jochen Mück Technsche Unverstät Darmstadt jochen.mueck@googlemal.com Abstract In robotcs good physcal models are needed to provde approprate moton control for dfferent
More informationLife Tables (Times) Summary. Sample StatFolio: lifetable times.sgp
Lfe Tables (Tmes) Summary... 1 Data Input... 2 Analyss Summary... 3 Survval Functon... 5 Log Survval Functon... 6 Cumulatve Hazard Functon... 7 Percentles... 7 Group Comparsons... 8 Summary The Lfe Tables
More informationCS 534: Computer Vision Model Fitting
CS 534: Computer Vson Model Fttng Sprng 004 Ahmed Elgammal Dept of Computer Scence CS 534 Model Fttng - 1 Outlnes Model fttng s mportant Least-squares fttng Maxmum lkelhood estmaton MAP estmaton Robust
More informationAn Optimal Algorithm for Prufer Codes *
J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,
More informationSimulation Based Analysis of FAST TCP using OMNET++
Smulaton Based Analyss of FAST TCP usng OMNET++ Umar ul Hassan 04030038@lums.edu.pk Md Term Report CS678 Topcs n Internet Research Sprng, 2006 Introducton Internet traffc s doublng roughly every 3 months
More informationApplying Continuous Action Reinforcement Learning Automata(CARLA) to Global Training of Hidden Markov Models
Applyng Contnuous Acton Renforcement Learnng Automata(CARLA to Global Tranng of Hdden Markov Models Jahanshah Kabudan, Mohammad Reza Meybod, and Mohammad Mehd Homayounpour Department of Computer Engneerng
More informationFEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur
FEATURE EXTRACTION Dr. K.Vjayarekha Assocate Dean School of Electrcal and Electroncs Engneerng SASTRA Unversty, Thanjavur613 41 Jont Intatve of IITs and IISc Funded by MHRD Page 1 of 8 Table of Contents
More informationR s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes
SPH3UW Unt 7.3 Sphercal Concave Mrrors Page 1 of 1 Notes Physcs Tool box Concave Mrror If the reflectng surface takes place on the nner surface of the sphercal shape so that the centre of the mrror bulges
More informationA Fast Content-Based Multimedia Retrieval Technique Using Compressed Data
A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,
More informationy and the total sum of
Lnear regresson Testng for non-lnearty In analytcal chemstry, lnear regresson s commonly used n the constructon of calbraton functons requred for analytcal technques such as gas chromatography, atomc absorpton
More informationReading. 14. Subdivision curves. Recommended:
eadng ecommended: Stollntz, Deose, and Salesn. Wavelets for Computer Graphcs: heory and Applcatons, 996, secton 6.-6., A.5. 4. Subdvson curves Note: there s an error n Stollntz, et al., secton A.5. Equaton
More informationVirtual Machine Migration based on Trust Measurement of Computer Node
Appled Mechancs and Materals Onlne: 2014-04-04 ISSN: 1662-7482, Vols. 536-537, pp 678-682 do:10.4028/www.scentfc.net/amm.536-537.678 2014 Trans Tech Publcatons, Swtzerland Vrtual Machne Mgraton based on
More informationOptimizing Document Scoring for Query Retrieval
Optmzng Document Scorng for Query Retreval Brent Ellwen baellwe@cs.stanford.edu Abstract The goal of ths project was to automate the process of tunng a document query engne. Specfcally, I used machne learnng
More information12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification
Introducton to Artfcal Intellgence V22.0472-001 Fall 2009 Lecture 24: Nearest-Neghbors & Support Vector Machnes Rob Fergus Dept of Computer Scence, Courant Insttute, NYU Sldes from Danel Yeung, John DeNero
More informationA Simple and Efficient Goal Programming Model for Computing of Fuzzy Linear Regression Parameters with Considering Outliers
62626262621 Journal of Uncertan Systems Vol.5, No.1, pp.62-71, 211 Onlne at: www.us.org.u A Smple and Effcent Goal Programmng Model for Computng of Fuzzy Lnear Regresson Parameters wth Consderng Outlers
More informationWireless Sensor Network Localization Research
Sensors & Transducers 014 by IFSA Publshng, S L http://wwwsensorsportalcom Wreless Sensor Network Localzaton Research Lang Xn School of Informaton Scence and Engneerng, Hunan Internatonal Economcs Unversty,
More informationXV International PhD Workshop OWD 2013, October Machine Learning for the Efficient Control of a Multi-Wheeled Mobile Robot
XV Internatonal PhD Workshop OWD 203, 9 22 October 203 Machne Learnng for the Effcent Control of a Mult-Wheeled Moble Robot Uladzmr Dzomn, Brest State Techncal Unversty (prof. Vladmr Golovko, Brest State
More informationProgramming in Fortran 90 : 2017/2018
Programmng n Fortran 90 : 2017/2018 Programmng n Fortran 90 : 2017/2018 Exercse 1 : Evaluaton of functon dependng on nput Wrte a program who evaluate the functon f (x,y) for any two user specfed values
More informationGA-Based Learning Algorithms to Identify Fuzzy Rules for Fuzzy Neural Networks
Seventh Internatonal Conference on Intellgent Systems Desgn and Applcatons GA-Based Learnng Algorthms to Identfy Fuzzy Rules for Fuzzy Neural Networks K Almejall, K Dahal, Member IEEE, and A Hossan, Member
More informationBehavioral Model Extraction of Search Engines Used in an Intelligent Meta Search Engine
Behavoral Model Extracton of Search Engnes Used n an Intellgent Meta Search Engne AVEH AVOUSI Computer Department, Azad Unversty, Garmsar Branch BEHZAD MOSHIRI Electrcal and Computer department, Faculty
More informationSequential search. Building Java Programs Chapter 13. Sequential search. Sequential search
Sequental search Buldng Java Programs Chapter 13 Searchng and Sortng sequental search: Locates a target value n an array/lst by examnng each element from start to fnsh. How many elements wll t need to
More informationThe Research of Support Vector Machine in Agricultural Data Classification
The Research of Support Vector Machne n Agrcultural Data Classfcaton Le Sh, Qguo Duan, Xnmng Ma, Me Weng College of Informaton and Management Scence, HeNan Agrcultural Unversty, Zhengzhou 45000 Chna Zhengzhou
More informationIMAGE MATCHING WITH SIFT FEATURES A PROBABILISTIC APPROACH
IMAGE MATCHING WITH SIFT FEATURES A PROBABILISTIC APPROACH Jyot Joglekar a, *, Shrsh S. Gedam b a CSRE, IIT Bombay, Doctoral Student, Mumba, Inda jyotj@tb.ac.n b Centre of Studes n Resources Engneerng,
More informationPerformance Evaluation of an ANFIS Based Power System Stabilizer Applied in Multi-Machine Power Systems
Performance Evaluaton of an ANFIS Based Power System Stablzer Appled n Mult-Machne Power Systems A. A GHARAVEISI 1,2 A.DARABI 3 M. MONADI 4 A. KHAJEH-ZADEH 5 M. RASHIDI-NEJAD 1,2,5 1. Shahd Bahonar Unversty
More informationUsing Fuzzy Logic to Enhance the Large Size Remote Sensing Images
Internatonal Journal of Informaton and Electroncs Engneerng Vol. 5 No. 6 November 015 Usng Fuzzy Logc to Enhance the Large Sze Remote Sensng Images Trung Nguyen Tu Huy Ngo Hoang and Thoa Vu Van Abstract
More informationFitting: Deformable contours April 26 th, 2018
4/6/08 Fttng: Deformable contours Aprl 6 th, 08 Yong Jae Lee UC Davs Recap so far: Groupng and Fttng Goal: move from array of pxel values (or flter outputs) to a collecton of regons, objects, and shapes.
More informationAn Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation
17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 An Iteratve Soluton Approach to Process Plant Layout usng Mxed
More informationSimulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010
Smulaton: Solvng Dynamc Models ABE 5646 Week Chapter 2, Sprng 200 Week Descrpton Readng Materal Mar 5- Mar 9 Evaluatng [Crop] Models Comparng a model wth data - Graphcal, errors - Measures of agreement
More informationDesign of Structure Optimization with APDL
Desgn of Structure Optmzaton wth APDL Yanyun School of Cvl Engneerng and Archtecture, East Chna Jaotong Unversty Nanchang 330013 Chna Abstract In ths paper, the desgn process of structure optmzaton wth
More informationMachine Learning: Algorithms and Applications
14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of
More information6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour
6.854 Advanced Algorthms Petar Maymounkov Problem Set 11 (November 23, 2005) Wth: Benjamn Rossman, Oren Wemann, and Pouya Kheradpour Problem 1. We reduce vertex cover to MAX-SAT wth weghts, such that the
More informationOverview. Basic Setup [9] Motivation and Tasks. Modularization 2008/2/20 IMPROVED COVERAGE CONTROL USING ONLY LOCAL INFORMATION
Overvew 2 IMPROVED COVERAGE CONTROL USING ONLY LOCAL INFORMATION Introducton Mult- Smulator MASIM Theoretcal Work and Smulaton Results Concluson Jay Wagenpfel, Adran Trachte Motvaton and Tasks Basc Setup
More informationTsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance
Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for
More informationHelsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr)
Helsnk Unversty Of Technology, Systems Analyss Laboratory Mat-2.08 Independent research projects n appled mathematcs (3 cr) "! #$&% Antt Laukkanen 506 R ajlaukka@cc.hut.f 2 Introducton...3 2 Multattrbute
More informationCS246: Mining Massive Datasets Jure Leskovec, Stanford University
CS46: Mnng Massve Datasets Jure Leskovec, Stanford Unversty http://cs46.stanford.edu /19/013 Jure Leskovec, Stanford CS46: Mnng Massve Datasets, http://cs46.stanford.edu Perceptron: y = sgn( x Ho to fnd
More informationVISUAL SELECTION OF SURFACE FEATURES DURING THEIR GEOMETRIC SIMULATION WITH THE HELP OF COMPUTER TECHNOLOGIES
UbCC 2011, Volume 6, 5002981-x manuscrpts OPEN ACCES UbCC Journal ISSN 1992-8424 www.ubcc.org VISUAL SELECTION OF SURFACE FEATURES DURING THEIR GEOMETRIC SIMULATION WITH THE HELP OF COMPUTER TECHNOLOGIES
More informationMathematics 256 a course in differential equations for engineering students
Mathematcs 56 a course n dfferental equatons for engneerng students Chapter 5. More effcent methods of numercal soluton Euler s method s qute neffcent. Because the error s essentally proportonal to the
More informationComparison of traveltime inversions on a limestone structure
Comparson of traveltme nversons on a lmestone structure Comparson of traveltme nversons on a lmestone structure Matthew D. Allen and Robert R. Stewart ABSRAC Four traveltme nverson technques were appled
More informationMOTION BLUR ESTIMATION AT CORNERS
Gacomo Boracch and Vncenzo Caglot Dpartmento d Elettronca e Informazone, Poltecnco d Mlano, Va Ponzo, 34/5-20133 MILANO boracch@elet.polm.t, caglot@elet.polm.t Keywords: Abstract: Pont Spread Functon Parameter
More informationImproving The Test Quality for Scan-based BIST Using A General Test Application Scheme
_ Improvng The Test Qualty for can-based BIT Usng A General Test Applcaton cheme Huan-Chh Tsa Kwang-Tng Cheng udpta Bhawmk Department of ECE Bell Laboratores Unversty of Calforna Lucent Technologes anta
More informationComplexity Analysis of Problem-Dimension Using PSO
Proceedngs of the 7th WSEAS Internatonal Conference on Evolutonary Computng, Cavtat, Croata, June -4, 6 (pp45-5) Complexty Analyss of Problem-Dmenson Usng PSO BUTHAINAH S. AL-KAZEMI AND SAMI J. HABIB,
More informationSteps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices
Steps for Computng the Dssmlarty, Entropy, Herfndahl-Hrschman and Accessblty (Gravty wth Competton) Indces I. Dssmlarty Index Measurement: The followng formula can be used to measure the evenness between
More informationEfficient Distributed File System (EDFS)
Effcent Dstrbuted Fle System (EDFS) (Sem-Centralzed) Debessay(Debsh) Fesehaye, Rahul Malk & Klara Naherstedt Unversty of Illnos-Urbana Champagn Contents Problem Statement, Related Work, EDFS Desgn Rate
More informationTerm Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task
Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto
More informationLecture #15 Lecture Notes
Lecture #15 Lecture Notes The ocean water column s very much a 3-D spatal entt and we need to represent that structure n an economcal way to deal wth t n calculatons. We wll dscuss one way to do so, emprcal
More informationStudy on Fuzzy Models of Wind Turbine Power Curve
Proceedngs of the 006 IASME/WSEAS Internatonal Conference on Energy & Envronmental Systems, Chalkda, Greece, May 8-0, 006 (pp-7) Study on Fuzzy Models of Wnd Turbne Power Curve SHU-CHEN WANG PEI-HWA HUANG
More informationCentroid Density of Interval Type-2 Fuzzy Sets: Comparing Stochastic and Deterministic Defuzzification
Centrod Densty of Interval Type-2 Fuzzy Sets: Comparng Stochastc and Determnstc Defuzzfcaton Ondrej Lnda, Mlos Manc Unversty of Idaho Idaho Falls, ID, US olnda@udaho.edu, msko@eee.org bstract Recently,
More informationLearning-Based Top-N Selection Query Evaluation over Relational Databases
Learnng-Based Top-N Selecton Query Evaluaton over Relatonal Databases Lang Zhu *, Wey Meng ** * School of Mathematcs and Computer Scence, Hebe Unversty, Baodng, Hebe 071002, Chna, zhu@mal.hbu.edu.cn **
More informationFusion Performance Model for Distributed Tracking and Classification
Fuson Performance Model for Dstrbuted rackng and Classfcaton K.C. Chang and Yng Song Dept. of SEOR, School of I&E George Mason Unversty FAIRFAX, VA kchang@gmu.edu Martn Lggns Verdan Systems Dvson, Inc.
More informationA Saturation Binary Neural Network for Crossbar Switching Problem
A Saturaton Bnary Neural Network for Crossbar Swtchng Problem Cu Zhang 1, L-Qng Zhao 2, and Rong-Long Wang 2 1 Department of Autocontrol, Laonng Insttute of Scence and Technology, Benx, Chna bxlkyzhangcu@163.com
More informationQuality Improvement Algorithm for Tetrahedral Mesh Based on Optimal Delaunay Triangulation
Intellgent Informaton Management, 013, 5, 191-195 Publshed Onlne November 013 (http://www.scrp.org/journal/m) http://dx.do.org/10.36/m.013.5601 Qualty Improvement Algorthm for Tetrahedral Mesh Based on
More informationSum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints
Australan Journal of Basc and Appled Scences, 2(4): 1204-1208, 2008 ISSN 1991-8178 Sum of Lnear and Fractonal Multobjectve Programmng Problem under Fuzzy Rules Constrants 1 2 Sanjay Jan and Kalash Lachhwan
More informationAVO Modeling of Monochromatic Spherical Waves: Comparison to Band-Limited Waves
AVO Modelng of Monochromatc Sphercal Waves: Comparson to Band-Lmted Waves Charles Ursenbach* Unversty of Calgary, Calgary, AB, Canada ursenbach@crewes.org and Arnm Haase Unversty of Calgary, Calgary, AB,
More informationA DATA ANALYSIS CODE FOR MCNP MESH AND STANDARD TALLIES
Supercomputng n uclear Applcatons (M&C + SA 007) Monterey, Calforna, Aprl 15-19, 007, on CD-ROM, Amercan uclear Socety, LaGrange Par, IL (007) A DATA AALYSIS CODE FOR MCP MESH AD STADARD TALLIES Kenneth
More information