State Indexed Policy Search by Dynamic Programming. Abstract. 1. Introduction. 2. System parameterization. Charles DuHadway
|
|
- Alexia McCoy
- 5 years ago
- Views:
Transcription
1 State Inexe Policy Search by Dynamic Programming Charles DuHaway Yi Gu December 4, 2007 Abstract We consier the reinforcement learning problem of simultaneous trajectory-following an obstacle avoiance by a raio-controlle car. A space-inexe non-stationary controller policy class is chosen that is linear in the features set, where the multiplier of each feature in each controller is learne using the policy search by ynamic programming algorithm. Uner the control of the learne policies the raio-controlle car is shown to be capable of reliably following a preefine trajectory while avoiing point obstacles along its way.. Introuction We consier the reinforcement learning problem of trajectory-following involving iscrete ecisions. Casting the problem as a Markov ecision process (MDP), we have a tuple (S, A, {P sa }, γ, R) enoting, respectively, the set of states, the set of actions, the state transition probabilities, the iscount factor an the rewar function, the objective is to fin a policy set Π opt that is optimal with respect to some choice of R. In cases where the system ynamics, R an Π are linear or linearizable, Π opt can be reaily solve for e.g. by linear quaratic regulators (LQR) []. Our current work however focuses on the case where such linearity oes not hol, an therefore generalizes the class of trajectory-following problems to scenarios involving non-trivial environmental constraints. We use a raio-controlle (RC) car as a test platform, an introuce non-linearities to R an Π through the nee to make iscrete ecisions in trying to avoi a point obstacle along a pre-efine trajectory. We choose the policy search by ynamic programming (PSDP) algorithm as the means to fining Π opt. This is because given a base istribution (inicating roughly how often we expect a goo policy to visit each state) PSDP can efficiently compute Π opt in spite of non-linear R an Π [2], hence our trajectory-following problem fits naturally into this framework. In trajectory-following problems actions customize for local path characteristics are especially useful. Requisite to such exploitation however are: a) a non-stationary policy set i.e. Π changes as the car traverses the trajectory, b) an accurate corresponence between spatial position an the policy use. However, the accuracy in this corresponence is inevitably egrae by run-time uncertainties. Consequently we chose for the policies to be inexe over space instea of time increments, as uner such a scheme the corresponence woul be immune from the sai uncertainties. The goal of our work is to investigate the feasibility of employing PSDP in trajectory-following problems uner a space-inexe scheme. More concretely, we aim to fin a suitable parameterization of the system S, action space A, iscount mechanism, rewar function R an policy class Π, such that the RC car woul reliably follow a preefine trajectory in the absence of obstacles, an avoi obstacles with some margin in their presence. We will first present a space-inexe parameterization of the system, an then efine the policy class an feature set. This leas on to the algorithm s implementation, starting with a brief efinition of PSDP, followe by a section-by-section account of the choices mae for the various functions, parameters an other algorithmic components. The results an accompanying analysis are provie in the final part of the paper. 2. System parameterization Typically, a reasonable system parameterization for a vehicle coul be s [ ] T t = xt yt t ut vt ut vt where x t an y t are the Cartesian coorinates of the vehicle in the worl coorinate system (WCS), t the orientation of the car in the WCS, an u t an v t are the longituinal an lateral velocities in the vehicle coorinate system (VCS). All values are with respect to the center of the vehicle, which is assume to be a rectangle. The subscripts t an t- inicate the time instants to which the values correspon. This parameterization is illustrate graphically in Figure.
2 Figure. Timeinexe states Note that time is not inclue as a state as the mapping of controller to the time instant it is use makes this implicit. Uner the space-inexe scheme for a vehicle riving in R 2 however, each controller is mappe to a subspace in R which we will call a waypoint. We efine each waypoint to be a subspace orthogonal to the trajectory tangent at the point where the waypoint exists. This is illustrate in Figure 2. Figure 2. Space-inexe states To localize the vehicle along a waypoint, we efine l as the cross-track error between where the target trajectory intersects the waypoint an where the center of the vehicle intersects with the same waypoint. This gives a space-inexe system parameterization [ t l u v u v ] T s = where t enotes the time at space inex along the preefine target trajectory, while, u an v are efine similarly as before except they now enote state values at a space inex. Note that the position of the vehicle is implicitly but completely specifie. The simple bicycle moel s & = f ( s, a) is use to escribe the vehicle ynamics, where a A is an action. State propagation from inex to + is achieve by first computing t such that at t + =t + t the vehicle is precisely at waypoint +, then states at + are compute by propagating s forwar through f(s,a) by exactly t. For brevity, states capturing historic values of the steering angle use to moel the steering ynamics are not shown. 3. Policy class The ultimate performance of the algorithm epens critically on the policy class, which is efine by both its format an the features that constitutes it. As mentione previously, the class of policy employe is nonstationary i.e. (Θ)=( ( ),, D- ( D- )), where Θ=(,, D- ), so that we have a one-to-one mapping between each inex an stationary policy ( ). All ( ): S A come form the same policy class, an for simplicity we chose each ( ) to be linear in the feature set i.e. T = g s ( ) ( ) = [ ],,2,3 [ g g g g ] T,4 g( s ) = g is simply the cross-track error l, g 2 is the yaw of the car with respect to the target trajectory tangent at ; for example if the orientation of the vehicle at in WCS is 40 while the trajectory tangent at in WCS is 37, then g 2 =3. Together g an g 2 keep the vehicle close an parallel to the target trajectory. g 3 is the anticipate clearance surplus from the obstacle. For instance if l =.5 cm (.5 cm to the right of the trajectory looking in the irection of travel) as the car is approaching a point obstacle some istance ahea situate 5 cm to the left of the trajectory, an the esire clearance is 20 cm to the right of the obstacle, then g 3 =(.5+5)-20=-3.5 cm i.e. the car nees to be 3.5 cm further to the right to avoi colliing with the obstacle. As a conservative measure, g 3 s value is offset by 5 cm so that g 3 =-25 cm at -20 cm real clearance, increasing to -5 cm at 0 cm real clearance, beyon which its absolute value ecays exponentially while remaining negative. g 3 is intene to steer the car off the target trajectory away an from an obstacle. g 3 =0 when no obstacle is within the control horizon. We allow the car to reevaluate which sie to pass an obstacle on at each step, the rule being that if the obstacle is currently to the left of the car, then pass it on the right an vice versa, an g 3 is compute accoringly. g 4 is the obstacle conitione yaw. Its value is non-zero only when the car is approaching an obstacle an is steering in a irection that woul reuce the clearance. In such cases, g 4 is yaw of the trajectory tangent as measure in the VCS. This feature is inclue to preserve any clearance achieve via g 3, since an existing clearance woul ten to make g 3 small, an a controller without g 4 woul be ominate by g an g 2. This conclues the efinition of the policy class, an we note in particular that the policy class is clearly nonlinear an cannot be easily cast into a form suitable for a linear solver. T 2
3 4. Algorithm implementation 4. Definition We start by consiering a trajectory that spans D space inices an a policy class as efine in 3, we efine the value function V(s) D V ( s) E R( s ) s s ( ) t,..., D i ;,..., D D = i= R( s) + E V s s~ Ps ( s ) [ ( )] +,..., D where s ~ P s ( s) inicates that the expectation is with respect to s rawn from the state transition probability Ps (s). The PSDP algorithm takes the form: for = D-, D-2,,0 Set arg max E [ V ( )] = s ~ µ s, +,..., D Π where we assume we are given a sequence of base istributions µ over the states. PSDP is thus a policy iteration algorithm that yiels an optimal for every space inex. The ynamic programming part is evient in that the solution for µ starts from the en of the trajectory towars the beginning. 4.2 Initialization We now turn our attention to µ, an note that we o not know the set of base istributions (µ 0, µ,, µ D- ) over a pre-efine trajectory. However if we ha a controller that coul simultaneously track a target trajectory an avoi obstacles, then by running the RC car over the trajectory using that controller a large m number of times, we woul see a representative istribution of s at across all m trials. In this way we have effectively sample from µ without knowing the explicit form of µ ; the problem is that if we ha such a controller, we wouln t have neee to run PSDP to begin with. One way aroun this catch 22 is to start PSDP by sampling from a poor approximation of µ i.e.. Run the RC car over the target trajectory without obstacles m times using a suboptimal yet reasonable controller. A obstacles to each of the m trajectories. 2. Derive a set of policies using PSDP base on these m samples. 3. Run the RC car over m trajectories with obstacles using. 4. Iterate over steps 2 an 3 until converges, at which point we eem that we have been sampling from a goo approximation of (µ 0, µ,, µ D- ). In our implementation, the suboptimal controller is erive in two steps. First we compute a noiseless state 3 trajectory using a heuristically-tune PI controller over the target trajectory with no obstacles. The resulting state sequence is use as the starting point for fining a LQR controller using ifferential ynamic programming (DDP). We finally obtain m samples by using the LQR controller in step above. Graphically, the PSDP algorithm as efine in 4. an initialize in 4.2 appears as in Figure 3. Figure 3. PSDP algorithmic flow 4.3 Action space The control of an RC car involves steering an throttle. If we choose to rive the vehicle at constant velocity, then our action space is simplifie to comprise of only steering. The steering angle range of the RC car is a continuous interval from -28 (full lock left) to 28 (full lock right). In orer to keep the calculation of V(s) tractable however this interval is iscretize to 0 points: A={-28, -2.78,, 2.78, 28 }. 4.4 Transition probability Two mechanisms influence the transition from state s to s + viz. system ynamics an noise. The former is capture by f(s,a), while the noise is moele as zeromean Gaussian measurement noise. Consequently, our transition probability is 2 s ~ N s + f ( s, a ) t σ ( ) +, where σ 2 is chosen to be consistent with variance of state measurements obtaine from the RC car s Kalman filter. 4.5 Discount factor Recall that V(s) is the expecte sum of R(s) over istance, an terms for which γ h R(s +h- ) < ε for some small ε are ignore as they have negligible effect (those more than h- inices own the length of the trajectory from ). If γ is close to then the learne policies are anticipatory of the future an ten to be more globally optimal. However the closer γ is to the larger p becomes for a given ε, with a corresponing longer computation time. If γ is close to 0, the computation time woul be shorter, but
4 the policies woul also be less sensitive to future highcost consequences to current actions, making them only locally optimal. We foun that the geometrically ecaying weighting envelope γ imposes coul not yiel a satisfactory balance between global-optimality an computation time for any value between 0 an. Consequently we sacrifice envelope continuity an instea employe a rectangular weighting envelope. V(s) is thus the sum of a fixe number h of rewar functions each of which is weighte equally. 4.6 Rewar function The esire behaviors of the car inclue: a) close tracking of the target trajectory, b) reliable avoiance of obstacles, an c) use of steering actions that are realizable i.e. limite banwith actuation. Since it is more natural to treat these attributes in the negative e.g. we o not wish the car to be far from the target trajectory, R(s) will henceforth refer to a cost function instea of a rewar function. The problem remains unchange, only now we wish to minimize R(s) instea of maximizing it. From these a 4-component cost function was conceive: 2 2 bl + b2q& if no collision R( s ) = 2 c b q& + b b4 otherwise c where q& is erivative of steering angle, c is the istance between the center of the car an a point obstacle, an c is the esire minimum clearance to an obstacle. If c < c, then a collision is consiere to have occurre this effectively treats the car as a circular object an was a conscious an conservative choice. The term involving l penalizes eviation from the target path, the term involving q& penalizes rapi changes in steering, the constant b 4 penalizes a collision an the term involving c an c enhances the repulsion of the car away from the obstacle uring learning. Figure 4 is a visualization ai to R(s), which has iscrete regimes an is clearly non-linear. Note in particular the expecte parabolic appearance of the b contribution, the half cyliner an the slight half cone atop of it in b) attributable to the b 4 an b 3 terms respectively. The weighting parameters b to b 4 were chosen experimentally. 4 (a) No obstacle (b) Collision Figure 4. Cost function visualization Note that at this point, we have also fully specifie the MDP in the context of our trajectory following problem, having efine the tuple (S, A, {P sa }, γ, R). 4.7 Optimization Referring to Figure 3, in each iteration of PSDP we have m sample state trajectories each containing D points. For a DP backup at :. We start with s on the first sample trajectory, note the ranom number see, take action a A. 2. Compute s +,,s +h- (h as efine in 4.5), the s (the subscript V a corresponing costs R(s) an ( ) a inicates unique corresponence to action a ). 3. Re-see the ranom number generator with the value note in, take action a 2 A an repeat step Repeat steps 2 an 3 for actions a 3 to a Repeat steps to 4 for sample trajectories 2 to m. At the en of the 5 steps, we have a recor of how V(s) varies with action at inex for each of the m samples. We then use the coorinate escen algorithm with ynamic resolution to fin the that minimizes the expecte value i.e. we solve ( ) = arg min Es [ V ( s)] ( ) ( ) ~ µ, +,..., + h Π Θ To expeite the numeric search, the starting point is chosen to be the least squares estimate init =(B T B) - B T b where b is the vector of value-minimizing steering angle
5 specific to each trajectory an B is a matrix containing the feature values associate with each trajectory. This is base on the observation that the least squares solution of to iniviual steering actions that minimize iniviual V(s) is often close to the solution that minimize the aggregate. 5. Results an iscussion 5. Performance The learne controllers performe well in both simulation an the physical system. Example trajectories from both are shown in Figure 5. To quantitatively evaluate the performance of a learne controller we use both the cost function an obstacle collision rate. The controller shown in Figure 5 ha a collision rate of 3.3%. The target an actual trajectories are shown in blue an black respectively. Obstacles are shown as ouble re circles. The inner circle enotes the closest the physical car coul pass by the obstacle without collision if perfectly aligne (exactly tangent to the obstacle). The outer circle enotes the farthest the car coul be an still manage to hit the obstacle given worst case alignment (exactly orthogonal to the obstacle). Note the controller perform wells espite the obvious ifference in sampling rates between the actual an simulate systems. Figure 5. Simulate an physical trajectories Circular trajectories were use to test the actual car system ue to physical limitations in our tracking system. Similar performance was observe in simulation for both straight an oval trajectories an we expect our system to perform well over a large variety of trajectories. 5.2 Choosing a policy class an cost function Choosing the appropriate policy class an cost function was non-trivial an require a series of tests to choose between several alternatives. Figure 6 shows the results of one such test comparing two cost functions: one with an accurate, bouning box collision test an one with a conservatively inaccurate bouning circle collision test. In this case the latter s performance far exceee the former s. Collision rate (%) Iterations Bouning Circle Bouning Box Iterations Figure 6. Bouning box vs. circle collision test Bouning Circle Bouning Box Similar tests were use to choose the specific feature set an cost function escribe above. Among other things we learne that the stanar integral term was unnecessary for our learning task. We also foun it important to use a cost function with no flat areas. 5.3 Conclusions We have implemente a system that emonstrates the feasibility an effectiveness of employing PSDP in a state-inexe scheme to solve a reinforcement learning problem with a non-linear policy class an rewar function. Our approach takes full avantage of PSDP s strengths while eliminating its epenence on accurate space-time correlation. 5.4 Next Steps The obvious next step is a etaile comparison of stateinexe PSDP with time-inexe PSDP. We are confient that such a comparison will clearly emonstrate the superiority of state-base PSDP. This shoul be especially clear over long or non-uniform trajectories. Acknowlegments The authors wish to acknowlege valuable inputs from Anrew Ng an Aam Coates uring the course of this research project. References [] Abbeel P., Coates A., Quigley M., an Ng A. Y.. An Application of Reinforcement Learning to Aerobatic Helicopter Flight, in NIPS 9, 2007 [2] Bagnell J. A., Kakae S., Ng Y A., an Schneier J.. Policy search by ynamic programming, in NIPS 6,
Classifying Facial Expression with Radial Basis Function Networks, using Gradient Descent and K-means
Classifying Facial Expression with Raial Basis Function Networks, using Graient Descent an K-means Neil Allrin Department of Computer Science University of California, San Diego La Jolla, CA 9237 nallrin@cs.ucs.eu
More informationAlmost Disjunct Codes in Large Scale Multihop Wireless Network Media Access Control
Almost Disjunct Coes in Large Scale Multihop Wireless Network Meia Access Control D. Charles Engelhart Anan Sivasubramaniam Penn. State University University Park PA 682 engelhar,anan @cse.psu.eu Abstract
More informationTransient analysis of wave propagation in 3D soil by using the scaled boundary finite element method
Southern Cross University epublications@scu 23r Australasian Conference on the Mechanics of Structures an Materials 214 Transient analysis of wave propagation in 3D soil by using the scale bounary finite
More informationParticle Swarm Optimization Based on Smoothing Approach for Solving a Class of Bi-Level Multiobjective Programming Problem
BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 17, No 3 Sofia 017 Print ISSN: 1311-970; Online ISSN: 1314-4081 DOI: 10.1515/cait-017-0030 Particle Swarm Optimization Base
More information6 Gradient Descent. 6.1 Functions
6 Graient Descent In this topic we will iscuss optimizing over general functions f. Typically the function is efine f : R! R; that is its omain is multi-imensional (in this case -imensional) an output
More informationDual Arm Robot Research Report
Dual Arm Robot Research Report Analytical Inverse Kinematics Solution for Moularize Dual-Arm Robot With offset at shouler an wrist Motivation an Abstract Generally, an inustrial manipulator such as PUMA
More informationBends, Jogs, And Wiggles for Railroad Tracks and Vehicle Guide Ways
Ben, Jogs, An Wiggles for Railroa Tracks an Vehicle Guie Ways Louis T. Klauer Jr., PhD, PE. Work Soft 833 Galer Dr. Newtown Square, PA 19073 lklauer@wsof.com Preprint, June 4, 00 Copyright 00 by Louis
More informationOnline Appendix to: Generalizing Database Forensics
Online Appenix to: Generalizing Database Forensics KYRIACOS E. PAVLOU an RICHARD T. SNODGRASS, University of Arizona This appenix presents a step-by-step iscussion of the forensic analysis protocol that
More informationSURVIVABLE IP OVER WDM: GUARANTEEEING MINIMUM NETWORK BANDWIDTH
SURVIVABLE IP OVER WDM: GUARANTEEEING MINIMUM NETWORK BANDWIDTH Galen H Sasaki Dept Elec Engg, U Hawaii 2540 Dole Street Honolul HI 96822 USA Ching-Fong Su Fuitsu Laboratories of America 595 Lawrence Expressway
More informationApprenticeship Learning for Reinforcement Learning. with application to RC helicopter flight Ritwik Anand, Nick Haliday, Audrey Huang
Apprenticeship Learning for Reinforcement Learning with application to RC helicopter flight Ritwik Anand, Nick Haliday, Audrey Huang Table of Contents Introduction Theory Autonomous helicopter control
More informationGeneralized Edge Coloring for Channel Assignment in Wireless Networks
Generalize Ege Coloring for Channel Assignment in Wireless Networks Chun-Chen Hsu Institute of Information Science Acaemia Sinica Taipei, Taiwan Da-wei Wang Jan-Jan Wu Institute of Information Science
More informationCharacterizing Decoding Robustness under Parametric Channel Uncertainty
Characterizing Decoing Robustness uner Parametric Channel Uncertainty Jay D. Wierer, Wahee U. Bajwa, Nigel Boston, an Robert D. Nowak Abstract This paper characterizes the robustness of ecoing uner parametric
More informationQuestions? Post on piazza, or Radhika (radhika at eecs.berkeley) or Sameer (sa at berkeley)!
EE122 Fall 2013 HW3 Instructions Recor your answers in a file calle hw3.pf. Make sure to write your name an SID at the top of your assignment. For each problem, clearly inicate your final answer, bol an
More informationBayesian localization microscopy reveals nanoscale podosome dynamics
Nature Methos Bayesian localization microscopy reveals nanoscale poosome ynamics Susan Cox, Ewar Rosten, James Monypenny, Tijana Jovanovic-Talisman, Dylan T Burnette, Jennifer Lippincott-Schwartz, Gareth
More informationMath 131. Implicit Differentiation Larson Section 2.5
Math 131. Implicit Differentiation Larson Section.5 So far we have ealt with ifferentiating explicitly efine functions, that is, we are given the expression efining the function, such as f(x) = 5 x. However,
More informationSolution Representation for Job Shop Scheduling Problems in Ant Colony Optimisation
Solution Representation for Job Shop Scheuling Problems in Ant Colony Optimisation James Montgomery, Carole Faya 2, an Sana Petrovic 2 Faculty of Information & Communication Technologies, Swinburne University
More informationClassical Mechanics Examples (Lagrange Multipliers)
Classical Mechanics Examples (Lagrange Multipliers) Dipan Kumar Ghosh Physics Department, Inian Institute of Technology Bombay Powai, Mumbai 400076 September 3, 015 1 Introuction We have seen that the
More informationMORA: a Movement-Based Routing Algorithm for Vehicle Ad Hoc Networks
: a Movement-Base Routing Algorithm for Vehicle A Hoc Networks Fabrizio Granelli, Senior Member, Giulia Boato, Member, an Dzmitry Kliazovich, Stuent Member Abstract Recent interest in car-to-car communications
More informationEstimating Velocity Fields on a Freeway from Low Resolution Video
Estimating Velocity Fiels on a Freeway from Low Resolution Vieo Young Cho Department of Statistics University of California, Berkeley Berkeley, CA 94720-3860 Email: young@stat.berkeley.eu John Rice Department
More informationFigure 1: Schematic of an SEM [source: ]
EECI Course: -9 May 1 by R. Sanfelice Hybri Control Systems Eelco van Horssen E.P.v.Horssen@tue.nl Project: Scanning Electron Microscopy Introuction In Scanning Electron Microscopy (SEM) a (bunle) beam
More informationPairwise alignment using shortest path algorithms, Gunnar Klau, November 29, 2005, 11:
airwise alignment using shortest path algorithms, Gunnar Klau, November 9,, : 3 3 airwise alignment using shortest path algorithms e will iscuss: it graph Dijkstra s algorithm algorithm (GDU) 3. References
More informationOffloading Cellular Traffic through Opportunistic Communications: Analysis and Optimization
1 Offloaing Cellular Traffic through Opportunistic Communications: Analysis an Optimization Vincenzo Sciancalepore, Domenico Giustiniano, Albert Banchs, Anreea Picu arxiv:1405.3548v1 [cs.ni] 14 May 24
More informationImproving Spatial Reuse of IEEE Based Ad Hoc Networks
mproving Spatial Reuse of EEE 82.11 Base A Hoc Networks Fengji Ye, Su Yi an Biplab Sikar ECSE Department, Rensselaer Polytechnic nstitute Troy, NY 1218 Abstract n this paper, we evaluate an suggest methos
More informationGeneralized Edge Coloring for Channel Assignment in Wireless Networks
TR-IIS-05-021 Generalize Ege Coloring for Channel Assignment in Wireless Networks Chun-Chen Hsu, Pangfeng Liu, Da-Wei Wang, Jan-Jan Wu December 2005 Technical Report No. TR-IIS-05-021 http://www.iis.sinica.eu.tw/lib/techreport/tr2005/tr05.html
More informationResearch Article Inviscid Uniform Shear Flow past a Smooth Concave Body
International Engineering Mathematics Volume 04, Article ID 46593, 7 pages http://x.oi.org/0.55/04/46593 Research Article Invisci Uniform Shear Flow past a Smooth Concave Boy Abullah Mura Department of
More informationParts Assembly by Throwing Manipulation with a One-Joint Arm
21 IEEE/RSJ International Conference on Intelligent Robots an Systems, Taipei, Taiwan, October, 21. Parts Assembly by Throwing Manipulation with a One-Joint Arm Hieyuki Miyashita, Tasuku Yamawaki an Masahito
More information4.2 Implicit Differentiation
6 Chapter 4 More Derivatives 4. Implicit Differentiation What ou will learn about... Implicitl Define Functions Lenses, Tangents, an Normal Lines Derivatives of Higher Orer Rational Powers of Differentiable
More informationCONSTRUCTION AND ANALYSIS OF INVERSIONS IN S 2 AND H 2. Arunima Ray. Final Paper, MATH 399. Spring 2008 ABSTRACT
CONSTUCTION AN ANALYSIS OF INVESIONS IN S AN H Arunima ay Final Paper, MATH 399 Spring 008 ASTACT The construction use to otain inversions in two-imensional Eucliean space was moifie an applie to otain
More informationNavigation Around an Unknown Obstacle for Autonomous Surface Vehicles Using a Forward-Facing Sonar
Navigation Aroun an nknown Obstacle for Autonomous Surface Vehicles sing a Forwar-Facing Sonar Patrick A. Plonski, Joshua Vaner Hook, Cheng Peng, Narges Noori, Volkan Isler Abstract A robotic boat is moving
More informationHere are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed.
Preface Here are my online notes for my Calculus I course that I teach here at Lamar University. Despite the fact that these are my class notes, they shoul be accessible to anyone wanting to learn Calculus
More informationLearning convex bodies is hard
Learning convex boies is har Navin Goyal Microsoft Research Inia navingo@microsoftcom Luis Raemacher Georgia Tech lraemac@ccgatecheu Abstract We show that learning a convex boy in R, given ranom samples
More informationTechnical Report TR Navigation Around an Unknown Obstacle for Autonomous Surface Vehicles Using a Forward-Facing Sonar
Technical Report Department of Computer Science an Engineering niversity of Minnesota 4-192 Keller Hall 2 nion Street SE Minneapolis, MN 55455-159 SA TR 15-5 Navigation Aroun an nknown Obstacle for Autonomous
More informationTracking and Regulation Control of a Mobile Robot System With Kinematic Disturbances: A Variable Structure-Like Approach
W. E. Dixon e-mail: wixon@ces.clemson.eu D. M. Dawson e-mail: awson@ces.clemson.eu E. Zergeroglu e-mail: ezerger@ces.clemson.eu Department of Electrical & Computer Engineering, Clemson University, Clemson,
More informationAnyTraffic Labeled Routing
AnyTraffic Labele Routing Dimitri Papaimitriou 1, Pero Peroso 2, Davie Careglio 2 1 Alcatel-Lucent Bell, Antwerp, Belgium Email: imitri.papaimitriou@alcatel-lucent.com 2 Universitat Politècnica e Catalunya,
More informationDivide-and-Conquer Algorithms
Supplment to A Practical Guie to Data Structures an Algorithms Using Java Divie-an-Conquer Algorithms Sally A Golman an Kenneth J Golman Hanout Divie-an-conquer algorithms use the following three phases:
More informationGenetic Fuzzy Logic Control Technique for a Mobile Robot Tracking a Moving Target
.IJCSI.org 67 Genetic Fuzzy Logic Control Technique for a Mobile Robot Tracking a Moving Target Karim Benbouaballah an Zhu Qi-an College of Automation, Harbin Engineering University Harbin, 151, China
More informationA shortest path algorithm in multimodal networks: a case study with time varying costs
A shortest path algorithm in multimoal networks: a case stuy with time varying costs Daniela Ambrosino*, Anna Sciomachen* * Department of Economics an Quantitative Methos (DIEM), University of Genoa Via
More information1 Surprises in high dimensions
1 Surprises in high imensions Our intuition about space is base on two an three imensions an can often be misleaing in high imensions. It is instructive to analyze the shape an properties of some basic
More informationUsing Vector and Raster-Based Techniques in Categorical Map Generalization
Thir ICA Workshop on Progress in Automate Map Generalization, Ottawa, 12-14 August 1999 1 Using Vector an Raster-Base Techniques in Categorical Map Generalization Beat Peter an Robert Weibel Department
More informationFuzzy Learning Variable Admittance Control for Human-Robot Cooperation
Fuzzy Learning ariable Amittance Control for Human-Robot Cooperation Fotios Dimeas an Nikos Aspragathos Abstract This paper presents a metho for variable amittance control in human-robot cooperation tasks,
More informationSkyline Community Search in Multi-valued Networks
Syline Community Search in Multi-value Networs Rong-Hua Li Beijing Institute of Technology Beijing, China lironghuascut@gmail.com Jeffrey Xu Yu Chinese University of Hong Kong Hong Kong, China yu@se.cuh.eu.h
More informationA Convex Clustering-based Regularizer for Image Segmentation
Vision, Moeling, an Visualization (2015) D. Bommes, T. Ritschel an T. Schultz (Es.) A Convex Clustering-base Regularizer for Image Segmentation Benjamin Hell (TU Braunschweig), Marcus Magnor (TU Braunschweig)
More informationDistributed Planning in Hierarchical Factored MDPs
Distribute Planning in Hierarchical Factore MDPs Carlos Guestrin Computer Science Dept Stanfor University guestrin@cs.stanfor.eu Geoffrey Goron Computer Science Dept Carnegie Mellon University ggoron@cs.cmu.eu
More informationLearning Subproblem Complexities in Distributed Branch and Bound
Learning Subproblem Complexities in Distribute Branch an Boun Lars Otten Department of Computer Science University of California, Irvine lotten@ics.uci.eu Rina Dechter Department of Computer Science University
More informationNon-homogeneous Generalization in Privacy Preserving Data Publishing
Non-homogeneous Generalization in Privacy Preserving Data Publishing W. K. Wong, Nios Mamoulis an Davi W. Cheung Department of Computer Science, The University of Hong Kong Pofulam Roa, Hong Kong {wwong2,nios,cheung}@cs.hu.h
More informationIEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 31, NO. 4, APRIL
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 1, NO. 4, APRIL 01 74 Towar Efficient Distribute Algorithms for In-Network Binary Operator Tree Placement in Wireless Sensor Networks Zongqing Lu,
More informationAd-Hoc Networks Beyond Unit Disk Graphs
A-Hoc Networks Beyon Unit Disk Graphs Fabian Kuhn, Roger Wattenhofer, Aaron Zollinger Department of Computer Science ETH Zurich 8092 Zurich, Switzerlan {kuhn, wattenhofer, zollinger}@inf.ethz.ch ABSTRACT
More informationQueueing Model and Optimization of Packet Dropping in Real-Time Wireless Sensor Networks
Queueing Moel an Optimization of Packet Dropping in Real-Time Wireless Sensor Networks Marc Aoun, Antonios Argyriou, Philips Research, Einhoven, 66AE, The Netherlans Department of Computer an Communication
More informationThreshold Based Data Aggregation Algorithm To Detect Rainfall Induced Landslides
Threshol Base Data Aggregation Algorithm To Detect Rainfall Inuce Lanslies Maneesha V. Ramesh P. V. Ushakumari Department of Computer Science Department of Mathematics Amrita School of Engineering Amrita
More informationControlling formations of multiple mobile robots. Jaydev P. Desai Jim Ostrowski Vijay Kumar
Controlling formations of multiple mobile robots Jayev P. Desai Jim Ostrowski Vijay Kumar General Robotics an Active Sensory Perception (GRASP) Laboratory, University of Pennsylvania 340 Walnut Street,
More informationModule13:Interference-I Lecture 13: Interference-I
Moule3:Interference-I Lecture 3: Interference-I Consier a situation where we superpose two waves. Naively, we woul expect the intensity (energy ensity or flux) of the resultant to be the sum of the iniviual
More informationAll-to-all Broadcast for Vehicular Networks Based on Coded Slotted ALOHA
Preprint, August 5, 2018. 1 All-to-all Broacast for Vehicular Networks Base on Coe Slotte ALOHA Mikhail Ivanov, Frerik Brännström, Alexanre Graell i Amat, an Petar Popovski Department of Signals an Systems,
More informationFigure 1: 2D arm. Figure 2: 2D arm with labelled angles
2D Kinematics Consier a robotic arm. We can sen it commans like, move that joint so it bens at an angle θ. Once we ve set each joint, that s all well an goo. More interesting, though, is the question of
More informationOptimal Distributed P2P Streaming under Node Degree Bounds
Optimal Distribute P2P Streaming uner Noe Degree Bouns Shaoquan Zhang, Ziyu Shao, Minghua Chen, an Libin Jiang Department of Information Engineering, The Chinese University of Hong Kong Department of EECS,
More informationEDOVE: Energy and Depth Variance-Based Opportunistic Void Avoidance Scheme for Underwater Acoustic Sensor Networks
sensors Article EDOVE: Energy an Depth Variance-Base Opportunistic Voi Avoiance Scheme for Unerwater Acoustic Sensor Networks Safar Hussain Bouk 1, *, Sye Hassan Ahme 2, Kyung-Joon Park 1 an Yongsoon Eun
More informationFINDING OPTICAL DISPERSION OF A PRISM WITH APPLICATION OF MINIMUM DEVIATION ANGLE MEASUREMENT METHOD
Warsaw University of Technology Faculty of Physics Physics Laboratory I P Joanna Konwerska-Hrabowska 6 FINDING OPTICAL DISPERSION OF A PRISM WITH APPLICATION OF MINIMUM DEVIATION ANGLE MEASUREMENT METHOD.
More informationArchitecture Design of Mobile Access Coordinated Wireless Sensor Networks
Architecture Design of Mobile Access Coorinate Wireless Sensor Networks Mai Abelhakim 1 Leonar E. Lightfoot Jian Ren 1 Tongtong Li 1 1 Department of Electrical & Computer Engineering, Michigan State University,
More informationCoupling the User Interfaces of a Multiuser Program
Coupling the User Interfaces of a Multiuser Program PRASUN DEWAN University of North Carolina at Chapel Hill RAJIV CHOUDHARY Intel Corporation We have evelope a new moel for coupling the user-interfaces
More informationAdjacency Matrix Based Full-Text Indexing Models
1000-9825/2002/13(10)1933-10 2002 Journal of Software Vol.13, No.10 Ajacency Matrix Base Full-Text Inexing Moels ZHOU Shui-geng 1, HU Yun-fa 2, GUAN Ji-hong 3 1 (Department of Computer Science an Engineering,
More informationA Plane Tracker for AEC-automation Applications
A Plane Tracker for AEC-automation Applications Chen Feng *, an Vineet R. Kamat Department of Civil an Environmental Engineering, University of Michigan, Ann Arbor, USA * Corresponing author (cforrest@umich.eu)
More informationRobust Camera Calibration for an Autonomous Underwater Vehicle
obust Camera Calibration for an Autonomous Unerwater Vehicle Matthew Bryant, Davi Wettergreen *, Samer Aballah, Alexaner Zelinsky obotic Systems Laboratory Department of Engineering, FEIT Department of
More informationExercises of PIV. incomplete draft, version 0.0. October 2009
Exercises of PIV incomplete raft, version 0.0 October 2009 1 Images Images are signals efine in 2D or 3D omains. They can be vector value (e.g., color images), real (monocromatic images), complex or binary
More informationModifying ROC Curves to Incorporate Predicted Probabilities
Moifying ROC Curves to Incorporate Preicte Probabilities Cèsar Ferri DSIC, Universitat Politècnica e València Peter Flach Department of Computer Science, University of Bristol José Hernánez-Orallo DSIC,
More informationAutomation of Bird Front Half Deboning Procedure: Design and Analysis
Automation of Bir Front Half Deboning Proceure: Design an Analysis Debao Zhou, Jonathan Holmes, Wiley Holcombe, Kok-Meng Lee * an Gary McMurray Foo Processing echnology Division, AAS Laboratory, Georgia
More informationChapter 5 Proposed models for reconstituting/ adapting three stereoscopes
Chapter 5 Propose moels for reconstituting/ aapting three stereoscopes - 89 - 5. Propose moels for reconstituting/aapting three stereoscopes This chapter offers three contributions in the Stereoscopy area,
More informationMore Raster Line Issues. Bresenham Circles. Once More: 8-Pt Symmetry. Only 1 Octant Needed. Spring 2013 CS5600
Spring 03 Lecture Set 3 Bresenham Circles Intro to Computer Graphics From Rich Riesenfel Spring 03 More Raster Line Issues Fat lines with multiple pixel with Symmetric lines n point geometry how shoul
More informationOptimal path planning in a constant wind with a bounded turning rate
Optimal path planning in a constant win with a boune turning rate Timothy G. McGee, Stephen Spry an J. Karl Herick Center for Collaborative Control of Unmanne Vehicles, University of California, Berkeley,
More informationPreamble. Singly linked lists. Collaboration policy and academic integrity. Getting help
CS2110 Spring 2016 Assignment A. Linke Lists Due on the CMS by: See the CMS 1 Preamble Linke Lists This assignment begins our iscussions of structures. In this assignment, you will implement a structure
More informationIndexing the Edges A simple and yet efficient approach to high-dimensional indexing
Inexing the Eges A simple an yet efficient approach to high-imensional inexing Beng Chin Ooi Kian-Lee Tan Cui Yu Stephane Bressan Department of Computer Science National University of Singapore 3 Science
More informationShift-map Image Registration
Shift-map Image Registration Linus Svärm Petter Stranmark Centre for Mathematical Sciences, Lun University {linus,petter}@maths.lth.se Abstract Shift-map image processing is a new framework base on energy
More informationMessage Transport With The User Datagram Protocol
Message Transport With The User Datagram Protocol User Datagram Protocol (UDP) Use During startup For VoIP an some vieo applications Accounts for less than 10% of Internet traffic Blocke by some ISPs Computer
More informationIntensive Hypercube Communication: Prearranged Communication in Link-Bound Machines 1 2
This paper appears in J. of Parallel an Distribute Computing 10 (1990), pp. 167 181. Intensive Hypercube Communication: Prearrange Communication in Link-Boun Machines 1 2 Quentin F. Stout an Bruce Wagar
More informationA Discrete Search Method for Multi-modal Non-Rigid Image Registration
A Discrete Search Metho for Multi-moal on-rigi Image Registration Alexaner Shekhovtsov Juan D. García-Arteaga Tomáš Werner Center for Machine Perception, Dept. of Cybernetics Faculty of Electrical Engineering,
More informationOverlap Interval Partition Join
Overlap Interval Partition Join Anton Dignös Department of Computer Science University of Zürich, Switzerlan aignoes@ifi.uzh.ch Michael H. Böhlen Department of Computer Science University of Zürich, Switzerlan
More informationWLAN Indoor Positioning Based on Euclidean Distances and Fuzzy Logic
WLAN Inoor Positioning Base on Eucliean Distances an Fuzzy Logic Anreas TEUBER, Bern EISSFELLER Institute of Geoesy an Navigation, University FAF, Munich, Germany, e-mail: (anreas.teuber, bern.eissfeller)@unibw.e
More informationA FUZZY FRAMEWORK FOR SEGMENTATION, FEATURE MATCHING AND RETRIEVAL OF BRAIN MR IMAGES
A FUZZY FRAMEWORK FOR SEGMENTATION, FEATURE MATCHING AND RETRIEVAL OF BRAIN MR IMAGES Archana.S 1 an Srihar.S 2 1 Department of Information Science an Technology, College of Engineering, Guiny archana.santhira@gmail.com
More informationEvolutionary Optimisation Methods for Template Based Image Registration
Evolutionary Optimisation Methos for Template Base Image Registration Lukasz A Machowski, Tshilizi Marwala School of Electrical an Information Engineering University of Witwatersran, Johannesburg, South
More information(a) Find the equation of the plane that passes through the points P, Q, and R.
Math 040 Miterm Exam 1 Spring 014 S o l u t i o n s 1 For given points P (, 0, 1), Q(, 1, 0), R(3, 1, 0) an S(,, 0) (a) Fin the equation of the plane that passes through the points P, Q, an R P Q = 0,
More informationUsing Ray Tracing for Site-Specific Indoor Radio Signal Strength Analysis 1
Using Ray Tracing for Site-Specific Inoor Raio Signal Strength Analysis 1 Michael Ni, Stephen Mann, an Jay Black Computer Science Department, University of Waterloo, Waterloo, Ontario, NL G1, Canaa Abstract
More informationSection 20. Thin Prisms
OPTI-0/0 Geometrical an Instrumental Optics opyright 08 John E. Greivenkamp 0- Section 0 Thin Prisms Thin Prism Deviation Thin prisms introuce small angular beam eviations an are useful as alignment evices.
More informationApproximation with Active B-spline Curves and Surfaces
Approximation with Active B-spline Curves an Surfaces Helmut Pottmann, Stefan Leopolseer, Michael Hofer Institute of Geometry Vienna University of Technology Wiener Hauptstr. 8 10, Vienna, Austria pottmann,leopolseer,hofer
More informationProvisioning Virtualized Cloud Services in IP/MPLS-over-EON Networks
Provisioning Virtualize Clou Services in IP/MPLS-over-EON Networks Pan Yi an Byrav Ramamurthy Department of Computer Science an Engineering, University of Nebraska-Lincoln Lincoln, Nebraska 68588 USA Email:
More informationBIJECTIONS FOR PLANAR MAPS WITH BOUNDARIES
BIJECTIONS FOR PLANAR MAPS WITH BOUNDARIES OLIVIER BERNARDI AND ÉRIC FUSY Abstract. We present bijections for planar maps with bounaries. In particular, we obtain bijections for triangulations an quarangulations
More informationAn Investigation in the Use of Vehicle Reidentification for Deriving Travel Time and Travel Time Distributions
An Investigation in the Use of Vehicle Reientification for Deriving Travel Time an Travel Time Distributions Carlos Sun Department of Civil an Environmental Engineering, University of Missouri-Columbia,
More informationShift-map Image Registration
Shift-map Image Registration Svärm, Linus; Stranmark, Petter Unpublishe: 2010-01-01 Link to publication Citation for publishe version (APA): Svärm, L., & Stranmark, P. (2010). Shift-map Image Registration.
More informationDense Disparity Estimation in Ego-motion Reduced Search Space
Dense Disparity Estimation in Ego-motion Reuce Search Space Luka Fućek, Ivan Marković, Igor Cvišić, Ivan Petrović University of Zagreb, Faculty of Electrical Engineering an Computing, Croatia (e-mail:
More informationAn Adaptive Routing Algorithm for Communication Networks using Back Pressure Technique
International OPEN ACCESS Journal Of Moern Engineering Research (IJMER) An Aaptive Routing Algorithm for Communication Networks using Back Pressure Technique Khasimpeera Mohamme 1, K. Kalpana 2 1 M. Tech
More informationLesson 11 Interference of Light
Physics 30 Lesson 11 Interference of Light I. Light Wave or Particle? The fact that light carries energy is obvious to anyone who has focuse the sun's rays with a magnifying glass on a piece of paper an
More informationChalmers Publication Library
Chalmers Publication Library All-to-all Broacast for Vehicular Networks Base on Coe Slotte ALOHA This ocument has been ownloae from Chalmers Publication Library (CPL). It is the author s version of a work
More informationVerifying performance-based design objectives using assemblybased vulnerability
Verying performance-base esign objectives using assemblybase vulnerability K.A. Porter Calornia Institute of Technology, Pasaena, Calornia, USA A.S. Kiremijian Stanfor University, Stanfor, Calornia, USA
More informationQuestions? Post on piazza, or Radhika (radhika at eecs.berkeley) or Sameer (sa at berkeley)!
EE122 Fall 2013 HW3 Instructions Recor your answers in a file calle hw3.pf. Make sure to write your name an SID at the top of your assignment. For each problem, clearly inicate your final answer, bol an
More informationLab work #8. Congestion control
TEORÍA DE REDES DE TELECOMUNICACIONES Grao en Ingeniería Telemática Grao en Ingeniería en Sistemas e Telecomunicación Curso 2015-2016 Lab work #8. Congestion control (1 session) Author: Pablo Pavón Mariño
More informationComparison of Methods for Increasing the Performance of a DUA Computation
Comparison of Methos for Increasing the Performance of a DUA Computation Michael Behrisch, Daniel Krajzewicz, Peter Wagner an Yun-Pang Wang Institute of Transportation Systems, German Aerospace Center,
More informationCluster Center Initialization Method for K-means Algorithm Over Data Sets with Two Clusters
Available online at www.scienceirect.com Proceia Engineering 4 (011 ) 34 38 011 International Conference on Avances in Engineering Cluster Center Initialization Metho for K-means Algorithm Over Data Sets
More informationMultilevel Linear Dimensionality Reduction using Hypergraphs for Data Analysis
Multilevel Linear Dimensionality Reuction using Hypergraphs for Data Analysis Haw-ren Fang Department of Computer Science an Engineering University of Minnesota; Minneapolis, MN 55455 hrfang@csumneu ABSTRACT
More informationOn the Placement of Internet Taps in Wireless Neighborhood Networks
1 On the Placement of Internet Taps in Wireless Neighborhoo Networks Lili Qiu, Ranveer Chanra, Kamal Jain, Mohamma Mahian Abstract Recently there has emerge a novel application of wireless technology that
More informationIN the last few years, we have seen an increasing interest in
55 IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL., NO. 9, SEPTEMBER 009 Calibration of Cameras with Raially Symmetric Distortion Jean-Philippe Tarif, Peter Sturm, Member, IEEE Computer
More informationDesign of Policy-Aware Differentially Private Algorithms
Design of Policy-Aware Differentially Private Algorithms Samuel Haney Due University Durham, NC, USA shaney@cs.ue.eu Ashwin Machanavajjhala Due University Durham, NC, USA ashwin@cs.ue.eu Bolin Ding Microsoft
More informationAn Algorithm for Building an Enterprise Network Topology Using Widespread Data Sources
An Algorithm for Builing an Enterprise Network Topology Using Wiesprea Data Sources Anton Anreev, Iurii Bogoiavlenskii Petrozavosk State University Petrozavosk, Russia {anreev, ybgv}@cs.petrsu.ru Abstract
More informationBlind Data Classification using Hyper-Dimensional Convex Polytopes
Blin Data Classification using Hyper-Dimensional Convex Polytopes Brent T. McBrie an Gilbert L. Peterson Department of Electrical an Computer Engineering Air Force Institute of Technology 9 Hobson Way
More information