Rapid Simultaneous Learning of Multiple Behaviours with a Mobile Robot

Similar documents
A Predictive Controller for Object Tracking of a Mobile Robot

Learning Fuzzy Rules by Evolution for Mobile Agent Control

Prediction-Based Path Planning with Obstacle Avoidance in Dynamic Target Environment

Collision Avoidance with Unity3d

AUTONOMOUS ROBOT NAVIGATION BASED ON FUZZY LOGIC AND REINFORCEMENT LEARNING

Elastic Bands: Connecting Path Planning and Control

CANAL FOLLOWING USING AR DRONE IN SIMULATION

Adaptive Robotics - Final Report Extending Q-Learning to Infinite Spaces

Waypoint Navigation with Position and Heading Control using Complex Vector Fields for an Ackermann Steering Autonomous Vehicle

Non-rigid body Object Tracking using Fuzzy Neural System based on Multiple ROIs and Adaptive Motion Frame Method

MOTION MODELLING USING CONCEPTS OF FUZZY ARTIFICIAL POTENTIAL FIELDS

Hybrid Genetic-Fuzzy Approach to Autonomous Mobile Robot

Safe Prediction-Based Local Path Planning using Obstacle Probability Sections

AUTONOMOUS mobile robots are those robots that have

A Simulation Based Comparative Study of Normalization Procedures in Multiattribute Decision Making

Non-Homogeneous Swarms vs. MDP s A Comparison of Path Finding Under Uncertainty

JOURNAL OF COMPUTERS, VOL. 4, NO. 4, APRIL

A Fuzzy Local Path Planning and Obstacle Avoidance for Mobile Robots

1 Lab 5: Particle Swarm Optimization

Motion Control in Dynamic Multi-Robot Environments

Fuzzy Hierarchical Control for Autonomous Vehicles

Graphs, Search, Pathfinding (behavior involving where to go) Steering, Flocking, Formations (behavior involving how to go)

Research Article Path Planning Using a Hybrid Evolutionary Algorithm Based on Tree Structure Encoding

Kinesthetic Teaching via Fast Marching Square

1 Lab + Hwk 5: Particle Swarm Optimization

EFFICIENT CLUSTERING WITH FUZZY ANTS

Adaptive Speed Control for Autonomous Mobile Robot using Fuzzy Logic Controller

Trajectory Modification Using Elastic Force for Collision Avoidance of a Mobile Manipulator

Fuzzy Logic Based Collision Avoidance For a Mobile Robot

COMPARISON OF ROBOT NAVIGATION METHODS USING PERFORMANCE METRICS

Autonomous Mobile Robots, Chapter 6 Planning and Navigation Where am I going? How do I get there? Localization. Cognition. Real World Environment

Design of Obstacle Avoidance System for Mobile Robot using Fuzzy Logic Systems

Techniques. IDSIA, Istituto Dalle Molle di Studi sull'intelligenza Articiale. Phone: Fax:

Neuro-Fuzzy Inverse Forward Models

Safe Navigation of Mobile Robot Using A* Algorithm

SEMI-ACTIVE CONTROL OF BUILDING STRUCTURES USING A NEURO-FUZZY CONTROLLER WITH ACCELERATION FEEDBACK

ECM A Novel On-line, Evolving Clustering Method and Its Applications

Mobile Robots: An Introduction.

Self-learning Mobile Robot Navigation in Unknown Environment Using Evolutionary Learning

1 Lab + Hwk 5: Particle Swarm Optimization

Traffic Signal Control Based On Fuzzy Artificial Neural Networks With Particle Swarm Optimization

Autonomous Navigation in Unknown Environments via Language Grounding

A new Eulerian computational method for the propagation of short acoustic and electromagnetic pulses

Relation Organization of SOM Initial Map by Improved Node Exchange

Online Simultaneous Localization and Mapping in Dynamic Environments

12 - More Steering. from Milligan, "Artificial Intelligence for Games", Morgan Kaufman, 2006

Particle Systems. Typical Time Step. Particle Generation. Controlling Groups of Objects: Particle Systems. Flocks and Schools

Kinematic Control Algorithms for On-Line Obstacle Avoidance for Redundant Manipulators

Dynamic Memory Allocation for CMAC using Binary Search Trees

Novel Methods for the Design of General Type-2 fuzzy Sets based on Device Characteristics and Linguistic Labels Surveys

Path Planning for a Robot Manipulator based on Probabilistic Roadmap and Reinforcement Learning

CONCLUSION ACKNOWLEDGMENTS REFERENCES

Fuzzy Logic Control of Autonomous Vehicles for Parallel Parking Maneuver

CONTROL ALGORITHM OP THE WALKER CLIMBING OVER OBSTACLES. D.E. Okhotsimski, A.K, Platonov U S S R

Planning for Optimal Robot Trajectory for Visual Inspection

Object Conveyance Algorithm for Multiple Mobile Robots based on Object Shape and Size

A method for depth-based hand tracing

Developing a Tracking Algorithm for Underwater ROV Using Fuzzy Logic Controller

Intelligent Risk Identification and Analysis in IT Network Systems

A nodal based evolutionary structural optimisation algorithm

Building Reliable 2D Maps from 3D Features

Robot Path Planning Method Based on Improved Genetic Algorithm

Lecture 14: Path Planning II. Autonomous Robots, Carnegie Mellon University Qatar

Topological Modeling with Fuzzy Petri Nets for Autonomous Mobile Robots

A New Neuro-Fuzzy Adaptive Genetic Algorithm

Learning Fuzzy Cognitive Maps by a Hybrid Method Using Nonlinear Hebbian Learning and Extended Great Deluge Algorithm

Robot Motion Planning Using Generalised Voronoi Diagrams

Vision Guided AGV Using Distance Transform

Multi- Robot Systems. Ioannis Rekleitis

Mobile Sink to Track Multiple Targets in Wireless Visual Sensor Networks

Collision Avoidance Method for Mobile Robot Considering Motion and Personal Spaces of Evacuees

Statistical Techniques in Robotics (16-831, F10) Lecture #02 (Thursday, August 28) Bayes Filtering

Detecting motion by means of 2D and 3D information

TASK ALLOCATION IN A MULTIPROCESSOR SYSTEM USING FUZZY LOGIC

crrt : Planning loosely-coupled motions for multiple mobile robots

A development of PSD sensor system for navigation and map building in the indoor environment

Fuzzy Logic Controller

Uniform Motion Lab. The position equation for an object moving with a constant velocity is:

S-SHAPED ONE TRAIL PARALLEL PARKING OF A CAR-LIKE MOBILE ROBOT

Final Exam Practice Fall Semester, 2012

AC : ADAPTIVE ROBOT MANIPULATORS IN GLOBAL TECHNOLOGY

Design and Implementation of a Real-Time Autonomous Navigation System Applied to Lego Robots

Research and application of volleyball target tracking algorithm based on surf corner detection

AN IMPROVED K-MEANS CLUSTERING ALGORITHM FOR IMAGE SEGMENTATION

SVG GRAPHICS LANGUAGE AS A DESCRIPTION OF A 2D PATH IN ROBOT PROGRAMMING TASKS

The Comparative Study of Machine Learning Algorithms in Text Data Classification*

Collision-free Path Planning Based on Clustering

Instant Prediction for Reactive Motions with Planning

Trust4All: a Trustworthy Middleware Platform for Component Software

Torque-Position Transformer for Task Control of Position Controlled Robots

MOTION TRAJECTORY PLANNING AND SIMULATION OF 6- DOF MANIPULATOR ARM ROBOT

Joe Buckner & Nir Chezrony

Mobile Robots Path Planning using Genetic Algorithms

MOVING OBJECT DETECTION USING BACKGROUND SUBTRACTION ALGORITHM USING SIMULINK

Everything You Always Wanted To Know About Programming Behaviors But Were Afraid To Ask

Dynamic Sensor-based Path Planning and Hostile Target Detection with Mobile Ground Robots. Matt Epperson Dr. Timothy Chung

Kinematics of Wheeled Robots

Reference Variables Generation Using a Fuzzy Trajectory Controller for PM Tubular Linear Synchronous Motor Drive

Optimal Trajectory Generation for Nonholonomic Robots in Dynamic Environments

Clustering Color/Intensity. Group together pixels of similar color/intensity.

Transcription:

Rapid Simultaneous Learning of Multiple Behaviours with a Mobile Robot Koren Ward School of Information Technology and Computer Science University of Wollongong koren@uow.edu.au www.uow.edu.au/~koren

Abstract 2 Figure 1 Yamabico Mobile Robot Although various robot behaviour learning methods have been available for some time [1] [2] they generally are too slow to be of much practical use. The following slides provide a brief introduction to a novel robot behaviour learning method called Trajectory Velocity Learning (TVL) [3]. The advantage of TVL is that it enables a mobile robot equipped with range sensing devices to automatically acquire multiple adjustable behaviours quickly and simultaneously.

Trajectory Velocity Learning 3 Figure 2 Learning a map between sensors and appropriate trajectory velocities. Trajectory Velocity Learning involves learning a map between the robot's input sensors and appropriate velocities for negotiating the robot's predefined output trajectory commands as shown in Fig. 2. Figure 3 The robot's predefined trajectory commands Each output trajectory command has a predefined radius and maximum velocity for moving in free space. Although various combinations of output trajectory commands can be used, the 7 shown in Fig. 3 have been shown to be adequate for medium sized robots while providing relatively smooth motion when commands are switched

Defining Appropriate Trajectory Velocities 4 Figure 4 Defining appropriate velocities for negotiating the robot's trajectory commands. Before implementing a robot that learns appropriate trajectory velocities, it is necessary to firstly define what appropriate trajectory velocities are. Ideally, a robot following a trajectory that is on a collision course with an object, should slow down and stop just before coming into contact with the object. Also, the deceleration rate should be low enough (i.e. around -0.5m/s 2 ) so as to prevent abrupt changes to the robot's motion. Therefore, if a robot is to appropriately set its velocity during each time step, it should set its velocity according to its predefined deceleration rate and the collision distance with the obstacle so that the robot becomes stationary just before a collision occurs, as shown in Fig 4(a). Alternatively, if the engaged trajectory is not on a collision course with an obstacle, as shown in Fig 4(b), the robot should set its velocity at the trajectory's predefined maximum velocity defined in Fig.3.

Perceiving Appropriate Trajectory Velocities from Sensor Data 5 Figure 5 Appropriate Trajectory Velocities Unfortunately, due to the highly specular nature of sonar signals, trajectory collision points are very difficult to calculate directly from sonar data, particularly within unstructured environments. However, it is relatively easy for the robot to learn associations between sensor data and appropriate trajectory velocities with an associative map (as explained on page 10). By having a learnt associative map that can indicate appropriate trajectory velocities directly from sensor data, as shown in Fig. 5, the robot can not only appropriately control its velocity but can perform a variety of behaviours by making trajectory choices based on this information.

Object Avoidance Behaviour 6 Figure 6 Object avoidance behaviour on a TVL robot. By being able to perceive appropriate trajectory velocities directly from sensor data, obstacle avoidance behaviour can be performed by giving the robot a single instruction to follow fast 1 trajectories nearest to the forward direction, as shown in Fig. 6. This occurs simply because trajectories which lead into free space are perceived (from the learnt map) as having faster appropriate velocities than trajectories that collide with nearby objects. 1 A "fast" trajectory is one that is perceived as having an appropriate trajectory velocity that is a high proportion of its maximum velocity.

Wall Following Behaviour 7 Figure 7 Wall following behaviour on a TVL robot. Similarly, wall following behaviour can also be produced by providing the robot with a single instruction to follow fast trajectories nearest to the closest object. This will cause the robot to follow walls in the direction nearest to the direction it is facing. Alternatively, left wall following behaviour can be produced by instructing the robot to follow fast trajectories nearest to the right of the closest object. Likewise, right wall following behaviour can be produced by instructing the robot to follow fast trajectories nearest to the left of the closest object, as shown in Fig.7.

Goal Seeking Behaviour 8 Figure 8 Goal seeking behaviour on a TVL robot. Goal seeking behaviour is also possible by providing the robot with an instruction to follow fast trajectories toward the perceived goal location, as shown in Fig. 8. Fortunately, this also produces an implied obstacle avoidance capability without the need to switch behaviours because any convex object encountered between the robot and the goal will cause the direct trajectory to be perceived to be slower than those which lead around the obstacle. However, if the robot encounters a deep crevice simply following a fast trajectory toward the goal may not escape the crevice. To escape deep local minimums the robot could attempt to follow walls in both directions for increasing periods of time.

Adjusting the Object Clearance Distances 9 Figure 9 Simulation showing how object clearance distances can be controlled with a velocity threshold. By using a variable velocity threshold 2 to determine if perceived trajectory velocities are considered to be fast or slow, the robot's wall clearance distance can be adjusted. Lowering the velocity threshold results in walls being followed more closely and cautiously at lower speed as shown in Fig. 9 (a) & (c). Conversely, raising the threshold causes the robot to maintain larger object clearances and results in the robot moving faster and more competently through the environment when performing its behaviours as shown in Fig. 9 (b) & (d). 2 The velocity threshold determines if a perceived trajectory velocity is above a specified proportion of its maximum velocity.

TVL Mobile Robot Controller 10 Figure 10 A TVL controller for producing multiple adjustable mobile robot behaviours simultaneously. Figure 10 shows a schematic of a typical TVL controller which summarizes how a mobile robot can become capable of appropriately controlling its velocity and performing a variety of adjustable behaviours by making simple choices based on learnt appropriate trajectory velocities. But how are appropriate trajectory velocities learnt?.....

Learning Appropriate Trajectory Velocities 11 Figure 11 Learning appropriate trajectory velocities by locating trajectory collision points. One way appropriate trajectory velocities can be learnt is by randomly selecting trajectories and slowly following each until a collision or full circle occurs, as shown in Fig 11. When a collision occurs, a preset constant deceleration rate (eg. -0.5m/s 2 ) is used to calculate appropriate trajectory velocities back along the path that lead up to the collision point. These calculated velocities are then associated with the sensor data that occurred at each time step leading up to the collision point. The resulting training patterns are then used to train the associative map using compositional rule inference [5]. If a full circle occurs, the trajectory's maximum velocity is used to associate with the sensor data for training the map.

12 Learning Trajectory Velocities with Fuzzy Associative Memories Figure 12 Learning trajectory velocities with 7 FAMs Fuzzy Associative Maps (FAMs) [4] are very suitable for learning associations between sensor data and trajectory velocities because they can be incrementally trained via compositional rule inference [5] and their output is generalized by taking the weighted average over a small neighborhood of map entries. To learn appropriate velocities associated with all 7 trajectories, 7 FAMs are used, as shown in Fig. 12.

TVL FAMs 13 Sonar Sensors Fuzzy Membership Functions VN N M F VF Very Near Near Medium Far Very Far Figure 13 Inputs and membership functions of FAMs To reduce the input search space, each FAM's inputs are connected only to a sector of sensors that are adjacent to each FAM's respective trajectory. Fig. 13 shows how the sensors and fuzzy membership functions of the FAM matrices belonging to trajectories T0 through to T3L were allocated.

14 Conclusion Figure 14 Wall following behaviour exhibited by the Yamabico robot after 45 minutes of learning within an unstructured laboratory environment. When unstructured environments are interpreted via sonar sensors, like the lab shown in Fig. 14 (a), range readings can be difficult to predict. This is because a sonar signal will only be reflected back to a sonar sensor if the object's surface is almost normal to the sensor's beam. Consequently, objects such as tables, chairs, bookcases, cables, etc. can interrupt, delay or scatter sonar signals. Therefore hardwiring behaviours into mobile robots can be difficult and unreliable within unstructured environments. However, by learning to perceive the environment in terms of appropriate trajectory velocities a mobile robot can acquire a variety of adjustable behaviours quickly and simultaneously in addition to appropriate velocity control, as shown in Fig. 14 (b).

References 15 [1] L. P Kaelbling. Reinforcement Learning: A Survey. Journal of Artificial Intelligence Research, Vol.4, pages 237-285, May 1996. [2] M. J. Mataric and D Cliff, Challenges in Evolving Controllers for Physical Robots, Robotics and Autonomous Systems, Vol.19, No.1, pages 67-83, 1996. [3] K. Ward and A. Zelinsky. Acquiring mobile robot behaviours by learning trajectory velocities. Autonomous Robots Journal, Vol.9, No.2, pages 113-133, September, 2000. [4] B. Kosko. Neural Networks and Fuzzy Systems, Englewood Cliffs, NJ, Prentice Hall, Inc. 1992. [5] T. Sudkamp and R, J, Hammell.. Interpolation, Completion and Learning Fuzzy Rules, IEEE Transactions on Systems, Man and Cybernetics, Vol. 24 No. 2 pages 332-342, February 1994.