Acknowledgements. Outline

Similar documents
Heuristic Search for Planning

Principles of AI Planning. Principles of AI Planning. 7.1 How to obtain a heuristic. 7.2 Relaxed planning tasks. 7.1 How to obtain a heuristic

Search Algorithms for Planning

Fast Forward Planning. By J. Hoffmann and B. Nebel

CSC2542 Planning-Graph Techniques

Searching. Assume goal- or utilitybased. Next task to achieve is to determine the best path to the goal

Intelligent Agents. State-Space Planning. Ute Schmid. Cognitive Systems, Applied Computer Science, Bamberg University. last change: 14.

Principles of AI Planning. Principles of AI Planning. 8.1 Parallel plans. 8.2 Relaxed planning graphs. 8.3 Relaxation heuristics. 8.

CSC2542 SAT-Based Planning. Sheila McIlraith Department of Computer Science University of Toronto Summer 2014

Class Overview. Introduction to Artificial Intelligence COMP 3501 / COMP Lecture 2. Problem Solving Agents. Problem Solving Agents: Assumptions

CMU-Q Lecture 6: Planning Graph GRAPHPLAN. Teacher: Gianni A. Di Caro

Set 9: Planning Classical Planning Systems. ICS 271 Fall 2013

INTRODUCTION TO HEURISTIC SEARCH

Heuris'c Search. Reading note: Chapter 4 covers heuristic search.

Heuristic Search and Advanced Methods

Lectures 20, 21: Axiomatic Semantics

Informed search algorithms. Chapter 4

Vertex Cover Approximations

Planning, Execution & Learning 1. Heuristic Search Planning

Class Overview. Introduction to Artificial Intelligence COMP 3501 / COMP Lecture 2: Search. Problem Solving Agents

Informed Search A* Algorithm

Binary Decision Diagrams

Informed Search. Xiaojin Zhu Computer Sciences Department University of Wisconsin, Madison

Principles of AI Planning

EE562 ARTIFICIAL INTELLIGENCE FOR ENGINEERS

Informed search. Soleymani. CE417: Introduction to Artificial Intelligence Sharif University of Technology Spring 2016

Lecture 3 - States and Searching

Artificial Intelligence

Foundations of Artificial Intelligence

Informed search strategies (Section ) Source: Fotolia

Today s s lecture. Lecture 3: Search - 2. Problem Solving by Search. Agent vs. Conventional AI View. Victor R. Lesser. CMPSCI 683 Fall 2004

Constraint Satisfaction Problems

Last time: Problem-Solving

P and NP (Millenium problem)

ARTIFICIAL INTELLIGENCE. Informed search

A* optimality proof, cycle checking

Heuristic Search. Heuristic Search. Heuristic Search. CSE 3401: Intro to AI & LP Informed Search

Foundations of Artificial Intelligence

4 INFORMED SEARCH AND EXPLORATION. 4.1 Heuristic Search Strategies

CS 416, Artificial Intelligence Midterm Examination Fall 2004

The LAMA Planner. Silvia Richter. Griffith University & NICTA, Australia. Joint work with Matthias Westphal Albert-Ludwigs-Universität Freiburg

11.1 Facility Location

Heuristic Search: A* CPSC 322 Search 4 January 19, Textbook 3.6 Taught by: Vasanth

Heuristic (Informed) Search

Outline. Informed Search. Recall: Uninformed Search. An Idea. Heuristics Informed search techniques More on heuristics Iterative improvement

Informed search algorithms. (Based on slides by Oren Etzioni, Stuart Russell)

Informed Search. Best-first search. Greedy best-first search. Intelligent Systems and HCI D7023E. Romania with step costs in km

Lecture 9. Heuristic search, continued. CS-424 Gregory Dudek

Outline. Best-first search

B2.1 Regression. Planning and Optimization. Planning and Optimization. B2.1 Regression. B2.2 Regression Example. B2.3 Regression for STRIPS Tasks

Lecture 2: NP-Completeness

Planning with Recursive Subgoals

36.1 Relaxed Planning Graphs

Classical Planning Problems: Representation Languages

CS:4420 Artificial Intelligence

Validating Plans with Durative Actions via Integrating Boolean and Numerical Constraints

1 Definition of Reduction

CPS 170: Artificial Intelligence Search

Examples of P vs NP: More Problems

Lecture 4: Search 3. Victor R. Lesser. CMPSCI 683 Fall 2010

Informed search algorithms. Chapter 3 (Based on Slides by Stuart Russell, Dan Klein, Richard Korf, Subbarao Kambhampati, and UW-AI faculty)

Set 9: Planning Classical Planning Systems. ICS 271 Fall 2014

Compiling Uncertainty Away in Non-Deterministic Conformant Planning

CS 4700: Foundations of Artificial Intelligence

CS 380: Artificial Intelligence Lecture #4

Computational problems. Lecture 2: Combinatorial search and optimisation problems. Computational problems. Examples. Example

Example: Map coloring

Informed Search. CS 486/686 University of Waterloo May 10. cs486/686 Lecture Slides 2005 (c) K. Larson and P. Poupart

CMPSCI611: The SUBSET-SUM Problem Lecture 18

CS 4700: Foundations of Artificial Intelligence

Informed (Heuristic) Search. Idea: be smart about what paths to try.

Searching with Partial Information

Informed search. Lirong Xia

Outline for today s lecture. Informed Search. Informed Search II. Review: Properties of greedy best-first search. Review: Greedy best-first search:

University of Nevada, Las Vegas Computer Science 456/656 Fall 2016

3 SOLVING PROBLEMS BY SEARCHING

Artificial Intelligence p.1/49. n-queens. Artificial Intelligence p.2/49. Initial state: the empty board or a board with n random

Outline. Best-first search

Informed search algorithms. (Based on slides by Oren Etzioni, Stuart Russell)

/633 Introduction to Algorithms Lecturer: Michael Dinitz Topic: Approximation algorithms Date: 11/27/18

Heuristic (Informed) Search

Automated Planning & Artificial Intelligence

Today. Informed Search. Graph Search. Heuristics Greedy Search A* Search

Heuristic Search. Robert Platt Northeastern University. Some images and slides are used from: 1. CS188 UC Berkeley 2. RN, AIMA

Reductions and Satisfiability

Chapter 10 Part 1: Reduction

CS510 \ Lecture Ariel Stolerman

Boolean Functions (Formulas) and Propositional Logic

! A* is best first search with f(n) = g(n) + h(n) ! Three changes make it an anytime algorithm:

CSE 473. Chapter 4 Informed Search. CSE AI Faculty. Last Time. Blind Search BFS UC-BFS DFS DLS Iterative Deepening Bidirectional Search

CSC384: Intro to Artificial Intelligence Search III

CS 188: Artificial Intelligence. Recap Search I

Informed search algorithms

Artificial Intelligence

Structure and Complexity in Planning with Unary Operators

1. [5 points each] True or False. If the question is currently open, write O or Open.

NP and computational intractability. Kleinberg and Tardos, chapter 8

CS 4700: Artificial Intelligence

Foundations of AI. 8. Satisfiability and Model Construction. Davis-Putnam, Phase Transitions, GSAT and GWSAT. Wolfram Burgard & Bernhard Nebel

Problem Solving: Informed Search

Transcription:

Acknowledgements Heuristic Search for Planning Sheila McIlraith University of Toronto Fall 2010 Many of the slides used in today s lecture are modifications of slides developed by Malte Helmert, Bernhard Nebel, and Jussi Rintanen. Some material comes from papers by Daniel Bryce and Rao Kambhampati. I would like to gratefully acknowledge the contributions of these researchers, a nd thank them for generously permitting me to use aspects of their presentation material. S. McIlraith Heuristic Search for Planning 1 / 50 S. McIlraith Heuristic Search for Planning 2 / 50 A simple heuristic for deterministic planning STRIPS (Fikes & Nilsson, 1971) used the number of state variables that differ in current state s and a STRIPS goal l 1 l n : h(s) := {i {1,..., n} s(a) = l i }. Intuition: more true goal literals closer to the goal STRIPS heuristic (properties?) Note: From now on, for convenience we usually write heuristics as functions of states (as above), not nodes. Node heuristic h is defined from state heuristic h as h (σ) := h(state(σ)). S. McIlraith Heuristic Search for Planning 3 / 50 S. McIlraith Heuristic Search for Planning 4 / 50 Criticism of the STRIPS heuristic What is wrong with the STRIPS heuristic? quite uninformative: the range of heuristic values in a given task is small; typically, most successors have the same estimate very sensitive to reformulation: can easily transform any planning task into an equivalent one where h(s) = 1 for all non-goal states ignores almost all problem structure: heuristic value does not depend on the set of operators! need a better, principled way of coming up with heuristics S. McIlraith Heuristic Search for Planning 5 / 50 S. McIlraith Heuristic Search for Planning 6 / 50

Coming up with heuristics in a principled way Relaxing a problem How do we relax a problem? General procedure for obtaining a heuristic Solve an easier version of the problem. Two common methods: relaxation: consider less constrained version of the problem abstraction: consider smaller version of real problem Both have been very successfully applied in planning. We consider both in this course, beginning with relaxation. (Route planning for a road network) The road network is formalized as a weighted graph over points in the Euclidean plane. The weight of an edge is the road distance between two locations. A relaxation drops constraints of the original problem. (Relaxation for route planning) Use the Euclidean distance x 1 y 1 2 + x 2 y 2 2 as a heuristic for the road distance between (x 1, x 2 ) and (y 1, y 2 ) This is a lower bound on the road distance ( admissible). We drop the constraint of having to travel on roads. S. McIlraith Heuristic Search for Planning 7 / 50 S. McIlraith Heuristic Search for Planning 8 / 50 200 km 200 km 270 km 150 km S. McIlraith Heuristic Search for Planning 9 / 50 S. McIlraith Heuristic Search for Planning 10 / 50 420 km 1 340 km 420 km 1 450 km 130 km S. McIlraith Heuristic Search for Planning 11 / 50 S. McIlraith Heuristic Search for Planning 12 / 50

440 km 460 km 450 km 130 km 450 km 130 km S. McIlraith Heuristic Search for Planning 13 / 50 S. McIlraith Heuristic Search for Planning 14 / 50 460 km 460 km 540 km S. McIlraith Heuristic Search for Planning 15 / 50 S. McIlraith Heuristic Search for Planning 16 / 50 Relaxations for planning Relaxation is a general technique for heuristic design: Straight-line heuristic (route planning): Ignore the fact that one must stay on roads. Manhattan heuristic (15-puzzle): Ignore the fact that one cannot move through occupied tiles. We want to apply the idea of relaxations to planning. Informally, we want to ignore bad side effects of applying operators. S. McIlraith Heuristic Search for Planning 17 / 50 S. McIlraith Heuristic Search for Planning 18 / 50

What is a good or bad effect? Question: Which operator effects are good, and which are bad? Difficult to answer in general, because it depends on context: Locking the entrance door is good if we want to keep burglars out. Locking the entrance door is bad if we want to enter. We will now consider a reformulation of planning tasks that makes the distinction between good and bad effects obvious. S. McIlraith Heuristic Search for Planning 19 / 50 S. McIlraith Heuristic Search for Planning 20 / 50 Notation Review Notation Review (cont.) The notation we use here is a generalization of the notation used in previous introductory lectures, which was based on the GNT textbook. Recall: An operator c, e is a STRIPS operator if 1 precondition c is a conjunction* of literals, and 2 effect e is a conjunction of atomic effects. *We previously used set rather than conjunction. Here we extend the expressiveness of our operator definition as follows: precondition c is an arbitrary propositional formula. (Deterministic) effect e is defined recursively as follows: 1 If a A is a state variable, then a and a are effects (atomic effects). 2 If e 1,..., e n are effects, then e 1 e n is an effect (conjunctive effects). The special case with n = 0 is the empty conjunction. 3 If c is a propositional formula and e is an effect, then c e is an effect (conditional effects). Atomic effects a and a are best understood as assignments a := 1 and a := 0, respectively. S. McIlraith Heuristic Search for Planning 21 / 50 S. McIlraith Heuristic Search for Planning 22 / 50 Positive normal form Positive normal form: existence (operators in positive normal form) An operator o = c, e is in positive normal form if it is in normal form, no negation symbols appear in c, and no negation symbols appear in any effect condition in e. (planning tasks in positive normal form) A planning task A, I, O, G is in positive normal form if all operators in O are in positive normal form and no negation symbols occur in the goal G. Theorem (positive normal form) Every planning task Π has an equivalent planning task Π in positive normal form. Moreover, Π can be computed from Π in polynomial time. Note: Equivalence here means that the represented transition systems of Π and Π, limited to the states that can be reached from the initial state, are isomorphic. We prove the theorem by describing a suitable algorithm. (However, we do not prove its correctness or complexity.) S. McIlraith Heuristic Search for Planning 23 / 50 S. McIlraith Heuristic Search for Planning 24 / 50

Positive normal form: algorithm Transformation of A, I, O, G to positive normal form Convert all operators o O to normal form. Convert all conditions* to negation normal form (NNF). while any condition contains a negative literal a: Let a be a variable which occurs negatively in a condition. A := A {â} for some new state variable â I(â) := 1 I(a) Replace the effect a by (a â) in all operators o O. Replace the effect a by ( a â) in all operators o O. Replace a by â in all conditions. Convert all operators o O to normal form (again). * Here, all conditions refers to all operator preconditions, operator effect conditions and the goal. S. McIlraith Heuristic Search for Planning 25 / 50 S. McIlraith Heuristic Search for Planning 26 / 50 (transformation to positive normal form) (transformation to positive normal form) A = {home, uni, lecture, bike, bike-locked} uni 0, lecture 0} bike bike-locked, bike-locked, bike bike-locked, bike-locked, A = {home, uni, lecture, bike, bike-locked} uni 0, lecture 0} bike bike-locked, bike-locked, bike bike-locked, bike-locked, Identify state variable a occurring negatively in conditions. S. McIlraith Heuristic Search for Planning 27 / 50 S. McIlraith Heuristic Search for Planning 28 / 50 (transformation to positive normal form) (transformation to positive normal form) bike bike-locked, bike-locked, bike bike-locked, bike-locked, Introduce new variable â with complementary initial value. bike bike-locked, bike-locked, bike bike-locked, bike-locked, Identify effects on variable a. S. McIlraith Heuristic Search for Planning 29 / 50 S. McIlraith Heuristic Search for Planning 30 / 50

(transformation to positive normal form) (transformation to positive normal form) bike bike-locked, bike-locked bike-unlocked, bike bike-locked, bike-locked bike-unlocked, Introduce complementary effects for â. bike bike-locked, bike-locked bike-unlocked, bike bike-locked, bike-locked bike-unlocked, Identify negative conditions for a. S. McIlraith Heuristic Search for Planning 31 / 50 S. McIlraith Heuristic Search for Planning 32 / 50 (transformation to positive normal form) (transformation to positive normal form) O = { home bike bike-unlocked, home uni, bike bike-locked, bike-locked bike-unlocked, bike bike-unlocked, bike-locked bike-unlocked, uni, lecture ((bike bike-unlocked) bike) } O = { home bike bike-unlocked, home uni, bike bike-locked, bike-locked bike-unlocked, bike bike-unlocked, bike-locked bike-unlocked, uni, lecture ((bike bike-unlocked) bike) } Replace by positive condition â. S. McIlraith Heuristic Search for Planning 33 / 50 S. McIlraith Heuristic Search for Planning 34 / 50 What does this transformation achieve? We have expanded the size of our domain by introducing new propositions to ensure that all the conditions that affect planning: preconditions conditions of conditional effects goals are expressed in terms of positive literals, and we ve adjusted the effects of operators to ensure that they are consistent with the introduction of these new propositions. S. McIlraith Heuristic Search for Planning 35 / 50 S. McIlraith Heuristic Search for Planning 36 / 50

Relaxed planning tasks: idea Relaxed planning tasks In positive normal form, good and bad effects are easy to distinguish: Effects that make state variables true are good (add effects). Effects that make state variables false are bad (delete effects). *** Idea for the heuristic: Ignore all delete effects. ** (relaxation of operators) The relaxation o + of an operator o = c, e in positive normal form is the operator which is obtained by replacing all negative effects a within e by the do-nothing effect. (relaxation of planning tasks) The relaxation Π + of a planning task Π = A, I, O, G in positive normal form is the planning task Π + := A, I, {o + o O}, G. (relaxation of operator sequences) The relaxation of an operator sequence π = o 1... o n is the operator sequence π + := o 1 +... o n +. S. McIlraith Heuristic Search for Planning 37 / 50 S. McIlraith Heuristic Search for Planning 38 / 50 Relaxed planning tasks: terminology Planning tasks in positive normal form without delete effects are called relaxed planning tasks. Plans for relaxed planning tasks are called relaxed plans. If Π is a planning task in positive normal form and π + is a plan for Π +, then π + is called a relaxed plan for Π. S. McIlraith Heuristic Search for Planning 39 / 50 S. McIlraith Heuristic Search for Planning 40 / 50 for relaxed planning tasks Correctness of the greedy algorithm The relaxed planning task can be solved in polynomial time using a simple greedy algorithm: Greedy planning algorithm for A, I, O +, G s := I π + := ǫ forever: if s = G: return π + else if there is an operator o + O + applicable in s with app o +(s) s: Append such an operator o + to π +. s := app o +(s) else: return unsolvable The algorithm is sound: If it returns a plan, this is indeed a correct solution. If it returns unsolvable, the task is indeed unsolvable What about completeness (termination) and runtime? Each iteration of the loop adds at least one atom to the set of true state variables in s. This guarantees termination after at most A iterations. Thus, the algorithm can clearly be implemented to run in polynomial time. S. McIlraith Heuristic Search for Planning 41 / 50 S. McIlraith Heuristic Search for Planning 42 / 50

Using the greedy algorithm as a heuristic We can apply the greedy algorithm within heuristic search: In a search node σ, solve the relaxation of the planning task with state(σ) as the initial state. Set h(σ) to the length of the generated relaxed plan. Is this an admissible heuristic? Yes if the relaxed plans are optimal (due to the plan preservation corollary). However, usually they are not, because our greedy planning algorithm is very poor. S. McIlraith Heuristic Search for Planning 43 / 50 S. McIlraith Heuristic Search for Planning 44 / 50 Generating an admissible heuristic is NP-hard To obtain an admissible heuristic, we need to generate an optimal relaxed plan. The problem of deciding whether a given relaxed planning task has a length at most K is NP-complete (through a reduction of part of the problem to the set cover problem). Thus, generating an optimal relaxed plan for the purposes of generating a heuristic (not even solving the problem!) is not a good strategy. S. McIlraith Heuristic Search for Planning 45 / 50 S. McIlraith Heuristic Search for Planning 46 / 50 Using relaxations in practice How can we use relaxations for heuristic planning in practice? Different possibilities: Implement an optimal planner for relaxed planning tasks and use its solution lengths as an estimate, even though it is NP-hard. h + heuristic Do not actually solve the relaxed planning task, but compute an estimate of its difficulty in a different way. h max heuristic, h add heuristic Compute a solution for relaxed planning tasks which is not necessarily optimal, but reasonable. h FF heuristic S. McIlraith Heuristic Search for Planning 47 / 50 S. McIlraith Heuristic Search for Planning 48 / 50

The Relaxed Plan Graph Heuristic and FF Why does the greedy algorithm compute low-quality plans? It may apply many operators which are not goal-directed. How can this problem be fixed? Reaching the goal of a relaxed planning task is most easily achieved with forward search. Analyzing relevance of an operator for achieving a goal (or subgoal) is most easily achieved with backward search. Idea: Use a forward-backward algorithm that first finds a path to the goal greedily, then prunes it to a relevant subplan. Does this sound similar to an algorithm we ve seen before? In the tutorial today you will learn about the Relaxed Plan Graph (RPG) heuristic and how it is used in one particular planner, Fast-Forward (FF) (Hoffmannn & Nebel, JAIR-01). Heuristic: Solve the relaxed planning problem using a planning graph approach. Search: Hill-climbing extended by breadth-first search on plateaus and with pruning Pruning: Only those successors are considered that are part of a relaxed solution i.e., the result of so-called helpful actions Fall-back strategy: Complete best-first search S. McIlraith Heuristic Search for Planning 49 / 50 S. McIlraith Heuristic Search for Planning 50 / 50