• Keine Ergebnisse gefunden

Planning and Optimization C1. Delete Relaxation: Relaxed Planning Tasks Gabriele R¨oger and Thomas Keller

N/A
N/A
Protected

Academic year: 2022

Aktie "Planning and Optimization C1. Delete Relaxation: Relaxed Planning Tasks Gabriele R¨oger and Thomas Keller"

Copied!
19
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

C1. Delete Relaxation: Relaxed Planning Tasks

Gabriele R¨oger and Thomas Keller

Universit¨at Basel

October 17, 2018

(2)

Content of this Course

Planning

Classical

Tasks Progression/

Regression Complexity Heuristics

Probabilistic

MDPs Uninformed Search

Heuristic Search Monte-Carlo

Methods

(3)

Heuristics

(4)

Planning as Heuristic Search

Heuristic search is the most common approach to planning.

ingredients: general search algorithm+ heuristic

heuristic estimates cost from a given state to a given goal progression: from varying states sto fixed goalγ

regression: from fixed initial stateI to varying subgoalsϕ Over the next weeks, we study the main ideas

behind heuristics for planning tasks.

(5)

Reminder: Heuristics

Need to Catch Up?

We assume familiarity with heuristics and their properties:

heuristich:S R+0 ∪ {∞}

perfect heuristich: h(s) cost of optimal solution from s (∞if unsolvable)

properties of heuristicsh:

safe: (h(s) =∞ ⇒h(s) =∞) for all statess goal-aware:h(s) = 0 for all goal statess admissible: h(s)h(s) for all statess

consistent: h(s)cost(o) +h(s0) for all transitionsso s0 connections between these properties

If you are not familiar with these topics, we recommend Chapters 13–14 of the Foundations of Artificial Intelligence course at https://dmi.unibas.ch/de/studium/

computer-science-informatik/fs18/

lecture-foundations-of-artificial-intelligence/

(6)

Coming Up with Heuristics

(7)

A Simple Heuristic for Propositional Planning Tasks

STRIPS (Fikes & Nilsson, 1971) used the number of state variables that differ in current states and a STRIPS goalv1∧ · · · ∧vn:

h(s) :=|{i ∈ {1, . . . ,n} |s 6|=vi}|.

Intuition: more satisfied goal atoms closer to the goal STRIPS heuristic (a.k.a. goal-count heuristic) (properties?)

(8)

Criticism of the STRIPS Heuristic

What is wrong with the STRIPS heuristic?

quite uninformative:

the range of heuristic values in a given task is small;

typically, most successors have the same estimate very sensitive toreformulation:

can easily transform any planning task into an equivalent one whereh(s) = 1 for all non-goal states (how?)

ignores almost all problem structure:

heuristic value does not depend on the set of operators!

need a better, principled way of coming up with heuristics

(9)

Coming Up with Heuristics in a Principled Way

General Procedure for Obtaining a Heuristic

Simplify the problem, for example by removing problem constraints.

Solve the simplified problem (ideally optimally).

Use the solution cost for the simplified problem as a heuristic for the real problem.

As heuristic values are computed for every generated search state, it is important that they can be computedefficiently.

(10)

Relaxing a Problem: Example

Example (Route Planning in a Road Network)

The road network is formalized as a weighted graph over points in the Euclidean plane. The weight of an edge is theroad distance between two locations.

Example (Relaxation for Route Planning) Use theEuclidean distance p

|x1−x2|2+|y1−y2|2

as a heuristic for the road distance betweenhx1,y1i andhx2,y2i This isa lower bound on the road distance ( admissible).

We drop the constraint of having to travel on roads.

(11)

Heuristics Coming Up with Heuristics Relaxed Planning Tasks Summary

Planning Heuristics: Main Concepts

Major ideas for heuristics in the planning literature:

delete relaxation abstraction landmarks critical paths network flows potential heuristics

(12)

Planning Heuristics: Main Concepts

Major ideas for heuristics in the planning literature:

delete relaxation abstraction landmarks critical paths network flows potential heuristics

We will consider most of them in this course.

(13)

Relaxed Planning Tasks

(14)

Content of this Course: Heuristics

Heuristics

Delete Relaxation Relaxed Tasks Relaxed Task Graphs

Relaxation Heuristics Abstraction

Landmarks Potential Heuristics Cost Partitioning

(15)

Delete Relaxation: Idea

Inpositive normal form(Chapter A6, remember?), good and bad effects are easy to distinguish:

Effects that make state variables true are good (add effects).

Effects that make state variables false are bad (delete effects).

Idea ofdelete relaxation heuristics: ignore all delete effects.

(16)

Delete-Relaxed Planning Tasks

Definition (Delete Relaxation of Operators)

Thedelete relaxationo+ of an operator o in positive normal form is the operator obtained by replacing all negative effects¬a withineff(o) by the do-nothing effect >.

Definition (Delete Relaxation of Propositional Planning Tasks) Thedelete relaxationΠ+ of a propositional planning task Π =hV,I,O, γi in positive normal form is the planning task Π+:=hV,I,{o+|o ∈O}, γi.

Definition (Delete Relaxation of Operator Sequences)

Thedelete relaxationof an operator sequenceπ =ho1, . . . ,oni is the operator sequenceπ+:=ho1+, . . . ,on+i.

Note: “delete” is often omitted: relaxation,relaxed

(17)

Relaxed Planning Tasks: Terminology

Planning tasks in positive normal form without delete effects are called relaxed planning tasks.

Plans for relaxed planning tasks are called relaxed plans.

If Π is a planning task in positive normal form and π+ is a plan for Π+, then π+ is called arelaxed plan for Π.

(18)

Summary

(19)

Summary

A general way to come up with heuristics:

solve a simplified version of the real problem, for example by removing problem constraints.

delete relaxation: given a task in positive normal form, discard all delete effects

Referenzen

ÄHNLICHE DOKUMENTE

Definition 2 (Logical Formula over Finite-Domain State Variables).. Let V be a set of finite- domain

The last missing piece is a definition of regression through operators, describing exactly in which states s applying a given operator o leads to a state satisfying a given formula

decision problem analogue of satisficing planning Definition (Bounded-Cost Plan Existence). The bounded-cost plan existence problem ( BCPlanEx ) is the following

Delete relaxation is a simplification in the sense that it is never harder to solve a relaxed task than the original one. Delete-relaxed tasks have a

C3.1 Optimal Relaxed Plans C3.2 AND/OR Graphs C3.3 Forced Nodes?. C3.4 Most/Least Conservative Valuations

If n is a node in an AND/OR graph that is forced true, then this can be derived by a sequence of applications of Rule T-(∧) and Rule

Therefore, we estimate the cost of an OR node as the minimum of the costs of its successors.. I To make an AND node true, all its successors must be made

I The AND/OR graph that is obtained by removing all nodes with infinite cost from this reduced graph is called the best achiever graph for h max /h add. I We write G max and G add