• Keine Ergebnisse gefunden

C1.1Heuristics C1.1HeuristicsC1.2ComingUpwithHeuristicsC1.3RelaxedPlanningTasksC1.4Summary PlanningandOptimization PlanningandOptimization ContentofthisCourse

N/A
N/A
Protected

Academic year: 2022

Aktie "C1.1Heuristics C1.1HeuristicsC1.2ComingUpwithHeuristicsC1.3RelaxedPlanningTasksC1.4Summary PlanningandOptimization PlanningandOptimization ContentofthisCourse"

Copied!
5
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Planning and Optimization

C1. Delete Relaxation: Relaxed Planning Tasks

Gabriele R¨oger and Thomas Keller

Universit¨at Basel

October 17, 2018

G. R¨oger, T. Keller (Universit¨at Basel) Planning and Optimization October 17, 2018 1 / 19

Planning and Optimization

October 17, 2018 — C1. Delete Relaxation: Relaxed Planning Tasks

C1.1 Heuristics

C1.2 Coming Up with Heuristics C1.3 Relaxed Planning Tasks C1.4 Summary

G. R¨oger, T. Keller (Universit¨at Basel) Planning and Optimization October 17, 2018 2 / 19

Content of this Course

Planning

Classical

Tasks Progression/

Regression Complexity Heuristics

Probabilistic

MDPs Uninformed Search

Heuristic Search Monte-Carlo

Methods

C1. Delete Relaxation: Relaxed Planning Tasks Heuristics

C1.1 Heuristics

(2)

C1. Delete Relaxation: Relaxed Planning Tasks Heuristics

Planning as Heuristic Search

I Heuristic searchis the most common approach to planning.

I ingredients: general search algorithm+heuristic

I heuristic estimates cost from a given state to a given goal

I progression: from varying statessto fixed goalγ

I regression: from fixed initial stateI to varying subgoalsϕ

I Over the next weeks, we study the main ideas behind heuristics for planning tasks.

G. R¨oger, T. Keller (Universit¨at Basel) Planning and Optimization October 17, 2018 5 / 19

C1. Delete Relaxation: Relaxed Planning Tasks Heuristics

Reminder: Heuristics

Need to Catch Up?

I We assume familiarity with heuristics and their properties:

I heuristich:S R+0 ∪ {∞}

I perfect heuristich: h(s) cost of optimal solution froms (∞if unsolvable)

I properties of heuristicsh:

I safe: (h(s) =∞ ⇒h(s) =∞) for all statess

I goal-aware: h(s) = 0 for all goal statess

I admissible: h(s)h(s) for all statess

I consistent: h(s)cost(o) +h(s0) for all transitionsso s0

I connections between these properties

I If you are not familiar with these topics, we recommend Chapters 13–14 of theFoundations of Artificial Intelligence course athttps://dmi.unibas.ch/de/studium/

computer-science-informatik/fs18/

lecture-foundations-of-artificial-intelligence/

G. R¨oger, T. Keller (Universit¨at Basel) Planning and Optimization October 17, 2018 6 / 19

C1. Delete Relaxation: Relaxed Planning Tasks Coming Up with Heuristics

C1.2 Coming Up with Heuristics

G. R¨oger, T. Keller (Universit¨at Basel) Planning and Optimization October 17, 2018 7 / 19

C1. Delete Relaxation: Relaxed Planning Tasks Coming Up with Heuristics

A Simple Heuristic for Propositional Planning Tasks

STRIPS (Fikes & Nilsson, 1971) used the number of state variables that differ in current states and a STRIPS goalv1∧ · · · ∧vn:

h(s) :=|{i ∈ {1, . . . ,n} |s 6|=vi}|.

Intuition: more satisfied goal atoms closer to the goal STRIPS heuristic (a.k.a. goal-count heuristic)(properties?)

G. R¨oger, T. Keller (Universit¨at Basel) Planning and Optimization October 17, 2018 8 / 19

(3)

C1. Delete Relaxation: Relaxed Planning Tasks Coming Up with Heuristics

Criticism of the STRIPS Heuristic

What is wrong with the STRIPS heuristic?

I quiteuninformative:

the range of heuristic values in a given task is small;

typically, most successors have the same estimate

I very sensitive toreformulation:

can easily transform any planning task into an equivalent one where h(s) = 1 for all non-goal states (how?)

I ignores almost allproblem structure:

heuristic value does not depend on the set of operators!

need a better, principled way of coming up with heuristics

G. R¨oger, T. Keller (Universit¨at Basel) Planning and Optimization October 17, 2018 9 / 19

C1. Delete Relaxation: Relaxed Planning Tasks Coming Up with Heuristics

Coming Up with Heuristics in a Principled Way

General Procedure for Obtaining a Heuristic

I Simplify the problem, for example by removing problem constraints.

I Solve the simplified problem (ideally optimally).

I Use the solution cost for the simplified problem as a heuristic for the real problem.

As heuristic values are computed for every generated search state, it is important that they can be computedefficiently.

G. R¨oger, T. Keller (Universit¨at Basel) Planning and Optimization October 17, 2018 10 / 19

C1. Delete Relaxation: Relaxed Planning Tasks Coming Up with Heuristics

Relaxing a Problem: Example

Example (Route Planning in a Road Network)

The road network is formalized as a weighted graph over points in the Euclidean plane. The weight of an edge is the road distance between two locations.

Example (Relaxation for Route Planning) Use the Euclidean distance p

|x1−x2|2+|y1−y2|2

as a heuristic for the road distance between hx1,y1i andhx2,y2i This is a lower bound on the road distance ( admissible).

We drop the constraint of having to travel on roads.

C1. Delete Relaxation: Relaxed Planning Tasks Coming Up with Heuristics

Planning Heuristics: Main Concepts

Major ideas for heuristics in the planning literature:

I delete relaxation

I abstraction

I landmarks

I critical paths

I network flows

I potential heuristics

We will consider most of them in this course.

(4)

C1. Delete Relaxation: Relaxed Planning Tasks Relaxed Planning Tasks

C1.3 Relaxed Planning Tasks

G. R¨oger, T. Keller (Universit¨at Basel) Planning and Optimization October 17, 2018 13 / 19

C1. Delete Relaxation: Relaxed Planning Tasks Relaxed Planning Tasks

Content of this Course: Heuristics

Heuristics

Delete Relaxation Relaxed Tasks Relaxed Task Graphs

Relaxation Heuristics Abstraction

Landmarks Potential Heuristics Cost Partitioning

G. R¨oger, T. Keller (Universit¨at Basel) Planning and Optimization October 17, 2018 14 / 19

C1. Delete Relaxation: Relaxed Planning Tasks Relaxed Planning Tasks

Delete Relaxation: Idea

In positive normal form(Chapter A6, remember?), good and bad effects are easy to distinguish:

I Effects that make state variables true are good (add effects).

I Effects that make state variables false are bad (delete effects).

Idea of delete relaxation heuristics: ignore all delete effects.

G. R¨oger, T. Keller (Universit¨at Basel) Planning and Optimization October 17, 2018 15 / 19

C1. Delete Relaxation: Relaxed Planning Tasks Relaxed Planning Tasks

Delete-Relaxed Planning Tasks

Definition (Delete Relaxation of Operators)

Thedelete relaxationo+ of an operator o in positive normal form is the operator obtained by replacing all negative effects ¬a within eff(o) by the do-nothing effect>.

Definition (Delete Relaxation of Propositional Planning Tasks) Thedelete relaxationΠ+ of a propositional planning task Π =hV,I,O, γi in positive normal form is the planning task Π+:=hV,I,{o+|o ∈O}, γi.

Definition (Delete Relaxation of Operator Sequences)

Thedelete relaxationof an operator sequence π=ho1, . . . ,oni is the operator sequence π+:=ho1+, . . . ,on+i.

Note: “delete” is often omitted: relaxation,relaxed

G. R¨oger, T. Keller (Universit¨at Basel) Planning and Optimization October 17, 2018 16 / 19

(5)

C1. Delete Relaxation: Relaxed Planning Tasks Relaxed Planning Tasks

Relaxed Planning Tasks: Terminology

I Planning tasks in positive normal form without delete effects are calledrelaxed planning tasks.

I Plans for relaxed planning tasks are calledrelaxed plans.

I If Π is a planning task in positive normal form andπ+ is a plan for Π+, thenπ+ is called arelaxed plan for Π.

G. R¨oger, T. Keller (Universit¨at Basel) Planning and Optimization October 17, 2018 17 / 19

C1. Delete Relaxation: Relaxed Planning Tasks Summary

C1.4 Summary

G. R¨oger, T. Keller (Universit¨at Basel) Planning and Optimization October 17, 2018 18 / 19

C1. Delete Relaxation: Relaxed Planning Tasks Summary

Summary

I A general way to come up with heuristics:

solve asimplified version of the real problem, for example by removing problem constraints.

I delete relaxation: given a task in positive normal form, discard all delete effects

Referenzen

ÄHNLICHE DOKUMENTE

A7.1 Invariants A7.2 Mutexes A7.3 Reformulation A7.4 Summary.. Keller (Universit¨ at Basel) Planning and Optimization September 30, 2019 2

I Propositional planning tasks compactly represent transition systems and are suitable as inputs for planning algorithms. I They are based on concepts from propositional logic,

Propositional planning tasks compactly represent transition systems and are suitable as inputs for planning algorithms. They are based on concepts from propositional logic, enhanced

A6.2 Positive Normal Form A6.3 Example and Discussion A6.4 STRIPS..

Planning tasks in positive normal form without delete effects are called relaxed planning tasks. Plans for relaxed planning tasks are called

C3.1 Optimal Relaxed Plans C3.2 AND/OR Graphs C3.3 Forced Nodes?. C3.4 Most/Least Conservative Valuations

Therefore, we estimate the cost of an OR node as the minimum of the costs of its successors.. I To make an AND node true, all its successors must be made

I The AND/OR graph that is obtained by removing all nodes with infinite cost from this reduced graph is called the best achiever graph for h max /h add. I We write G max and G add