• Keine Ergebnisse gefunden

Planning and Optimization E1. Landmarks: MHS & Uniform Cost Partitioning Heuristic Gabriele R¨oger and Thomas Keller

N/A
N/A
Protected

Academic year: 2022

Aktie "Planning and Optimization E1. Landmarks: MHS & Uniform Cost Partitioning Heuristic Gabriele R¨oger and Thomas Keller"

Copied!
27
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

E1. Landmarks: MHS & Uniform Cost Partitioning Heuristic

Gabriele R¨oger and Thomas Keller

Universit¨at Basel

November 12, 2018

(2)

Landmarks

(3)

Content of this Course

Planning

Classical

Tasks Progression/

Regression Complexity Heuristics

Probabilistic

MDPs Uninformed Search

Heuristic Search Monte-Carlo

Methods

(4)

Content of this Course: Heuristics

Heuristics

Delete Relaxation Abstraction

Landmarks

Action Landmarks MHS Heuristic

Uniform Cost Partitioning Cut Landmarks LM-Cut Heuristic Network Flows

Potential Heuristics Cost Partitioning

(5)

Landmarks

Basic Idea: Something that must happen in every solution For example

some operator must be applied some atom must be true some formula must be true

→Derive heuristic estimate from this kind of information.

We will only consider disjunctive action landmarks.

(6)

Landmarks

Basic Idea: Something that must happen in every solution For example

some operator must be applied some atom must be true some formula must be true

→Derive heuristic estimate from this kind of information.

We will only considerdisjunctive action landmarks.

(7)

Disjunctive Action Landmarks

Definition (Disjunctive Action Landmark)

Lets be a state of planning task Π =hV,I,O, γi.

Adisjunctive action landmarkfor s is a set of operatorsL⊆O such that every plan fors (= label path froms to a goal state) contains an operator fromL.

Thecostof landmark Lis cost(L) = mino∈Lcost(o).

(8)

Example Task

Two trucks, one airplane

Airplane can fly between locations A3 and B1

Trucks can drive arbitrarily between locations A1, A2, and A3 Package to be transported from A1 to B1

Operators

Load(v, `)andUnload(v, `)for vehiclev and location` Drive(t, `, `0)for truckt and locations`, `0

Fly(`, `0)for locations`, `0

A1 A2

A3 B1

t1

t2

f

(9)

Example: Disjunctive Action Landmarks

L1 ={Load(Truck1,A1),Load(Truck2,A1)} and L2 ={Fly(B1, A3)} are disjunctive action landmarks.

A1 A2

A3 B1

t1

t2

f

What other disjunctive action landmarks are there?

(10)

Remarks

Not every landmark is informative.

For example: If the initial state is not already a goal state then the set of all operators is a disjunctive action landmark.

Deciding whether a given operator set is a disjunctive action landmark is as hard as the plan existence problem.

(11)

Exploiting Disjunctive Action Landmarks

How can we exploit a given setL of disjunctive action landmarks?

Sum of costsP

L∈Lcost(L)?

not admissible!

Maximize costs maxL∈Lcost(L)?

usually very weak heuristic

better: hitting setsor cost partitioning

(12)

Minimum Hitting Set Heuristic

(13)

Content of this Course: Heuristics

Heuristics

Delete Relaxation Abstraction

Landmarks

Action Landmarks MHS Heuristic

Uniform Cost Partitioning Cut Landmarks LM-Cut Heuristic Network Flows

Potential Heuristics Cost Partitioning

(14)

Hitting Sets

Definition (Hitting Set)

LetX be a set,F ={F1, . . . ,Fn} ⊆2X be a family of subsets of X andc :X →R+0 be a cost function for X.

Ahitting setis a subsetH⊆X that “hits” all subsets in F, i.e., H∩F 6=∅ for allF ∈ F. The costofH isP

x∈Hc(x).

Aminimum hitting set (MHS)is a hitting set with minimal cost.

MHS is a “classical” NP-complete problem (Karp, 1972)

(15)

Example: Hitting Sets

Example

X ={o1,o2,o3,o4}

F={{o4},{o1,o2},{o1,o3},{o2,o3}}

c(o1) = 3, c(o2) = 4, c(o3) = 5, c(o4) = 0

minimum hitting set: {o1,o2,o4}with cost 3 + 4 + 0 = 7

(16)

Example: Hitting Sets

Example

X ={o1,o2,o3,o4}

F={{o4},{o1,o2},{o1,o3},{o2,o3}}

c(o1) = 3, c(o2) = 4, c(o3) = 5, c(o4) = 0

minimum hitting set: {o1,o2,o4}with cost 3 + 4 + 0 = 7

(17)

Hitting Sets for Disjunctive Action Landmarks

Idea: disjunctive action landmarks are interpreted as Idea: instance of minimum hitting set

Definition (Hitting Set Heuristic)

LetL be a set of disjunctive action landmarks. The hitting set heuristichMHS(L) is defined as the cost of a minimum hitting set forL with c(o) =cost(o).

Proposition (Hitting Set Heuristic is Admissible)

LetL be a set of disjunctive action landmarks for state s.

Then hMHS(L)is an admissible estimate for s.

Why?

(18)

Hitting Set Heuristic: Discussion

The hitting set heuristic is the best possibleheuristic that only uses the given information. . .

. . . but is NP-hard to compute.

Use approximations that can be efficiently computed.

Now: uniform cost partitioning

Later in the course: optimal cost partitioning

(19)

Uniform Cost Partitioning

(20)

Content of this Course: Heuristics

Heuristics

Delete Relaxation Abstraction

Landmarks

Action Landmarks MHS Heuristic

Uniform Cost Partitioning Cut Landmarks LM-Cut Heuristic Network Flows

Potential Heuristics Cost Partitioning

(21)

Uniform Cost Partitioning (1)

Idea: Distribute cost of operators uniformly among the landmarks.

Definition (Uniform Cost Partitioning Heuristic for Landmarks) LetL be a set of disjunctive action landmarks.

Theuniform cost partitioning heuristic hUCP(L) is defined as hUCP(L) =X

L∈L

min

o∈Lc0(o) with c0(o) =cost(o)/|{L∈ L |o ∈L}|.

(22)

Landmarks Minimum Hitting Set Heuristic Uniform Cost Partitioning Summary

Uniform Cost Partitioning (2)

Theorem (Uniform Cost Partitioning Heuristic is Admissible) LetL be a set of disjunctive action landmarks for state s ofΠ.

Then hUCP(L) is an admissibleheuristic estimate for s.

Letπ =ho1, . . . ,oni be an optimal plan fors. ForL∈ Ldefine a new cost functioncostL ascostL(o) =c0(o) ifo ∈L and

costL(o) = 0 otherwise. Let ΠL be a modified version of Π, where for all operatorso the cost is replaced withcostL(o). We make three independent observations:

1 For L∈ L the valuecost0(L) := mino∈Lc0(o) is an admissible estimate for s in ΠL.

2 π is also a plan fors in ΠL, so hΠ

L(s)≤Pn

i=1costL(oi).

3 P

L∈LcostL(o) =cost(o) for each operatoro.

. . .

(23)

Uniform Cost Partitioning (2)

Theorem (Uniform Cost Partitioning Heuristic is Admissible) LetL be a set of disjunctive action landmarks for state s ofΠ.

Then hUCP(L) is an admissibleheuristic estimate for s.

Proof.

Letπ=ho1, . . . ,oni be an optimal plan fors. ForL∈ Ldefine a new cost functioncostL ascostL(o) =c0(o) ifo ∈L and

costL(o) = 0 otherwise. Let ΠL be a modified version of Π, where for all operatorso the cost is replaced withcostL(o). We make three independent observations:

1 For L∈ Lthe value cost0(L) := mino∈Lc0(o) is an admissible estimate for s in ΠL.

2 π is also a plan fors in ΠL, so hΠ

L(s)≤Pn

i=1costL(oi).

3 P

L∈LcostL(o) =cost(o) for each operator o.

. . .

(24)

Uniform Cost Partitioning (3)

Proof (continued).

Together, this leads to the following inequality (subscripts indicate for which task the heuristic is computed):

hΠUCP(L) =X

L∈L

cost0(L)

(1)

≤ X

L∈L

hΠ

L(s)

(2)

≤ X

L∈L n

X

i=1

costL(oi) =

n

X

i=1

X

L∈L

costL(oi)

(3)=

n

X

i=1

cost(o) =hΠ(s)

(25)

Relationship

Theorem

LetL be a set of disjunctive action landmarks for state s.

ThenhUCP(L)≤hMHS(L)≤h(s).

(Proof omitted.)

(26)

Summary

(27)

Summary

Disjunctive action landmark: setL of operators such that every plan uses some operator from L

The costof the landmark is the cost of its cheapest operator.

Hitting setsyield the most accurate heuristic for a given set of disjunctive action landmarks, but the computation is NP-hard.

Uniform cost partitioningis a polynomial approach for the computation of informative heuristics from

disjunctive action landmarks.

Referenzen

ÄHNLICHE DOKUMENTE

Same algorithm can be used for disjunctive action landmarks, where we also have a minimal saturated cost function. Definition (MSCF for Disjunctive

For abstraction heuristics and disjunctive action landmarks, we know how to determine an optimal cost partitioning, using linear programming. Although solving a linear program

domain-independent planning = general problem solving classical planning = the “easy case”. (deterministic, fully observable etc.) still

The merge steps combine two abstract transition systems by replacing them with their synchronized product. The shrink steps make an abstract system smaller by abstracting

I Hitting sets over all cut landmarks yield a perfect heuristic for delete-free planning tasks. I The LM-cut heuristic is an admissible heuristic based on

Theorem (Uniform Cost Partitioning Heuristic is Admissible) Let L be a set of disjunctive action landmarks for state s of ΠG. Then h UCP (L) is an admissible heuristic estimate

Performs Bellman backup in each encountered state Uses admissible heuristic for states not updated before.. Motivation Asynchronous VI RTDP

Initialization: expand decision node and all its chance node successors, then initialize all ˆ V k with admissible heuristic Backup function: Bellman backups & solved