• Keine Ergebnisse gefunden

Planning and Optimization E3. Landmarks: LM-Cut Heuristic Malte Helmert and Gabriele R¨oger

N/A
N/A
Protected

Academic year: 2022

Aktie "Planning and Optimization E3. Landmarks: LM-Cut Heuristic Malte Helmert and Gabriele R¨oger"

Copied!
7
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Planning and Optimization

E3. Landmarks: LM-Cut Heuristic

Malte Helmert and Gabriele R¨ oger

Universit¨ at Basel

November 18, 2020

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 18, 2020 1 / 28

Planning and Optimization

November 18, 2020 — E3. Landmarks: LM-Cut Heuristic

E3.1 i-g Form

E3.2 Cut Landmarks

E3.3 The LM-Cut Heuristic E3.4 Summary & Outlook

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 18, 2020 2 / 28

Content of this Course

Planning

Classical

Foundations Logic Heuristics Constraints

Probabilistic

Explicit MDPs Factored MDPs

Content of this Course: Constraints

Constraints

Landmarks RTG Landmarks

MHS Heuristic

LM-Cut Heuristic Cost

Partitioning Network

Flows

Operator

Counting

(2)

Roadmap for this Chapter

I We first introduce a new normal form

for delete-free STRIPS tasks that simplifies later definitions.

I We then present a method that computes disjunctive action landmarks for such tasks.

I We conclude with the LM-cut heuristic that builds on this method.

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 18, 2020 5 / 28

E3. Landmarks: LM-Cut Heuristic i-g Form

E3.1 i-g Form

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 18, 2020 6 / 28

E3. Landmarks: LM-Cut Heuristic i-g Form

Delete-Free STRIPS Planning Task in i-g Form (1)

In this chapter, we only consider delete-free STRIPS tasks in a special form:

Definition (i-g Form for Delete-free STRIPS)

A delete-free STRIPS planning task hV , I , O, γi is in i-g form if I V contains atoms i and g

I Initially exactly i is true: I (v ) = T iff v = i I g is the only goal atom: γ = {g }

I Every action has at least one precondition.

E3. Landmarks: LM-Cut Heuristic i-g Form

Transformation to i-g Form

Every delete-free STRIPS task Π = hV , I , O, γi can easily be transformed into an analogous task in i-g form.

I If i or g are in V already, rename them everywhere.

I Add i and g to V .

I Add an operator h{i}, {v ∈ V | I(v ) = T}, {}, 0i.

I Add an operator hγ, {g }, {}, 0i.

I Replace all operator preconditions > with i . I Replace initial state and goal.

For the remainder of this chapter, we assume tasks in i-g form.

(3)

E3. Landmarks: LM-Cut Heuristic i-g Form

Example: Delete-Free Planning Task in i-g Form

Example

Consider a delete-free STRIPS planning task hV , I, O, γi with V = {i , a, b, c , d , g }, I = {i 7→ T} ∪ {v 7→ F | v ∈ V \ {i}}, γ = {g } and operators

I o blue = h{i }, {a, b}, {}, 4i, I o green = h{i}, {a, c}, {}, 5i, I o black = h{i}, {b, c}, {}, 3i, I o red = h{b, c }, {d }, {}, 2i, and I o orange = h{a, d }, {g }, {}, 0i.

optimal solution to reach g from i:

I plan: ho blue , o black , o red , o orange i

I cost: 4 + 3 + 2 + 0 = 9 (= h + (I ) because plan is optimal)

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 18, 2020 9 / 28

E3. Landmarks: LM-Cut Heuristic Cut Landmarks

E3.2 Cut Landmarks

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 18, 2020 10 / 28

E3. Landmarks: LM-Cut Heuristic Cut Landmarks

Justification Graphs

Definition (Precondition Choice Function)

A precondition choice function (pcf) P : O → V for a delete-free STRIPS task Π = hV , I, O, γi in i-g form maps each operator to one of its preconditions (i.e. P (o) ∈ pre(o ) for all o ∈ O).

Definition (Justification Graphs)

Let P be a pcf for hV , I , O, γi in i-g form. The justification graph for P is the directed, edge-labeled graph J = hV , Ei, where

I the vertices are the variables from V , and

I E contains an edge P(o) − → o a for each o ∈ O, a ∈ add(o).

E3. Landmarks: LM-Cut Heuristic Cut Landmarks

Example: Justification Graph

Example (Precondition Choice Function)

P(o

blue

) = P(o

green

) = P(o

black

) = i, P(o

red

) = b, P(o

orange

) = a P

0

(o

blue

) = P

0

(o

green

) = P

0

(o

black

) = i, P

0

(o

red

) = c, P

0

(o

orange

) = d

i

a

b

c

d

g

o

blue

= h{i }, {a, b}, {}, 4i

o

green

= h{i }, {a, c }, {}, 5i

o

black

= h{i}, {b, c }, {}, 3i

o

red

= h{b, c }, {d }, {}, 2i

o

orange

= h{a, d }, {g }, {}, 0i

(4)

E3. Landmarks: LM-Cut Heuristic Cut Landmarks

Cuts

Definition (Cut)

A cut in a justification graph is a subset C of its edges such that all paths from i to g contain an edge from C .

i

a

b

c

d

g

o

blue

= h{i}, {a, b}, {}, 4i o

green

= h{i}, {a, c}, {}, 5i o

black

= h{i }, {b, c}, {}, 3i o

red

= h{b, c}, {d}, {}, 2i o

orange

= h{a, d}, {g}, {}, 0i

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 18, 2020 13 / 28

E3. Landmarks: LM-Cut Heuristic Cut Landmarks

Cuts are Disjunctive Action Landmarks

Theorem (Cuts are Disjunctive Action Landmarks) Let P be a pcf for hV , I, O , γi (in i-g form) and C be a cut in the justification graph for P.

The set of edge labels from C (formally {o | hv , o, v 0 i ∈ C }) is a disjunctive action landmark for I .

Proof idea:

I The justification graph corresponds to a simpler problem where some preconditions (those not picked by the pcf) are ignored.

I Cuts are landmarks for this simplified problem.

I Hence they are also landmarks for the original problem.

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 18, 2020 14 / 28

E3. Landmarks: LM-Cut Heuristic Cut Landmarks

Example: Cuts in Justification Graphs

Example (Landmarks) I L 1 = {o orange } (cost = 0) I L 3 = {o red } (cost = 2)

I L 2 = {o green , o black } (cost = 3) I L 4 = {o green , o blue } (cost = 4)

i

a

b

c

d

g

o

blue

= h{i}, {a, b}, {}, 4i o

green

= h{i}, {a, c}, {}, 5i o

black

= h{i }, {b, c}, {}, 3i o

red

= h{b, c}, {d}, {}, 2i o

orange

= h{a, d}, {g}, {}, 0i

E3. Landmarks: LM-Cut Heuristic Cut Landmarks

Power of Cuts in Justification Graphs

I Which landmarks can be computed with the cut method?

I all interesting ones!

Proposition (perfect hitting set heuristics)

Let L be the set of all “cut landmarks” of a given planning task with initial state I . Then h MHS (L) = h + (I ).

Hitting set heuristic for L is perfect.

Proof idea:

I Show 1:1 correspondence of hitting sets H for L and plans, i.e., each hitting set for L corresponds to a plan,

and vice versa.

(5)

E3. Landmarks: LM-Cut Heuristic The LM-Cut Heuristic

E3.3 The LM-Cut Heuristic

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 18, 2020 17 / 28

E3. Landmarks: LM-Cut Heuristic The LM-Cut Heuristic

LM-Cut Heuristic: Motivation

I In general, there are exponentially many pcfs, hence computing all relevant landmarks is not tractable.

I The LM-cut heuristic is a method that chooses pcfs and computes cuts in a goal-oriented way.

I As a side effect, it computes

I a cost partitioning over multiple instances of h

max

that is also I a saturated cost partitioning over disjunctive action landmarks.

currently one of the best admissible planning heuristic

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 18, 2020 18 / 28

E3. Landmarks: LM-Cut Heuristic The LM-Cut Heuristic

LM-Cut Heuristic

h LM-cut : Helmert & Domshlak (2009) Initialize h LM-cut (I ) := 0. Then iterate:

1

Compute h max values of the variables. Stop if h max (g ) = 0.

2

Compute justification graph G for the P that chooses preconditions with maximal h max value

3

Determine the goal zone V g of G that consists of all nodes that have a zero-cost path to g.

4

Compute the cut L that contains the labels of all edges hv , o, v 0 i such that v 6∈ V g , v 0 ∈ V g and v can be reached from i without traversing a node in V g .

It is guaranteed that cost(L) > 0.

5

Increase h LM-cut (I) by cost(L).

6

Decrease cost(o) by cost(L) for all o ∈ L.

E3. Landmarks: LM-Cut Heuristic The LM-Cut Heuristic

Example: Computation of LM-Cut

i 0

a 4

0 0

b 3

0 0

c 3

1 1 0

d 5

3 3 1 1 0

g 5

4 4 1 1 0

1

Compute h

max

values of the variables

1

Compute h

max

values of the variables. Stop if h

max

(g) = 0.

2

Compute justification graph

3

Determine goal zone

4

Compute cut

5

Increase h

LM-cut

(I) by cost(L)

6

Decrease cost(o) by cost(L) for all o ∈ L

o

blue

= h{i}, {a, b}, {}, 4i o

green

= h{i }, {a, c }, {}, 5i o

black

= h{i }, {b, c }, {}, 3i o

red

= h{b, c}, {d }, {}, 0i o

orange

= h{a, d}, {g }, {}, 0i

round P(o

orange

) P(o

red

) landmark cost

1 d b {o

red

} 2

{o

green

, o

blue

} {o

green

, o

black

}

h

LM-cut

(I) 2

(6)

E3. Landmarks: LM-Cut Heuristic The LM-Cut Heuristic

Example: Computation of LM-Cut

i 0

a 4

0 0

b 3

0 0

c 3

1 1 0

d

5 3

3

1 1 0

g

5 4

4

1 1 0

1

Compute h

max

values of the variables

1

Compute h

max

values of the variables. Stop if h

max

(g ) = 0.

2

Compute justification graph

3

Determine goal zone

4

Compute cut

5

Increase h

LM-cut

(I ) by cost(L)

6

Decrease cost(o) by cost(L) for all o ∈ L

o

blue

= h{i }, {a, b}, {}, 0i o

green

= h{i}, {a, c}, {}, 1i o

black

= h{i }, {b, c}, {}, 3i o

red

= h{b, c}, {d}, {}, 0i o

orange

= h{a, d}, {g}, {}, 0i

round P(o

orange

) P(o

red

) landmark cost

1 d b {o

red

} 2

2 a b {o

green

, o

blue

} 4

{o

green

, o

black

}

h

LM-cut

(I ) 6

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 18, 2020 21 / 28

E3. Landmarks: LM-Cut Heuristic The LM-Cut Heuristic

Example: Computation of LM-Cut

i 0

a

4 0

0

b

3 0

0

c

3 1

1

0

d

5 3 3 1

1

0

g

5 4 4 1

1

0

1

Compute h

max

values of the variables

1

Compute h

max

values of the variables. Stop if h

max

(g) = 0.

2

Compute justification graph

3

Determine goal zone

4

Compute cut

5

Increase h

LM-cut

(I) by cost(L)

6

Decrease cost(o) by cost(L) for all o ∈ L

o

blue

= h{i}, {a, b}, {}, 0i o

green

= h{i }, {a, c }, {}, 0i o

black

= h{i }, {b, c }, {}, 2i o

red

= h{b, c}, {d }, {}, 0i o

orange

= h{a, d}, {g }, {}, 0i

round P(o

orange

) P(o

red

) landmark cost

1 d b {o

red

} 2

2 a b {o

green

, o

blue

} 4

3 d c {o

green

, o

black

} 1

h

LM-cut

(I) 7

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 18, 2020 22 / 28

E3. Landmarks: LM-Cut Heuristic The LM-Cut Heuristic

Properties of LM-Cut Heuristic

Theorem

Let hV , I, O , γi be a delete-free STRIPS task in i-g normal form.

The LM-cut heuristic is admissible: h LM-cut (I) ≤ h (I ).

Proof omitted.

If Π is not delete-free, we can compute h LM-cut on Π + . Then h LM-cut is bounded by h + .

E3. Landmarks: LM-Cut Heuristic Summary & Outlook

E3.4 Summary & Outlook

(7)

E3. Landmarks: LM-Cut Heuristic Summary & Outlook

Summary

I Cuts in justification graphs are a general method to find disjunctive action landmarks.

I The minimum hitting set over all cut landmarks is a perfect heuristic for delete-free planning tasks.

I The LM-cut heuristic is an admissible heuristic based on these ideas.

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 18, 2020 25 / 28

E3. Landmarks: LM-Cut Heuristic Summary & Outlook

Literature (1)

References on landmark heuristics:

Julie Porteous, Laura Sebastia and Joerg Hoffmann.

On the Extraction, Ordering, and Usage of Landmarks in Planning.

Proc. ECP 2001, pp. 174–182, 2013.

Introduces landmarks.

Malte Helmert and Carmel Domshlak.

Landmarks, Critical Paths and Abstractions: What’s the Difference Anyway?

Proc. ICAPS 2009, pp. 162–169, 2009.

Introduces cut landmarks and LM-cut heuristic.

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 18, 2020 26 / 28

E3. Landmarks: LM-Cut Heuristic Summary & Outlook

Literature (2)

Lin Zhu and Robert Givan.

Landmark Extraction via Planning Graph Propagation.

Doctoral Consortium ICAPS 2003, 2003.

Core idea for complete landmark generation.

Emil Keyder, Silvia Richter and Malte Helmert.

Sound and Complete Landmarks for And/Or Graphs Proc. ECAI 2010 , pp. 335–340, 2010.

Introduces landmarks from AND/OR graphs and usage of Π m compilation.

E3. Landmarks: LM-Cut Heuristic Summary & Outlook

Literature (3)

Silvia Richter and Matthias Westphal.

The LAMA Planner: Guiding Cost-Based Anytime Planning with Landmarks.

JAIR 39 (2010) , pp. 127–177, 2010.

Introduces landmark-count heuristic and contains another landmark generation method.

Erez Karpas and Carmel Domshlak.

Cost-Optimal Planning with Landmarks.

Proc. IJCAI 2009, pp. 1728–1733, 2009.

Introduces admissible variant of landmark heuristic.

Referenzen

ÄHNLICHE DOKUMENTE

some operator must be applied (action landmark) some atomic proposition must hold (fact landmark) some formula must be true (formula landmark).. → Derive heuristic estimate from

Same algorithm can be used for disjunctive action landmarks, where we also have a minimal saturated cost function. Definition (MSCF for Disjunctive

Same algorithm can be used for disjunctive action landmarks, where we also have a minimal saturated cost function. Definition (MSCF for Disjunctive

I Adding more constraints can only remove feasible solutions I Fewer feasible solutions can only increase objective value I Higher objective value means better informed heuristic

Adding more constraints can only remove feasible solutions Fewer feasible solutions can only increase objective value Higher objective value means better informed heuristic Are

Quality of given policy π can be computed (via LP or backward induction) or approximated arbitrarily closely (via iterative policy evaluation) in small SSPs or MDPs Impossible

The merge steps combine two abstract transition systems by replacing them with their synchronized product. The shrink steps make an abstract system smaller by abstracting

I Hitting sets over all cut landmarks yield a perfect heuristic for delete-free planning tasks. I The LM-cut heuristic is an admissible heuristic based on