• Keine Ergebnisse gefunden

Planning and Optimization E3. Landmarks: LM-Cut Heuristic Malte Helmert and Gabriele R¨oger

N/A
N/A
Protected

Academic year: 2022

Aktie "Planning and Optimization E3. Landmarks: LM-Cut Heuristic Malte Helmert and Gabriele R¨oger"

Copied!
53
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Planning and Optimization

E3. Landmarks: LM-Cut Heuristic

Malte Helmert and Gabriele R¨ oger

Universit¨at Basel

(2)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Content of this Course

Planning

Classical

Foundations Logic Heuristics Constraints

Probabilistic

Explicit MDPs

Factored MDPs

(3)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Content of this Course: Constraints

Constraints

Landmarks RTG Landmarks

MHS Heuristic

LM-Cut Heuristic Cost

Partitioning Network

Flows

Operator

Counting

(4)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Roadmap for this Chapter

We first introduce a new normal form

for delete-free STRIPS tasks that simplifies later definitions.

We then present a method that computes disjunctive action landmarks for such tasks.

We conclude with the LM-cut heuristic

that builds on this method.

(5)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

i-g Form

(6)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Delete-Free STRIPS Planning Task in i-g Form (1)

In this chapter, we only consider delete-free STRIPS tasks in a special form:

Definition (i-g Form for Delete-free STRIPS)

A delete-free STRIPS planning task hV , I , O, γi is in i-g form if V contains atoms i and g

Initially exactly i is true: I (v) = T iff v = i g is the only goal atom: γ = {g }

Every action has at least one precondition.

(7)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Transformation to i-g Form

Every delete-free STRIPS task Π = hV , I , O , γi can easily be transformed into an analogous task in i-g form.

If i or g are in V already, rename them everywhere.

Add i and g to V .

Add an operator h{i }, {v ∈ V | I (v) = T}, {}, 0i.

Add an operator hγ, {g }, {}, 0i.

Replace all operator preconditions > with i . Replace initial state and goal.

For the remainder of this chapter, we assume tasks in i-g form.

(8)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Transformation to i-g Form

Every delete-free STRIPS task Π = hV , I , O , γi can easily be transformed into an analogous task in i-g form.

If i or g are in V already, rename them everywhere.

Add i and g to V .

Add an operator h{i }, {v ∈ V | I (v) = T}, {}, 0i.

Add an operator hγ, {g }, {}, 0i.

Replace all operator preconditions > with i . Replace initial state and goal.

For the remainder of this chapter, we assume tasks in i-g form.

(9)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Delete-Free Planning Task in i-g Form

Example

Consider a delete-free STRIPS planning task hV , I , O, γi with V = {i , a, b, c , d , g }, I = {i 7→ T} ∪ {v 7→ F | v ∈ V \ {i}}, γ = {g } and operators

o

blue

= h{i }, {a, b}, {}, 4i, o

green

= h{i }, {a, c}, {}, 5i, o

black

= h{i }, {b, c }, {}, 3i, o

red

= h{b, c }, {d }, {}, 2i, and o

orange

= h{a, d }, {g }, {}, 0i.

optimal solution?

(10)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Delete-Free Planning Task in i-g Form

Example

Consider a delete-free STRIPS planning task hV , I , O, γi with V = {i , a, b, c , d , g }, I = {i 7→ T} ∪ {v 7→ F | v ∈ V \ {i}}, γ = {g } and operators

o

blue

= h{i }, {a, b}, {}, 4i, o

green

= h{i }, {a, c}, {}, 5i, o

black

= h{i }, {b, c }, {}, 3i, o

red

= h{b, c }, {d }, {}, 2i, and o

orange

= h{a, d }, {g }, {}, 0i.

optimal solution to reach g from i : plan: ho

blue

, o

black

, o

red

, o

orange

i

cost: 4 + 3 + 2 + 0 = 9 (= h

+

(I) because plan is optimal)

(11)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Cut Landmarks

(12)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Justification Graphs

Definition (Precondition Choice Function)

A precondition choice function (pcf) P : O → V for a delete-free STRIPS task Π = hV , I , O, γi in i-g form maps each operator to one of its preconditions (i.e. P(o) ∈ pre(o) for all o ∈ O).

Definition (Justification Graphs)

Let P be a pcf for hV , I , O, γi in i-g form. The justification graph for P is the directed, edge-labeled graph J = hV , E i, where

the vertices are the variables from V , and

E contains an edge P(o) − →

o

a for each o ∈ O , a ∈ add(o).

(13)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Justification Graph

Example (Precondition Choice Function)

P(oblue) =P(ogreen) =P(oblack) =i,P(ored) =b,P(oorange) =a

P0(oblue) =P0(ogreen) =P0(oblack) =i,P0(ored) =c,P0(oorange) =d

i

a

b

c

d

g

oblue=h{i},{a,b},{},4i ogreen=h{i},{a,c},{},5i oblack=h{i},{b,c},{},3i ored=h{b,c},{d},{},2i oorange=h{a,d},{g},{},0i

(14)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Justification Graph

Example (Precondition Choice Function)

P(oblue) =P(ogreen) =P(oblack) =i,P(ored) =b,P(oorange) =a P0(oblue) =P0(ogreen) =P0(oblack) =i,P0(ored) =c,P0(oorange) =d

i

a

b

c

d

g

oblue=h{i},{a,b},{},4i ogreen=h{i},{a,c},{},5i oblack=h{i},{b,c},{},3i ored=h{b,c},{d},{},2i oorange=h{a,d},{g},{},0i

(15)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Cuts

Definition (Cut)

A cut in a justification graph is a subset C of its edges such that all paths from i to g contain an edge from C .

i

a

b

c

d

g

oblue=h{i},{a,b},{},4i ogreen=h{i},{a,c},{},5i oblack=h{i},{b,c},{},3i ored=h{b,c},{d},{},2i oorange=h{a,d},{g},{},0i

(16)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Cuts

Definition (Cut)

A cut in a justification graph is a subset C of its edges such that all paths from i to g contain an edge from C .

i

a

b

c

d

g

oblue=h{i},{a,b},{},4i ogreen=h{i},{a,c},{},5i oblack=h{i},{b,c},{},3i ored=h{b,c},{d},{},2i oorange=h{a,d},{g},{},0i

(17)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Cuts are Disjunctive Action Landmarks

Theorem (Cuts are Disjunctive Action Landmarks) Let P be a pcf for hV , I , O, γi (in i-g form) and C be a cut in the justification graph for P.

The set of edge labels from C (formally {o | hv, o, v

0

i ∈ C }) is a disjunctive action landmark for I .

Proof idea:

The justification graph corresponds to a simpler problem where some preconditions (those not picked by the pcf) are ignored.

Cuts are landmarks for this simplified problem.

Hence they are also landmarks for the original problem.

(18)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Cuts in Justification Graphs

Example (Landmarks) L

1

= {o

orange

} (cost = 0)

L

3

= {o

red

} (cost = 2)

L

2

= {o

green

, o

black

} (cost = 3) L

4

= {o

green

, o

blue

} (cost = 4)

i

a

b

c

d

g

oblue=h{i},{a,b},{},4i ogreen=h{i},{a,c},{},5i oblack=h{i},{b,c},{},3i ored=h{b,c},{d},{},2i oorange=h{a,d},{g},{},0i

(19)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Cuts in Justification Graphs

Example (Landmarks) L

1

= {o

orange

} (cost = 0)

L

3

= {o

red

} (cost = 2)

L

2

= {o

green

, o

black

} (cost = 3)

L

4

= {o

green

, o

blue

} (cost = 4)

i

a

b

c

d

g

oblue=h{i},{a,b},{},4i ogreen=h{i},{a,c},{},5i oblack=h{i},{b,c},{},3i ored=h{b,c},{d},{},2i oorange=h{a,d},{g},{},0i

(20)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Cuts in Justification Graphs

Example (Landmarks) L

1

= {o

orange

} (cost = 0) L

3

= {o

red

} (cost = 2)

L

2

= {o

green

, o

black

} (cost = 3)

L

4

= {o

green

, o

blue

} (cost = 4)

i

a

b

c

d

g

oblue=h{i},{a,b},{},4i ogreen=h{i},{a,c},{},5i oblack=h{i},{b,c},{},3i ored=h{b,c},{d},{},2i oorange=h{a,d},{g},{},0i

(21)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Cuts in Justification Graphs

Example (Landmarks) L

1

= {o

orange

} (cost = 0) L

3

= {o

red

} (cost = 2)

L

2

= {o

green

, o

black

} (cost = 3) L

4

= {o

green

, o

blue

} (cost = 4)

i

a

b

c

d

g

oblue=h{i},{a,b},{},4i ogreen=h{i},{a,c},{},5i oblack=h{i},{b,c},{},3i ored=h{b,c},{d},{},2i oorange=h{a,d},{g},{},0i

(22)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Power of Cuts in Justification Graphs

Which landmarks can be computed with the cut method?

all interesting ones!

Proposition (perfect hitting set heuristics)

Let L be the set of all “cut landmarks” of a given planning task with initial state I . Then h

MHS

(L) = h

+

(I ).

Hitting set heuristic for L is perfect.

Proof idea:

Show 1:1 correspondence of hitting sets H for L and plans, i.e., each hitting set for L corresponds to a plan,

and vice versa.

(23)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Power of Cuts in Justification Graphs

Which landmarks can be computed with the cut method?

all interesting ones!

Proposition (perfect hitting set heuristics)

Let L be the set of all “cut landmarks” of a given planning task with initial state I . Then h

MHS

(L) = h

+

(I ).

Hitting set heuristic for L is perfect.

Proof idea:

Show 1:1 correspondence of hitting sets H for L and plans, i.e., each hitting set for L corresponds to a plan,

and vice versa.

(24)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Power of Cuts in Justification Graphs

Which landmarks can be computed with the cut method?

all interesting ones!

Proposition (perfect hitting set heuristics)

Let L be the set of all “cut landmarks” of a given planning task with initial state I . Then h

MHS

(L) = h

+

(I ).

Hitting set heuristic for L is perfect.

Proof idea:

Show 1:1 correspondence of hitting sets H for L and plans, i.e., each hitting set for L corresponds to a plan,

and vice versa.

(25)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

The LM-Cut Heuristic

(26)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

LM-Cut Heuristic: Motivation

In general, there are exponentially many pcfs, hence computing all relevant landmarks is not tractable.

The LM-cut heuristic is a method that chooses pcfs and computes cuts in a goal-oriented way.

As a side effect, it computes

a cost partitioning over multiple instances ofhmax that is also asaturated cost partitioningover disjunctive action landmarks.

currently one of the best admissible planning heuristic

(27)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

LM-Cut Heuristic

h

LM-cut

: Helmert & Domshlak (2009) Initialize h

LM-cut

(I) := 0. Then iterate:

1

Compute h

max

values of the variables. Stop if h

max

(g ) = 0.

2

Compute justification graph G for the P that chooses preconditions with maximal h

max

value

3

Determine the goal zone V

g

of G that consists of all nodes that have a zero-cost path to g .

4

Compute the cut L that contains the labels of all edges hv, o, v

0

i such that v 6∈ V

g

, v

0

∈ V

g

and v can be reached from i without traversing a node in V

g

.

It is guaranteed that cost(L) > 0.

5

Increase h

LM-cut

(I ) by cost(L).

6

Decrease cost(o ) by cost(L) for all o ∈ L.

(28)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Computation of LM-Cut

i

0

a

4 0 0

b

3 0 0

c

3 1 1 0

d

5 3 3 1 1 0

g

5 4 4 1 1 0

1 Computehmaxvalues of the variables

1 Computehmaxvalues of the variables. Stop ifhmax(g) = 0.

2 Compute justification graph

3 Determine goal zone

4 Compute cut

5 IncreasehLM-cut(I) bycost(L)

6 Decreasecost(o) bycost(L) for allo∈L

oblue=h{i},{a,b},{},4i ogreen=h{i},{a,c},{},5i oblack=h{i},{b,c},{},3i ored=h{b,c},{d},{},2i oorange=h{a,d},{g},{},0i

round P(oorange) P(ored) landmark cost

{ored} {ogreen,oblue} {ogreen,oblack}

hLM-cut(I) 0

(29)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Computation of LM-Cut

i 0

a 4

0 0

b 3

0 0

c 3

1 1 0

d 5

3 3 1 1 0

g 5

4 4 1 1 0

1 Computehmaxvalues of the variables

1 Computehmaxvalues of the variables. Stop ifhmax(g) = 0.

2 Compute justification graph

3 Determine goal zone

4 Compute cut

5 IncreasehLM-cut(I) bycost(L)

6 Decreasecost(o) bycost(L) for allo∈L

oblue=h{i},{a,b},{},4i ogreen=h{i},{a,c},{},5i oblack=h{i},{b,c},{},3i ored=h{b,c},{d},{},2i oorange=h{a,d},{g},{},0i

round P(oorange) P(ored) landmark cost 1

{ored} {ogreen,oblue} {ogreen,oblack}

hLM-cut(I) 0

(30)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Computation of LM-Cut

i 0

a 4

0 0

b 3

0 0

c 3

1 1 0

d 5

3 3 1 1 0

g 5

4 4 1 1 0

1 Computehmaxvalues of the variables

1 Computehmaxvalues of the variables. Stop ifhmax(g) = 0.

2 Compute justification graph

3 Determine goal zone

4 Compute cut

5 IncreasehLM-cut(I) bycost(L)

6 Decreasecost(o) bycost(L) for allo∈L

oblue=h{i},{a,b},{},4i ogreen=h{i},{a,c},{},5i oblack=h{i},{b,c},{},3i ored=h{b,c},{d},{},2i oorange=h{a,d},{g},{},0i

round P(oorange) P(ored) landmark cost

1 d b

{ored} {ogreen,oblue} {ogreen,oblack}

hLM-cut(I) 0

(31)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Computation of LM-Cut

i 0

a 4

0 0

b 3

0 0

c 3

1 1 0

d 5

3 3 1 1 0

g 5

4 4 1 1 0

1 Computehmaxvalues of the variables

1 Computehmaxvalues of the variables. Stop ifhmax(g) = 0.

2 Compute justification graph

3 Determine goal zone

4 Compute cut

5 IncreasehLM-cut(I) bycost(L)

6 Decreasecost(o) bycost(L) for allo∈L

oblue=h{i},{a,b},{},4i ogreen=h{i},{a,c},{},5i oblack=h{i},{b,c},{},3i ored=h{b,c},{d},{},2i oorange=h{a,d},{g},{},0i

round P(oorange) P(ored) landmark cost

1 d b

{ored} {ogreen,oblue} {ogreen,oblack}

hLM-cut(I) 0

(32)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Computation of LM-Cut

i 0

a 4

0 0

b 3

0 0

c 3

1 1 0

d 5

3 3 1 1 0

g 5

4 4 1 1 0

1 Computehmaxvalues of the variables

1 Computehmaxvalues of the variables. Stop ifhmax(g) = 0.

2 Compute justification graph

3 Determine goal zone

4 Compute cut

5 IncreasehLM-cut(I) bycost(L)

6 Decreasecost(o) bycost(L) for allo∈L

oblue=h{i},{a,b},{},4i ogreen=h{i},{a,c},{},5i oblack=h{i},{b,c},{},3i ored=h{b,c},{d},{},2i oorange=h{a,d},{g},{},0i

round P(oorange) P(ored) landmark cost

1 d b {ored} 2

{ogreen,oblue} {ogreen,oblack}

hLM-cut(I) 0

(33)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Computation of LM-Cut

i 0

a 4

0 0

b 3

0 0

c 3

1 1 0

d 5

3 3 1 1 0

g 5

4 4 1 1 0

1 Computehmaxvalues of the variables

1 Computehmaxvalues of the variables. Stop ifhmax(g) = 0.

2 Compute justification graph

3 Determine goal zone

4 Compute cut

5 IncreasehLM-cut(I) bycost(L)

6 Decreasecost(o) bycost(L) for allo∈L

oblue=h{i},{a,b},{},4i ogreen=h{i},{a,c},{},5i oblack=h{i},{b,c},{},3i ored=h{b,c},{d},{},2i oorange=h{a,d},{g},{},0i

round P(oorange) P(ored) landmark cost

1 d b {ored} 2

{ogreen,oblue} {ogreen,oblack}

hLM-cut(I) 2

(34)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Computation of LM-Cut

i 0

a 4

0 0

b 3

0 0

c 3

1 1 0

d 5

3 3 1 1 0

g 5

4 4 1 1 0

1 Computehmaxvalues of the variables

1 Computehmaxvalues of the variables. Stop ifhmax(g) = 0.

2 Compute justification graph

3 Determine goal zone

4 Compute cut

5 IncreasehLM-cut(I) bycost(L)

6 Decreasecost(o) bycost(L) for allo∈L

oblue=h{i},{a,b},{},4i ogreen=h{i},{a,c},{},5i oblack=h{i},{b,c},{},3i ored=h{b,c},{d},{},0i oorange=h{a,d},{g},{},0i

round P(oorange) P(ored) landmark cost

1 d b {ored} 2

{ogreen,oblue} {ogreen,oblack}

hLM-cut(I) 2

(35)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Computation of LM-Cut

i 0

a 4

0 0

b 3

0 0

c 3

1 1 0

d

5

3

3 1 1 0

g

5

4

4 1 1 0

1 Computehmaxvalues of the variables

1 Computehmaxvalues of the variables. Stop ifhmax(g) = 0.

2 Compute justification graph

3 Determine goal zone

4 Compute cut

5 IncreasehLM-cut(I) bycost(L)

6 Decreasecost(o) bycost(L) for allo∈L

oblue=h{i},{a,b},{},4i ogreen=h{i},{a,c},{},5i oblack=h{i},{b,c},{},3i ored=h{b,c},{d},{},0i oorange=h{a,d},{g},{},0i

round P(oorange) P(ored) landmark cost

1 d b {ored} 2

2

{ogreen,oblue} {ogreen,oblack}

hLM-cut(I) 2

(36)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Computation of LM-Cut

i 0

a 4

0 0

b 3

0 0

c 3

1 1 0

d

5 3

3

1 1 0

g

5 4

4

1 1 0

1 Computehmaxvalues of the variables

1 Computehmaxvalues of the variables. Stop ifhmax(g) = 0.

2 Compute justification graph

3 Determine goal zone

4 Compute cut

5 IncreasehLM-cut(I) bycost(L)

6 Decreasecost(o) bycost(L) for allo∈L

oblue=h{i},{a,b},{},4i ogreen=h{i},{a,c},{},5i oblack=h{i},{b,c},{},3i ored=h{b,c},{d},{},0i oorange=h{a,d},{g},{},0i

round P(oorange) P(ored) landmark cost

1 d b {ored} 2

2 a b

{ogreen,oblue} {ogreen,oblack}

hLM-cut(I) 2

(37)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Computation of LM-Cut

i 0

a 4

0 0

b 3

0 0

c 3

1 1 0

d

5 3

3

1 1 0

g

5 4

4

1 1 0

1 Computehmaxvalues of the variables

1 Computehmaxvalues of the variables. Stop ifhmax(g) = 0.

2 Compute justification graph

3 Determine goal zone

4 Compute cut

5 IncreasehLM-cut(I) bycost(L)

6 Decreasecost(o) bycost(L) for allo∈L

oblue=h{i},{a,b},{},4i ogreen=h{i},{a,c},{},5i oblack=h{i},{b,c},{},3i ored=h{b,c},{d},{},0i oorange=h{a,d},{g},{},0i

round P(oorange) P(ored) landmark cost

1 d b {ored} 2

2 a b

{ogreen,oblue} {ogreen,oblack}

hLM-cut(I) 2

(38)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Computation of LM-Cut

i 0

a 4

0 0

b 3

0 0

c 3

1 1 0

d

5 3

3

1 1 0

g

5 4

4

1 1 0

1 Computehmaxvalues of the variables

1 Computehmaxvalues of the variables. Stop ifhmax(g) = 0.

2 Compute justification graph

3 Determine goal zone

4 Compute cut

5 IncreasehLM-cut(I) bycost(L)

6 Decreasecost(o) bycost(L) for allo∈L

oblue=h{i},{a,b},{},4i ogreen=h{i},{a,c},{},5i oblack=h{i},{b,c},{},3i ored=h{b,c},{d},{},0i oorange=h{a,d},{g},{},0i

round P(oorange) P(ored) landmark cost

1 d b {ored} 2

2 a b {ogreen,oblue} 4

{ogreen,oblack}

hLM-cut(I) 2

(39)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Computation of LM-Cut

i 0

a 4

0 0

b 3

0 0

c 3

1 1 0

d

5 3

3

1 1 0

g

5 4

4

1 1 0

1 Computehmaxvalues of the variables

1 Computehmaxvalues of the variables. Stop ifhmax(g) = 0.

2 Compute justification graph

3 Determine goal zone

4 Compute cut

5 IncreasehLM-cut(I) bycost(L)

6 Decreasecost(o) bycost(L) for allo∈L

oblue=h{i},{a,b},{},4i ogreen=h{i},{a,c},{},5i oblack=h{i},{b,c},{},3i ored=h{b,c},{d},{},0i oorange=h{a,d},{g},{},0i

round P(oorange) P(ored) landmark cost

1 d b {ored} 2

2 a b {ogreen,oblue} 4

{ogreen,oblack}

hLM-cut(I) 6

(40)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Computation of LM-Cut

i 0

a 4

0 0

b 3

0 0

c 3

1 1 0

d

5 3

3

1 1 0

g

5 4

4

1 1 0

1 Computehmaxvalues of the variables

1 Computehmaxvalues of the variables. Stop ifhmax(g) = 0.

2 Compute justification graph

3 Determine goal zone

4 Compute cut

5 IncreasehLM-cut(I) bycost(L)

6 Decreasecost(o) bycost(L) for allo∈L

oblue=h{i},{a,b},{},0i ogreen=h{i},{a,c},{},1i oblack=h{i},{b,c},{},3i ored=h{b,c},{d},{},0i oorange=h{a,d},{g},{},0i

round P(oorange) P(ored) landmark cost

1 d b {ored} 2

2 a b {ogreen,oblue} 4

{ogreen,oblack}

hLM-cut(I) 6

(41)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Computation of LM-Cut

i 0

a

4

0

0

b

3

0

0

c

3

1

1 0

d

5 3 3

1

1 0

g

5 4 4

1

1 0

1 Computehmaxvalues of the variables

1 Computehmaxvalues of the variables. Stop ifhmax(g) = 0.

2 Compute justification graph

3 Determine goal zone

4 Compute cut

5 IncreasehLM-cut(I) bycost(L)

6 Decreasecost(o) bycost(L) for allo∈L

oblue=h{i},{a,b},{},0i ogreen=h{i},{a,c},{},1i oblack=h{i},{b,c},{},3i ored=h{b,c},{d},{},0i oorange=h{a,d},{g},{},0i

round P(oorange) P(ored) landmark cost

1 d b {ored} 2

2 a b {ogreen,oblue} 4

3

{ogreen,oblack}

hLM-cut(I) 6

(42)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Computation of LM-Cut

i 0

a

4 0

0

b

3 0

0

c

3 1

1

0

d

5 3 3 1

1

0

g

5 4 4 1

1

0

1 Computehmaxvalues of the variables

1 Computehmaxvalues of the variables. Stop ifhmax(g) = 0.

2 Compute justification graph

3 Determine goal zone

4 Compute cut

5 IncreasehLM-cut(I) bycost(L)

6 Decreasecost(o) bycost(L) for allo∈L

oblue=h{i},{a,b},{},0i ogreen=h{i},{a,c},{},1i oblack=h{i},{b,c},{},3i ored=h{b,c},{d},{},0i oorange=h{a,d},{g},{},0i

round P(oorange) P(ored) landmark cost

1 d b {ored} 2

2 a b {ogreen,oblue} 4

3 d c

{ogreen,oblack}

hLM-cut(I) 6

(43)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Computation of LM-Cut

i 0

a

4 0

0

b

3 0

0

c

3 1

1

0

d

5 3 3 1

1

0

g

5 4 4 1

1

0

1 Computehmaxvalues of the variables

1 Computehmaxvalues of the variables. Stop ifhmax(g) = 0.

2 Compute justification graph

3 Determine goal zone

4 Compute cut

5 IncreasehLM-cut(I) bycost(L)

6 Decreasecost(o) bycost(L) for allo∈L

oblue=h{i},{a,b},{},0i ogreen=h{i},{a,c},{},1i oblack=h{i},{b,c},{},3i ored=h{b,c},{d},{},0i oorange=h{a,d},{g},{},0i

round P(oorange) P(ored) landmark cost

1 d b {ored} 2

2 a b {ogreen,oblue} 4

3 d c

{ogreen,oblack}

hLM-cut(I) 6

(44)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Computation of LM-Cut

i 0

a

4 0

0

b

3 0

0

c

3 1

1

0

d

5 3 3 1

1

0

g

5 4 4 1

1

0

1 Computehmaxvalues of the variables

1 Computehmaxvalues of the variables. Stop ifhmax(g) = 0.

2 Compute justification graph

3 Determine goal zone

4 Compute cut

5 IncreasehLM-cut(I) bycost(L)

6 Decreasecost(o) bycost(L) for allo∈L

oblue=h{i},{a,b},{},0i ogreen=h{i},{a,c},{},1i oblack=h{i},{b,c},{},3i ored=h{b,c},{d},{},0i oorange=h{a,d},{g},{},0i

round P(oorange) P(ored) landmark cost

1 d b {ored} 2

2 a b {ogreen,oblue} 4

3 d c {ogreen,oblack} 1

hLM-cut(I) 6

(45)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Computation of LM-Cut

i 0

a

4 0

0

b

3 0

0

c

3 1

1

0

d

5 3 3 1

1

0

g

5 4 4 1

1

0

1 Computehmaxvalues of the variables

1 Computehmaxvalues of the variables. Stop ifhmax(g) = 0.

2 Compute justification graph

3 Determine goal zone

4 Compute cut

5 IncreasehLM-cut(I) bycost(L)

6 Decreasecost(o) bycost(L) for allo∈L

oblue=h{i},{a,b},{},0i ogreen=h{i},{a,c},{},1i oblack=h{i},{b,c},{},3i ored=h{b,c},{d},{},0i oorange=h{a,d},{g},{},0i

round P(oorange) P(ored) landmark cost

1 d b {ored} 2

2 a b {ogreen,oblue} 4

3 d c {ogreen,oblack} 1

hLM-cut(I) 7

(46)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Computation of LM-Cut

i 0

a

4 0

0

b

3 0

0

c

3 1

1

0

d

5 3 3 1

1

0

g

5 4 4 1

1

0

1 Computehmaxvalues of the variables

1 Computehmaxvalues of the variables. Stop ifhmax(g) = 0.

2 Compute justification graph

3 Determine goal zone

4 Compute cut

5 IncreasehLM-cut(I) bycost(L)

6 Decreasecost(o) bycost(L) for allo∈L

oblue=h{i},{a,b},{},0i ogreen=h{i},{a,c},{},0i oblack=h{i},{b,c},{},2i ored=h{b,c},{d},{},0i oorange=h{a,d},{g},{},0i

round P(oorange) P(ored) landmark cost

1 d b {ored} 2

2 a b {ogreen,oblue} 4

3 d c {ogreen,oblack} 1

hLM-cut(I) 7

(47)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Example: Computation of LM-Cut

i 0

a

4 0

0

b

3 0

0

c

3 1 1

0

d

5 3 3 1 1

0 g

5 4 4 1 1

0

1 Computehmaxvalues of the variables

1 Computehmaxvalues of the variables. Stop ifhmax(g) = 0.

2 Compute justification graph

3 Determine goal zone

4 Compute cut

5 IncreasehLM-cut(I) bycost(L)

6 Decreasecost(o) bycost(L) for allo∈L

oblue=h{i},{a,b},{},0i ogreen=h{i},{a,c},{},0i oblack=h{i},{b,c},{},2i ored=h{b,c},{d},{},0i oorange=h{a,d},{g},{},0i

round P(oorange) P(ored) landmark cost

1 d b {ored} 2

2 a b {ogreen,oblue} 4

3 d c {ogreen,oblack} 1

hLM-cut(I) 7

(48)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Properties of LM-Cut Heuristic

Theorem

Let hV , I , O, γi be a delete-free STRIPS task in i-g normal form.

The LM-cut heuristic is admissible: h

LM-cut

(I) ≤ h

(I ).

Proof omitted.

If Π is not delete-free, we can compute h

LM-cut

on Π

+

.

Then h

LM-cut

is bounded by h

+

.

(49)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Summary & Outlook

(50)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Summary

Cuts in justification graphs are a general method to find disjunctive action landmarks.

The minimum hitting set over all cut landmarks is a perfect heuristic for delete-free planning tasks.

The LM-cut heuristic is an admissible heuristic

based on these ideas.

(51)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Literature (1)

References on landmark heuristics:

Julie Porteous, Laura Sebastia and Joerg Hoffmann.

On the Extraction, Ordering, and Usage of Landmarks in Planning.

Proc. ECP 2001, pp. 174–182, 2013.

Introduces landmarks.

Malte Helmert and Carmel Domshlak.

Landmarks, Critical Paths and Abstractions: What’s the Difference Anyway?

Proc. ICAPS 2009, pp. 162–169, 2009.

Introduces cut landmarks and LM-cut heuristic.

(52)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Literature (2)

Lin Zhu and Robert Givan.

Landmark Extraction via Planning Graph Propagation.

Doctoral Consortium ICAPS 2003, 2003.

Core idea for complete landmark generation.

Emil Keyder, Silvia Richter and Malte Helmert.

Sound and Complete Landmarks for And/Or Graphs Proc. ECAI 2010 , pp. 335–340, 2010.

Introduces landmarks from AND/OR graphs

and usage of Π

m

compilation.

(53)

i-g Form Cut Landmarks The LM-Cut Heuristic Summary & Outlook

Literature (3)

Silvia Richter and Matthias Westphal.

The LAMA Planner: Guiding Cost-Based Anytime Planning with Landmarks.

JAIR 39 (2010) , pp. 127–177, 2010.

Introduces landmark-count heuristic and contains another landmark generation method.

Erez Karpas and Carmel Domshlak.

Cost-Optimal Planning with Landmarks.

Proc. IJCAI 2009, pp. 1728–1733, 2009.

Introduces admissible variant of landmark heuristic.

Referenzen

ÄHNLICHE DOKUMENTE

some operator must be applied (action landmark) some atomic proposition must hold (fact landmark) some formula must be true (formula landmark).. → Derive heuristic estimate from

I The minimum hitting set over all cut landmarks is a perfect heuristic for delete-free planning tasks. I The LM-cut heuristic is an admissible heuristic based on

Same algorithm can be used for disjunctive action landmarks, where we also have a minimal saturated cost function. Definition (MSCF for Disjunctive

Same algorithm can be used for disjunctive action landmarks, where we also have a minimal saturated cost function. Definition (MSCF for Disjunctive

I Adding more constraints can only remove feasible solutions I Fewer feasible solutions can only increase objective value I Higher objective value means better informed heuristic

Adding more constraints can only remove feasible solutions Fewer feasible solutions can only increase objective value Higher objective value means better informed heuristic Are

The merge steps combine two abstract transition systems by replacing them with their synchronized product. The shrink steps make an abstract system smaller by abstracting

I Hitting sets over all cut landmarks yield a perfect heuristic for delete-free planning tasks. I The LM-cut heuristic is an admissible heuristic based on