• Keine Ergebnisse gefunden

Planning and Optimization

N/A
N/A
Protected

Academic year: 2022

Aktie "Planning and Optimization"

Copied!
7
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Planning and Optimization

E1. Constraints: Introduction

Malte Helmert and Gabriele R¨oger

Universit¨at Basel

November 16, 2020

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 16, 2020 1 / 26

Planning and Optimization

November 16, 2020 — E1. Constraints: Introduction

E1.1 Constraint-based Heuristics E1.2 Multiple Heuristics

E1.3 Summary

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 16, 2020 2 / 26

Content of this Course

Planning

Classical

Foundations Logic Heuristics Constraints

Probabilistic

Explicit MDPs Factored MDPs

Content of this Course: Constraints

Constraints

Landmarks Cost Partitioning

Network Flows Operator Counting

(2)

E1. Constraints: Introduction Constraint-based Heuristics

E1.1 Constraint-based Heuristics

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 16, 2020 5 / 26

E1. Constraints: Introduction Constraint-based Heuristics

Coming Up with Heuristics in a Principled Way

General Procedure for Obtaining a Heuristic Solve a simplified version of the problem.

Major ideas for heuristics in the planning literature:

I delete relaxation I abstraction I landmarks I critical paths I network flows I potential heuristic

Landmarks, network flows and potential heuristics are based on constraints that can be specified for a planning task.

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 16, 2020 6 / 26

E1. Constraints: Introduction Constraint-based Heuristics

Constraints: Example

1 2 3 4 5 6

C B A

Images from wikimedia

E1. Constraints: Introduction Constraint-based Heuristics

Constraints: Example

Example

Consider a FDR planning taskhV,I,O, γi with I V ={robot-at,dishes-at} with

I dom(robot-at) ={A1, . . . ,C3,B4,A5, . . . ,B6}

I dom(dishes-at) ={Table,Robot,Dishwasher}

I I ={robot-at7→C1,dishes-at7→Table}

I operators

I move-x-y to move from cellx to adjacent celly I pickup dishes, and

I load dishes into the dishwasher.

I γ = (robot-at=B6)∧(dishes-at= Dishwasher)

(3)

E1. Constraints: Introduction Constraint-based Heuristics

Constraints

Some heuristics exploit constraints that describe something that holds in every solution of the task.

For instance, every solution is such that

I a variable takes some value in at least one visited state.

(afact landmark constraint)

I at least one action from a set of actions must be applied. (a disjunctive action landmarkconstraint)

I fact consumption and production is “balanced”. (a network flowconstraint)

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 16, 2020 9 / 26

E1. Constraints: Introduction Constraint-based Heuristics

Fact Landmarks: Example

Which values dorobot-at anddishes-at take in every solution?

1 2 3 4 5 6

C B A

I robot-at = C1,dishes-at= Table (initial state) I robot-at = B6,dishes-at= Dishwasher (goal state) I robot-at = A1,robot-at = B3, robot-at= B4,

robot-at = B5,robot-at = A6, dishes-at= Robot

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 16, 2020 10 / 26

E1. Constraints: Introduction Constraint-based Heuristics

Constraints

Some heuristics exploit constraints that describe something that holds in every solution of the task.

For instance, every solution is such that

I a variable takes some value in at least one visited state.

(afact landmark constraint) I an action must be applied.

(an action landmark constraint)

I fact consumption and production is “balanced”. (a network flowconstraint)

E1. Constraints: Introduction Constraint-based Heuristics

Action Landmarks: Example

Which actions must be applied in every solution?

1 2 3 4 5 6

C B A

I pickup I load

I move-B3-B4 I move-B4-B5

(4)

E1. Constraints: Introduction Constraint-based Heuristics

Constraints

Some heuristics exploit constraints that describe something that holds in every solution of the task.

For instance, every solution is such that

I a variable takes some valuein at least one visited state.

(afact landmark constraint) I an action must be applied.

(an action landmark constraint)

I at least one action from a set of actions must be applied.

(adisjunctive action landmark constraint)

I fact consumption and production is “balanced”. (a network flowconstraint)

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 16, 2020 13 / 26

E1. Constraints: Introduction Constraint-based Heuristics

Disjunctive Action Landmarks: Example

Which set of actions is such that at least one must be applied?

1 2 3 4 5 6

C B A

I {pickup}

I {load}

I {move-B3-B4}

I {move-B4-B5}

I {move-A6-B6,move-B5-B6}

I {move-A3-B3,move-B2-B3,move-C3-B3}

I {move-B1-A1,move-A2-A1}

I . . .

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 16, 2020 14 / 26

E1. Constraints: Introduction Constraint-based Heuristics

Constraints

Some heuristics exploit constraints that describe something that holds in every solution of the task.

For instance, every solution is such that

I a variable takes some value in at least one visited state.

(afact landmark constraint)

I at least one action from a set of actions must be applied.

(adisjunctive action landmark constraint) I fact consumption and production is “balanced”.

(anetwork flowconstraint)

E1. Constraints: Introduction Constraint-based Heuristics

Network Flow: Example

Consider the fact robot-at =B1.

How often are actions used that enter this cell?

1 2 3 4 5 6

C B A

Answer: as often as actions that leave this cell

If Counto denotes how often operatoro is applied, we have:

Countmove-A1-B1+ Countmove-B2-B1+ Countmove-C1-B1= Countmove-B1-A1+ Countmove-B1-B2+ Countmove-B1-C1

(5)

E1. Constraints: Introduction Multiple Heuristics

E1.2 Multiple Heuristics

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 16, 2020 17 / 26

E1. Constraints: Introduction Multiple Heuristics

Combining Admissible Heuristics Admissibly

Major ideas to combine heuristics admissibly:

I maximize

I canoncial heuristic (for abstractions) I minimum hitting set (for landmarks) I cost partitioning

I operator counting

Often computed as solution to a(integer) linear program.

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 16, 2020 18 / 26

E1. Constraints: Introduction Multiple Heuristics

Combining Heuristics Admissibly: Example

Example

Consider an FDR planning task hV,I,{o1,o2,o3,o4}, γi with V ={v1,v2,v3} withdom(v1) ={A,B} and

dom(v2) = dom(v3) ={A,B,C},I ={v1 7→A,v27→A,v3 7→A}, o1=hv1 = A,v1 := B,1i

o2=hv2 = A∧v3= A,v2 := B∧v3 := B,1i o3=hv2 = B,v2 := C,1i

o4=hv3 = B,v3 := C,1i andγ = (v1= B)∧(v2 = C)∧(v3= C).

LetC be the pattern collection that contains all atomic projections.

What is the canonical heuristic function hC?

Answer: Lethi :=hvi. Then hC = max{h1+h2,h1+h3}.

E1. Constraints: Introduction Multiple Heuristics

Reminder: Orthogonality and Additivity

Why can we addh1 andh2 (h1 andh3) admissibly?

Theorem (Additivity for Orthogonal Abstractions)

Let hα1, . . . ,hαn be abstraction heuristics of the same transition system such thatαi andαj are orthogonal for all i 6=j .

ThenPn

i=1hαi is a safe, goal-aware, admissible and consistent heuristic forΠ.

Consistency proof exploits thatevery concrete transition induces state-changing transition in at most one abstraction.

(6)

E1. Constraints: Introduction Multiple Heuristics

Combining Heuristics Admissibly: Example

Let h=h1+h2+h3. Where is consistency violated?

h1

1

A A

0

B o1

o1

o2,o3,o4 oo22,,oo33,,oo44

h2

2

A A

1

B B

0

C o2

o2 oo33

o1,o4

o1,o4 o1,o4 oo11,,oo44

Here:

h(BAA) =4 h(BBB) =2 h2 andh3 not additive because ofo2 h3

2

A A

1

B B

0

C o2

o2 oo44

o1,o3

o1,o3 oo11,,oo33 o1,o3

Consider solution ho1,o2,o3,o4i

Consider solutionho1,o2,o3,o4i Consider solutionho1,o2,o3,o4i Consider solutionho1,o2,o3,o4i Consider solutionho1,o2,o3,o4i Consider solutionho1,o2,o3,o4i

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 16, 2020 21 / 26

E1. Constraints: Introduction Multiple Heuristics

Inconsistency of h

2

and h

3

The reason that h2 andh3 are not additive is because the cost ofo2 is considered in both.

Is there anything we can do about this?

Solution: We can ignore the cost ofo2 in one heuristic by setting its cost to 0 (e.g.,cost3(o2) = 0).

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 16, 2020 22 / 26

E1. Constraints: Introduction Multiple Heuristics

Combining Heuristics Admissibly: Example

Let h0 =h1+h2+h03, whereh03=hv3 assumingcost3(o2) = 0.

h1

1

A A

0

B o1

o1

o2,o3,o4 oo22,,oo33,,oo44

h2

2

A A

1

B B

0

C o2

o2 oo33

o1,o4

o1,o4 o1,o4 oo11,,oo44 Here:

h0(BAA) =3 h0(BBB) = 2 h2 andh03 are additive h03

1

A A

1

B B

0

C o2

0-cost

o2 oo44

o1,o3

o1,o3 oo11,,oo33 o1,o3

Consider solution ho1,o2,o3,o4i

Consider solutionho1,o2,o3,o4i Consider solutionho1,o2,o3,o4i Consider solutionho1,o2,o3,o4i Consider solutionho1,o2,o3,o4i Consider solutionho1,o2,o3,o4i

E1. Constraints: Introduction Multiple Heuristics

Cost partitioning

Using the cost of every operator only in one heuristic is called a zero-one cost partitioning.

More generally, heuristics are additive if all operator costs are distributed in a way that the sum of the individual costs is no larger than the cost of the operator.

This can also be expressed as a constraint, the cost partitioning constraint:

n

X

i=1

costi(o)≤cost(o) for allo ∈O (more details later)

(7)

E1. Constraints: Introduction Summary

E1.3 Summary

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 16, 2020 25 / 26

E1. Constraints: Introduction Summary

Summary

I Landmarks and network flows areconstraints that describe something that holds in every solution of the task.

I Heuristics can be summed up admissibly if thecost partitioning constraint is satisfied.

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 16, 2020 26 / 26

Referenzen

ÄHNLICHE DOKUMENTE

I The objective value of an integer program that minimizes this cost subject to the flow constraints is a lower bound on the plan cost (i.e., an admissible heuristic estimate).. I

The objective value of an integer program that minimizes this cost subject to the flow constraints is a lower bound on the plan cost (i.e., an admissible heuristic estimate)..

I used to traverse explicated tree from root node to a leaf I maps decision nodes to a probability distribution over actions. (usually as a function over a decision node and

C2.1 The Domination Lemma C2.2 The Relaxation Lemma C2.3 Further Properties C2.4 Greedy Algorithm C2.5 SummaryG. Keller (Universit¨ at Basel) Planning and Optimization October 17,

I The merge steps combine two abstract transition systems by replacing them with their synchronized product. I The shrink steps make an abstract system smaller by abstracting

The objective value of an integer program that minimizes this cost subject to the flow constraints is a lower bound on the plan cost (i.e., an admissible heuristic estimate)..

Use it in an eager greedy search on the instances in the directory castle and compare the heuristic values of the initial state with the cost of an optimal relaxed plan, the

Complete the implementation in the methods build compatibility graph and compute heuristic to create the compatibility graph for a given pattern collection and for computing