• Keine Ergebnisse gefunden

Planning and Optimization E8. Operator Counting Malte Helmert and Gabriele R¨oger

N/A
N/A
Protected

Academic year: 2022

Aktie "Planning and Optimization E8. Operator Counting Malte Helmert and Gabriele R¨oger"

Copied!
7
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Planning and Optimization

E8. Operator Counting

Malte Helmert and Gabriele R¨oger

Universit¨at Basel

November 25, 2020

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 25, 2020 1 / 27

Planning and Optimization

November 25, 2020 — E8. Operator Counting

E8.1 Introduction

E8.2 Operator-counting Framework E8.3 Properties

E8.4 Summary

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 25, 2020 2 / 27

Content of this Course

Planning

Classical

Foundations Logic Heuristics Constraints

Probabilistic

Explicit MDPs Factored MDPs

Content of this Course: Constraints

Constraints

Landmarks

Cost Partitioning Network

Flows

Operator Counting

(2)

E8. Operator Counting Introduction

E8.1 Introduction

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 25, 2020 5 / 27

E8. Operator Counting Introduction

Reminder: Flow Heuristic

In the previous chapter, we usedflow constraints to describe how often operators must be usedin each plan.

Example (Flow Constraints)

Let Π be a planning problem with operators{ored,ogreen,oblue}.

The flow constraint for some atoma is the constraint 1 +Countogreen =Countored. In natural language, this flow constraint expresses that

every plan usesored once morethan ogreen.

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 25, 2020 6 / 27

E8. Operator Counting Introduction

Reminder: Flow Heuristic

Let us now observe how each flow constraint alters the operator count solution space.

“plans that use as often as ”

“plans that use once more than ”

0 0 0 0 0 0

1 0 0 1 0 0 2 1 1 0 1 1

3 2 2

1 2 0 2 0 1

1 1 0

2 2 1 2 2 0

1 1 1 0 2 2

3 1 0 2 1 0

0 0 1

3 0 2

E8. Operator Counting Operator-counting Framework

E8.2 Operator-counting Framework

(3)

E8. Operator Counting Operator-counting Framework

Operator Counting

Operator counting

I generalizes this idea to a framework that allows to admissibly combine different heuristics.

I useslinear constraints . . .

I . . . that describenumber of occurrencesof an operator . . . I . . . and must be satisfied byevery plan.

I provides declarative way to describe knowledge about solutions.

I allows reasoning about solutionsto derive heuristic estimates.

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 25, 2020 9 / 27

E8. Operator Counting Operator-counting Framework

Operator-counting Constraint

Definition (Operator-counting Constraints)

Let Π be a planning task with operatorsO and let s be a state.

LetV be the set of integer variablesCounto for eacho ∈O. A linear inequality overV is called an operator-counting constraint fors if for every planπ for s setting each Counto to the number of occurrences ofo inπ is a feasible variable assignment.

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 25, 2020 10 / 27

E8. Operator Counting Operator-counting Framework

Operator-counting Heuristics

Definition (Operator-counting IP/LP Heuristic)

The operator-counting integer program IPC for a setC of operator-counting constraints for states is

Minimize X

o∈O

cost(o)·Counto subject to C andCounto ≥0 for allo∈O,

where O is the set of operators.

TheIP heuristic hIPC is the objective value of IPC,

the LP heuristic hLPC is the objective value of its LP-relaxation.

If the IP/LP is infeasible, the heuristic estimate is ∞.

E8. Operator Counting Operator-counting Framework

Operator-counting Constraints

I Adding more constraints can only remove feasible solutions I Fewer feasible solutions can only increase objective value I Higher objective value means better informed heuristic Are there operator-counting constraints other than flow constraints?

(4)

E8. Operator Counting Operator-counting Framework

Reminder: Minimum Hitting Set for Landmarks

Variables

Non-negative variable Appliedo for each operatoro Objective

Minimize P

ocost(o)·Appliedo

Subject to

X

o∈L

Appliedo ≥1 for all landmarksL

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 25, 2020 13 / 27

E8. Operator Counting Operator-counting Framework

Operator Counting with Disjunctive Action Landmarks

Variables

Non-negative variableCounto for each operator o Objective

MinimizeP

ocost(o)·Counto

Subject to

X

o∈L

Counto ≥1 for all landmarks L

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 25, 2020 14 / 27

E8. Operator Counting Operator-counting Framework

New: Post-hoc Optimization Constraints

For set of abstractions{α1, . . . , αn}:

Variables

Non-negative variables Counto for all operators o ∈O Counto ·cost(o) is cost incurred by operatoro

Objective Minimize P

o∈Ocost(o)·Counto

Subject to X

o∈O:oaffectsTαcost(o)·Counto ≥hα(s) for α∈ {α1, . . . , αn} cost(o)·Counto ≥0 for all o∈O

E8. Operator Counting Operator-counting Framework

Example

“plans that use at least once”

“plans where and

cost 4 or more together” “plans that use once more than ”

0 0 1 2 0 1 2 0 1 3 0 2 1 1 2

3 2 2

1 2 0 1 0 0 1 0 0

1 1 0 2 2 0

1 3 1 1 2 1

3 1 0 2 1 0

0 0 0

· · · 2 2 1

2 2 1

(5)

E8. Operator Counting Operator-counting Framework

Further Examples?

I The definition of operator-counting constraints can be extended to groups of constraints and auxiliary variables.

I With this extended definition we could also cover more heuristics, e.g., the perfect delete-relaxation heuristich+.

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 25, 2020 17 / 27

E8. Operator Counting Properties

E8.3 Properties

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 25, 2020 18 / 27

E8. Operator Counting Properties

Admissibility

Theorem (Operator-counting Heuristics are Admissible) The IP and the LP heuristic are admissible.

Proof.

Let C be a set of operator-counting constraints for state s andπ be an optimal plan for s. The number of operator occurrences ofπ are a feasible solution for C. As the IP/LP minimizes the total plan cost, the objective value cannot exceed the cost of π and is therefore an admissible estimate.

E8. Operator Counting Properties

Dominance

Theorem

Let C and C0 be sets of operator-counting constraints for s and let C ⊆C0. ThenIPC ≤IPC0 andLPC ≤LPC0.

Proof.

Every feasible solution ofC0 is also feasible forC. As the LP/IP is a minimization problem, the objective value subject toC can therefore not be larger than the one subject toC0.

Adding more constraints can only improve the heuristic estimate.

(6)

E8. Operator Counting Properties

Heuristic Combination

Operator counting as heuristic combination

I Multiple operator-counting heuristics can be combined by computinghLPC /hIPC for theunion of their constraints.

I This is anadmissiblecombination.

I Never worse than maximum of individual heuristics I Sometimes even better than their sum

I We already know a way of admissibly combining heuristics:

cost partitioning.

⇒How are they related?

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 25, 2020 21 / 27

E8. Operator Counting Properties

Connection to Cost Partitioning

Theorem

Let C1, . . . ,Cn be sets of operator-counting constraints for s and C=Sn

i=1Ci. Then hLPC is theoptimal general cost partitioning over the heuristics hLPC

i .

Proof ommitted.

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 25, 2020 22 / 27

E8. Operator Counting Properties

Comparison to Optimal Cost Partitioning

I some heuristics aremore compact if expressed as operator counting

I some heuristics cannot be expressedas operator counting I operator counting IP even better than

optimal cost partitioning

I Cost partitioning maximizes, so heuristics must be encoded perfectly to guarantee admissibility.

Operator counting minimizes, so missing information just makes the heuristic weaker.

E8. Operator Counting Summary

E8.4 Summary

(7)

E8. Operator Counting Summary

Summary

I Many heuristics can be formulated in terms of operator-counting constraints.

I The operator counting heuristic framework allows to combine the constraintsand to reason on the entire encoded declarative knowledge.

I The heuristic estimate for the combined constraints can be better than the one of the best ingredient heuristic but never worse.

I Operator counting is equivalent to optimal general cost partitioning over individual constraints.

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 25, 2020 25 / 27

E8. Operator Counting Summary

Literature (1)

Florian Pommerening, Gabriele R¨oger and Malte Helmert.

Getting the Most Out of Pattern Databases for Classical Planning.

Proc. IJCAI 2013, pp. 2357–2364, 2013.

Introducespost-hoc optimization and points out relation to canonical heuristic.

Blai Bonet.

An Admissible Heuristic for SAS+ Planning Obtained from the State Equation.

Proc. IJCAI 2013, pp. 2268–2274, 2013.

Suggests combinationof flow constraints and landmark constraints.

M. Helmert, G. R¨oger (Universit¨at Basel) Planning and Optimization November 25, 2020 26 / 27

E8. Operator Counting Summary

Literature (2)

Tatsuya Imai and Alex Fukunaga.

A Practical, Integer-linear Programming Model for the Delete-relaxation in Cost-optimal Planning.

Proc. ECAI 2014, pp. 459–464, 2014.

IP formulation of h+.

Florian Pommerening, Gabriele R¨oger, Malte Helmert and Blai Bonet.

LP-based Heuristics for Cost-optimal Planning.

Proc. ICAPS 2014, pp. 226–234, 2014.

Systematic introduction of operator-counting framework.

Referenzen

ÄHNLICHE DOKUMENTE

define a limit N on the number of states allowed in each factor in each iteration, select two factors we would like to merge merge them if this does not exhaust the state number

Landmarks, network flows and potential heuristics are based on constraints that can be specified for a planning task.... Constraint-based Heuristics Multiple

some operator must be applied (action landmark) some atomic proposition must hold (fact landmark) some formula must be true (formula landmark).. → Derive heuristic estimate from

I The minimum hitting set over all cut landmarks is a perfect heuristic for delete-free planning tasks. I The LM-cut heuristic is an admissible heuristic based on

Asynchronous VI performs backups for individual states Different approaches lead to different backup orders Can significantly reduce computation. Guaranteed to converge if all

I Quality of given policy π can be computed (via LP or backward induction) or approximated arbitrarily closely (via iterative policy evaluation) in small SSPs or MDPs I Impossible

Quality of given policy π can be computed (via LP or backward induction) or approximated arbitrarily closely (via iterative policy evaluation) in small SSPs or MDPs Impossible

Real-time Dynamic Programming (RTDP) generates hash map with state-value estimates of relevant states.. uses admissible heuristic to achieve convergence albeit not updating