• Keine Ergebnisse gefunden

Planning and Optimization A7. Invariants, Mutexes and Finite Domain Representation Gabriele R¨oger and Thomas Keller

N/A
N/A
Protected

Academic year: 2022

Aktie "Planning and Optimization A7. Invariants, Mutexes and Finite Domain Representation Gabriele R¨oger and Thomas Keller"

Copied!
32
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

A7. Invariants, Mutexes and Finite Domain Representation

Gabriele R¨oger and Thomas Keller

Universit¨at Basel

October 8, 2018

(2)

Content of this Course

Planning

Classical

Tasks Progression/

Regression Complexity Heuristics

Probabilistic

MDPs Uninformed Search

Heuristic Search Monte-Carlo

Methods

(3)

Invariants

(4)

Invariants

When we as humans reason about planning tasks, we implicitly make use of “obvious” properties of these tasks.

Example: we are never in two places at the same time We can represent such properties as logical formulas ϕ that are true in all reachable states.

Example: ϕ=¬(at-uniat-home)

Such formulas are called invariants of the task.

(5)

Invariants: Definition

Definition (Invariant)

Aninvariant of a planning task Π with state variables V is a logical formulaϕoverV such that s |=ϕ

for all reachable statess of Π.

(6)

Invariants Mutexes FDR Planning Tasks FDR Task Semantics SAS+Planning Tasks Summary

Computing Invariants

Theoretically, testing if an arbitrary formula ϕ is an invariant is as hard as planningitself.

proof idea: a planning task is unsolvable iff the negation of its goal is an invariant

Still, many practical invariant synthesis algorithms exist.

To remain efficient (= polynomial-time), these algorithms only compute a subsetof all useful invariants.

sound, but notcomplete

Empirically, they tend to at least find the “obvious”

invariants of a planning task.

(7)

Exploiting Invariants

Invariants have many uses in planning:

Regression search:

Prune states that violate (are inconsistent with) invariants.

Planning as satisfiability:

Add invariantsto a SAT encoding of a planning task to get tighter constraints.

Reformulation:

Derive amore compact state space representation (i.e., with fewer unreachable states).

We now briefly discuss the last point because it is important forplanning tasks in finite-domain representation,

introduced in the following chapter.

(8)

Mutexes

(9)

Mutexes

Invariants that take the form ofbinary clauses are called mutexes because they express that certain variable assignments cannot be simultaneously true and are hencemutually exclusive.

Example (Blocks World)

The invariant¬A-on-B∨ ¬A-on-Cstates that A-on-Band A-on-Care mutex.

We say that a largerset of literals is mutually exclusive if every subset of two literals is a mutex.

Example (Blocks World)

Every pair in{B-on-A,C-on-A,D-on-A,A-clear} is mutex.

(10)

Encoding Mutex Groups as Finite-Domain Variables

LetL={`1, . . . , `n} be mutually exclusive literals overn different variablesVL={v1, . . . ,vn}.

Then the planning task can be rephrased using a single finite-domain(i.e., non-binary) state variablevL

withn+ 1 possible values in place of then variables in VL: n of the possible values represent situations

in which exactly oneof the literals in Lis true.

The remaining value represents situations in which none of the literalsin L is true.

Note: If we can prove that one of the literals inL

must be true in each state (i.e.,`1∨ · · · ∨`n is an invariant), this additional value can be omitted.

In many cases, the reduction in the number of variables dramatically improves performance of a planning algorithm.

(11)

FDR Planning Tasks

(12)

Reminder: Blocks World with Boolean State Variables

Example

s(A-on-B) =F s(A-on-C) =F s(A-on-table) =T s(B-on-A) =T s(B-on-C) =F s(B-on-table) =F s(C-on-A) =F s(C-on-B) =F s(C-on-table) =T 29 = 512 states

A B

C

Note: it may be useful to add auxiliary state variables like A-clear.

(13)

Blocks World with Finite-Domain State Variables

Use three finite-domain state variables:

below-a: {b,c,table}

below-b: {a,c,table}

below-c: {a,b,table}

Example

s(below-a) = table s(below-b) = a s(below-c) = table

33 = 27 states

A B

C

Note: it may be useful to add auxiliary state variables like above-a.

(14)

Finite-Domain State Variables

Definition (Finite-Domain State Variable) Afinite-domain state variableis a symbolv

with an associatedfinite domain, i.e., a non-empty finite set.

We writedom(v) for the domain of v.

Example (Blocks World)

v=above-a, dom(above-a) ={b,c,nothing}

This state variable encodes the same information as the propositional variablesB-on-A,C-on-A andA-clear.

(15)

Finite-Domain States

Definition (Finite-Domain State)

LetV be a finite set of finite-domain state variables.

Astate overV is an assignments :V →S

v∈Vdom(v) such thats(v)∈dom(v) for all v ∈V.

Example (Blocks World)

s ={above-a7→nothing,above-b7→a,above-c7→b, below-a7→b,below-b7→c,below-c7→table}

(16)

Finite-Domain Formulas

Definition (Finite-Domain Formula)

Logical formulas over finite-domain state variablesV are defined identically to the propositional case,

except that instead of atomic formulas of the formv0 ∈V0 with propositional state variablesV0, there are atomic formulas of the formv =d, wherev ∈V andd ∈dom(v).

Example (Blocks World)

The formula (above-a= nothing)∨ ¬(below-b= c) corresponds to the formulaA-clear∨ ¬B-on-C.

(17)

Finite-Domain Effects

Definition (Finite-Domain Effect)

Effects over finite-domain state variablesV are defined identically to the propositional case,

except that instead of atomic effects of the formv0 and¬v0 with propositional state variablesv0 ∈V0, there are atomic effects of the formv :=d, wherev ∈V andd ∈dom(v).

Example (Blocks World) The effect

(below-a:= table)∧((above-b= a)B(above-b:= nothing)) corresponds to the effect

A-on-table∧ ¬A-on-B∧ ¬A-on-C∧(A-on-BB(B-clear∧ ¬A-on-B)).

finite-domain operators,effect conditionsetc. follow

(18)

Planning Tasks in Finite-Domain Representation

Definition (Planning Task in Finite-Domain Representation) Aplanning task in finite-domain representation

orFDR planning task is a 4-tuple Π =hV,I,O, γiwhere V is a finite set of finite-domain state variables, I is a state over V called theinitial state,

O is a finite set offinite-domain operators overV, and γ is a formula overV called thegoal.

(19)

FDR Task Semantics

(20)

FDR Task Semantics: Introduction

We have now defined what FDR tasks look like.

We still have to define their semantics.

Because they are similar to propositional planning tasks, we can define their semantics in a very similar way.

(21)

Direct vs. Compilation Semantics

We describe two ways of defining semantics for FDR tasks:

directly, mirroring our definitions for propositional tasks bycompilation to propositional tasks

Comparison of the semantics:

The two semantics are equivalent in terms of thereachable state space and hence in terms of the set of solutions.

(We will not prove this.)

They are notequivalent w.r.t. the set of allstates.

Where the distinction matters, we use thedirect semantics in this course unless stated otherwise.

(22)

Conflicting Effects

As with propositional planning tasks, there is a subtlety:

what should an effect of the formv:= a∧v:= b mean?

For FDR tasks, the common convention is to make thisillegal, i.e., to make an operator inapplicable if it would lead to conflicting effects.

(23)

Consistency Condition and Applicability

Definition (Consistency Condition)

Lete be an effect over finite-domain state variablesV. Theconsistency conditionfor e,consist(e) is defined as

^

v∈V

^

d,d0∈dom(v),d6=d0

¬(effcond(v :=d,e)∧effcond(v :=d0,e)).

Definition (Applicable FDR Operator) An FDR operatoro isapplicable in a state s ifs |=pre(o)∧consist(eff(o)).

The definitions ofsJoKetc. then follow in the natural way.

(24)

Reminder: Semantics of Propositional Planning Tasks

Reminder from Chapter A4:

Definition (Transition System Induced by a Prop. Planning Task) The propositional planning task Π =hV,I,O, γi induces

the transition systemT(Π)=hS,L,c,T,s0,S?i, where S is the set of all valuations of V,

L is the set of operatorsO,

c(o) =cost(o) for all operatorso ∈O,

T ={hs,o,s0i |s ∈S, o applicable ins, s0 =sJoK}, s0=I, and

S? ={s ∈S |s |=γ}.

(25)

Semantics of Planning Tasks

A definition that works for both types of planning tasks:

Definition (Transition System Induced by a Planning Task) The planning task Π =hV,I,O, γi induces

the transition systemT(Π)=hS,L,c,T,s0,S?i, where S is the set of states over V,

L is the set of operatorsO,

c(o) =cost(o) for all operatorso ∈O,

T ={hs,o,s0i |s ∈S, o applicable ins, s0 =sJoK}, s0=I, and

S? ={s ∈S |s |=γ}.

Planning taskhere refers to either a propositional or FDR task.

(26)

Compilation Semantics

Definition (Induced Propositional Planning Task) Let Π =hV,I,O, γibe an FDR planning task.

Theinduced propositional planning taskΠ0 is the (regular) planning task Π0=hV0,I0,O0, γ0i, where

V0 ={hv,di |v ∈V,d ∈dom(v)}

I0(hv,di) =Tiff I(v) =d

O0 andγ0 are obtained from O andγ by replacing each operator preconditionpre(o) bypre(o)consist(eff(o)), and then

replacing each atomic formulav =d by the propositionhv,di, replacing each atomic effectv :=d by the effect

hv,di ∧V

d0∈dom(v)\{d}¬hv,d0i.

(27)

SAS + Planning Tasks

(28)

SAS

+

Planning Tasks

Definition (SAS+ Planning Task)

An FDR planning task Π =hV,I,O, γiis called aSAS+ planning task if

there are no conditional effects in O, and

all operator preconditions in O and the goal formula γ are conjunctions of atoms.

(29)

SAS

+

vs. STRIPS

SAS+ is the analogue of STRIPS planning tasks for FDR induced propositional planning task of a SAS+ task is a STRIPS planning task after simplification (consistency conditions simplify to⊥ or>) FDR tasks obtained by mutex-based reformulation of STRIPS planning tasks are SAS+ tasks

(30)

Summary

(31)

Summary

Invariants are common properties of all reachable states, expressed as logical formulas.

Mutexes are invariants that express that certain pairs of literals are mutually exclusive.

Planning tasks in finite-domain representation (FDR) are an alternative to propositional planning tasks.

FDR tasks are often more compact (have fewer states).

This makes many planning algorithms more efficient when working with a finite-domain representation.

SAS+ tasks are a restricted form of FDR tasks where only conjunctions of atoms are allowed in the preconditions, effects and goal. No conditional effects are allowed.

(32)

Quiz

kahoot.it

Referenzen

ÄHNLICHE DOKUMENTE

For this purpose, we will use propositional logic, which allows expressing information about 2 n states as logical formulas over n state variables... Reminder:

The last missing piece is a definition of regression through operators, describing exactly in which states s applying a given operator o leads to a state satisfying a given formula

decision problem analogue of satisficing planning Definition (Bounded-Cost Plan Existence). The bounded-cost plan existence problem ( BCPlanEx ) is the following

Planning tasks in positive normal form without delete effects are called relaxed planning tasks. Plans for relaxed planning tasks are called

Delete relaxation is a simplification in the sense that it is never harder to solve a relaxed task than the original one. Delete-relaxed tasks have a

C3.1 Optimal Relaxed Plans C3.2 AND/OR Graphs C3.3 Forced Nodes?. C3.4 Most/Least Conservative Valuations

If n is a node in an AND/OR graph that is forced true, then this can be derived by a sequence of applications of Rule T-(∧) and Rule

The objective value of an integer program that minimizes this cost subject to the flow constraints is a lower bound on the plan cost (i.e., an admissible heuristic estimate)..