Planning and Optimization A7. Invariants, Mutexes and Finite Domain Representation Gabriele R¨oger and Thomas Keller

(1)

A7. Invariants, Mutexes and Finite Domain Representation

Gabriele R¨oger and Thomas Keller

Universit¨at Basel

October 8, 2018

(2)

Content of this Course

Planning

Classical

Tasks Progression/

Regression Complexity Heuristics

Probabilistic

MDPs Uninformed Search

Heuristic Search Monte-Carlo

Methods

(3)

Invariants

(4)

Invariants

When we as humans reason about planning tasks, we implicitly make use of “obvious” properties of these tasks.

Example: we are never in two places at the same time We can represent such properties as logical formulas ϕ that are true in all reachable states.

Example: ϕ=¬(at-uni∧at-home)

Such formulas are called invariants of the task.

(5)

Invariants: Definition

Definition (Invariant)

Aninvariant of a planning task Π with state variables V is a logical formulaϕoverV such that s |=ϕ

for all reachable statess of Π.

(6)

Invariants Mutexes FDR Planning Tasks FDR Task Semantics SAS⁺Planning Tasks Summary

Computing Invariants

Theoretically, testing if an arbitrary formula ϕ is an invariant is as hard as planningitself.

proof idea: a planning task is unsolvable iff the negation of its goal is an invariant

Still, many practical invariant synthesis algorithms exist.

To remain efficient (= polynomial-time), these algorithms only compute a subsetof all useful invariants.

sound, but notcomplete

Empirically, they tend to at least find the “obvious”

invariants of a planning task.

(7)

Exploiting Invariants

Invariants have many uses in planning:

Regression search:

Prune states that violate (are inconsistent with) invariants.

Planning as satisfiability:

Add invariantsto a SAT encoding of a planning task to get tighter constraints.

Reformulation:

Derive amore compact state space representation (i.e., with fewer unreachable states).

We now briefly discuss the last point because it is important forplanning tasks in finite-domain representation,

introduced in the following chapter.

(8)

Mutexes

(9)

Mutexes

Invariants that take the form ofbinary clauses are called mutexes because they express that certain variable assignments cannot be simultaneously true and are hencemutually exclusive.

Example (Blocks World)

The invariant¬A-on-B∨ ¬A-on-Cstates that A-on-Band A-on-Care mutex.

We say that a largerset of literals is mutually exclusive if every subset of two literals is a mutex.

Every pair in{B-on-A,C-on-A,D-on-A,A-clear} is mutex.

(10)

Encoding Mutex Groups as Finite-Domain Variables

LetL={`₁, . . . , `n} be mutually exclusive literals overn different variablesV_L={v₁, . . . ,v_n}.

Then the planning task can be rephrased using a single finite-domain(i.e., non-binary) state variablev_L

withn+ 1 possible values in place of then variables in VL: n of the possible values represent situations

in which exactly oneof the literals in Lis true.

The remaining value represents situations in which none of the literalsin L is true.

Note: If we can prove that one of the literals inL

must be true in each state (i.e.,`₁∨ · · · ∨`_n is an invariant), this additional value can be omitted.

In many cases, the reduction in the number of variables dramatically improves performance of a planning algorithm.

(11)

FDR Planning Tasks

(12)

Reminder: Blocks World with Boolean State Variables

Example

s(A-on-B) =F s(A-on-C) =F s(A-on-table) =T s(B-on-A) =T s(B-on-C) =F s(B-on-table) =F s(C-on-A) =F s(C-on-B) =F s(C-on-table) =T 2⁹ = 512 states

A B

C

Note: it may be useful to add auxiliary state variables like A-clear.

(13)

Blocks World with Finite-Domain State Variables

Use three finite-domain state variables:

below-a: {b,c,table}

below-b: {a,c,table}

below-c: {a,b,table}

Example

s(below-a) = table s(below-b) = a s(below-c) = table

3³ = 27 states

A B

C

Note: it may be useful to add auxiliary state variables like above-a.

(14)

Finite-Domain State Variables

Definition (Finite-Domain State Variable) Afinite-domain state variableis a symbolv

with an associatedfinite domain, i.e., a non-empty finite set.

We writedom(v) for the domain of v.

v=above-a, dom(above-a) ={b,c,nothing}

This state variable encodes the same information as the propositional variablesB-on-A,C-on-A andA-clear.

(15)

Finite-Domain States

Definition (Finite-Domain State)

LetV be a finite set of finite-domain state variables.

Astate overV is an assignments :V →S

v∈Vdom(v) such thats(v)∈dom(v) for all v ∈V.

s ={above-a7→nothing,above-b7→a,above-c7→b, below-a7→b,below-b7→c,below-c7→table}

(16)

Finite-Domain Formulas

Definition (Finite-Domain Formula)

Logical formulas over finite-domain state variablesV are defined identically to the propositional case,

except that instead of atomic formulas of the formv⁰ ∈V⁰ with propositional state variablesV⁰, there are atomic formulas of the formv =d, wherev ∈V andd ∈dom(v).

The formula (above-a= nothing)∨ ¬(below-b= c) corresponds to the formulaA-clear∨ ¬B-on-C.

(17)

Finite-Domain Effects

Definition (Finite-Domain Effect)

Effects over finite-domain state variablesV are defined identically to the propositional case,

except that instead of atomic effects of the formv⁰ and¬v⁰ with propositional state variablesv⁰ ∈V⁰, there are atomic effects of the formv :=d, wherev ∈V andd ∈dom(v).

Example (Blocks World) The effect

(below-a:= table)∧((above-b= a)B(above-b:= nothing)) corresponds to the effect

A-on-table∧ ¬A-on-B∧ ¬A-on-C∧(A-on-BB(B-clear∧ ¬A-on-B)).

finite-domain operators,effect conditionsetc. follow

(18)

Planning Tasks in Finite-Domain Representation

Definition (Planning Task in Finite-Domain Representation) Aplanning task in finite-domain representation

orFDR planning task is a 4-tuple Π =hV,I,O, γiwhere V is a finite set of finite-domain state variables, I is a state over V called theinitial state,

O is a finite set offinite-domain operators overV, and γ is a formula overV called thegoal.

(19)

FDR Task Semantics

(20)

FDR Task Semantics: Introduction

We have now defined what FDR tasks look like.

We still have to define their semantics.

Because they are similar to propositional planning tasks, we can define their semantics in a very similar way.

(21)

Direct vs. Compilation Semantics

We describe two ways of defining semantics for FDR tasks:

directly, mirroring our definitions for propositional tasks bycompilation to propositional tasks

Comparison of the semantics:

The two semantics are equivalent in terms of thereachable state space and hence in terms of the set of solutions.

(We will not prove this.)

They are notequivalent w.r.t. the set of allstates.

Where the distinction matters, we use thedirect semantics in this course unless stated otherwise.

(22)

Conflicting Effects

As with propositional planning tasks, there is a subtlety:

what should an effect of the formv:= a∧v:= b mean?

For FDR tasks, the common convention is to make thisillegal, i.e., to make an operator inapplicable if it would lead to conflicting effects.

(23)

Consistency Condition and Applicability

Definition (Consistency Condition)

Lete be an effect over finite-domain state variablesV. Theconsistency conditionfor e,consist(e) is defined as

^

v∈V

^

d,d⁰∈dom(v),d6=d⁰

¬(effcond(v :=d,e)∧effcond(v :=d⁰,e)).

Definition (Applicable FDR Operator) An FDR operatoro isapplicable in a state s ifs |=pre(o)∧consist(eff(o)).

The definitions ofsJoKetc. then follow in the natural way.

(24)

Reminder: Semantics of Propositional Planning Tasks

Reminder from Chapter A4:

Definition (Transition System Induced by a Prop. Planning Task) The propositional planning task Π =hV,I,O, γi induces

the transition systemT(Π)=hS,L,c,T,s0,S?i, where S is the set of all valuations of V,

L is the set of operatorsO,

c(o) =cost(o) for all operatorso ∈O,

T ={hs,o,s⁰i |s ∈S, o applicable ins, s⁰ =sJoK}, s0=I, and

S? ={s ∈S |s |=γ}.

(25)

Semantics of Planning Tasks

A definition that works for both types of planning tasks:

Definition (Transition System Induced by a Planning Task) The planning task Π =hV,I,O, γi induces

the transition systemT(Π)=hS,L,c,T,s0,S?i, where S is the set of states over V,

L is the set of operatorsO,

c(o) =cost(o) for all operatorso ∈O,

T ={hs,o,s⁰i |s ∈S, o applicable ins, s⁰ =sJoK}, s0=I, and

S? ={s ∈S |s |=γ}.

Planning taskhere refers to either a propositional or FDR task.

(26)

Compilation Semantics

Definition (Induced Propositional Planning Task) Let Π =hV,I,O, γibe an FDR planning task.

Theinduced propositional planning taskΠ⁰ is the (regular) planning task Π⁰=hV⁰,I⁰,O⁰, γ⁰i, where

V⁰ ={hv,di |v ∈V,d ∈dom(v)}

I⁰(hv,di) =Tiff I(v) =d

O⁰ andγ⁰ are obtained from O andγ by replacing each operator preconditionpre(o) bypre(o)∧consist(eff(o)), and then

replacing each atomic formulav =d by the propositionhv,di, replacing each atomic effectv :=d by the effect

hv,di ∧V

d⁰∈dom(v)\{d}¬hv,d⁰i.

(27)

SAS ⁺ Planning Tasks

(28)

SAS

⁺

Planning Tasks

Definition (SAS⁺ Planning Task)

An FDR planning task Π =hV,I,O, γiis called aSAS⁺ planning task if

there are no conditional effects in O, and

all operator preconditions in O and the goal formula γ are conjunctions of atoms.

(29)

SAS

⁺

vs. STRIPS

SAS⁺ is the analogue of STRIPS planning tasks for FDR induced propositional planning task of a SAS⁺ task is a STRIPS planning task after simplification (consistency conditions simplify to⊥ or>) FDR tasks obtained by mutex-based reformulation of STRIPS planning tasks are SAS⁺ tasks

(30)

Summary

(31)

Summary

Invariants are common properties of all reachable states, expressed as logical formulas.

Mutexes are invariants that express that certain pairs of literals are mutually exclusive.

Planning tasks in finite-domain representation (FDR) are an alternative to propositional planning tasks.

FDR tasks are often more compact (have fewer states).

This makes many planning algorithms more efficient when working with a finite-domain representation.

SAS⁺ tasks are a restricted form of FDR tasks where only conjunctions of atoms are allowed in the preconditions, effects and goal. No conditional effects are allowed.

(32)

Planning and Optimization A7. Invariants, Mutexes and Finite Domain Representation Gabriele R¨oger and Thomas Keller

Content of this Course

Invariants

Invariants

Invariants: Definition

Computing Invariants

Exploiting Invariants

Mutexes

Mutexes

Encoding Mutex Groups as Finite-Domain Variables

FDR Planning Tasks

Reminder: Blocks World with Boolean State Variables

A B

C

Blocks World with Finite-Domain State Variables

A B

C

Finite-Domain State Variables

Finite-Domain States

Finite-Domain Formulas

Finite-Domain Effects

Planning Tasks in Finite-Domain Representation

FDR Task Semantics

FDR Task Semantics: Introduction

Direct vs. Compilation Semantics

Conflicting Effects

Consistency Condition and Applicability

Reminder: Semantics of Propositional Planning Tasks

Semantics of Planning Tasks

Compilation Semantics

SAS ⁺ Planning Tasks

SAS

Planning Tasks

SAS

vs. STRIPS

Summary

Summary

Quiz

kahoot.it

Planning and Optimization A7. Invariants, Mutexes and Finite Domain Representation Gabriele R¨oger and Thomas Keller

Content of this Course

Invariants

Invariants

Invariants: Definition

Computing Invariants

Exploiting Invariants

Mutexes

Mutexes

Encoding Mutex Groups as Finite-Domain Variables

FDR Planning Tasks

Reminder: Blocks World with Boolean State Variables

A B

C

Blocks World with Finite-Domain State Variables

A B

C

Finite-Domain State Variables

Finite-Domain States

Finite-Domain Formulas

Finite-Domain Effects

Planning Tasks in Finite-Domain Representation

FDR Task Semantics

FDR Task Semantics: Introduction

Direct vs. Compilation Semantics

Conflicting Effects

Consistency Condition and Applicability

Reminder: Semantics of Propositional Planning Tasks

Semantics of Planning Tasks

Compilation Semantics

SAS + Planning Tasks

SAS

Planning Tasks

SAS

vs. STRIPS

Summary

Summary

Quiz

kahoot.it

SAS ⁺ Planning Tasks