Planning and Optimization D1. Abstractions: Formal Deﬁnition and Heuristics Gabriele R¨oger and Thomas Keller

(1)

D1. Abstractions: Formal Definition and Heuristics

Gabriele R¨oger and Thomas Keller

Universit¨at Basel

October 29, 2018

(2)

Content of this Course

Planning

Classical

Tasks Progression/

Regression Complexity Heuristics

Probabilistic

MDPs Uninformed Search

Heuristic Search Monte-Carlo

Methods

(3)

Content of this Course: Heuristics

Heuristics

Delete Relaxation

Abstraction

Abstractions in General

Pattern Databases

Merge &

Shrink Landmarks

Potential Heuristics Cost Partitioning

(4)

Abstractions

(5)

Abstracting a Transition System

Abstracting a transition system meansdropping some distinctions between states, whilepreserving the transition behaviouras much as possible.

An abstraction of a transition system T is defined by an abstraction mapping α that defines which states of T should be distinguished and which ones should not.

FromT andα, we compute anabstract transition systemT^α which is similar to T, but smaller.

The abstract goal distances(goal distances in T^α) are used as heuristic estimates for goal distances inT.

(6)

Computing the Abstract Transition System

GivenT andα, how do we compute T^α? Requirement

We want to obtain anadmissible heuristic.

Hence,h^∗(α(s)) (in the abstract state spaceT^α) should never overestimateh^∗(s) (in the concrete state spaceT).

An easy way to achieve this is to ensure thatall solutions in T are also present inT^α:

Ifs is a goal state inT, then α(s) is a goal state in T^α. IfT has a transition froms to t, thenT^α

has a transition from α(s) to α(t).

(7)

Example Task: One Package, Two Trucks

Example (One Package, Two Trucks)

Consider the following FDR planning taskhV,I,O, γi:

V ={p,tA,tB}with dom(p) ={L,R,A,B}

dom(tA) = dom(tB) ={L,R}

I ={p 7→L,t_A7→R,t_B 7→R}

O ={pickup_i,j |i ∈ {A,B},j ∈ {L,R}}

∪ {drop_i,j |i ∈ {A,B},j ∈ {L,R}}

∪ {move_i,j_,j⁰ |i ∈ {A,B},j,j⁰ ∈ {L,R},j 6=j⁰}, where pickup_i,j =hti=j∧p=j,p:=i,1i

drop_i,j =ht_i =j∧p=i,p:=j,1i

move_i,j,j⁰ =ht_i =j,t_i :=j⁰,1i

γ = (p= R)

(8)

Concrete Transition System of Example Task

LRR LLL LLR

LRL

ALR

ALL

BLL

BRL

ARL

ARR

BRR

BLR

RRR RRL

RLR

RLL

State {p 7→i,t_A 7→j,t_B 7→k} is depicted asijk. Transition labels are again not shown. For example, the transition from LLL to ALL has the label pickup_A,L.

(9)

Abstract Transition System of Example Task

LRR

LLR

LLL

LRL LLR

LRL LLL

ALR ARL

ALL ARR

BLL

BRL

BRR

BLR ALR ARL

ARR ALL

BLL BRR

BLR BRL

RRR RRL

RLR

RLLRLL RRL

RLR RRR

State {p 7→i,t_A 7→j,t_B 7→k} is depicted asijk. Transition labels are again not shown. For example, the transition from LLL to ALL has the label pickup_A,L.

(10)

Abstractions

Definition (Abstraction)

LetT =hS,L,c,T,s0,S?ibe a transition system.

Anabstraction(also: abstractionfunction, abstraction mapping) ofT is a functionα:S →S^α defined on the states of T, whereS^α is an arbitrary set.

Without loss of generality, we require thatα is surjective.

Intuition: α maps the states of T to another (usually smaller) abstractstate space.

(11)

Abstract Transition System

Definition (Abstract Transition System)

LetT =hS,L,c,T,s₀,S_?ibe a transition system, and letα:S →S^α be an abstraction of T.

Theabstract transition system induced byα, in symbols T^α, is the transition systemT^α=hS^α,L,c,T^α,s₀^α,S_?^αidefined by:

T^α={hα(s), `, α(t)i | hs, `,ti ∈T} s₀^α=α(s₀)

S_?^α={α(s)|s ∈S?}

(12)

Terminology

LetT be a transition system andα be an abstraction of T. T is called the concrete transition system.

T^α is called the abstract transition system.

Similarly: concrete/abstract state space, concrete/abstract transition, etc.

(13)

Practical Requirements for Abstractions

To be useful in practice, an abstraction heuristic must be efficiently computable. This gives us two requirements forα:

For a given state s, theabstract state α(s) must be efficiently computable.

For a given abstract stateα(s), theabstract goal distance h^∗(α(s)) must be efficiently computable.

There are a number of ways of achieving these requirements:

pattern database heuristics(Culberson & Schaeffer, 1996) merge-and-shrink abstractions (Dr¨ager, Finkbeiner &

Podelski, 2006)

Cartesian abstractions (Ball, Podelski & Rajamani, 2001) structural patterns (Katz & Domshlak, 2008b)

(14)

Homomorphisms and Isomorphisms

(15)

Homomorphisms and Isomorphisms

The abstraction mappingα that transforms T toT^α is also called a strict homomorphismfromT toT^α. Roughly speaking, in mathematics a homomorphism is a property-preserving mapping between structures.

A stricthomomorphism is one where no additional features are introduced. A non-strict homomorphism in planning would mean that the abstract transition system may include additional transitions and goal states not induced byα.

We only consider strict homomorphisms in this course.

Ifα is bijective, it is called an isomorphismbetween T and T^α, and the two transition systems are called isomorphic.

(16)

Isomorphic Transition Systems

The notion of isomorphic transition systems is important enough to warrant a formal definition:

Definition (Isomorphic Transition Systems)

LetT =hS,L,c,T,s₀,S_?iand T⁰=hS⁰,L⁰,c⁰,T⁰,s₀⁰,S_?⁰i be transition systems.

We say thatT is isomorphic toT⁰, in symbolsT ∼ T⁰, if there exist bijective functionsϕ:S →S⁰ andλ:L→L⁰ such that:

s −→^` t ∈T iff ϕ(s)−−→^λ(`) ϕ(t)∈T⁰, c⁰(λ(`)) =c(`) for all`∈L, ϕ(s₀) =s₀⁰, and

s ∈S? iff ϕ(s)∈S_?⁰.

(17)

Graph-Equivalent Transition Systems

Sometimes a weaker notion of equivalence is useful:

Definition (Graph-Equivalent Transition Systems)

LetT =hS,L,c,T,s0,S?iand T⁰=hS⁰,L⁰,c,T⁰,s₀⁰,S_?⁰i be transition systems.

We say thatT is graph-equivalent to T⁰, in symbolsT ∼ T^G ⁰, if there exists a bijective functionϕ:S →S⁰ such that:

There is a transitions −→^` t ∈T with c(`) =k iff there is a transition ϕ(s) ^`

0

−→ϕ(t)∈T⁰ with c⁰(`⁰) =k, ϕ(s0) =s₀⁰, and

s ∈S_? iff ϕ(s)∈S_?⁰.

Note: The labels of T and T⁰ do not matter except that transitionsof the same costmust be preserved.

(18)

Isomorphism vs. Graph Equivalence

(∼) and (∼) are equivalence relations.^G

Two isomorphic transition systems are interchangeable for all practical intents and purposes.

Two graph-equivalent transition systems are interchangeable for most intents and purposes.

In particular, their goal distances are identical.

Isomorphism implies graph equivalence, but not vice versa.

(19)

Abstraction Heuristics

(20)

Abstraction Heuristics

Definition (Abstraction Heuristic)

Letα:S →S^α be an abstraction of a transition systemT. Theabstraction heuristic induced byα, writtenh^α,

is the heuristic functionh^α:S →R⁺₀ ∪ {∞} defined as h^α(s) =h^∗_Tα(α(s)) for all s ∈S, whereh^∗_Tα denotes the goal distance function in T^α. Notes:

h^α(s) =∞ if no goal state ofT^α is reachable fromα(s) We also apply abstraction terminology to planning tasks Π, which stand for their induced transition systems.

For example, an abstraction of Π is an abstraction ofT(Π).

(21)

Abstraction Heuristics: Example

LRR

LLR

LLL

LRL LLR

LRL LLL

ALR ARL

ALL ARR

BLL

BRL

BRR

BLR ALR ARL

ARR ALL

BLL BRR

BLR BRL

RRR RRL

RLR

RLLRLL RRL

RLR RRR

h^α({p7→L,tA 7→R,tB 7→R}) = 3

(22)

Consistency of Abstraction Heuristics (1)

Theorem (Consistency and Admissibility ofh^α) Letα be an abstraction of a transition systemT. Then h^α is safe, goal-aware, admissible and consistent.

Proof.

We prove goal-awareness and consistency;

the other properties follow from these two.

LetT =hS,L,c,T,s₀,S_?i.

LetT^α =hS^α,L,c,T^α,s₀^α,S_?^αi.

Goal-awareness: We need to show thath^α(s) = 0 for alls ∈S?, so lets ∈S?. Thenα(s)∈S_?^α by the definition of abstract

transition systems, and henceh^α(s) =h^∗_Tα(α(s)) = 0. . . .

(23)

Consistency of Abstraction Heuristics (1)

Theorem (Consistency and Admissibility ofh^α) Letα be an abstraction of a transition systemT. Then h^α is safe, goal-aware, admissible and consistent.

Proof.

We prove goal-awareness and consistency;

the other properties follow from these two.

LetT =hS,L,c,T,s₀,S_?i.

LetT^α =hS^α,L,c,T^α,s₀^α,S_?^αi.

Goal-awareness: We need to show that h^α(s) = 0 for alls ∈S?, so lets ∈S?. Thenα(s)∈S_?^α by the definition of abstract

transition systems, and henceh^α(s) =h^∗_Tα(α(s)) = 0. . . .

(24)

Consistency of Abstraction Heuristics (2)

Proof (continued).

Consistency: Consider any state transition s −→^` t ofT. We need to showh^α(s)≤c(`) +h^α(t).

By the definition ofT^α, we getα(s)−→^` α(t)∈T^α. Hence,α(t) is a successor of α(s) in T^α via the label`.

We get:

h^α(s) =h^∗_Tα(α(s))

≤c(`) +h_T^∗α(α(t))

=c(`) +h^α(t),

where the inequality holds because perfect goal distancesh^∗_Tα

are consistent inT^α.

(The shortest path fromα(s) to the goal in T^α cannot be longer than the shortest path fromα(s) to the goal via α(t).)

(25)

Consistency of Abstraction Heuristics (2)

Proof (continued).

We get:

≤c(`) +h_T^∗α(α(t))

=c(`) +h^α(t),

(26)

Consistency of Abstraction Heuristics (2)

Proof (continued).

We get:

≤c(`) +h_T^∗α(α(t))

=c(`) +h^α(t),

(27)

Coarsenings and Refinements

(28)

Abstractions of Abstractions

Since abstractions map transition systems to transition systems, they arecomposable:

Using a first abstractionα:S →S⁰, map T toT^α.

Using a second abstraction β:S⁰ →S⁰⁰, map T^α to (T^α)^β. The result isthe same as directly using the abstraction (β◦α):

Let γ :S →S⁰⁰ be defined asγ(s) = (β◦α)(s) =β(α(s)).

Then T^γ = (T^α)^β.

(29)

Abstractions of Abstractions: Example (1)

LRR LLL LLR

LRL

ALR

ALL

BLL

BRL

ARL

ARR

BRR

BLR

RRR RRL

RLR

RLL

transition systemT

(30)

Abstractions of Abstractions: Example (2)

LRR

LRL LRR

LRL LLL LLRLLR

LLL

ALR

ALL ALR

ALL

ARL

ARR ARL

ARR

BLR BLL BRR

BRL BLL

BLR BRR

BRL

RRR RRLRRL

RRR

RLR

RLLRLL

RLR

Transition systemT⁰ as an abstraction ofT (ignore t_B)

(31)

Abstractions of Abstractions: Example (2)

LRR

LRL LRR

LRL LLL LLRLLR

LLL

ALR

ALL ALR

ALL

ARL

ARR ARL

ARR

BRR BLL BLR

BRL

BLL BLR

BRL BRR

RRR RRLRRL

RRR

RLR

RLLRLL

RLR

Transition systemT⁰ as an abstraction ofT (ignore t_B)

(32)

Abstractions of Abstractions: Example (3)

LRR LLL LLR

LRL LRR

LLR

LRL LLL

ALR ARL

ALL ARR ALR ARL

ARR ALL

BLL

BRL

BLR

BRR BLL BLR

BRR BRL

RRR RRL

RLR

RLLRLL RRL

RLR RRR

Transition system T⁰⁰ as an abstraction ofT⁰ (ignore t_A)

(33)

Abstractions of Abstractions: Example (3)

LRR LLL LLR

LRL LRR

LLR

LRL LLL

ALR ARL

ALL ARR ALR ARL

ARR ALL

BLL

BRL

BRR

BLR BLL BRR

BLR BRL

RRR RRL

RLR

RLLRLL RRL

RLR RRR

Transition system T⁰⁰ as an abstraction ofT (ignoret_A andt_B)

(34)

Coarsenings and Refinements

Definition (Coarsening and Refinement)

Letα andγ be abstractions of the same transition system such thatγ =β◦α for some functionβ.

Thenγ is called acoarsening ofα andα is called arefinementof γ.

(35)

Heuristic Quality of Refinements

Theorem (Heuristic Quality of Refinements)

Letα andγ be abstractions of the same transition system such thatα is a refinement ofγ.

Then h^α dominates h^γ.

In other words,h^γ(s)≤h^α(s)≤h^∗(s) for all states s.

(36)

Heuristic Quality of Refinements: Proof

Proof.

Sinceα is a refinement ofγ,

there exists a functionβ with γ =β◦α.

For all statess of Π, we get:

h^γ(s) =h^∗_Tγ(γ(s))

=h^∗_Tγ(β(α(s)))

=h^β_Tα(α(s))

≤h^∗_Tα(α(s))

=h^α(s),

where the inequality holds becauseh_T^βα is an admissible heuristic in the transition systemT^α.

(37)

Summary

(38)

Summary

Abstractionis one of the principled ways of deriving heuristics.

An abstractionis a functionα that maps the states S of a transition system to another (usually smaller) setS^α. Thisinduces an abstract transition systemT^α, which behaves like the original transition system T except that states

mapped to the same abstract state cannot be distinguished.

Abstractions α induceabstraction heuristics h^α: h^α(s) is the goal distance of α(s) in the abstract transition system.

Abstraction heuristics are safe,goal-aware,admissible andconsistent.

Abstractions can be composed, leading to coarservs. finer abstractions. Heuristics for finer abstractions dominate those for coarser ones.