E1. Landmarks: MHS & Uniform Cost Partitioning Heuristic
Gabriele R¨oger and Thomas Keller
Universit¨at Basel
November 12, 2018
Landmarks
Content of this Course
Planning
Classical
Tasks Progression/
Regression Complexity Heuristics
Probabilistic
MDPs Uninformed Search
Heuristic Search Monte-Carlo
Methods
Content of this Course: Heuristics
Heuristics
Delete Relaxation Abstraction
Landmarks
Action Landmarks MHS Heuristic
Uniform Cost Partitioning Cut Landmarks LM-Cut Heuristic Network Flows
Potential Heuristics Cost Partitioning
Landmarks
Basic Idea: Something that must happen in every solution For example
some operator must be applied some atom must be true some formula must be true
→Derive heuristic estimate from this kind of information.
We will only consider disjunctive action landmarks.
Landmarks
Basic Idea: Something that must happen in every solution For example
some operator must be applied some atom must be true some formula must be true
→Derive heuristic estimate from this kind of information.
We will only considerdisjunctive action landmarks.
Disjunctive Action Landmarks
Definition (Disjunctive Action Landmark)
Lets be a state of planning task Π =hV,I,O, γi.
Adisjunctive action landmarkfor s is a set of operatorsL⊆O such that every plan fors (= label path froms to a goal state) contains an operator fromL.
Thecostof landmark Lis cost(L) = mino∈Lcost(o).
Example Task
Two trucks, one airplane
Airplane can fly between locations A3 and B1
Trucks can drive arbitrarily between locations A1, A2, and A3 Package to be transported from A1 to B1
Operators
Load(v, `)andUnload(v, `)for vehiclev and location` Drive(t, `, `0)for truckt and locations`, `0
Fly(`, `0)for locations`, `0
A1 A2
A3 B1
t1
t2
f
Example: Disjunctive Action Landmarks
L1 ={Load(Truck1,A1),Load(Truck2,A1)} and L2 ={Fly(B1, A3)} are disjunctive action landmarks.
A1 A2
A3 B1
t1
t2
f
What other disjunctive action landmarks are there?
Remarks
Not every landmark is informative.
For example: If the initial state is not already a goal state then the set of all operators is a disjunctive action landmark.
Deciding whether a given operator set is a disjunctive action landmark is as hard as the plan existence problem.
Exploiting Disjunctive Action Landmarks
How can we exploit a given setL of disjunctive action landmarks?
Sum of costsP
L∈Lcost(L)?
not admissible!
Maximize costs maxL∈Lcost(L)?
usually very weak heuristic
better: hitting setsor cost partitioning
Minimum Hitting Set Heuristic
Content of this Course: Heuristics
Heuristics
Delete Relaxation Abstraction
Landmarks
Action Landmarks MHS Heuristic
Uniform Cost Partitioning Cut Landmarks LM-Cut Heuristic Network Flows
Potential Heuristics Cost Partitioning
Hitting Sets
Definition (Hitting Set)
LetX be a set,F ={F1, . . . ,Fn} ⊆2X be a family of subsets of X andc :X →R+0 be a cost function for X.
Ahitting setis a subsetH⊆X that “hits” all subsets in F, i.e., H∩F 6=∅ for allF ∈ F. The costofH isP
x∈Hc(x).
Aminimum hitting set (MHS)is a hitting set with minimal cost.
MHS is a “classical” NP-complete problem (Karp, 1972)
Example: Hitting Sets
Example
X ={o1,o2,o3,o4}
F={{o4},{o1,o2},{o1,o3},{o2,o3}}
c(o1) = 3, c(o2) = 4, c(o3) = 5, c(o4) = 0
minimum hitting set: {o1,o2,o4}with cost 3 + 4 + 0 = 7
Example: Hitting Sets
Example
X ={o1,o2,o3,o4}
F={{o4},{o1,o2},{o1,o3},{o2,o3}}
c(o1) = 3, c(o2) = 4, c(o3) = 5, c(o4) = 0
minimum hitting set: {o1,o2,o4}with cost 3 + 4 + 0 = 7
Hitting Sets for Disjunctive Action Landmarks
Idea: disjunctive action landmarks are interpreted as Idea: instance of minimum hitting set
Definition (Hitting Set Heuristic)
LetL be a set of disjunctive action landmarks. The hitting set heuristichMHS(L) is defined as the cost of a minimum hitting set forL with c(o) =cost(o).
Proposition (Hitting Set Heuristic is Admissible)
LetL be a set of disjunctive action landmarks for state s.
Then hMHS(L)is an admissible estimate for s.
Why?
Hitting Set Heuristic: Discussion
The hitting set heuristic is the best possibleheuristic that only uses the given information. . .
. . . but is NP-hard to compute.
Use approximations that can be efficiently computed.
Now: uniform cost partitioning
Later in the course: optimal cost partitioning
Uniform Cost Partitioning
Content of this Course: Heuristics
Heuristics
Delete Relaxation Abstraction
Landmarks
Action Landmarks MHS Heuristic
Uniform Cost Partitioning Cut Landmarks LM-Cut Heuristic Network Flows
Potential Heuristics Cost Partitioning
Uniform Cost Partitioning (1)
Idea: Distribute cost of operators uniformly among the landmarks.
Definition (Uniform Cost Partitioning Heuristic for Landmarks) LetL be a set of disjunctive action landmarks.
Theuniform cost partitioning heuristic hUCP(L) is defined as hUCP(L) =X
L∈L
min
o∈Lc0(o) with c0(o) =cost(o)/|{L∈ L |o ∈L}|.
Landmarks Minimum Hitting Set Heuristic Uniform Cost Partitioning Summary
Uniform Cost Partitioning (2)
Theorem (Uniform Cost Partitioning Heuristic is Admissible) LetL be a set of disjunctive action landmarks for state s ofΠ.
Then hUCP(L) is an admissibleheuristic estimate for s.
Letπ =ho1, . . . ,oni be an optimal plan fors. ForL∈ Ldefine a new cost functioncostL ascostL(o) =c0(o) ifo ∈L and
costL(o) = 0 otherwise. Let ΠL be a modified version of Π, where for all operatorso the cost is replaced withcostL(o). We make three independent observations:
1 For L∈ L the valuecost0(L) := mino∈Lc0(o) is an admissible estimate for s in ΠL.
2 π is also a plan fors in ΠL, so h∗Π
L(s)≤Pn
i=1costL(oi).
3 P
L∈LcostL(o) =cost(o) for each operatoro.
. . .
Uniform Cost Partitioning (2)
Theorem (Uniform Cost Partitioning Heuristic is Admissible) LetL be a set of disjunctive action landmarks for state s ofΠ.
Then hUCP(L) is an admissibleheuristic estimate for s.
Proof.
Letπ=ho1, . . . ,oni be an optimal plan fors. ForL∈ Ldefine a new cost functioncostL ascostL(o) =c0(o) ifo ∈L and
costL(o) = 0 otherwise. Let ΠL be a modified version of Π, where for all operatorso the cost is replaced withcostL(o). We make three independent observations:
1 For L∈ Lthe value cost0(L) := mino∈Lc0(o) is an admissible estimate for s in ΠL.
2 π is also a plan fors in ΠL, so h∗Π
L(s)≤Pn
i=1costL(oi).
3 P
L∈LcostL(o) =cost(o) for each operator o.
. . .
Uniform Cost Partitioning (3)
Proof (continued).
Together, this leads to the following inequality (subscripts indicate for which task the heuristic is computed):
hΠUCP(L) =X
L∈L
cost0(L)
(1)
≤ X
L∈L
h∗Π
L(s)
(2)
≤ X
L∈L n
X
i=1
costL(oi) =
n
X
i=1
X
L∈L
costL(oi)
(3)=
n
X
i=1
cost(o) =h∗Π(s)
Relationship
Theorem
LetL be a set of disjunctive action landmarks for state s.
ThenhUCP(L)≤hMHS(L)≤h∗(s).
(Proof omitted.)
Summary
Summary
Disjunctive action landmark: setL of operators such that every plan uses some operator from L
The costof the landmark is the cost of its cheapest operator.
Hitting setsyield the most accurate heuristic for a given set of disjunctive action landmarks, but the computation is NP-hard.
Uniform cost partitioningis a polynomial approach for the computation of informative heuristics from
disjunctive action landmarks.