• Keine Ergebnisse gefunden

Planning and Optimization B4. Practical Issues of Regression Search Gabriele R¨oger and Thomas Keller

N/A
N/A
Protected

Academic year: 2022

Aktie "Planning and Optimization B4. Practical Issues of Regression Search Gabriele R¨oger and Thomas Keller"

Copied!
13
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

B4. Practical Issues of Regression Search

Gabriele R¨oger and Thomas Keller

Universit¨at Basel

October 15, 2018

(2)

Content of this Course

Planning

Classical

Tasks Progression/

Regression Complexity Heuristics

Probabilistic

MDPs Uninformed Search

Heuristic Search Monte-Carlo

Methods

(3)

Regression Search

regression search

search from goal to initial state

propositional formulas represent sets of states regression computes possible predecessor states for a set of states and an operator.

(4)

Unpromising Branches

(5)

Emptiness and Subsumption Testing

The following two tests are useful when performing regression searches to avoid exploring unpromising branches:

Test that regr(ϕ,o) does not represent the empty set (which would mean that search is in a dead end).

For example,regr(p,ha,¬pi)≡a∧(⊥ ∨(p∧ ¬>))≡ ⊥.

Test that regr(ϕ,o) does not represent a subset ofϕ (which would mean that the resulting search state is worse than ϕand can be pruned).

For example,regr(a,hb,ci)≡a∧b.

Both of these problems areNP-complete.

(6)

Formula Growth

(7)

Formula Growth

The formularegr(regr(. . .regr(ϕ,on). . . ,o2),o1) may have size O(|ϕ||o1||o2|. . .|on−1||on|), i.e., the product of the sizes ofϕand the operators.

worst-case exponentialsize Ω(|ϕ|n) Logical Simplifications

⊥ ∧ϕ≡ ⊥,> ∧ϕ≡ϕ,⊥ ∨ϕ≡ϕ,> ∨ϕ≡ >

a∨ϕ≡a∨ϕ[⊥/a],¬a∨ϕ≡ ¬a∨ϕ[>/a], a∧ϕ≡a∧ϕ[>/a],¬a∧ϕ≡ ¬a∧ϕ[⊥/a]

idempotence, absorption, commutativity, associativity, . . .

(8)

Restricting Formula Growth in Search Trees

Problem very big formulas obtained by regression Cause disjunctivityin the (NNF) formulas

(formulaswithout disjunctions easily convertible to monomials`1∧ · · · ∧`nwhere `i are literals andn is at most the number of state variables) Idea split disjunctive formulas when generating search trees

(9)

Unrestricted Regression: Search Tree Example

Unrestricted regression: do not treat disjunctions specially Goalγ =a∧b, initial stateI ={a7→F,b7→F,c 7→F}.

γ=a∧b

¬a∧a

(¬c∨a)∧b (¬c∨a)∧ ¬a

(¬c∨a)∧b

h¬a,bi

hb,¬c Bai h¬a,bi

hb,¬cBai

(10)

Full Splitting: Search Tree Example

Full splitting: always split all disjunctive formulas

Goalγ =a∧b, initial stateI ={a7→F,b7→F,c 7→F}.

(¬c∨a)∧b in DNF: (¬c ∧b)∨(a∧b) split into ¬c∧b and a∧b

γ =a∧b

¬a∧a

¬c∧b (duplicate of γ) a∧b

¬c∧ ¬a

¬c∧b

h¬a,bi

hb,¬c Bai hb,¬c Bai

h¬a,bi hb,¬c Bai

(11)

General Splitting Strategies

Alternatives:

1 Do nothing (unrestricted regression).

2 Always eliminate all disjunctivity (full splitting).

3 Reduce disjunctivity if formula becomes too big.

Discussion:

With unrestricted regression formulas may havesizes that are exponential in the number of state variables.

With full splitting search tree can beexponentially bigger than without splitting.

The third option lies between these two extremes.

(12)

Summary

(13)

Summary

When applying regression in practice, we need to consider emptiness testing to prune dead-end search states subsumption testing to prune dominated search states logical simplificationsand splittingto restrict formula growth

Referenzen

ÄHNLICHE DOKUMENTE

(formulas without disjunctions easily convertible to monomials ` 1 ∧ · · · ∧ ` n where ` i are literals and n is at most the number of state variables) Idea split disjunctive

decision problem analogue of satisficing planning Definition (Bounded-Cost Plan Existence). The bounded-cost plan existence problem ( BCPlanEx ) is the following

I Coming up with good abstractions automatically is the main research challenge when applying abstraction heuristics

D3.1 Projections and Pattern Database Heuristics D3.2 Implementing PDBs: Precomputation.. D3.3 Implementing PDBs: Lookup

Formally, pattern database heuristics are abstraction heuristics induced by a particular class of abstractions called projections.

The objective value of an integer program that minimizes this cost subject to the flow constraints is a lower bound on the plan cost (i.e., an admissible heuristic estimate)..

Performs Bellman backup in each encountered state Uses admissible heuristic for states not updated before.. Motivation Asynchronous VI RTDP

Each decision of MCTS tree policy can be viewed as multi-armed