Title: Planning as Satisfiability
1Planning as Satisfiability
2Outline
0. Overview of Planning 1. Modeling and Solving
Planning Problems as SAT - SATPLAN 2. Improved
Encodings using Graph Analysis - BLACKBOX 3.
Improved Encodings using Compiled Control
Knowledge
3Overview of Planning
- Find a sequence of operators that transform an
initial state to a goal state - State complete truth assignment to a set of
variables (fluents) - Goal partial truth assignment (set of states)
- Action a partial function State State
- specified by three sets of variables preconditio
n, add list, delete list
4Abdundance of Negative Complexity Results
- I. Domain-independent planning PSPACE-complete
- (Chapman 1987 Bylander 1991 Backstrom 1993)
- II. Domain-dependent planning NP-complete
- (Chenoweth 1991 Gupta and Nau 1992)
- III. Approximate planning NP-complete
- (Selman 1994)
5Planning as Inference
- Planning as first-order theorem proving (Green
1969) - computationally infeasible
- STRIPS (Fikes Nilsson 1971)
- very hard
- Partial-order planning (modal truth criteria)
(Tate 1977, Chapman 1985, McAllester 1991, Smith
Peot 1993) - can be more efficient, but still hard (Minton,
Bresina, Drummond 1994) - SATPLAN planning as propositional reasoning
6Part 1 Modeling and Solving Planning Problems
as SAT
7SAT Encodings
- Planning Problem -gt Propositional CNF by axiom
schemas - Discrete time, modeled by integers
- state predicates indexed by time at which they
hold - action predicates indexed by time at which
action begins - each action takes 1 time step
- many actions may occur at the same step
8Encoding Conventions
- Actions imply preconditions and effects
- fly(x,y,i) ? at(x,i) route(x,y) at(y,i1)
- Conflicting actions cannot occur at same time (A
deletes a precondition of B) - fly(x,y,i) y?z ? ?fly(x,z,i)
- If something changes, an action must have caused
it (Explanatory Frame Axioms) - at(x,i) ?at(x,i1) ? ? y . route(x,y)
fly(x,y,i) - Initial and final states hold
- at(NY,0) ... at(LA,9) ...
9Modeling Tricks
- Can often dramatically reduce size of problem by
modeling techniques - move(x,y,z,i) requires n4 vars
- pickup(x,y,i), putdown(x,z,i) requires 2n3 vars
- State-based encodings eliminate all action
variables (compile away) - at(x,i) ? at(x,i1) ? ? y . route(x,y)
at(y,i1) - at(x,i) x?y ? ?at(y,i)
10Solution to a Planning Problem
- A solution is specified by any model (satisfying
truth assignment) of the conjunction of the
axioms describing the initial state, goal state,
and operators - Easy to convert back to a STRIPS-style plan
11SATPLAN
instantiated propositional clauses
instantiate
axiom schemas
problem description
length
mapping
SAT engine(s)
interpret
satisfying model
plan
12SAT Algorithms
- Systematic Search
- DP (Davis Putnam Logemann Loveland)backtrack
search unit propagation - satz (Chu Min Li) - variable selection by forward
checking max unit props - relsat (Bayardo) - dependency directed
backtracking add new clauses at dead-ends - Local Search
- Inspired by Mins-Conflict algorithm (Adorf,
Johnson, Minton, Philips, Laird) - GSAT (Selman), Walksat (Selman, Kautz
Cohen)greedy local search noise to escape
minima
13Planning Benchmark Test Set
- Extension of Graphplan test set
- blocks world - up to 18 blocks, 1019 states
- logistics - complex, highly-parallel
transportation domain. - Logistics.d
- 2,165 possible actions per time slot
- 1016 legal configurations (22000 states)
- optimal solution contains 74 distinct actions
over 14 time slots - Problems of this size never previously handled by
state-space planning systems
14Scaling Up Logistics Planning
10000
1000
100
Graphplan
DP
log solution time
10
DP/Satz
Walksat
1
0.1
0.01
log.d
log.b
log.a
log.c
rocket.a
rocket.b
15Randomized Restarts
- Solution randomize the systematic solver
- Add noise to the heuristic branching (variable
choice) function - Cutoff and restart search after a fixed number of
backtracks - In practice rapid restarts with low cutoff can
dramatically improve performance - (Gomes 1996, Gomes, Kautz, and Selman 1997,
1998)
16Increased Predictability
10000
1000
100
Satz
log solution time
10
Satz/Rand
1
0.1
0.01
log.d
log.b
log.a
log.c
rocket.a
rocket.b
17What SATPLAN Shows
- General propositional theorem provers can compete
with state of the art specialized planning
systems - New, highly tuned variations of DP surprising
powerful - result of sharing ideas and code in large SAT/CSP
research community - specialized engines can catch up, but by then new
general techniques - Radically new stochastic approaches to SAT can
provide very low exponential scaling - 2 orders magnitude speedup on hard benchmark
problems
18Why SATPLAN Works
- More flexible than forward or backward chaining
- Systematic most unit propagation at most highly
constrained states - Stochastic iterative repair
- Randomized algorithms less likely to get trapped
along bad paths
19Part 2 Improved Encodings by Graph Analysis
The BLACKBOX Planner
20Graphplan
- Planning as graph search (Blum Furst 1995)
- Set new paradigm for planning
- Like SATPLAN...
- Two phases instantiation of propositional
structure, followed by search - Unlike SATPLAN...
- Interleaves instantiation and pruning of plan
graph - Employs specialized search engine
- Graphplan - better instantiation
- SATPLAN - better search
21Graph Pruning
- Graphplan instantiates in a forward direction,
pruning unreachable nodes - conflicting actions are mutex
- if all actions that add two facts are mutex, the
facts are mutex - if the preconditions for an action are mutex, the
action is unreachable! - In logical terms limited application of negative
binary propagation - given ? P V ? Q, P V R V S V ...
- infer ? Q V R V S V ...
22The Plan Graph
Facts
Facts
Actions
Facts
Facts
Actions
...
...
...
...
mutually exclusive
preconditions
add effects
delete effects
23Translation of Plan Graph
Act1
Pre1
Fact
Pre2
Act2
Fact ? Act1 ? Act2 Act1 ? Pre1 ? Pre2 Act1 ?
Act2
24General Limited Inference
- Generated wff can be further simplified by
consistency propagation techniques - Compact (Crawford Auton 1996)
- unit propagation is Wff inconsistant by
resolution against unit clauses? - O(n)
- failed literal rule is Wff P inconsistant
by unit propagation? - O(n2)
- binary failed literal rule is Wff P V Q
inconsistant by unit propagation? - O(n3)
- Complements domain specific limited inference
- Discovers hidden local structure!
25General Limited Inference
26Blackbox
Plan Graph
Mutex computation
STRIPS
Translator
CNF
Simplifier
General Stochastic / Systematic SAT engines
Solution
CNF
27Blackbox Results
28Applicability
- When is the BlackBox approach not a good idea?
- when domain too large for propositional planning
approaches - when long sequential plans are needed
- when solution time dominated by reachability
analysis (plan-graph generation), not extraction - when optimal or near optimal planning not
necessary
29Part 3 Improved Encodings Compiling Control
Knowledge
30Kinds of Control Knowledge
- About domain itself
- a truck is only in one location
- About good plans
- do not remove a package from its destination
location - About how to search
- plan air routes before land routes
31Expressing Knowledge
- Such information is traditionally incorporated in
the planning algorithm itself - or in a special programming language
- Instead use additional declarative axioms
- (Bacchus 1995 Kautz 1998 Chen, Kautz, Selman
1999) - Problem instance operator axioms initial and
goal axioms control axioms - Control knowledge constraints on search and
solution spaces - Independent of any search engine strategy
32Axiomatic Control Knowledge
- State Invariant A truck is at only one location
- at(truck,loc1,i) loc1 ¹ loc2 É Ø
at(truck,loc2,i) - Optimality Do not return a package to a location
- at(pkg,loc,i) Ø at(pkg,loc,i1) iltj É Ø
at(pkg,loc,j) - Simplifying Assumption Once a truck is loaded,
it should immediately move - Ø in(pkg,truck,i) in(pkg,truck,i1)
at(truck,loc,i1) É Ø at(truck,loc,i2)
33Adding Control Kx to SATPLAN
Problem Specification Axioms
Control Knowledge Axioms
Instantiated Clauses
As control knowledge increases, Core shrinks!
SAT Simplifier
SAT Core
SAT Engine
34Tradeoffs of Control Knowledge
- If the planning domain is inherently intractable,
how can any amount of control knowledge make
planning tractable? - by reducing solution quality
- optimal planning - NP-Hard
- non-optimal - (maybe) Polynomial
- Issue speed / quality tradeoff
- Case study Control Knowledge in TLPLAN and
BlackBox - TLPLAN (Bacchus 1996) simple forward-chaining
search with strong control rules
35TLPlan
Temporal Logic Control Formula
36Temporal Logic for Control
- I. Rules involves only static information
- II. Rules depends on the current state
- III. Rules depends on the current state and
requires dynamic user-defined predicates
37Category I Control Rules
Goal
Initial
a
a
a
SFO
ORL
NYC
Do NOT unload an object from an airplane unless
the object is at its goal destination
38Pruning the Planning GraphCategory I Rules
Facts
Facts
Actions
Facts
Facts
Actions
...
...
...
...
39Effect of Graph Pruning
40Category II Control Rules
a
SFO
ORL
NYC
Do NOT move an airplane if there is an object in
the airplane that needs to be unloaded at that
location.
41Control by Adding Constraints
Control Rules
Planning Formula
Constraints Clauses
42Blackbox with Control Knowledge(Logistics domain)
43 Comparison between Blackbox and TLPlan(Parallel
Plan Length)
44Comparison between Blackbox and TLPlan(Running
Time)
45Comparison
- TLPlan (without Control)
- Intractable.
- TLPlan (with Control)
- fastest, but limited parallelism
- Blackbox (without Control)
- slower, high parallelism
- Blackbox (with Control)
- faster, high parallelism
46Summary
- Easy to encode domain-specific knowledge in the
planning as satisfiablity frame - Key to order-of-magnitude scaling
- Propositional logic, temporal logic, ...
- Can be applied before/after SAT encoding
- Can control time / quality tradeoff
- Power of underlying SAT engines gives option of
finding higher quality solutions - Heuristics are independent from the SAT engine
- Can use same axioms for radically different
problem solvers
47How to Generate Control Kx
- Introspection
- Try to capture obvious inferences that are hard
to deduce - EBL (Minton, Kambhampati)
- Generalize trace of previous problem solving
- Static analysis (Smith, Etzioni, Knoblock, Peot)
- Analyze operators
- Inductive Logic Programming (Huang, Selman,
Kautz) - Find rules that hold for a set of previous
high-quality solution plans
48Conclusions
- Propositional approaches to Open-Loop planning
using general SAT engines are highly competitive
with specialized planning algorithms - Synergy with Plan Graph approaches
- Can effectively employ purely declarative control
knowledge - Biggest limitation domains where number of
objects is too large to instantiate