Title: Learning control knowledge and case-based planning
1Learning control knowledgeand case-based
planning
Jim Blythe, with additional slides from
presentations by Manuela Veloso
2Motivation
- Planning is hard. PSpace-hard.
- BUT.. this is a worst-case result
- In many domains there may exist efficient
strategies for planning - May be able to derive them automatically from
experience
3Controlling search
- Every planning algorithm does search
- Given a choice point, if makes incorrect choice,
needs to backtrack and try other choices - If we can make the right choice the first time
4Prodigy
- Explicit search control rules can apply to any
decision point - Many different learning approaches have been
implemented - Relatively old planning approach
5Learning methods in Prodigy
6Overview of Prodigy planning algorithm
7(No Transcript)
8Prodigy algorithm
9Prodigy algorithm, part II
10Decision points in Prodigy
11Example domain process planning
12Example control rules in Prodigy
13Review of explanation-based learning
MV
- Inputs
- Target concept definition
- Training example
- Domain theory
- Operationality criterion
- Output
- Generalization of the training example that is
- Sufficient to describe the target concept, and
- Satisfies the operationality criterion
14The safe-to-stack example
MV
- Input
- Target concept safe-to-stack(x,y)
- Training example
- on(obj1, obj2)
- isa(obj1, box) isa(obj2, endtable)
- color(obj1, red) color(obj2, blue)
- volume(obj1, 1) density(obj1, 0.1),
15The safe-to-stack example, cont.
MV
- Input
- Domain theory
- Not(fragile(y)) or lighter(x, y) gt
safe-to-stack(x,y) - Volume(x,v) and density(x,d) gt weight(x, vd)
- Weight(x1, w1) and weight(x2, w2) and less(w1,
w2) - gt lighter(x1, x2)
- Isa(x, endtable) gt weight(x, 5)
- Less(0.1, 5),
- Operationality criterion
- Learned description should use terms that
describe objects directly, or are easy to
evaluate, e.g less
16The safe-to-stack example
MV
- Explain why obj1 is safe-to-stack on obj2
- Construct a proof
- Do goal regression regress target concept
through the proof structure - Proof isolates relevant features
17Generating operational knowledge
MV
- Generalize proof
- Sometimes, simply replace constants by variables
- Prove that all identified relevant features are
necessary in general - Output
- volume(x,v1) and density(x,d1) and isa(y,
endtable) - and less(v1d1, 5)
- gt safe-to-stack(x,y)
18Using EBL to improve plan quality
- Given planning domain, evaluation function
- planners plan, a better plan
- Learn control knowledge to produce the better
plan - Explanation used explain why the alternative
plan is better - Target concept control rules that make choices
based on the planner state and meta-state
19EBL in Prodigy
- Used by Minton (88) to improve efficiency of
planning - Version used in Quality (95) to improve quality
of solution
20Architecture of Quality system
21Explaining better plans recursively
22Explaining better plans recursivelytarget
concept shared subgoal
23Example from process planning
24(No Transcript)
25Learned rules
26Discussion
- EBL is always correct, but Quality isnt only
learns why plan B is better than plan A - No guarantee of optimality
- Linear additive evaluation function how well
does this model metrics we care about? - Generality of control rules