Learning control knowledge and case-based planning

About This Presentation

Title:

Learning control knowledge and case-based planning

Description:

Prodigy. Explicit search control rules can apply to any decision point ... EBL in Prodigy. Used by Minton (88) to improve efficiency of planning ... – PowerPoint PPT presentation

Number of Views:29

Avg rating:3.0/5.0

Slides: 27

Provided by: jimbl

Learn more at: http://www.isi.edu

Category:

more less

Transcript and Presenter's Notes

Title: Learning control knowledge and case-based planning

1
Learning control knowledgeand case-based
planning
Jim Blythe, with additional slides from
presentations by Manuela Veloso
2
Motivation

Planning is hard. PSpace-hard.
BUT.. this is a worst-case result
In many domains there may exist efficient
strategies for planning
May be able to derive them automatically from
experience

3
Controlling search

Every planning algorithm does search
Given a choice point, if makes incorrect choice,
needs to backtrack and try other choices
If we can make the right choice the first time

4
Prodigy

Explicit search control rules can apply to any
decision point
Many different learning approaches have been
implemented
Relatively old planning approach

5
Learning methods in Prodigy
6
Overview of Prodigy planning algorithm
7
(No Transcript)
8
Prodigy algorithm
9
Prodigy algorithm, part II
10
Decision points in Prodigy
11
Example domain process planning
12
Example control rules in Prodigy
13
Review of explanation-based learning
MV

Inputs
Target concept definition
Training example
Domain theory
Operationality criterion
Output
Generalization of the training example that is
Sufficient to describe the target concept, and
Satisfies the operationality criterion

14
The safe-to-stack example
MV

Input
Target concept safe-to-stack(x,y)
Training example
on(obj1, obj2)
isa(obj1, box) isa(obj2, endtable)
color(obj1, red) color(obj2, blue)
volume(obj1, 1) density(obj1, 0.1),

15
The safe-to-stack example, cont.
MV

Input
Domain theory
Not(fragile(y)) or lighter(x, y) gt
safe-to-stack(x,y)
Volume(x,v) and density(x,d) gt weight(x, vd)
Weight(x1, w1) and weight(x2, w2) and less(w1,
w2)
gt lighter(x1, x2)
Isa(x, endtable) gt weight(x, 5)
Less(0.1, 5),
Operationality criterion
Learned description should use terms that
describe objects directly, or are easy to
evaluate, e.g less

16
The safe-to-stack example
MV

Explain why obj1 is safe-to-stack on obj2
Construct a proof
Do goal regression regress target concept
through the proof structure
Proof isolates relevant features

17
Generating operational knowledge
MV

Generalize proof
Sometimes, simply replace constants by variables
Prove that all identified relevant features are
necessary in general
Output
volume(x,v1) and density(x,d1) and isa(y,
endtable)
and less(v1d1, 5)
gt safe-to-stack(x,y)

18
Using EBL to improve plan quality

Given planning domain, evaluation function
planners plan, a better plan
Learn control knowledge to produce the better
plan
Explanation used explain why the alternative
plan is better
Target concept control rules that make choices
based on the planner state and meta-state

19
EBL in Prodigy

Used by Minton (88) to improve efficiency of
planning
Version used in Quality (95) to improve quality
of solution

20
Architecture of Quality system
21
Explaining better plans recursively
22
Explaining better plans recursivelytarget
concept shared subgoal
23
Example from process planning
24
(No Transcript)
25
Learned rules
26
Discussion

EBL is always correct, but Quality isnt only
learns why plan B is better than plan A
No guarantee of optimality
Linear additive evaluation function how well
does this model metrics we care about?
Generality of control rules

Write a Comment

User Comments (0)