Title: Learning from Only Positive Examples in Learning By Observation
1Learning from Only Positive Examples in Learning
By Observation
- Tolga Konik
- University of Michigan
2GOAL
- Generate AI agents by observing expert task
execution - Engineering Goal
- Reduce the cost of agent development
- Reduce the expertise required to develop agent
development. - AI Goal
- Agents that improve themselves observing experts
3Learning Framework
Environmental Interface
Environment
Expert
Behavior trace
Annotations
Behavior Recorder
Annotated Behavior trace
Knowledge Generator
Episodic Database
Background Knowledge
rules
Training Set Generator
Concept Learner (ILP)
examples
4Learning with Redux
Redux
Environmental Interface
Environment
Expert
Behavior trace
Annotations
Behavior Recorder
Annotated Behavior trace
Knowledge Generator
Episodic Database
Background Knowledge
rules
Training Set Generator
Concept Learner (ILP)
examples
5Current Experiments
Expert Soar Agent
Environmental Interface
Environment
Expert
Behavior trace
Annotations
Behavior Recorder
Annotated Behavior trace
Knowledge Generator
Episodic Database
Background Knowledge
rules
Training Set Generator
Concept Learner (ILP)
examples
6Learning Framework Mode2
Environmental Interface
Environment
Expert
Behavior trace
New Agent Program
Annotations
Behavior Recorder
external
Internal
Annotated Behavior trace
Knowledge Generator
Episodic Database
Background Knowledge
rules
Training Set Generator
Concept Learner (ILP)
examples
7Experiments in Haunt 2 Domain
8Move-to example
r3
d1
d2
d3
d4
move-to-area
move-to-via-node
move-to-connected-node
9An Example in Haunt Domain
move-to-area(Area)
move-to-via-node(Node)
move-to-connected-node(Node)
10An Example in Haunt Domain
move-to-area(Area)
move-to-via-node(Node)
move-to-connected-node(Node)
11An Example in Haunt Domain
r3
d1
r1
move-to-area(Area)
move-to-via-node(Node)
move-to-connected-node(Node)
- Correct selection condition for move-to-via-node
12(No Transcript)
13Example GenerationOperator Concepts
A
positive
negative
14Example GenerationOperator Concepts
A
B
negative
positive
15Learning Examples
- A Positive Example
- selection(Sit20, move-to-via-node(d1) )
-
16General to Special Search with positive and
negative examples
17General to Special Search with positive and
negative examples
18General to Special Search with positive and
negative examples
19General to Special Search with positive and
negative examples
20General to Special Search with positive and
negative examples
21Problem in Choosing Parameters
- Selection(move-to-via-node)
move-to-via-node
move-to-connected-node
22Problem in Choosing Parameters
- Selection(move-to-via-node)
Negative
Positive
move-to-via-node
move-to-connected-node
23Specific to General Learning with Positive
Examples only
d1
Positive
Difficult to deal with inconsistent examples
24General to Specific Learning with Positive
Examples Only
Positive
25General to Specific Learning with Positive
Examples Only
d1
Positive
26Learning Examples
- A Positive Example of move-to-via-node
-
27Learning Examples
- Random Examples of move-to-via-node
-
- For each positive example, use the same situation
with parameters selected in other situations -
28Nuggets
- Move-to operators are learned in Haunt domain
- 3 mins of trace
- 35000 situations
- 10 min to prepare examples
- 20 min for learning.
29Coals
- Missing Components
- It is still research not a tool