Title: MAS and MASRL Framework A OO Framework for Building Software Agents that Learn through Reinforcement
1MAS and MAS-RL Framework A OO Framework for
Building Software Agents that Learn through
Reinforcement Learning
- José A. R. P. Sardinha, Ruy L. Milidiú, Carlos J.
P. Lucena, Patrick Paranhos - TecComm Group Software Engineering Laboratory
- PUC Rio, Brazil
2Main Goals
- Build Applications using a Muti-Agent System in a
Distributed Environment - Diminish Development time
- Code Reuse
- Extend the MAS Framework
- Learning through Reinforcement Learning
- Competition Environments
- Games
- Auctions
- E-commerce
3Motivations
- The OO Framework was built to reuse code for the
following Applications - VGroups Sardinha - 2001
- Agents that build Consumer Groups
- Babilônia Clark 2002, Sardinha 2003
- Agents that perform Automatic Negotiation
- TIC-TAC-TOE Sardinha 2003
- Agents that learn to play
- Trading Agent Competition Melcop - 2003
- Agents to participate in the TAC Competition
- Gaia Methodology
- Models for the analysis and design phase
4Trading Agent Competition
5Introduction
- Agent Definition
- An autonomous entity driven by
- beliefs, goals, capabilities, plans
- and a number of behavioral properties, such as
- autonomy, adaptation, interaction, learning,
mobility, etc. - MAS Definition
- An Environment with
- Agents
- Objects
6Introduction
- Agent Type
- Cognitive Agents and Reactive Agents
- Relationship between Agents and Objects
- Some flaws related to design and implementation
of Agents - The most practical programming language to
implement the agent technology - Large Scale Multi-Agent System
- System with many Agents
- Complex Properties
- Solve Complex Problems
- Distributed Problem Solving
7MAS and MAS-RL Framework
8MAS and MAS-RL Framework
- MAS Framework
- An easy mapping from the models built in the Gaia
Methodology to OO Code - Interaction Protocols and Services are divided in
the OO Code - MAS-RL Framework
- Extension of the MAS Framework 3 classes were
added to introduce Reinforcement Learning in
Competition Environment
9Gaia Methodology
10MAS Framework
11MAS Framework
- Hot Spots
- Agent, AgentMessage, AgentBlackboardInfo,
InteractionProtocols, AgentInterface - Frozen Spots
- AgentCommunicationLayer, ProcessMessageThread
12MAS-RL extensions to MAS
13How to Instantiate the MAS Framework
14How to Instantiate the MAS-RL Framework
15Design Phase Decisions
- How to Model the Decision Tree ?
- Information of the Decision Points
- Decisions
- Size of the Tree
- Which Evaluation Function to Use ?
- Machine Learning Technique
- Percpetron, BackPropagation, PLS, etc.
- Which Reinforcement Learning Mechanism to Use?
- TD Learning, Adaptive Dynamic Programming
16Design Phase Decisions
- Reinforcement Learning Management
- Critical Agents
- Learning Goals
- Training Competitors
- Training Rounds Schedule
- Current Ad hoc solution
- Future To be conceptually added to the
Framework
17Reinforcement Learning Adjustments and Testing
Phase
- Reinforcement Learning
- Reinforcement Learning Parameter adjustments
- Testing
- Benchmarks
18Trading Agent Competition
19Trading Agent Competition
20The MAS
Demand Segmentation
Captures Auction Information
Classifies Customers
Sensors
Allocation
Market Knowledge Base
Supervisor
Market
Results of the Solver
Obtains Segmentation Results
Classifies the Solvers Results
Negotiates the Goods
Sends Buying Orders
Solver
Offering Segmentation
Buying Orders
Negotiator
21Decision Tree Modeling
- D1,D2,D3 -gt
- Sale(t?t) max(Sale(t), askPrice(t)?.
?askPrice), - where
- ? 1n
- ?askPrice askPrice(t) askPrice(t-?t)
- R1,R2,R3 -gt
- askPrice askPrice(t) ?. ?askPrice
- where
- ? 1n
- ?askPrice askPrice(t) askPrice(t-?t)
Negotiator Decision
D1
D2
D3
Market Response
R1
R2
R3
22Evaluation Function and Reinforcement Learning
- Evaluation Function Perceptron
- 8 inputs for the ?askPrice
- 8 inputs for the Sale(t)askPrice(t)
- Reinforcement Rule (Temporal Difference Learning)
- S(t- ?t) S(t) ß.(reward(t-?t)
(S(t)-S(t?t))) - S(t) Expected Output
- ß Reinforcement Learning rate
- reward(t) 0 or 1
23Reinforcement Learning Management
- Critical Agents
- Negotiator Agent
- Learning Goals
- Buy the Goods in the Buying Order
- Training Competitors
- Dummies, LivingAgentLike
- Training Rounds Schedule
- MAS and 7 Dummies
- MAS, LivingAgentLike, and 6 Dummies
- MAS, 2 LivingAgentLike and Dummies
- MAS, and 6 LivingAgentLike
24The Negotiator Agent Class Diagram
25Testing Phase
- Evolution of the MAS versions
26Final Comments
- A framework conceived from applications
- Re-use code and reduce development time
- A framework in evolution
- Complex Phase
- Reinforcement Learning Management
- Testing phase
27Future Work
- Add the Reinforcement Learning Management to
MAS-RL - Add classes and a Tool
- Instantiate the framework with different
Evaluation Functions - PLS Regression Technique
- Add TAC MAS Agents to the RL Management
- Offering Segmentation Agent
28Questions?