SelfOrganisation in Agent Organisations - PowerPoint PPT Presentation

1 / 25
About This Presentation
Title:

SelfOrganisation in Agent Organisations

Description:

solution to the problem of maintaining large, complex ... continuous = keep improving = optimising. Related Work. Self-organisation by reactive agents ... – PowerPoint PPT presentation

Number of Views:35
Avg rating:3.0/5.0
Slides: 26
Provided by: usersEcs
Category:

less

Transcript and Presenter's Notes

Title: SelfOrganisation in Agent Organisations


1
Self-Organisation in Agent Organisations
  • Ramachandra Kota, Nicholas Gibbins
  • and Nicholas R. Jennings
  • School of Electronics and Computer Science
  • University of Southampton, U.K.

2
Motivation
  • Autonomic systems
  • computing systems with self-management
  • solution to the problem of maintaining large,
    complex computing systems? (Kephart and Chess,
    2003)?
  • Self-organising multi-agent systems
  • autonomous
  • adaptive (continuous over time)
  • decentralised and robust
  • a paradigm to develop autonomic systems (Tesauro
    et al., 2004)?

3
Research Objective
  • For problem-solving agent organisations with
  • co-operative agents working together
  • receive, allocate and execute dynamic stream of
    tasks
  • representing a distributed computing system
  • Develop a decentralised structural adaptation
    method
  • local interactions gt used by every agent gt
    robust
  • continuous gt keep improving gt optimising

4
Related Work
  • Self-organisation by reactive agents
  • Centralised mechanisms requiring specialised
    agents
  • Organisation self-design (OSD) based spawning and
    merging agents
  • Adaptation in agent networks (contd.)

5
More Related Work
  • Adaptation in agent networks
  • Does not work for models with multiple kinds of
    links (organisational relations)
  • The links of an agent do not affect its own
    performance
  • (no organisational load)
  • Ignores computational load incurred for
    adaptation
  • (meta-reasoning)
  • Not designed to handle systems where agents join
    and leave over time (dynamic organisations)

6
Organisation Model
  • A problem-solving agent ax
  • provides a set of services Sx
  • has a limited computational capacity Lx
  • when assigned a service instance (SI) sij,
    either-
  • allocates the SI to another agent, or
  • executes the SI and then allocates the subsequent
    dependent SIs

si1 s1, p1
root SI
Example of a task
si3 s0, p3
si0 s0, p0
si2 s2, p2
si4 s3, p4
leaf SIs
7
Organisation Model Structure
  • The relation between two agents determines-
  • the knowledge held about each other
  • the interactions permitted between them
  • Three levels of relationships-
  • Acquaintance no interaction
  • Peer low frequency of interaction
  • Authority (superior-subordinate) high frequency
    of interaction

aX
az
aY
aW
Example of an organisation structure
8
Organisation at work an example
aX
si1 s1, p1
root SI
si3 s0, p3
az
aY
si0 s0, p0
si2 s2, p2
si4 s3, p4
leaf SIs
aW
Task
Organisation
9
Organisation Model Performance Evaluation
  • costORG depends on communication messages passed
  • Agent capacity Lx consumed by load lx as-
  • lx ? pi M. ? mj,x R.rx
  • load execution allocation reorganisation
  • load due to allocation (mj,x) depends on
    structure
  • rewardORG depends on speed of task completion
  • profitORG rewardORG - costORG

sii ? WxE
sij ? WxF
10
Adaptation example scenario
aX
si1 s1, p1
root SI
az
si3 s0, p3
aY
si0 s0, p0
si2 s2, p2
si4 s3, p4
leaf SIs
aW
Task
Organisation
11
Adaptation state transitions
12
Adaptation Method
  • Formulated using the decision theoretic approach
  • Changing the relation denoted as actions

Peers
Subordinate
Subordinate
Peers
Subordinate
Just acquaintances
Just acquaintances
  • Pairs of agents jointly estimate the expected
    utility (to the organisation) of changing their
    relation
  • Calculated for every possible action from the
    state and the action with best expected utility
    chosen

13
Expected Utility Calculation
  • Expected utility calculation based on the history
    of interactions and delegations involving the two
    agents
  • Using a Value function of the form
  • Vx,y ?loadx ?loady ?loadOA ?costcomm
    ?costreorg

14
Attribute values form_subr(x,y) action
  • ?Loadx - M Asgx,total filledx(ttotal) /
    ttotal
  • M management coefficient
  • Asgx,total total number of SIs allocated by ax
  • ttotal total number of time-steps of existence
    of ax
  • filledx(ttotal) number of time-steps within
    ttotal that axs capacity was filled with load

x
x
x
x
x
The attribute values are calculated on basis of
past interactions and delegations involving the
two agents
15
Meta-Reasoning
  • Reasoning about reorganisation with another agent
    needs computation (lx ...... R.rx)
  • R computation required for reasoning with
    another agent
  • rx number of acquaintances that ax reasoned with
  • ax should chose k acquaintances for reasoning
    in a time-step
  • what is the value of k?
  • which k acquaintances?

16
Meta-Reasoning contd.
  • Reduced to coupon collectors problem
  • Randomised approach
  • E (X) n ln(n) O(n)
  • can chose the k acquaintances randomly
  • Value of k decided at a time-step t as
  • kt max1, (Lx - lx)/R, acqtsxchangedx,t-1/kt-1

17
Dynamic Organisations
  • Agents join and leave the organisation over time
  • Forming relation with a new agent
  • no history of interactions gt no data for
    expected utility
  • explore versus exploit trade-off

18
Dynamic Organisations contd.
  • Same principle as in WOLF (Win Or Learn Fast)
  • agent with unused capacity gt winning gt ignores
    new agents
  • agents with waiting queue of SIs gt losing gt
    actively seek subordinates
  • searches amongst the new incoming agents for
    those providing in-demand services and forms a
    superior-subordinate relation

19
Empirical Evaluation
  • Simulation Parameters
  • Heterogeneity of the agents
  • Similarity of the tasks
  • Comparison Methods
  • Central an omniscient external centralised
    allocator performs the allocations ? maximum
    profit obtained
  • Random agents randomly reorganise their relations

20
Results Static Orgs
  • free-Adapt no reorganisation load (R 0)
  • all-Adapt k is always equal to total
    acquaintances

21
Results Static Orgs
Experiments by varying the reorganisation
coefficient R
22
Results Dynamic Orgs
  • free-Adapt no reorganisation load (R 0)
  • nowolf-Adapt no WOLF component in the algorithm

23
Results Dynamic Orgs
A sample scenario
24
Future Work
  • Organisations where the agents internal
    characteristics change over time
  • Environments where the kind of similarity present
    within tasks changes over time
  • Change proactively when the distribution of
    future tasks is known
  • Explore a principled Reinforcement Learning
    approach for adaptation

25
Thank you!!
  • ??
Write a Comment
User Comments (0)
About PowerShow.com