Learning Bayesian Metanetworks from Data with Multilevel Uncertainty - PowerPoint PPT Presentation

1 / 40
About This Presentation
Title:

Learning Bayesian Metanetworks from Data with Multilevel Uncertainty

Description:

Xk1 : order flowers. Xk2 : order wine. Xr1 : visit football match. Xr2 : visit ... have shown the effectiveness of the proposed models and learning procedures ... – PowerPoint PPT presentation

Number of Views:27
Avg rating:3.0/5.0
Slides: 41
Provided by: vaganterzi3
Category:

less

Transcript and Presenter's Notes

Title: Learning Bayesian Metanetworks from Data with Multilevel Uncertainty


1
Learning Bayesian Metanetworks from Data with
Multilevel Uncertainty
  • Vagan Terziyan, Oleksandra Vitko
  • vagan_at_it.jyu.fi, vitko_at_kture.kharkov.ua
  • University of Jyväskylä ,
  • Kharkov National University of Radioelectronics

AIAI-2004 (WCC 2004), Toulouse, France 24 August
2004
2
Contents
Oleksandra Vitko Department of Artificial
Intelligence Kharkov National University of
Radioelectronics (Ukraine) http//www.cs.jyu.fi/ai
/oleksandra
  • Bayesian Metanetworks
  • Metanetworks for managing conditional
    dependencies
  • Metanetworks for managing feature relevance
  • Learning Bayesian Metanetworks from Data
  • Conclusions

Vagan Terziyan Industrial Ontologies
Group Department of Mathematical Information
Technologies University of Jyvaskyla
(Finland) http//www.cs.jyu.fi/ai/vagan
This presentation http//www.cs.jyu.fi/ai/AIAI-20
04.ppt
3
Bayesian Metanetworks
4
Bayesian Metanetwork
  • Definition. The Bayesian Metanetwork is a set of
    Bayesian networks, which are put on each other in
    such a way that the elements (nodes or
    conditional dependencies) of every previous
    probabilistic network depend on the local
    probability distributions associated with the
    nodes of the next level network.

5
Two-level Bayesian C-Metanetwork for Managing
Conditional Dependencies
6
Contextual and Predictive Attributes
air pressure
dust
humidity
temperature
Machine
emission
Environment
Sensors
X
x5
x6
x7
x2
x3
x4
x1
contextual attributes
predictive attributes
7
Contextual Effect on Conditional Probability (1)
X
x5
x6
x7
x2
x3
x4
x1
contextual attributes
predictive attributes
Assume conditional dependence between predictive
attributes (causal relation between physical
quantities)
xt
some contextual attribute may effect directly
the conditional dependence between predictive
attributes but not the attributes itself
xk
xr
8
Contextual Effect on Conditional Probability (2)
  • X x1, x2, , xn predictive attribute with n
    values
  • Z z1, z2, , zq contextual attribute with q
    values
  • P(YX) p1(YX), p2(YX), , p r(YX)
    conditional dependence attribute (random
    variable) between X and Y with r possible values
  • P(P(YX)Z) conditional dependence between
    attribute Z and attribute P(YX)

9
Contextual Effect on Conditional Probability (3)
Xt1 I am in Paris Xt2 I am in Moscow
xt
P1(Xr Xk ) Xk1 Xk2
Xr1 0.3 0.9
Xr2 0.4 0.5
Xr1 visit football match Xr2 visit
girlfriend
Xk1 order flowers Xk2 order wine
xr
xk
P2(Xr Xk ) Xk1 Xk2
Xr1 0.1 0.2
Xr2 0.8 0.7
Xr Make a visit
Xk Order present
10
Contextual Effect on Conditional Probability (4)
Xt1 I am in Paris Xt2 I am in Moscow
xt
P( P (Xr Xk ) Xt ) Xt1 Xt2
P1(Xr Xk ) 0.7 0.2
P2(Xr Xk ) 0.3 0.8
xr
xk
P1(Xr Xk ) Xk1 Xk2
Xr1 0.3 0.9
Xr2 0.4 0.5
P2(Xr Xk ) Xk1 Xk2
Xr1 0.1 0.2
Xr2 0.8 0.7
11
Contextual Effect on Unconditional Probability (1)
X
x5
x6
x7
x2
x3
x4
x1
contextual attributes
predictive attributes
Assume some predictive attribute is a random
variable with appropriate probability
distribution for its values
xt
P(X)
some contextual attribute may effect directly
the probability distribution of the predictive
attribute
X
x1
x4
x2
x3
xk
12
Contextual Effect on Unconditional Probability (2)
  •   X x1, x2, , xn predictive attribute with
    n values
  •   Z z1, z2, , zq contextual attribute
    with q values and P(Z) probability distribution
    for values of Z
  • P(X) p1(X), p2(X), , pr(X) probability
    distribution attribute for X (random variable)
    with r possible values (different possible
    probability distributions for X) and P(P(X)) is
    probability distribution for values of attribute
    P(X)
  •    P(YX) is a conditional probability
    distribution of Y given X
  •    P(P(X)Z) is a conditional probability
    distribution for attribute P(X) given Z

13
Contextual Effect on Unconditional Probability (3)
P( P (Xk ) Xt ) Xt1 Xt2
P1(Xk ) 0.4 0.9
P2(Xk ) 0.6 0.1
Xt1 I am in Paris Xt2 I am in Moscow
xt
P1(Xk)
P2(Xk)
0.7
0.5
0.3
Xk
Xk
0.2
Xk1
Xk2
Xk1
Xk2
Xk1 order flowers Xk2 order wine
xk
Xk Order present
14
Causal Relation between Conditional Probabilities
xm
xn
P(P(Xn Xm))
P(Xn Xm)
P2(XnXm)
P3(XnXm)
P1(XnXm)
P(P(Xr Xk)P(Xn Xm))
P(P(Xr Xk))
There might be causal relationship between two
pairs of conditional probabilities
P(Xr Xk)
P2(XrXk)
P1(XrXk)
xk
xr
15
Two-level Bayesian C-Metanetwork for managing
conditional dependencies
16
Example of Bayesian C-Metanetwork
The nodes of the 2nd-level network correspond to
the conditional probabilities of the 1st-level
network P(BA) and P(YX). The arc in the
2nd-level network corresponds to the conditional
probability P(P(YX)P(BA))
17
Two-level Bayesian R-Metanetwork for Modelling
Relevant Features Selection
18
Feature relevance modelling (1)
We consider relevance as a probability of
importance of the variable to the inference of
target attribute in the given context. In such
definition relevance inherits all properties of a
probability.
19
Feature relevance modelling (2)
20
General Case of Managing Relevance (1)
Predictive attributes   X1 with values
x11,x12,,x1nx1 X2 with values
x21,x22,,x2nx2 XN with values
xn1,xn2,,xnnxn   Target attribute   Y with
values y1,y2,,yny.   Probabilities P(X1),
P(X2),, P(XN) P(YX1,X2,,XN).   Relevancies ?X
1 P(?(X1) yes) ?X2 P(?(X2)
yes) ?XN P(?(XN) yes) Goal to
estimate P(Y).
21
General Case of Managing Relevance (2)
Probability P(XN)
22
Example of Relevance Bayesian Metanetwork (1)
Conditional relevance !!!
23
Example of Relevance Bayesian Metanetwork (2)
24
Learning Bayesian Metanetworks from Data
25
Learning Task
  • Given training set D of training examples ltX1,
    X2, Xn, Ygt
  • Goal is to restore
  • the set of levels of Bayesian Metanetwork l1,,
    l2,, lL, each level is a Bayesian network
  • the interlevel links for each pair of successive
    levels lr , lr1
  • the network structure and parameters at each
    level, particularly probabilities P(vi) and
    P(viparents(vi)) for each variable vi.

26
Learning Bayesian Metanetwork
  • Use well-known learning methods for learning
    component Bayesian networks on each level of the
    Metanetwork
  • Add procedures for learning interlevel
    relationships for the case of multilevel
    probabilistic Metanetworks

27
Learning Process
Stage 1. Division of attributes on the levels
Stage 2. Learning the network structure
Stage 3. Learning the interlevel links to the
subsequent level
Stage 4. Learning the network parameters
over all levels of Metanetwork
28
Stage 1.
  • Division of attributes among the levels
  • The task of this stage is to divide the input
    vector of attributes ltX1, X2, Xngt into the
    predictive, contextual and perhaps metacontextual
    attributes.

X
x5
x6
x7
x2
x3
x4
x1
contextual attributes
predictive attributes
29
Stage 2.
  • Learning the network structure at the current
    level of Metanetwork
  • can be made by well-known methods with good
    performance
  • (Cheng-Greiner method,
  • KA2 algorithm, etc.)

30
Stage 3.
  • Learning the interlevel links between the current
    and subsequent levels
  • This is a new stage that has been added
    specifically for a Bayesian Metanetwork learning.
  • Differs for the C-Metanetwork and for the
    R-Metanetwork.

31
Learning Interlevel Links in C-Metanetwork
. . .
32
Different probability tables
corresponding to different contexts are
associated with vertexes of the second-level
Bayesian network
33
Context variables in C-Metanetwork
context random variable U
context random variable W
P(WU)
34
Learning Interlevel Links in R-Metanetwork
. . .
35
Different relevancies
corresponding to different contexts are
associated with vertexes of the second-level
Bayesian network
36
Context variables in R-Metanetwork
context random variable U
context random variable W
P(WU)
37
Stage 4.
  • Learning the parameters in the network at the
    current level
  • is made by the standard
  • procedure just taking into
  • account the dynamics of
  • parameters values in
  • different contexts

38
When Bayesian Metanetworks ?
  • Bayesian Metanetwork can be considered as very
    powerful tool in cases where structure (or
    strengths) of causal relationships between
    observed parameters of an object essentially
    depends on context (e.g. external environment
    parameters)
  • Also it can be considered as a useful model for
    such an object, which diagnosis depends on
    different set of observed parameters depending on
    the context.

39
Conclusions
  • The main challenge of this work is the extension
    of the standard Bayesian learning procedures with
    the algorithm of learning the interlevel links
  • The experiments on the data from the
    highly-contextual domain have shown the
    effectiveness of the proposed models and learning
    procedures

40
Read more about Bayesian Metanetworks in
Terziyan V., A Bayesian Metanetwork, In In
International Journal on Artificial Intelligence
Tools, Vol. 14, Ns. 3-4, World Scientific (to
appear).
http//www.cs.jyu.fi/ai/IJAIT-2003.doc
Terziyan V., Vitko O., Bayesian Metanetwork for
Modelling User Preferences in Mobile Environment,
In German Conference on Artificial Intelligence
(KI-2003), Hamburg, Germany, September 15-18,
2003.
http//www.cs.jyu.fi/ai/papers/KI-2003.pdf
Vitko O. The Multilevel Probabilistic Networks
for Modelling Complex Information Systems under
Uncertainty. Ph.D. Thesis, Kharkov National
University of Radioelectronics, 2003.
Write a Comment
User Comments (0)
About PowerShow.com