Expressive Graphical Models in Variational Approximations: Chain-Graphs and Hidden Variables - PowerPoint PPT Presentation

1 / 22
About This Presentation
Title:

Expressive Graphical Models in Variational Approximations: Chain-Graphs and Hidden Variables

Description:

Expressive Graphical Models in Variational Approximations: Chain-Graphs and Hidden Variables Tal El-Hay & Nir Friedman School of Computer Science & Engineering – PowerPoint PPT presentation

Number of Views:130
Avg rating:3.0/5.0
Slides: 23
Provided by: tale154
Category:

less

Transcript and Presenter's Notes

Title: Expressive Graphical Models in Variational Approximations: Chain-Graphs and Hidden Variables


1
Expressive Graphical Models in Variational
Approximations Chain-Graphs and Hidden Variables
  • Tal El-Hay Nir Friedman
  • School of Computer Science EngineeringHebrew
    University

2
Inference in Graphical Models
  • Exact Inference
  • NP-hard, in general
  • Can be efficient for certain classes
  • What do we do when exact inference is
    intractable?
  • Resort to approximate methods
  • Approximate inference is also NP-hard
  • But, specific approximation methods work for
    specific classes of models
  • ? Need to enrich approximate methods

3
Variational Approximations
  • Approximate the posterior of a complex model
    using a simpler distribution
  • Choice of a simpler model ? method Mean field,
    Structured approximations, and Mixture models

4
Variational Approximations
  • Approximate the posterior of a complex model
    using a simpler distribution
  • Choice of a simpler model ? method Mean field,
    Structured approximations, and Mixture models

5
Variational Approximations
  • Approximate the posterior of a complex model
    using a simpler distribution
  • Choice of a simpler model ? method Mean field,
    Structured approximations, and Mixture models

6
Variational Approximations
  • Approximate the posterior of a complex model
    using a simpler distribution
  • Choice of a simpler model ? method Mean field,
    Structured approximations, and Mixture models

7
Enhancing Variational Approximations
  • Basic tradeoff
  • accuracy ? complexity
  • Goal
  • New families of approximating distributions
  • ?better tradeoff

8
Outline
  • Structured variational approximations review
  • Using chain-graphs
  • Adding hidden variables
  • Discussion

9
Structured Approximations
10
Structured Approximations
  • Goal Maximize the following functional
  • ? FQ is a lower bound on the log likelihood
  • If Q is tractable then FQ might be tractable

11
Structured Approximations
  • To characterize the maximum point we definethe
    generalized functional
  • Differentiation yields the following equation
  • ? approximates using
    the lower bound on the local distribution

12
Structured Approximations
  • Optimization
  • Asynchronous updates guaranties convergence
  • Efficient calculation of the update formulas

13
Chain Graph Approximations
  • Posterior distributions can be modeled as chain
    graphs

14
Chain Graph Approximations
  • Chain graph distributionswhere are
    potential functions on subsets of T
  • Generalize both Bayesian networks and Markov
    networks
  • A simple approximation example

15
Chain Graph Approximations
  • Optimization
  • where

16
Adding Hidden Variables
  • Potential pitfall Multi-modal distributions
  • Jaakkola Jordan Use mixture models
  • Modeling assumption Factorized mixture
    components
  • GeneralizationStructured approximation with an
    extra set of hidden variables
  • Approximating distribution

17
Adding Hidden Variables Intuition
  • Lower bound improvement potentialwhere I(TV)
    is the mutual information
  • Capture correlations in a compact manner

18
Adding Hidden Variables Prospects
  • Lower bound improvement potentialwhere I(TV)
    is the mutual information
  • Describing correlations in a compact manner

19
Relaxing the lower bound
  • Rewriting the lower bound on the
    log-likelihoodwhere
  • The conditional entropy does not decompose
  • ? The lower bound is intractable

20
Relaxing the lower bound
  • Using the following convexity bound
  • Introducing extra variational parameters
  • The relaxed lower bound becomes tractable

21
Optimization
  • Bayesian network parameters
  • Smoothing parameters
  • Asynchronous updates guaranties convergence

22
Results
KL Bound
Number of time slices
Number of time slices
23
Discussion
  • Extending representational features of
    approximating distributions ?Better tradeoff ?
  • Addition of hidden variables improves
    approximation
  • Derivations of different methods use a uniform
    machinery
  • Future directions
  • Saving computations by planning the order of
    updates
  • Structure of the approximating distribution
Write a Comment
User Comments (0)
About PowerShow.com