Simple Instances of SwendsonWang

About This Presentation

Title:

Description:

Number of Views:32

Avg rating:3.0/5.0

Slides: 17

Provided by: syste190

Category:

Tags: swendsonwang | anneal | instances | simple

Transcript and Presenter's Notes

Title: Simple Instances of SwendsonWang

1
Simple Instances of Swendson-Wang RJMCMC

2
A Pedagogical Project (1/1)

3
Swendson-Wang for Denoising (1/8)

Zero-mean noise
Original image
Thresholded
4
Swendson-Wang for Denoising (2/8)

5
Swendson-Wang for Denoising (3/8)

6
Swendson-Wang for Denoising (4/8)

Many steps needed to fill this hole, since update
locations are chosen uniformly at random!
7
Swendson-Wang for Denoising (5/8)

SW to the rescue
Flip whole chunks of the image at once
BUT Make this a reversible Metropolis-Hastings
step so that we are still sampling from right
posterior

One step
8
Swendson-Wang for Denoising (6/8)
Hastings
SW
9
Swendson-Wang for Denoising (7/8)

Demo
Hastings VS. SW
Hastings allowed to iterate more often than SW to
account for difference in computational cost

10
Swendson-Wang for Denoising (8/8)

Conclusion SW ill-suited for this task
Discriminative edge probabilities very important
to convergence
Makes large steps at start (if initialization is
uniform) but slows near convergence (in presence
of small disconnected regions) ultimately,
becomes single-site update algorithm
Extra parameter to tune/anneal
Does what it claims to energy is minimized
faster than with Hastings alone

11
RJMCMC for Regression (1/5)

12
RJMCMC for Regression (2/5)

13
RJMCMC for Regression (3/5)

Heavy-handedly solve simple model selection
problem with RJMCMC
Recall ensuring reversibility is one
(convenient) way of constructing a MC having the
posterior we want to sample from as its invariant
distribution
RJMCMC/TDMCMC is just a formalism for ensuring
reversibility for proposals that jump between
models of varying dimension (constant 1 param.,
linear 2 params.)

14
RJMCMC for Regression (4/5)

Given initial starting conditions (model type,
parameters), chain has two types of moves
1. Parameter update (probability 0.6)
Within current model, propose new parameters
(Gaussian proposal) and accept with ordinary
Hastings ratio
2. Model change (probability 0.4)
Propose a model swap
If going from constant-gtlinear, uniformly sample
a new slope
Accept with special RJ ratio (see me/my tutorial
for details)

15
RJMCMC for Regression (5/5)

Model selection approximate marginal likelihood
by number of samples from each model
If the model is better at explaining the data,
the chain will spend more time using it
Demo