Spatial processes and statistical modelling - PowerPoint PPT Presentation

About This Presentation

Title:

Spatial processes and statistical modelling

Description:

profound implications for mathematical formulation and computational tractability ... The classic Ising model (for ferromagnetism) is the symmetric autologistic model ... – PowerPoint PPT presentation

Number of Views:86

Avg rating:3.0/5.0

Slides: 39

Provided by: Peter647

Category:

more less

Transcript and Presenter's Notes

Title: Spatial processes and statistical modelling

1
Spatial processes and statistical modelling

Peter Green
University of Bristol, UK
BCCS GMCSS 2008/09 Lecture 8

2
Spatial indexing

Continuous space
Discrete space
lattice
irregular - general graphs
areally aggregated
Point processes
other object processes

3
Space vs. time

apparently slight difference
profound implications for mathematical
formulation and computational tractability

4
Requirements of particular application domains

agriculture (design)
ecology (sparse point pattern, poor data?)
environmetrics (space/time)
climatology (huge physical models)
epidemiology (multiple indexing)
image analysis (huge size)

5
Key themes

conditional independence
graphical/hierarchical modelling
aggregation
analysing dependence between differently indexed
data
opportunities and obstacles
literal credibility of models
Bayes/non-Bayes distinction blurred

6
Why build spatial dependence into a model?

No more reason to suppose independence in
spatially-indexed data than in a time-series
However, substantive basis for form of spatial
dependent sometimes slight - very often space is
a surrogate for missing covariates that are
correlated with location

7
Discretely indexed data
8
Modelling spatial dependence in
discretely-indexed fields

Direct
Indirect
Hidden Markov models
Hierarchical models

9
Hierarchical models, using DAGs

Variables at several levels - allows modelling of
complex systems, borrowing strength, etc.

10
Modelling with undirected graphs

Directed acyclic graphs are a natural
representation of the way we usually specify a
statistical model - directionally
disease ? symptom
past ? future
parameters ? data
whether or not causality is understood.
But sometimes (e.g. spatial models) there is no
natural direction

11
Conditional independence

In model specification, spatial context often
rules out directional dependence (that would have
been acceptable in time series context)

12
Conditional independence

In model specification, spatial context often
rules out directional dependence

X20
X21
X22
X23
X24
X10
X11
X12
X13
X14
X00
X01
X02
X03
X04
13
Conditional independence

In model specification, spatial context often
rules out directional dependence

X20
X21
X22
X23
X24
X10
X11
X12
X13
X14
X00
X01
X02
X03
X04
14
Directed acyclic graph
a
b
c
in general
d
for example
p(a,b,c,d)p(a)p(b)p(ca,b)p(dc)
In the RHS, any distributions are legal, and
uniquely define joint distribution
15
Undirected (CI) graph
Regular lattice, irregular graph, areal data...
X20
X21
X22
Absence of edge denotes conditional independence
given all other variables
X10
X11
X12
X00
X01
X02
But now there are non-trivial constraints on
conditional distributions
16
Undirected (CI) graph
Suppose we assume
(?)
X20
X21
X22
X10
X11
X12
then
clique
X00
X01
X02
and so
The Hammersley-Clifford theorem says essentially
that the converse is also true - the only sure
way to get a valid joint distribution is to use
(?)
17
Hammersley-Clifford
A positive distribution p(X) is a Markov random
field
X20
X21
X22
X10
X11
X12
if and only if it is a Gibbs distribution
X00
X01
X02
- Sum over cliques C (complete subgraphs)
18
Partition function
Almost always, the constant of proportionality in
X20
X21
X22
X10
X11
X12
is not available in tractable form an obstacle
to likelihood or Bayesian inference about
parameters in the potential functions Physicists
call the partition function
X00
X01
X02
19
Markov properties for undirected graphs

The situation is a bit more complicated than it
is for DAGs. There are 4 kinds of Markovness
P pairwise
Non-adjacent pairs of variables are conditionally
independent given the rest
L local
Conditional only on adjacent variables
(neighbours), each variable is independent of all
others

G global
Any two subsets of variables separated by a third
are conditionally independent given the values of
the third subset.
F factorisation
the joint distribution factorises as a product of
functions of cliques
In general these are different, but F?G?L?P
always. For a positive distribution, they are all
the same.

21
Gaussian Markov random fields spatial
autoregression
If VC(XC) is -?ij(xi-xj)2/2 for Ci,j and 0
otherwise, then
is a multivariate Gaussian distribution, and
is the univariate Gaussian distribution
22
A B C D
Gaussian random fields
A B C D
Inverse of (co)variance matrix dependent case
A
B
C
D
23
Non-Gaussian Markov random fields
Pairwise interaction random fields with less
smooth realisations obtained by replacing squared
differences by a term with smaller tails, e.g.
24
Discrete-valued Markov random fields
Besag (1974) introduced various cases of
for discrete variables, e.g. auto-logistic
(binary variables), auto-Poisson (local
conditionals are Poisson), auto-binomial, etc.
25
Auto-logistic model
(Xi 0 or 1)
- a very useful model for dependent binary
variables (NB various parameterisations)
26
Statistical mechanics models
The classic Ising model (for ferromagnetism) is
the symmetric autologistic model on a square
lattice in 2-D or 3-D. The Potts model is the
generalisation to more than 2 colours
and of course you can usefully un-symmetrise this.
27
Auto-Poisson model
For integrability, ?ij must be ?0, so this
only models negative dependence very limited use.
28
Hierarchical models and hidden Markov processes
29
Chain graphs

If both directed and undirected edges, but no
directed loops
can rearrange to form global DAG with undirected
edges within blocks

30
Chain graphs

If both directed and undirected edges, but no
directed loops
can rearrange to form global DAG with undirected
edges within blocks
Hammersley-Clifford within blocks

31
Hidden Markov random fields

We have a lot of freedom modelling
spatially-dependent continuously-distributed
random fields on regular or irregular graphs
But very little freedom with discretely
distributed variables
? use hidden random fields, continuous or
discrete
compatible with introducing covariates, etc.

32
Hidden Markov models
e.g. Hidden Markov chain
z0
z1
z2
z3
z4
hidden
y1
y2
y3
y4
observed
33
Hidden Markov random fields
Unobserved dependent field
Observed conditionally-independent discrete field
(a chain graph)
34
Spatial epidemiology applications
relative risk
expected cases
cases

independently, for each region i. Options
CAR, CARwhite noise (BYM, 1989)
Direct modelling of ,e.g. SAR
Mixture/allocation/partition models
Covariates, e.g.

35
Spatial epidemiology applications
Spatial contiguity is usually somewhat idealised
36
Spatial epidemiology applications

Richardson Green (JASA, 2002) used a hidden
Markov random field model for disease mapping

observed incidence
relative risk parameters
expected incidence
hidden MRF
37
Chain graph for disease mapping
based on Potts model
38
Larynx cancer in females in France
SMRs

Write a Comment

User Comments (0)