Title: Clustering of brain tumours through constrained manifold learning using class information
1Clustering of brain tumours through constrained
manifold learning using class information
- Raúl Cruz and Alfredo Vellido
- rcruz, avellido_at_lsi.upc.edu
2Contents
- Introduction
- GTM model
- t-GTM model
- Class-tGTM
- Human brain tumour data
- Experimental results
- Conclusions
- References
3Introduction
- The cluster structure through Model-based
clustering p(X). - In classification, we model the relationship
between the labels C and the data by p(CX). - We aim to discover the cluster structure of the
data while taking into account the available
class information in a semi-supervised approach.
4The Generative Topographic Mapping
- The GTM is a non-linear latent variable model
- It defines a mapping from a latent space to the
data space, generating a probability density
within the latter - The GTM can be used for multivariate data
visualization and clustering - It can be seen as a constrained Mixture of
Distributions or as a probabilistic alternative
to the Self-Organizing Map (SOM).
5GTM an illustration
Image borrowed from Bishop, C.M., Svensén, M.
Williams, C.K.I., (1998) Developments of the
Generative Topographic Mapping. Neurocomputing,
21(1-3)
6GTM the algorithm (1)
- A regular grid of nodes ui is defined in latent
space, with a prior probability
- A regular, fixed grid of non-linear basis
functions provides the non-linear mapping into
data space, defined by - This mapping defines a set of reference vectors
-
- in data space
7GTM the algorithm (2)
- Each of the reference vectors yk forms the centre
of an isotropic Gaussian distribution in data
space - and, therefore, the data p.d.f. for the GTM model
is given by - REMEMBER The GTM can be understood as a
constrained mixture of distributions.
8GTM the algorithm (3)
- Given that the GTM is a parametric probability
density model, it can be fitted to the data. The
corresponding log-likelihood is defined as - The EM algorithm can be used to calculate the
parameters of the model
9The t-GTM model
- What if outliers are present in data?
- The use of Gaussian in standard GTM is likely
to negatively bias the estimation of the
adaptative parameters, distorting the clustering
results. - The GTM was redefined as a constrained
mixture of Student t-distributions the t-GTM
(Vellido, 2006), aiming to increase the
robustness of the model towards outliers.
10The class-tGTM
- Class separability might be improved if the
clustering model accounted for the available
class information. - this can be achieved by modelling the joint
density
11Human brain tumour data
- The data used in this study consist of 98 single
voxel PROBE (PROton Brain Exam system) spectra
acquired in vivo for five viable tumour types
astrocytomas, glioblastomas, metastases,
meningiomas, and oligodendrogliomas (typology
that will be used in this study as class
information) and from cystic regions (associated
to tumours) - A procedure based on Multivariate Bayesian
Variable Selection was used elsewhere (Lee et al
2000), to describe the data set in the form of
six frequency intensities, corresponding to fatty
acids, lactate, a compound-unassigned peak,
glutamine, choline, and taurine-inositol.
12Experimental results (1)
- The differences in class separability between the
models with and without class information were
quantified through the following entropy-like
measure
where
set of tumour types
is the total number of tumour spectra
is the number of tumour spectra assigned to the
kth cluster
is the number of tumour spectra from tumour type
i assigned to cluster k
,
13Experimental results artificial data (2)
- We used a synthetic data set, consisting of 1,200
data points, sampled from four neatly separated
Gaussian distributions with centres located at
and they were subsequently
corrupted by four extra features consisting of
Gaussian noise.
Figure 1. Representation, on the t-GTM
2-dimensional latent space, of the synth data set
described in the main text. The representation is
based on the mean posterior distributions for the
1200 data points. Points belonging to each of the
four original Gaussians (classes) are plotted
using a different symbol. The axes of the plot
are the elements of the latent vector and convey
no meaning by themselves. For that reason, axes
are kept unlabeled. (Left) t-GTM without class
information. (Right) class-t-GTM.
14Experimental results tumour data (3)
Figure 2 Cysts (circles) vs Tumour tissue
Class-tGTM
t-GTM
Figure 3. The brain tumour MRS data set using
the full tumour typology. Symbols Cysts
(circles) astrocytomas (black dots)
glioblastomas (black rhombus) metastases
(five-pointed stars) meningiomas (white
rhombus) oligodendrogliomas (asterisks).
15Experimental results (4)
Table 1. Entropy results for the different models
and data sets analyzed. In all cases, and for the
sake of brevity, results are reported only for
three values of ? (although results for other
values are consistent with the reported ones).
Brain simplified refers to the simplification in
which data are separated into cysts versus tumour
tissue, with the five tumour types reduced to a
single class. Brain refers to the full tumour
typology.
16Conclusions and future work
- Adding class information to a clustering process
has the potential of improving the class
separation provided by the resulting clusters. - We have clustered human brain tumour MRS data
using a constrained generative model, in a
variant known to behave robustly in the presence
of outliers the t-GTM. This model has been
extended to account for class information in a
semi-supervised manner, and experiments carried
out on both synthetic and the MRS data have shown
that class-t-GTM improves class discrimination. - This semi-supervised clustering model might also
be used to visualize new and unlabelled MR
spectra and infer their typology. - An alternative and perhaps more principled way to
estimate the joint density would entail
considering a mixture model with both Gaussian or
t-Student components for the continuous data and
multinomial components for the class information
(binary, categorical)
17References
- 1 M. Prastawa, E. Bullitt, S. Ho, G. Gerig, A
brain tumor segmentation framework based on
outlier detection, Medical Image Analysis 8
(2004), 275-283. - 2 C.M. Bishop, M. Svensén, C.K.I. Williams,
GTM The Generative Topographic Mapping, Neural
Computation 10 (1998), 215-234. - 3 T. Kohonen, Self-organizing Maps, 3rd ed.
Springer-Verlag, Berlin, 2000. - 4 D. Peel, G.J. McLachlan, Robust mixture
modelling using the t distribution, Statistics
and Computing 10 (2000), 339348. - 5 H.X. Wang, Q.B. Zhang, B. Luo, S. Wei, Robust
mixture modelling using multivariate
t-distribution with missing information, Pattern
Recognition Letters 25 (2004) 701710. - 6 A. Vellido, Missing data imputation through
GTM as a mixture of t-distributions. Neural
Networks, In press. - 7 Z. Ghahramani and M.I. Jordan, Supervised
learning from incomplete data via an EM approach,
in J.D. Cowan, G. Tesauro, J. Alspector (eds.)
Advances in Neural Information Processing Systems
6. Morgan-Kaufmann Publishers, San Francisco, CA,
120-127, 1994. - 8 Y. Sun, P. Tino, I. Nabney, Visualization of
incomplete data using class information
constraints, in J. Winkler, M. Niranjan (eds.)
Uncertainty in Geometric Computations, Kluwer
Academic Publishers, The Netherlands, 165-174,
2002. - 9 Y. Huang, P.J.G. Lisboa, W. El-Deredy, Tumour
grading from Magnetic Resonance Spectroscopy A
comparison of feature extraction with variable
selection, Statistics in Medicine 22 (2003),
147-164.
18References
- 10 P.J.G. Lisboa, A. Vellido, H. Wong,
Outstanding issues for clinical decision support
with Neural Networks, in H. Malmgren, M. Borga,
L. Niklasson (eds.), Artificial Neural Networks
in Medicine and Biology, Springer, London, 63-71,
2000. - 11 A. Vellido, P.J.G. Lisboa, Functional
topographic mapping for robust handling of
outliers in brain tumour data, in M. Verleysen
(ed.), Proceedings of the ESANN05, D-Side
Publications, Bruges, 133-138, 2005. - 12 A. Vellido, A., P.J.G. Lisboa, Handling
outliers in brain tumour MRS data analysis
through robust topographic mapping, Computers in
Biology and Medicine, 36(10), 1049-1063. - 13 A. Vellido, A., P.J.G. Lisboa, K. Meehan,
The Generative Topographic Mapping as a
principled model for data visualization and
market segmentation an electronic commerce case
study, International Journal of Computers,
Systems, and Signals 1 (2000), 119-138. - 14 Y.Y.B. Lee, Y. Huang, W. El-Deredy, P.J.G.
Lisboa, C. Arus, P. Harris, Robust methodology
for the discrimination of brain tumours from in
vivo magnetic resonance spectra, in IEE
Proceedings of the 1st International Conference
on Advances in Medical Signal and Information
Processing, 88-95, 2000. - 15 C.M. Bishop, M. Svensén, C.K.I. Williams,
Developments of the Generative Topographic
Mapping, Neurocomputing 21 (1998), 203-224.