Title: Data Mining Information Extraction Techniques: Principal Component Images
1Data Mining / Information Extraction
TechniquesPrincipal Component Images
- Don Hillger
- NOAA/NESDIS/RAMMT
- CIRA / Colorado State University
- hillger_at_cira.colostate.edu
- 20-21 August 2003
2Principal Component Image (PCI) transformation
of multi-spectral imagery
- Terminology/Definitions
- PCI Principal Component Image a new image
combination - Eigenvectors transformation vectors to create
PCIs from multi-spectral imagery - Eigenvalues explained variances (weights) of
the principal component images
3Why transform imagery?
- To simplify multi-spectral imagery by reducing
redundancy to obtain the independent information - A new set of images that are optimal
combinations of the original spectral-band images
for extracting the variance in the available
imagery - Important image combinations for detection of
atmospheric and surface features in
multi-spectral data
4GOES Imager bands
5General Case
- band(N) ? PCI(N)
- The number of component images resulting from a
PCI transformation is equal to the number of
spectral-band images input. - The sum of the explained variances of the
component images is equal to the sum of the
explained variances of the original images (the
same information content as the original imagery
expressed in a new form)
6General Case
- PCI E _at_ B
- where
- PCI transformed set of N images, at M
horizontal locations (pixels) - E N by N transformation matrix. The rows of E
are the eigenvectors of the symmetric matrix with
elements determined by the covariance of each
band with every other band (summed over M pixels) - B set of imagery from N bands, viewing a scene
at M horizontal locations (pixels)
7Two-dimensional Case
- pci1 e1 _at_ band1 e2 _at_ band2
- pci2 f1 _at_ band1 f2 _at_ band2
- where
- pci1 and pci2 Principal Component Images
(PCIs) - band1 and band2 band images
- e and f linear transformation vectors
(eigenvectors, or rows in the eigenvector matrix
E). - In the two-dimensional case
- pci1 usually contains the information that is
common to the band1 and band2 images - pci2 contains the information that is different
between the band1 and band2 images.
82-dimensional case Montserrat / Soufriere Hills
volcano
2 PCIs
2 bands
92-dimensional case Montserrat / Soufriere Hills
volcano
Comparison to ash-cloud analysis
10GOES 5-band Imager Covariance Matrix
11GOES 5-bandPrincipal Component Matrix
125-band transform (GOES Imager)
135-band transform (GOES Imager)
145-band transform(GOES Imager)
5 bands
5 PCIs
155 bands (GOES Imager)
165 PCIs (GOES Imager)
17Signal-to-Noise(GOES Imager)
? 5 bands
5 PCIs ?
1819-band transform(GOES Sounder)
19 bands
19 PCIs
1919-band transform (GOES Sounder)
2019-band transform (GOES Sounder)
2119 bands (GOES Sounder)
2219 PCIs (GOES Sounder)
23Signal-to-Noise(GOES Sounder)
? 19 bands
19 PCIs ?
24Analysis of MODIS
25Analysis of MODIS
26Northeast UT fog/status 7 Dec 2002 18 UTC
27Northeast UT fog/status 12 Dec 2002 18 UTC
28Principal Component Images of fire hot spots and
smoke
clouds
clouds
smoke
smoke
rings of fire
Arizona fires 21 June 2002 1806 UTC (MODIS)
29Principal Component Images of fire hot spots and
smoke
smoke
rings of fire
smoke
Arizona fires 23 June 2002 1754 UTC (MODIS)
30In conclusionWhy transform imagery?
- To simplify multi-spectral imagery by reducing
redundancy to obtain the independent information - A new set of images that are optimal
combinations of the original spectral-band images
for extracting the variance in the available
imagery - Important image combinations for detection of
atmospheric and surface features in
multi-spectral data