Title: Vectors and more on masks
1Vectors and more on masks
- Vector space theory applies directly to several
image processing/representation problems
2Image as a sum of basic images
What if every persons portrait photo could be
expressed as a sum of 20 special images? ? We
would only need 20 numbers to model any photo ?
sparse rep on our Smart card.
3Efaces
100 x 100 images of faces are approximated by a
subspace of only 4 100 x 100 images, the mean
image plus a linear combination of the 3 most
important eigenimages
4The image as an expansion
5Different bases, different properties revealed
6Fundamental expansion
7Basis gives structural parts
8Vector space review, part 1
9Vector space review, Part 2
2
10A space of images in a vector space
- M x N image of real intensity values has
dimension D M x N - Can concatenate all M rows to interpret an image
as a D dimensional 1D vector - The vector space properties apply
- The 2D structure of the image is NOT lost
11Orthonormal basis vectors help
12Represent S 10, 15, 20
13Projection of vector U onto V
14Normalized dot product
Can now think about the angle between two
signals, two faces, two text documents,
15Every 2x2 neighborhood has some constant, some
edge, and some line component
Confirm that basis vectors are orthonormal
16Roberts basis cont.
If a neighborhood N has large dot product with a
basis vector (image), then N is similar to that
basis image.
17Standard 3x3 image basis
Structureless and relatively useless!
18Frie-Chen basis
Confirm that bases vectors are orthonormal
19Structure from Frie-Chen expansion
Expand N using Frie-Chen basis
20Sinusoids provide a good basis
21Sinusoids also model well in images
22Operations using the Fourier basis
23A few properties of 1D sinusoids
They are orthogonal
Are they orthonormal?
24F(x,y) as a sum of sinusoids
25Spatial direction and frequency in 2D
26Continuous 2D Fourier Transform
To compute F(u,v) we do a dot product of our
image f(x,y) with a specific sinusoid with
frequencies u and v
27Power spectrum from FT
28Examples from images
Done with HIPS in 1997
29Descriptions of former spectra
30Discrete Fourier Transform
Do N x N dot products and determine where the
energy is. High energy in parameters u and v
means original image has similarity to those
sinusoids.
31Bandpass filtering
32Convolution of two functions in the spatial
domain is equivalent to pointwise multiplication
in the frequency domain
33LOG or DOG filter
- Laplacian of Gaussian
- Approx
- Difference of Gaussians
34LOG filter properties
35Mathematical model
361D model rotate to create 2D model
371D Gaussian and 1st derivative
382nd derivative then all 3 curves
39Laplacian of Gaussian as 3x3
40G(x,y) Mexican hat filter
41Convolving LOG with region boundary creates a
zero-crossing
Mask h(x,y)
Input f(x,y)
Output f(x,y) h(x,y)
42(No Transcript)
43LOG relates to animal vision
441D EX.
Artificial Neural Network (ANN) for computing
g(x) f(x) h(x) level 1 cells feed 3 level 2
cells level 2 cells integrate 3 level 1 input
cells using weights -1,2,-1
45Experience the Mach band effect
46Simple model of a neuron
47Output conditioning threshold versus smoother
output signal
483D situation in the eye
Neuron c has input to neuron A but - input to
neuron B. Neuron d has input to neuron B but
input to neuron A. Neuron b gives no input to
neuron B it is not in the receptive field of B.
49Receptive fields
50Experiments with cats/monkeys
- Stabilize/drug animal to stare
- Place delicate probe in visual network
- Move step edge across FOV
- Probe shows response function when the edge
images to receptive field - Slightly moving the probe produces similar signal
when edge is nearby
51Canny edge detector uses LOG filter
52Summary of LOG filter
- Convenient filter shape
- Boundaries detected as 0-crossings
- Psychophysical evidence that animal visual
systems might work this way (your testimony) - Physiological evidence that real NNs work as the
ANNs