Title: Vectors [and more on masks]
1Vectors and more on masks
- Vector space theory applies directly to several
image processing/representation problems
2Image as a sum of basic images
What if every persons portrait photo could be
expressed as a sum of 20 special images? ? We
would only need 20 numbers to model any photo ?
sparse rep on our Smart card.
3The image as an expansion
4Different bases, different properties revealed
5Fundamental expansion
6Basis gives structural parts
7Vector space defs., part 1
8Vector space defs. Part 2
2
9A space of images in a vector space
- M x N image of real intensity values has
dimension D M x N - Can concatenate all M rows to interpret an image
as a D dimensional 1D vector - The vector space properties apply
- The 2D structure of the image is NOT lost
10Orthonormal basis vectors help
11Represent S 10, 15, 20
12Projection of vector U onto V
13Normalized dot product
Can now think about the angle between two
signals, two faces, two text documents,
14Every 2x2 neighborhood has some constant, some
edge, and some line component
Confirm that basis vectors are orthonormal
15Roberts basis cont.
If a neighborhood N has large dot product with a
basis vector (image), then N is similar to that
basis image.
16Standard 3x3 image basis
Structureless and relatively useless!
17Frie-Chen basis
Confirm that bases vectors are orthonormal
18Structure from Frie-Chen expansion
Expand N using Frie-Chen basis
19Sinusoids provide a good basis
20Sinusoids also model well in images
21Operations using the Fourier basis
22A few properties of 1D sinusoids
They are orthogonal
Are they orthonormal?
23F(x,y) as a sum of sinusoids
24Spatial direction and frequency in 2D
25Continuous 2D Fourier Transform
To compute F(u,v) we do a dot product of our
image f(x,y) with a specific sinusoid with
frequencies u and v
26Power spectrum from FT
27Examples from images
Done with HIPS in 1997
28Descriptions of former spectra
29Discrete Fourier Transform
Do N x N dot products and determine where the
energy is. High energy in parameters u and v
means original image has similarity to those
sinusoids.
30Bandpass filtering
31Convolution of two functions in the spatial
domain is equivalent to pointwise multiplication
in the frequency domain
32LOG or DOG filter
- Laplacian of Gaussian
- Approx
- Difference of Gaussians
33LOG filter properties
34Mathematical model
351D model rotate to create 2D model
361D Gaussian and 1st derivative
372nd derivative then all 3 curves
38Laplacian of Gaussian as 3x3
39G(x,y) Mexican hat filter
40Convolving LOG with region boundary creates a
zero-crossing
Mask h(x,y)
Input f(x,y)
Output f(x,y) h(x,y)
41(No Transcript)
42LOG relates to animal vision
431D EX.
Artificial Neural Network (ANN) for computing
g(x) f(x) h(x) level 1 cells feed 3 level 2
cells level 2 cells integrate 3 level 1 input
cells using weights -1,2,-1
44Experience the Mach band effect
45Simple model of a neuron
46Output conditioning threshold versus smoother
output signal
473D situation in the eye
Neuron c has input to neuron A but - input to
neuron B. Neuron d has input to neuron B but
input to neuron A. Neuron b gives no input to
neuron B it is not in the receptive field of B.
48Receptive fields
49Experiments with cats/monkeys
- Stabilize/drug animal to stare
- Place delicate probe in visual network
- Move step edge across FOV
- Probe shows response function when the edge
images to receptive field - Slightly moving the probe produces similar signal
when edge is nearby
50Canny edge detector uses LOG filter
51Summary of LOG filter
- Convenient filter shape
- Boundaries detected as 0-crossings
- Psychophysical evidence that animal visual
systems might work this way (your testimony) - Physiological evidence that real NNs work as the
ANNs