Title: Computational%20Vision
1Computational Vision
- Jitendra Malik
- University of California, Berkeley
2What is in an image?
The input is just an array of brightness values
humans perceive structure in it.
3From Pixels to Perception
outdoor wildlife
4If visual processing was purely feedforward(it
isnt)
5Boundaries of image regions defined by a number
of attributes
- Brightness/color
- Texture
- Motion
- Binocular disparity
- Familiar configuration
6(No Transcript)
7(No Transcript)
8Grouping is hierarchical
Perceptual organization forms a tree
Image
BG
L-bird
R-bird
bush
far
grass
beak
body
beak
body
head
eye
eye
head
Two segmentations are consistent when they can
be explained by the same segmentation tree
- A,C are refinements of B
- A,C are mutual refinements
- A,B,C represent the same percept
9Humans assign a depth ordering to surfaces across
a contour
- R1 appears in front of R2
- R2 appears in front of R3
This can be done for images of natural scenes
10Figure-Ground Labeling
-
- - red is near blue is far
11Figure/Ground Organization
- A contour belongs to one of the two (but not
both) abutting regions.
Important for the perception of shape
12Some other aspects of perceptual organization
13What do we see here?
14And here?
15Some Pictorial Cues
16Support, Size
2
?
3
?
1
?
17Cast Shadows
18Shading
19Measuring Surface Orientation
20Binocular Stereopsis
21Optical flow for a pilot
22Object Category Recognition
23Shape variation within a category
- DArcy Thompson On Growth and Form, 1917
- studied transformations between shapes of
organisms
24Attneaves Cat (1954)Line drawings convey most
of the information
25Objects are in Scenes
26Human stick figure from single image
Input image
Stick figure
Support masks
27 This is hard
- Variety of poses
- Clothing
- Missing parts
- Small support for parts
- Background clutter
28 Taxonomy and Partonomy
- Taxonomy E.g. Cats are in the order Felidae
which in turn is in the class Mammalia - Recognition can be at multiple levels of
categorization, or be identification at the level
of specific individuals , as in faces. - Partonomy Objects have parts, they have
subparts and so on. The human body contains the
head, which in turn contains the eyes. - These notions apply equally well to scenes and to
activities. - Psychologists have argued that there is a
basic-level at which categorization is fastest
(Eleanor Rosch et al). - In a partonomy each level contributes useful
information for recognition.
29Visual Control of Action
- Locomotion
- Navigation/Way-finding
- Obstacle Avoidance
- Manipulation
- Grasping
- Pick and Place
- Tool use
30(No Transcript)
31Camera Obscura(Reinerus Gemma-Frisius, 1544)
32Camera Obscura(Angelo Sala, 1576-1637)
33(No Transcript)
34(No Transcript)