Title: Mid level vision, neglected yet still important
1Mid level vision, neglected yet still important
- Ken Nakayama
- Harvard University
221st C challenge
- Existence and variation of occlusion and variable
sources of illumination pose unsolved problems
for vision
3- Object representation needs an intermediate level
format - Low level vision alone is not even explanatory
for wide range of visual processes (motion,
stereo, search) - Missing -- a satisfactory scientific description
of surface level vision -
41970s Visual take-over of the whole brain
1950s
5Macaque monkey brain flattened
Visual regionsshown in color
6Global division ofthe visual system
dorsal
(where, how)
ventral
what
7action
objectrecognition
8action
objectrecognition
9Kanizsa
Phenomenology, reviving the Gestalt approach
Level surfaces Method phenomenology Practition
er Gaetano Kanizsa
new concepts amodal and modal completion
10modal competion (in front)
Amodal competion (behind)
11Inferences, but at what level ?
12(No Transcript)
13Suggests that thereis a completion process
within thevisual system
Amodal completion trumps knowledge of horses
14Amodal completion allows fragments to be grouped
and thus recognized (strongest evidence)
spot the 5 letter Bs
From Bregman, 1990
15Occlusion and the problem of segmentation for
object recognition
What belongstogether ?
Border ownership issues - for 3-D scenes,
borderscannot be shared. Border dispute needs
resolution
Rule - border belongs to the closest surface
16Problem of segmentationKaniszas figure
Normal or amputee ?
17Border ownership dictatedby lines
preventsmodal and amodal completion
18New sources of evidence
Stereoscopic disparity
Surface in front owns the border. Thus face
on right is broken up, on left is OK
Nakayama et al.Perception 89 - faces easier to
recognize on left
19Stereoscopic depth also determines
borderownership between regions. Nearer surface
will ownthe border (for opaque surfaces)
Nakayama Shimojo stereo demonstrations
20Image level cant even explain much lower level
vision
- Deployment of attention, motion perception,
texture, visual search
21(No Transcript)
22Surfaces needed for much lower visual function
23He and Nakayama search task
Used stereo vision
Nature (1992)
24(No Transcript)
25Random dot stereogram
The correspondence problem an image based problem
26(No Transcript)
27occluding surfaces
What gives rise to unpaired points?
What would happen if we presented unpaired
points by themselves?
28DaVinci stereopsis
(Nakayama Shimojo)
29Scene depth from unpaired gaps
Gillam and Nakayama, 1999
30Forest vs plane
A plane is a surface Which can occlude, a set of
random sticks cannot
31Planes vs sticks
Gillam and Nakayama, 200
32Level of processinghigh or low level inference?
Hypothesis - inferences learned via associative
cortical learning
33generic view principle
when faced with more than one surface
interpretation of an image, the visual system
assumes it is viewing the scene from a generic,
not accidental, vantage point.
34Some counterintuitive observations
folded wings?
LE
RE
Why dont we interpolate depth and see folded
wings and cards?
35(No Transcript)
36Accidental vs generic vantage points
37- accidental view
- generic view
38(No Transcript)
39generic view principle
- when faced with more than one surface
interpretation of an image, the visual system
assumes it is viewing the scene from a generic,
not accidental, vantage point.
40(No Transcript)
41Some counterintuitive observations
folded wings?
LE
RE
Why dont we interpolate depth and see folded
wings and cards?
42(No Transcript)
43(No Transcript)
44neural mechanisms ofsurface representation ?
- Cells in V2 respond to subjective contours
Strategy vary stimuli in ways that lead
to Appearance and disappearance of subjective
contours
45Physiological correlates of illusory contours in
singleunit recordings
Recordings from a single cell in area V2 of monkey
46Bakin, Nakayama, and Gilbert, 2000
47Cells coding Border ownership?
Edgar Rubin figure and ground
Von der Heydt Et al.
Edge labeling?
contrast polarity vs edge labeling
48image based response
49Border ownership based response
50Border ownership cells
Von der Heydt and colleagues
51Von der Heydt (1984)
V2
Bakin, Nakayama, Gilbert (2000)
V2
DaVinci stereopsis
Border ownership cells (V2)
Mechanistic account of surface representation? --gt
5221st C challenge
- Existence and variation of occlusion and variable
sources of illumination pose unsolved problems
for vision
53- Object representation needs an intermediate level
format - Low level vision alone is not even explanatory
for wide range of visual processes (motion,
stereo, search) - Missing -- a satisfactory scientific description
of surface level vision - -- demos the importance for illumination for
object recognition
54importance of shadow processing
Ted Adelson
55Shadow processing in object recognition
56Outline is very destructive to seeing regionsas
shaded. Line is interpreted as a
bounding Contour of an object
57(No Transcript)