Title: Visual Perception in Humans and Machines
1Visual Perception in Humans and Machines
- Kostas Daniilidis
- Assistant Professor
- GRASP Lab
- University of Pennsylvania
2Examples
- How do we (humans) recognize faces ? Make a
machine find President Clintons face in the web
3(No Transcript)
4An interdisciplinary definition
- Computer Vision is devoted to the discovery of
algorithms, representations, and architectures
that embody the principles of visual
capabilities. - What are visual capabilities?
- Recognizing objects and faces
- Estimating shapes and distances
- Moving, grasping
5Relation to other fields
- Computer Vision is inspired from Biological
Vision (Phenomenology and Models in Psychophysics
and in Neurobiology) but does not try to imitate
the nature's architecture or algorithms. - Biological Vision and Psychophysics may find
computational models discovered in Computer
Vision useful for explaining nature.
6Target problem in computer vision
- Compute properties of the 3D world from one or
more digital images - These properties may be
- dynamic (observer and object motion)
- geometric (distances, object shape)
- enabling recognition
- The result may be an action (grasp an object,
avoid an obstacle)
7What is an image ?
- A gray-value image is just a set of numbers
(usually from 0 to 255)
8An image is a set of numbers
- 175 189 190 188 199 197 196 193
- 181 189 191 194 198 196 191 179
- 189 191 197 198 200 195 173 129
- 192 194 198 200 194 161 116 116
- 198 200 200 190 152 113 116 119
- 201 202 185 135 105 103 114 119
- 205 180 121 89 104 101 109 114
- 177 105 88 90 100 103 101 105
9An image is a surface I(x,y)
10Basic image processing operations
- Smoothing and Noise Removal
11Blur removal
12Edge detection
13(Sub) sampling
- Shannon Theorem Sampling frequency must be
greater than the maximal frequency in the image
(therefore smooth before subsample)
14Brightness perception
15How do we perceive distances?
- Perspective distortion in texture, contour,
shading, and a-priori knowledge - Stereopsis (what most people believe)
- Motion
16The Eye as a Pinhole CameraPerspective
Projection
Z
X
u X/Z
17Ames Illusion
18Perspective Illusions
19Quiz
- From which points in space is a rectangle viewed
as a square (more difficult an ellipse viewed as
a circle ?
- Be careful The center of
- the ellipse in the image
- is not the projection of the
- center of the ellipse on the floor!
20The power of vanishing points
- Perspective projection preserves
- cross-ratio AC/AD BC/BD is the same on the
street and in the image. If A is a vanishing
point AC/AD 1. -
We measure A,B,C,D in pixels in the image and
form cross ratio for image and for the street.
BC is computed from the equality of the two
ratios.
A
B
C
D
21Stereopsis
Infer depth from the disparity between the
positions of the same feature in left and right
image
22Stereo Reconstruction
23Stereo-disparity estimation
- Search at every pixel for candidates of maximum
correlation between left and right - Estimate 3D-coordinates of point
24The power of visual motion
- Most of the animals have monocular vision (left
and right visual fields do not overlap) - 8 of the population can not see stereo
- Stereopsis is limited to a very short depth of
field (10m).
25(No Transcript)
26Kinetic depth effect(moving dots)
27Self- and object-motion
28Motion artifacts
29Structure from Motion
- Given a sequence of images
- find
- 1. Ego-motion
- 2. 3D-structure
- 3. Independent motions
- applying only the assumption of
- rigidity.
30Motion Field and Heading Direction
31Depth map
32Temporal aliasing
- Wagon Wheel Illusion A wheel with a periodic
radial pattern is perceived to move backwards
depending on the relation between the speed, the
radius of the wheel, and the period of the
pattern (www.cstr.ed.ac.uk/rjc/wagonWheel)
33Aperture problem
- Inside a small aperture displaying a small line
we can estimate only the motion direction
perpendicular to the line.
34My wife and my mother-in-law
- The role of the
- focus of attention