Vision Strategies and Tools You Can Use - PowerPoint PPT Presentation

1 / 30

About This Presentation

Title:

Vision Strategies and Tools You Can Use

Description:

What local motion freedom does robot have? Manipulation support. Is brick pose amenable to grasping/placement? Is robot pose ... (show picture of rob and me? ... – PowerPoint PPT presentation

Number of Views:48

Avg rating:3.0/5.0

Slides: 31

Provided by: projects7

Learn more at: http://projects.csail.mit.edu

Category:

more less

Transcript and Presenter's Notes

Title: Vision Strategies and Tools You Can Use

1
Vision Strategies and Tools You Can Use

RSS II Lecture 5
September 16, 2005
Prof. Teller

2
Today

Strategies
What do we want from a vision system?
Tools
Pinhole camera model
Vision algorithms
Development
Carmen Module APIs, brainstorming

3
Vision System Capabilities

Material identification
Are there bricks in vicinity? If so, where?
Motion freedom
What local motion freedom does robot have?
Manipulation support
Is brick pose amenable to grasping/placement?
Is robot pose correct for grasping/placement?
Localization
Where is robot, with respect to provided map?
Which way to home base? To a new region?
Has robot been in this region before?

4
Material Identification

Detecting bricks when theyre present
How?
Locate bricks
In which coordinate system?
Estimate range and bearing how?

5
Gauge distance from apparent size?

Yes under what assumption?

6
Pinhole Camera Model (physical)
y
v
World point
Image plane (u, v)
P (0, y, z)
z
Image point
p (0, -y/z)
pinhole at world origin O
World coordinates (x,y,z)
z 0
z -1
enclosure
Notes Diagram is drawn in the plane
x0 Image-space u-axis points out of
diagram World-space x-axis points out of
diagram Both coordinate systems are left-handed
7
Pinhole Camera Model (virtual)
(Virtual image plane placed 1 unit in front of
pinhole no inversion)
World points
P2 (0, y2, z2)
v
y
Image point
P1 (0, y1, z1)
p (0, y/z)
O
z
Image plane
(All points along ray Op project to image point
p!)
z 1
z 0
8
Perspective Apparent Size

Apparent object size decreases with depth(perp.
distance from camera image plane)

9
Perspective Apparent Size
10
What assumptions yield depth?
11
Ground plane assumption

Requires additional metric information
Think of this as a constraint on camera, world
structure
Plane in scene, with two independent marked
lengths
Can measure distance to, or size of, objects on
the plane
but where do the marked lengths come from?

4 m
3 m
2 m
1 m
1 m
12
Camera Calibration

Maps 3D world points wP to 2D image plane IP
Map can be factored into two operations
Extrinsic (rigid-body) calibration (situates
camera in world)
Intrinsic calibration (warps rays through optics
onto image)

v
wZ
cY
cZ
height
wP
cO
Principal point
IP
wY
CP
(u0, v0)
A
cX
K
wO
wX
width
u
IO
World coordinates (arbitrary choice)
Camera coordinates (e.g., cm)
Image coordinates (pixels)
K3x3
A3x4 (R3x3 t3x1)
Ip K (1/CZ) A WP K (1/Cz) (R t) WP
Ip3x1 K3x3 (1/CZ) A3x4 WP4x1
13
World-to-Camera Transform

Relabels world-space points w.r.t. camera body
Extrinsic (rigid-body) calibration (situates
camera in world)

wZ
cY
cZ1
cZ
wP
cO
wY
CP
A
cX
wO
wX
World coordinates (arbitrary choice)
Camera coordinates (e.g., cm)
A3x4 (R3x3 t3x1)
CP (1/Cz) (R t) WP
CP3x1 (1/CZ) A3x4 WP4x1
Note effect of division by Cz no scaling
necessary!
14
Camera-to-Image Transform

Maps 2D camera points to 2D image plane
Models ray path through camera optics and body to
CCD

v
cY
cZ1
height
cZ
cO
Principal point
IP
CP
(u0, v0)
cX
K
width
u
IO
Camera coordinates (e.g., cm)
Image coordinates (pixels)
Ip K CP
Ip3x1 K3x3 CP3x1
Matrix K captures the cameras intrinsic
parameters a, b horizontal, vertical scale
factors (equal iff pixel elements are
square) u0, v0 principal point, i.e., point at
which optical axis pierces image plane c image
element (CCD) skew, usually 0
K3x3
15
End-to-End Transformation
v
wZ
cY
cZ
height
cZ1
wP
cO
Principal point
IP
wY
CP
(u0, v0)
A
cX
K
wO
wX
u
IO
width
World coordinates (arbitrary choice)
Camera coordinates (e.g., cm)
Image coordinates (pixels)
K3x3
A3x4 (R3x3 t3x1)
Ip K (1/CZ) A WP K (1/Cz) (R t) WP
Ip3x1 K3x3 (1/CZ) A3x4 WP4x1
16
Example Metric Ground Plane

Make camera-frame and world-frame coincidentThus
R I3x3, t 03x1, A4x4 (R t) as before
Lay out a tape measure on line x 0, y -h
Mark off points at (e.g.) 50-cm intervals
What is the functional form of map u
f(wx,wy,wz)?

Ip K (1/CZ) A WP K (1/Cz) I WP
Image point
y
K (1/CZ) (0, -h, Cz)T
Ip (0, v, 1)
K (0, -h/Cz, 1)T
v
(u0, -bh/Cz v0, 1)T
Camera
Measure h observe CZi, vi repeatedly solve
for u0, b, v0
y -h
z
Image plane
z 1
z 0
z12
z2 3
z3 4
17
Vision System Capabilities

Material identification
Are there bricks in vicinity? If so, where?
Motion freedom
What local motion freedom does robot have?
Manipulation support
Is brick pose amenable to grasping/placement?
Is robot pose correct for grasping/placement?
Localization
Where is robot, with respect to provided map?
Which way to home base? To a new region?
Has the robot been in this region before?

18
Motion Freedom

What can be inferred from image?

19
Freespace Map

Discretize bearing classify surface type

20
Freespace Map Ideas

Use simple color classifier
Train on road, sidewalk, grass, leaves etc.
Training could be done offline, or in a
start-of-mission calibration phase adapted from
RSS II Lab 2
For each wedge of disk, could report distance to
nearest obstruction
Careful how will your code deal with varying
lighting conditions?
Finally can fuse (or confirm) with laser data

21
Vision System Capabilities

Material identification
Are there bricks in vicinity? If so, where?
Motion freedom
What local motion freedom does robot have?
Manipulation support
Is brick pose amenable to grasping/placement?
Is robot pose correct for grasping/placement?
Localization
Where is robot, with respect to provided map?
Which way to home base? To a new region?
Has the robot been in this region before?

22
Manipulation Support

Two options
Manipulate brick into appropriate grasp pose
Plan motion to approach the (fixed-pose) brick

Manipulation and/or motion plan
Initial pose
Desired pose
How? Hint compute moments
How to disambiguate edge-on, end-on?
23
Vision System Capabilities