Title: Object Recognition
1Object Recognition
2What is object recognition?
- Perception of objects is different for humans
than for computers. - For humans perception of familiar items.
- For computers perception of familiar patterns.
- Are they really the same thing?
3What do we mean by objects
- What we call object recognition may also be
called pattern recognition. - A pattern is an arrangement of descriptors.
- Descriptors may have more forms, but they are
primarily vectors and strings.
4More generally
- Object recognition is the process whereby
observers are able to recognize three-dimensional
objects despite receiving only two-dimensional
input that varies greatly depending on viewing
conditions.(2)
52 main approaches
- Decision-theoretic
- Patterns described using quantitative
descriptors. - Structural
- Patterns represented by symbolic information.
- Strings, for example.
6Decision Theoretic
- Based on discriminant functions
- Let x (x1, x2, , Xn)T represent an
n-dimensional pattern vector - Let W (w1, w2, ,wW) be pattern classes.
7Basic problem of decision-theoretic
- We want to find W decision functions d1(x),
d2(x),,dw(x) with the property - If a pattern x belongs to class wi, then
di(x) gt dj(x), - where j 1, 2, , W j ! i
8In other words..
- We want to classify x, which is a pattern.
- We are given a finite set of classes of objects.
- We want to categorize the pattern x into one of
the classes. - To do so, we apply x to all decision functions,
and categorize x to the class of best fit.
9Structural
- Represents objects as strings, trees, graphs..
- Define descriptors and recognition rules base on
the representations.
10What does finite classification imply?
- The idea of a finite set of classes is quite
limiting. - Corresponds with industries use of object
recognition very application specific. - Indicates that computer object recognition
techniques lack some abilities which are simple
for humans.
11Differences in classification
- Techniques thus far only classify objects based
on their shape, color, texture, etc. These are
only representative of the light reflected by an
object. - Humans classify objects many ways, including an
objects function.
12For example
- We classify a ring of rocks with a fire inside as
a fire pit. - We classify a board as a joist once it is
installed as support for the floor. - We classify our computer as a paperweight once it
is more than five years old.
13Correlation
- Given an image, we want to find all places in the
image which contain a subimage, also called a
template. - Very useful for answering where is the x in
this picture?
14Notice..
- Recognition models typically rely on input from
optical sensors. - Such input is represented entirely in
two-dimensional space.
15Is a 3D representation necessary?
- DARPA challenge was not successfully completed.
- Armys LADAR sensors, which provide depth data,
have demonstrated more capability.
163D Object recognition with neural trees
- First stage extracts features from the input
range images. - These features are used in the second stage to
group image pixels into different surface patches
according to the six surface classes proposed by
the differential geometry.(4)
17Invariants
- Basic idea
- D(g(A),g(B)) D(A,B)
- For all g in transformation group G
- Limitations
- There are very many possible transformations in
G, and computation times becomes a problem.
18Varying goals of object recognition
- Are we looking for that object?
- Face recognition
- Are we looking for one of those objects?
- Web search for 1987 Chevy pickup.
19Notice
- Just because an object exists in an image doesnt
mean it is recognizable. - Example from Late Night with Conan OBrien
20We dont know what this is
21Recognizable as a human face
22Recognizable as the pope
23The Punchline
24Histogram approach
- Vary bad results for images with
- Much noise
- Small target objects
- With tightly controlled conditions, moderate
success can be achieved.
25Noisy histograms
26Noisy histograms
27Noisy histograms
28Correlation example
29Create template
- Actual template size 32X32
30Acquire input image
- Actual image size 1600X1200
31Compute correlation image
- Actual image size 1600X1200
32Show areas of best match
- Actual image size 1600X1200
33Find flower with more noise
34Correlation image
35Area of best match?
36Templates for a coin
37Acquire target image
38Compute correlation image
39Display area of best match
40Finding coin among noise
41Correlation image
42Brightest coin wrong one
43Among different noise
44Correlation image
45Coin found
46Structural approach to stapler
- Acquire source stapler image
47Segment and find the stapler edge
48Compute the boundary
- Image recreated from computed boundary
49Select boundary points
- Boundary points at distance of 8
50Image recreated from boundary points
51Compare to a different view
52Segment and acquire boundary
53Image redrawn from boundary data
54Boundary points selected at a distance of 8
55Redrawn from selected boundary points
56Final step
- Compare the chain code strings of the 2 sets of
boundary points.
57Finding boundaries with noise
- Custom filters for each target image may be
required
58Conclusions
- Modern object recognition techniques can provide
much functionality in controlled environments. - Simulation of human object recognition
capabilities is a long way off.
59Best way to search for objects
- The best approach to create an image search
engine requires extensive human labor involving
organizing every image in the database into its
correct hierarchical position. - Input as text can provide as much functionality
as input from images in this approach.
60References
- Digital Image Processing using Matlab
- Prentice Hall, ISBN 0-13-008519-7
- Michael Tarr, Brown University
- http//www.cog.brown.edu/tarr/pdf/Tarr02ECS.pdf
search'object20recognition - 3D Object Recognition by Neural
Trees http//csdl.computer.org/comp/proceedings/i
cip/1997/8183/03/81830408abs.htm - Longjin Jan Latecki, CIS 601 Lecture notes
- http//www.cis.temple.edu/latecki/CIS601-04/lect
ures_fall04.htm