Partbased Object DetectionRecognition - PowerPoint PPT Presentation

1 / 27

About This Presentation

Title:

Partbased Object DetectionRecognition

Description:

... matching an image to a model built from a large number of training images. Model distortion using rotation distortion and length distortion for pairs of parts ... – PowerPoint PPT presentation

Number of Views:25

Avg rating:3.0/5.0

Slides: 28

Provided by: robf165

Category:

more less

Transcript and Presenter's Notes

Title: Partbased Object DetectionRecognition

1
Part-based Object Detection/Recognition

Longbin Chen

2
Recap of last Talk

We can extract salient features (e.g. SIFT
feature) from images
These salient features can be used for the tasks
of object recognition using the bag of words
model
Bag of words model, however, ignores the
position of feature points

3
Part-Based Object Recognition/Detection

Representation
Models to represent the geometry relationship
between parts
Learning
Find the parameters for the models
Recognition
Recognize/locate the objects

4
Representation

Object as set of parts
Generative representation
Model
Relative locations between parts
Appearance of part
Issues
How to model location
How to represent appearance
Sparse or dense (pixels or regions)
How to handle occlusion/clutter

Figure from Fischler73
5
Todays Coverage

Constellation Model
Fergus, R. , Perona, P. and Zisserman, A. Object
Class Recognition by Unsupervised Scale-Invariant
Learning, CVPR 2003. (Winner of CVPR 2003 Best
Pape prize)
Pictorial Structure Representation
P. Felzenszwalb and D. Huttenlocher. Pictorial
structures for object recognition. International,
Journal of Computer Vision, 6155-79, January
2005.
Shape Matching
A. C. Berg, T. L. Berg, J. Malik. Shape Matching
and Object Recognition using Low Distortion
Correspondence, CVPR 2005

6
Todays Coverage

Constellation Model
Fergus, R. , Perona, P. and Zisserman, A. Object
Class Recognition by Unsupervised Scale-Invariant
Learning, CVPR 2003. (Winner of CVPR 2003 Best
Pape prize)
Pictorial Structure Representation
P. Felzenszwalb and D. Huttenlocher. Pictorial
structures for object recognition. International,
Journal of Computer Vision, 6155-79, January
2005.
Shape Matching
A. C. Berg, T. L. Berg, J. Malik. Shape Matching
and Object Recognition using Low Distortion
Correspondence, CVPR 2005

7
Part-based Matching

Model with P parts
Image with N possible locations for each part

NP combinations!!!
Use a vector h, h P, and h(i) k means the
ith part of the object is assigned to kth
detected features, h(i) 0 means ith part is
occluded

8
Model Representation(1)

In a query image, assume that we identify N
interesting feature points with Location X,
scales S, and appearance A
We now make a Bayesian decision, R

9
Model Representation(2)

Factorized

10
Model Representation(3)

Appearance Model
Notes
G is the Gaussian Distribution
Each part p has a Gaussian density
Dp is the pth entry of vector d, the occlusion
vector, d sign(h)

11
Model Representation(3)

Shape Model
A joint Gaussian distribution
The covariance matrix is a full matrix
Alpha is the background area, f is the
normalization factor(?)

12
Model Representation(4)

Relative Scale Model
Occlusion Model
Where M is the mean of the Poission distribution

13
Learning(1)

Task to estimate
ML estimate using training images

14
Learning(2)

Varying levels of supervision
Unsupervised
Image labels
Object centroid/bounding box
Segmented object
Manual correspondence (typically sub-optimal)
Generative models naturally incorporate labelling
information (or lack of it)
If the correspondence of parts and detected
features in the training data are not given, use
EM algorithm to estimate the parameters

Contains a motorbike
15
Learning using EM

Task Estimation of model parameters

Chicken and Egg type problem, since we initially
know neither
Model parameters
- Assignment of regions to parts

Let the assignments be a hidden variable and use
EM algorithm to learn them and the model
parameters

16
Learning procedure

Find regions their location appearance

Initialize model parameters

Use EM and iterate to convergence

E-step Compute assignments for which regions
belong to which part M-step Update model
parameters

Trying to maximize likelihood consistency in
shape appearance

17
Todays Coverage

Constellation Model
Fergus, R. , Perona, P. and Zisserman, A. Object
Class Recognition by Unsupervised Scale-Invariant
Learning, CVPR 2003. (Winner of CVPR 2003 Best
Pape prize)
Pictorial Structure Representation
P. Felzenszwalb and D. Huttenlocher. Pictorial
structures for object recognition. International,
Journal of Computer Vision, 6155-79, January
2005.
Shape Matching
A. C. Berg, T. L. Berg, J. Malik. Shape Matching
and Object Recognition using Low Distortion
Correspondence, CVPR 2005

18
Pictorial Structure

Represent an object by a collection of parts
arranged in a deformable configuration
Model appearance of each part separately
Deformable configuration using spring-like
connection between pairs of parts
Matching as an optimization problem

19
Efficient Optimization Search

Previous methods have used heuristics or local
search techniques that do not find an optimal
solution and depend on having good initialization
This paper presents an efficient search for the
optimal solution given that
the graph G be acyclic (i.e., form a tree).
the relationships between connected pairs of
parts is a Mahalanobis distance between
transformed locations

20
Mahalanobis DistanceBob Fisher

The distance between two N dimensional points
scaled by the statistical variation in each
component of the point. For example, if X and Y
are two points from the same distribution which
has covariance matrix , then the Mahalanobis
distance is given by
Sqrt(((X-Y)) C(-1)(X-Y))
The Mahalanobis distance is the same as the
Euclidean distance if the covariance matrix is
the identity matrix.
A common usage in computer vision systems is for
comparing feature vectors whose elements are
quantities having different ranges and amounts of
variation, such as a 2-vector recording the
properties of area and perimeter.

21
Todays Coverage

Constellation Model
Fergus, R. , Perona, P. and Zisserman, A. Object
Class Recognition by Unsupervised Scale-Invariant
Learning, CVPR 2003. (Winner of CVPR 2003 Best
Pape prize)
Pictorial Structure Representation
P. Felzenszwalb and D. Huttenlocher. Pictorial
structures for object recognition. International,
Journal of Computer Vision, 6155-79, January
2005.
Shape Matching
A. C. Berg, T. L. Berg, J. Malik. Shape Matching
and Object Recognition using Low Distortion
Correspondence, CVPR 2005

22
A 1-1 matching problem
Database of Templates
Query Image
Best matching template is a helicopter
23
A 1-1 matching problem

Find a correspondence between the query image and
each template
Evaluate correspondence based on
Similarity of appearance near feature points
Similarity in configuration of the feature points
(distortion)

24
An Integer Quadratic Programming Problem

Use a binary matrix x to represent a
correspondence, x(i,j)1 iff template point i
maps to query point j
An Integer Quadratic Programming Problem

25
Measuring Distortion(Similarity in Configuration)
Query
Template
Rij
Si'j'
Measure distortion in vectors between pairs of
feature points - R and S same length for
rotations - R and S same direction for scalings
26
Modeling Distortion

Formulation
da penalizes the change in direction
dr penalizes the change in length
are constants

27
Experiment Face Detection Result
28
Caltech 101 Recognition Results
102 way Alternative Forced Choice test (15
training examples per class)
Chance 1 N.N. whole image 16 Discrimina
tive version of Constellation Model 27 N.N.
Geometric Blur Descriptors 38 Low Distortion
Correspondence (GBIQP) 45
102 way confusion matrix
100
0
29
Summary

Constellation Model
Gaussian model for each parts appearance
Gaussian model for shape
Gaussian model for relative scale
Poission model for occlusion
Pictorial Structure
Model shape using pairs of parts
Geometry Distortion
One to one Match, instead matching an image to a
model built from a large number of training
images
Model distortion using rotation distortion and
length distortion for pairs of parts