Image Matching via Saliency Region Correspondences - PowerPoint PPT Presentation

1 / 56

About This Presentation

Title:

Image Matching via Saliency Region Correspondences

Description:

Correspondence estimation is one of the fundamental challenges in computer ... power is in the ability to robustly capture discriminative image structures ... – PowerPoint PPT presentation

Number of Views:214

Avg rating:3.0/5.0

Slides: 57

Provided by: lsch87

Category:

more less

Transcript and Presenter's Notes

Title: Image Matching via Saliency Region Correspondences

1
Image Matching via Saliency Region Correspondences

Alexander Toshev
Jianbo Shi
Kostas Daniilidis

IEEE Conference on Computer Vision and Pattern
Recognition (CVPR), 2007
2
Outline

Introduction
Joint-Image Graph (JIG) Matching Model
Optimization in the JIG
Estimation of Dense Correspondences
Implementation Details
Experiments
Conclusion

3
Outline

Introduction
Joint-Image Graph (JIG) Matching Model
Optimization in the JIG
Estimation of Dense Correspondences
Implementation Details
Experiments
Conclusion

4
Introduction

Correspondence estimation is one of the
fundamental challenges in computer vision lying
in the core of many problems
To find the correspondence of interest points,
whose power is in the ability to robustly capture
discriminative image structures

5
Introduction

Feature-based approaches suffer from the
ambiguity of local feature descriptors
To address matching ambiguities is to provide
grouping constraints via segmentation
Disadvantagechanging drastically even for small
deformation of the scene

6
Introduction
Example
Improvement
Matching by modeling in one score function both
the coherence of regions
7
Introduction

A pair of corresponding regions as co-salient
define them as follows
Each region in the pair should exhibit strong
internal coherence with respect to the background
in the image
The correspondence between the regions from the
two images should be supported by high similarity
of features extracted from these regions

8
Outline

Introduction
Joint-Image Graph (JIG) Matching Model
Optimization in the JIG
Estimation of Dense Correspondences
Implementation Details
Experiments
Conclusion

9
Joint-Image Graph Matching Model

To formalize this model Introduce the
joint-image graph (JIG) which contains
vertices the pixels of both images
edges represent intra-image similarities and
inter-image feature matches
A good cluster in the JIG consists of a pair of
coherent segments describing corresponding scene
parts from the two image

10
Joint-Image Graph Matching Model
11
Joint-Image Graph Matching Model

In order to combine the robustness of matching
via local features with the descriptive power of
salient segments
We detect clusters in JIG
represents a pair of co-salient regions
contains pixels from both images
coherent and perceptually salient regions in the
images (intra-image similarity criterion)
match well according to the feature descriptors
(inter-image similarity criterion)

12
Joint-Image Graph Matching Model

Intra-image similarity The image segmentation
score is the Normalized Cut criterion applied to
both segments

(2)
13
Joint-Image Graph Matching Model

Inter-image similarity
This function measures the strength of the
connections between the regions and
Correspondences between pixels are weakly
connected with their neighboring pixels exactly
is uncertain
If we use the same indicator vector , then it can
be shown that

(3)
14
Joint-Image Graph Matching Model

The correspondence matrix is defined in terms
of feature correspondences encoded in a
matrix
should select from pixel matches which
connect each pixel of one of the images with at
most one pixel of the other image
This can be written as

15
Joint-Image Graph Matching Model

Matching score function we should maximize
the sum of the scores in eq. (2) and eq. (3)
in the case of pairs of co-salient regions we
can introduce indicator vectors packed in
matrix we need to maximize
subject to

16
Joint-Image Graph Matching Model

The above optimization problem is NP-hard even
for fixed
We relax the indicator vectors to real
numbers
Following 12 it can be shown that the problem
is equivalent towhere is a matrix
containing feature similarities across the images
the constraints enforce to select for each
pixel in one of the images only one pixel
in the another which it can be mapped

(4)
12 S. Yu and J. Shi. Multiclass spectral
clustering. In ICCV,2003
17
Joint-Image Graph Matching Model
18
Outline

Introduction
Joint-Image Graph (JIG) Matching Model
Optimization in the JIG
Implementation Details
Estimation of Dense Correspondences
Experiments
Conclusion

19
Optimization in the JIG

In order to optimize matching score function we
adopt an iterative two-step approach
First step we maximize with respect
to for given this step amounts to
synchronization of the soft segmentations of
two images based on
Second step, we find an optimal correspondence
matrix given the joint segmentation

20
Optimization in the JIG

Segmentation synchronization
for fixed the optimization problem from eq.
(4) can be solved in a closed form the maximum
is attained for eigenvectors of the generalized
eigenvalue problem
due to clutter in this may lead to erroneous
solutions
assume that the joint soft segmentation
lies in the subspace spanned by the soft
segmentations and of the separate
imageswhere are eigenvectors of the
corresponding generalized eigenvalue problems for
each of the images

21
Optimization in the JIG

Segmentation synchronization
Hence we can write ,whereis the
joint image segmentation subspace basis and
are the coordinates of the joint soft
segmentation in this subspace
With this subspace restriction for V the score
function can be written assubject to
is the original JIG weight matrix restricted to
the segmentation subspaces

(5)
22
Optimization in the JIG

Segmentation synchronization
If we write in terms of the
subspace basis coordinates and for
both image
then the score function can be decomposed as
follows

(6)
23
Optimization in the JIG

Segmentation synchronization In eq. (6)
The first term serves as a regularizer, which
emphasizes eigenvectors in the subspaces with
larger eigenvaluesdescribing clearer segments
The second term is a correlation between the
segmentations of both images weighted by the
correspondences inmeasures the quality of the
match

24
Optimization in the JIG

Segmentation synchronization
The optimal in eq. (5) is attained for the
eigenvectors of diagonal matrix with the
largest eigenvalues
is a matrix,
In eq. (4) has much higher dimension

25
Optimization in the JIG

Segmentation synchronization

26
Optimization in the JIG

Segmentation synchronization A different
view of the above process can be obtained by
representing the eigenvectors by their rows
denote by the row of
We can assign to each pixel in the image a
k-dimensional vector which we will call the
embedding vector of this pixel
The segmentation synchronization can be viewed as
a rotation of the segmentation embeddings of both
images such that corresponding pixels are close
in the embedding

27
Optimization in the JIG
Figure 4
28
Optimization in the JIG

Obtaining discrete co-salient regions From
the synchronized segmentation eigenvectors we can
extract regions
suppose is the
embedding vector of a particular pixel
the binary mask which describes the
segment is a column vector defined as
describes a segment in the JIG and
represents a pair of corresponding segments in
the images
the matching score between segments can be
defined as

29
Optimization in the JIG

Optimizing the correspondence matrix After
we obtained we seek In order to
obtain fast solution we relax the problem by
removing the last inequality constrain we
denote where is the embedded vector
for pixel

(eq. (4))
(7)
30
Optimization in the JIG

Algorithm 1
Initialize . Compute
Compute segmentation subspaces as the
eigenvectors to the largest eigenvalues of
Find optimal segmentation subspace alignment by
computing the eigenvectors of
Compute optimal as in eq. (7).
If different from previous iteration go to
step 3
Obtain pairs of corresponding segments
is the match score for the co-salient region

31
Outline

Introduction
Joint-Image Graph (JIG) Matching Model
Optimization in the JIG
Estimation of Dense Correspondences
Implementation Details
Experiments
Conclusion

32
Estimation of Dense Correspondences

Initially we choose a sparse set of feature
matches extracted using a feature detector
In order to obtain denser set of correspondences
we use a larger set of matches between
features extracted everywhere in the image
Since this set can potentially contain many more
wrong matches than , running algorithm 1
directly on does not give always
satisfactory results

33
Estimation of Dense Correspondences

We prune based on the solution by combining
Similarity between co-salient regions obtained
for old feature set Using the embedding view of
the segmentation synchronization from fig. 4this
translates to euclidean distances in the joint
segmentation space weighted by the eigenvalues
Feature similarity from new

34
Estimation of Dense Correspondences

Suppose, two pixels and have
embedding coordinates and
obtained from
Then following feature similarities embody both
requirements from above
Finally, the entries in are scaled such
that the largest value in is 1
The new co-salient regions are obtained as a
solution of

35
Estimation of Dense Correspondences

Algorithm 2 Matching algorithm
Extract conservatively using a feature
detector
Solve using
alg. 1
Extract using features extracted everywhere
in the image
Compute and are the rows of Scale
such that maximal element in is 1
Solve
using alg. 1

36
Outline

Introduction
Joint-Image Graph (JIG) Matching Model
Optimization in the JIG
Estimation of Dense Correspondences
Implementation Details
Experiments
Conclusion

37
Implementation Details

Inter-image similarities
The feature correspondence matrix
is based on affine covariant region
detector
For comparison, each feature is represented by a
descriptor extracted frombe used to evaluate
the appearance similarity between two interest
points and

38
Implementation Details

Inter-image similarities
Define a similarity between pixels
andlying in the interest point regions
1st term measures the appearance similarity
between the regions in which and lie
2nd term measures their geometric compatibility
with respect to the affine transformation of
to

39
Implementation Details

Inter-image similarities
Provided, we have extracted two feature sets
from and from as described above
the final match score for a pair of pixels
equals the largest match score supported by a
pair of feature points
pixels on different sides of corresponding image
contours in both images get connected
shape information is encoded in

40
Implementation Details

Inter-image similarities

41
Implementation Details

Inter-image similarities
The final is obtained by pruningretain
For feature extraction we use the MSER
detector12 combined with SIFT descriptor4
For the dense correspondences we use features
extracted on a dense grid in the image and use
the same descriptor

10 T. Tuytelaars and L. V. Gool. Matching
widely separated views based on affine invariant
regions. IJCV, 59(1)6185,2004 4 D. Lowe.
Distinctive image features from scale-invariant
keypoints. IJCV, 60(2), 91-110, 2004
42
Implementation Details

Intra-image similarities The matrices
for each image are based on
intervening contours
two pixels and from the same
image belong to the same segment if there are no
edges with large magnitude, which spatially
separate them

43
Implementation Details

Algorithm settings
The optimal dimension of the segmentation
subspaces in step 2 depends on the area of the
segments in the images
-- to capture small detailed regions we need more
eigenvectors
For the experiments we used
The threshold from is determined so that
initially we obtain approx. 200 - 400 matchesfor
our experiments it is

44
Implementation Details

Time complexity
denote bythe time complexity of step 1,2 in alg.
1 corresponds to the complexity of the Ncut
segmentation which is 12
the complexity of line 3 is computing the full
SVD of a dense matrix of size
denote the number of interest point matching
isline 4 takes
line 6 is

45
Implementation Details

Time complexity
in alg. 2, we use alg. 1 twice and step 4 is
the total complexity of alg. 1 is
we can precompute the segmentation for an
image and use it every time we match this image

46
Outline

Introduction
Joint-Image Graph (JIG) Matching Model
Optimization in the JIG
Estimation of Dense Correspondences
Implementation Details
Experiments
Conclusion

47
Experiments

We conduct two experiments
detection of matching regions
place recognition
datasets ICCV2005 Computer Vision Contest
Test4 Final5
containing each 38 and 29 images of buildings
each building is shown under different viewpoints

48
Experiments

Detection of Matching Regions

49
Experiments

Detection of Matching Regions

50
Experiments

Detection of Matching Regions
detect matching regions, enhance the feature
matches, and segment common objects in manually
selected image pairs
the 30 matches with highest score in of
the output
the top 6 matching regions

51
Experiments

Detection of Matching Regions
Finding the correct match for a given
point may fail usually because
The appearance similarity to the matching point
is not as high as the score of the best matches
( not ranked high in the initial )
There are several matches with high scores due to
similar or repeating structure

52
Experiments

Detection of Matching Regions
To compare quantitatively the difference
between the initial and the improved set of
feature matches we count how many of the top 30,
60, and 90 best matches are correct

53
Experiments

Place Recognition
Test4 and Final5 has been split into two subsets
exemplar set and query set
The query set contains for Test4 19 and for
Final5 22 images, while the exemplar set contains
9 and 16 images respectively
Each query image is compared with all exemplars
images and the matches are ranked according to
the value of the match score function

54
Experiments

Place Recognition
For all queries, which have at least similar
exemplars in the datasetcompute how many of them
are among the top matches

55
Outline

Introduction
Joint-Image Graph (JIG) Matching Model
Optimization in the JIG
Estimation of Dense Correspondences
Implementation Details
Experiments
Conclusion

56
Conclusion

Present an algorithm to detects co-salient
regions
These regions are obtained through
synchronization of the segmentations using local
feature matches
Dense correspondence between coherent segments
are obtained
The approach has shown promising results for
correspondence detection in the context of place
recognition