Title: OBJ CUT
1OBJ CUT
UNIVERSITY OF OXFORD
- M. Pawan Kumar
- Philip Torr
- Andrew Zisserman
2Aim
- Given an image, to segment the object
Object Category Model
Segmentation
Cow Image
Segmented Cow
- Segmentation should (ideally) be
- shaped like the object e.g. cow-like
- obtained efficiently in an unsupervised manner
- able to handle self-occlusion
3Challenges
Intra-Class Shape Variability
Intra-Class Appearance Variability
Self Occlusion
4Motivation
Magic Wand
- Current methods require user intervention
- Object and background seed pixels (Boykov and
Jolly, ICCV 01) - Bounding Box of object (Rother et al. SIGGRAPH
04)
Object Seed Pixels
Cow Image
5Motivation
Magic Wand
- Current methods require user intervention
- Object and background seed pixels (Boykov and
Jolly, ICCV 01) - Bounding Box of object (Rother et al. SIGGRAPH
04)
Object Seed Pixels
Background Seed Pixels
Cow Image
6Motivation
Magic Wand
- Current methods require user intervention
- Object and background seed pixels (Boykov and
Jolly, ICCV 01) - Bounding Box of object (Rother et al. SIGGRAPH
04)
Segmented Image
7Motivation
Magic Wand
- Current methods require user intervention
- Object and background seed pixels (Boykov and
Jolly, ICCV 01) - Bounding Box of object (Rother et al. SIGGRAPH
04)
Object Seed Pixels
Background Seed Pixels
Cow Image
8Motivation
Magic Wand
- Current methods require user intervention
- Object and background seed pixels (Boykov and
Jolly, ICCV 01) - Bounding Box of object (Rother et al. SIGGRAPH
04)
Segmented Image
9Motivation
- Problem
- Manually intensive
- Segmentation is not guaranteed to be
object-like
Non Object-like Segmentation
10Our Method
- Combine object detection with segmentation
- Borenstein and Ullman, ECCV 02
- Leibe and Schiele, BMVC 03
- Incorporate global shape priors in MRF
- Detection provides
- Object Localization
- Global shape priors
- Automatically segments the object
- Note our method is completely generic
- Applicable to any object category model
11Outline
- Problem Formulation
- Form of Shape Prior
- Optimization
- Results
12Problem
- Labelling m over the set of pixels D
- Shape prior provided by parameter ?
- Energy E (m, ?) ??x(Dmx)?x(mx ?)
??xy(mx,my) ?(Dmx,my) - Unary terms
- Likelihood based on colour
- Unary potential based on distance from ?
- Pairwise terms
- Prior
- Contrast term
- Find best labelling m arg min ? wi E (m, ? i)
- wi is the weight for sample ? i
Unary terms
Pairwise terms
13MRF
- Probability for a labelling consists of
- Likelihood
- Unary potential based on colour of pixel
- Prior which favours same labels for neighbours
(pairwise potentials)
mx
Pairwise Potential ?xy(mx, my)
m (labels)
my
Unary Potential ?x(Dmx)
x
y
D (pixels)
Image Plane
14Example
Cow Image
Object Seed Pixels
Background Seed Pixels
?x(Dobj)
x
x
? x(Dbkg)
? xy(mx,my)
y
y
Prior
Likelihood Ratio (Colour)
15Example
Cow Image
Object Seed Pixels
Background Seed Pixels
Prior
Likelihood Ratio (Colour)
16Contrast-Dependent MRF
- Probability of labelling in addition has
- Contrast term which favours boundaries to lie on
image edges
mx
m (labels)
my
x
Contrast Term ?(Dmx,my)
y
D (pixels)
Image Plane
17Example
Cow Image
Object Seed Pixels
Background Seed Pixels
?x(Dobj)
x
x
? x(Dbkg)
?xy(mx,my) ?xy(Dmx,my)
y
y
Prior Contrast
Likelihood Ratio (Colour)
18Example
Cow Image
Object Seed Pixels
Background Seed Pixels
Prior Contrast
Likelihood Ratio (Colour)
19Our Model
- Probability of labelling in addition has
- Unary potential which depend on distance from ?
(shape parameter)
? (shape parameter)
Unary Potential ?x(mx?)
mx
m (labels)
my
Object Category Specific MRF
x
y
D (pixels)
Image Plane
20Example
Cow Image
Object Seed Pixels
Background Seed Pixels
Shape Prior ?
Distance from ?
Prior Contrast
21Example
Cow Image
Object Seed Pixels
Background Seed Pixels
Shape Prior ?
Likelihood Distance from ?
Prior Contrast
22Example
Cow Image
Object Seed Pixels
Background Seed Pixels
Shape Prior ?
Likelihood Distance from ?
Prior Contrast
23Outline
- Problem Formulation
- Energy E (m, ?) ??x(Dmx)?x(mx ?)
??xy(mx,my) ?(Dmx,my) - Form of Shape Prior
- Optimization
- Results
24Layered Pictorial Structures (LPS)
- Generative model
- Composition of parts spatial layout
Layer 2
Spatial Layout (Pairwise Configuration)
Layer 1
Parts in Layer 2 can occlude parts in Layer 1
25Layered Pictorial Structures (LPS)
Cow Instance
Layer 2
Transformations
?1 P(?1) 0.9
Layer 1
26Layered Pictorial Structures (LPS)
Cow Instance
Layer 2
Transformations
?2 P(?2) 0.8
Layer 1
27Layered Pictorial Structures (LPS)
Unlikely Instance
Layer 2
Transformations
?3 P(?3) 0.01
Layer 1
28LPS for Detection
- Learning
- Learnt automatically using a set of videos
- Part correspondence using Shape Context
Shape Context Matching
Multiple Shape Exemplars
29LPS for Detection
- Detection
- Putative parts found using tree cascade of
classifiers
(x,y)
30LPS for Detection
- MRF over parts
- Labels represent putative poses
- Prior (pairwise potential) - Robust Truncated
Model -
- Match LPS by obtaining MAP configuration
Linear Model
Quadratic Model
Potts Model
31LPS for Detection
Efficient Belief Propagation
xi
- Likelihood ?i(xi)
- tree cascade of classifiers
- Prior ?ij(xi,xj)
- fij(xi,xj), if xi ? Ci(xj)
- ?ij , otherwise
- Pr(x) ? ? ?i(xi) ? ?ij(xi,xj)
i
xj
xk
j
k
mj-gti
ij
i
Messages
j
jk
k
ki
32LPS for Detection
Efficient Belief Propagation
xi
- Likelihood ?i(xi)
- tree cascade of classifiers
- Prior ?ij(xi,xj)
- fij(xi,xj), if xi ? Ci(xj)
- ?ij , otherwise
- Pr(x) ? ? ?i(xi) ? ?ij(xi,xj)
i
xj
xk
j
k
Messages calculated as
33LPS for Detection
Efficient Generalized Belief Propagation
xi
- Likelihood ?i(xi)
- tree cascade of classifiers
- Prior ?ij(xi,xj)
- fij(xi,xj), if xi ? Ci(xj)
- ?ij , otherwise
- Pr(x) ? ? ?i(xi) ? ?ij(xi,xj)
i
xj
xk
j
k
ij
i
mk-gtij
Messages
j
ijk
jk
k
ki
34LPS for Detection
Efficient Generalized Belief Propagation
xi
- Likelihood ?i(xi)
- tree cascade of classifiers
- Prior ?ij(xi,xj)
- fij(xi,xj), if xi ? Ci(xj)
- ?ij , otherwise
- Pr(x) ? ? ?i(xi) ? ?ij(xi,xj)
i
xj
xk
j
k
Messages calculated as
35LPS for Detection
Second Order Cone Programming Relaxations
xi
- Likelihood ?i(xi)
- tree cascade of classifiers
- Prior ?ij(xi,xj)
- fij(xi,xj), if xi ? Ci(xj)
- ?ij , otherwise
- Pr(x) ? ? ?i(xi) ? ?ij(xi,xj)
i
xj
xk
j
k
36LPS for Detection
Second Order Cone Programming Relaxations
1
- Likelihood ?i(xi)
- tree cascade of classifiers
- Prior ?ij(xi,xj)
- fij(xi,xj), if xi ? Ci(xj)
- ?ij , otherwise
- Pr(x) ? ? ?i(xi) ? ?ij(xi,xj)
0
0
0
0
i
1
0
0
1
j
k
m - Concatenation of all binary vectors l -
Likelihood vector P - Prior matrix
37LPS for Detection
Second Order Cone Programming Relaxations
1
0
0
0
0
i
1
0
0
1
j
k
38LPS for Detection
Second Order Cone Programming Relaxations
1
0
0
0
0
i
1
0
0
1
j
k
39LPS for Detection
Second Order Cone Programming Relaxations
1
0
0
0
0
i
1
0
0
1
j
k
40Outline
- Problem Formulation
- Form of Shape Prior
- Optimization
- Results
41Optimization
- Given image D, find best labelling as
m arg max p(mD) - Treat LPS parameter ? as a latent (hidden)
variable - EM framework
- E sample the distribution over ?
- M obtain the labelling m
42E-Step
- Given initial labelling m, determine p(? m,D)
- Problem
- Efficiently sampling from p(? m,D)
- Solution
- We develop efficient sum-product Loopy Belief
Propagation (LBP) for matching LPS. - Similar to efficient max-product LBP for MAP
estimate
43Results
- Different samples localize different parts well.
- We cannot use only the MAP estimate of the LPS.
44M-Step
- Given samples from p(? m,D), get new labelling
mnew - Sample ?i provides
- Object localization to learn RGB distributions of
object and background - Shape prior for segmentation
- Problem
- Maximize expected log likelihood using all
samples - To efficiently obtain the new labelling
45M-Step
w1 P(?1m,D)
Cow Image
Shape ?1
RGB Histogram for Background
RGB Histogram for Object
46M-Step
w1 P(?1m,D)
Cow Image
Shape ?1
?1
m (labels)
Image Plane
D (pixels)
- Best labelling found efficiently using a Single
Graph Cut
47Segmentation using Graph Cuts
Obj
Cut
?x(Dbkg) ?x(bkg?)
x
?xy(mx,my) ?xy(Dmx,my)
y
m
z
?z(Dobj) ?z(obj?)
Bkg
48Segmentation using Graph Cuts
Obj
x
y
m
z
Bkg
49M-Step
w2 P(?2m,D)
Cow Image
Shape ?2
RGB Histogram for Background
RGB Histogram for Object
50M-Step
w2 P(?2m,D)
Cow Image
Shape ?2
?2
m (labels)
Image Plane
D (pixels)
- Best labelling found efficiently using a Single
Graph Cut
51M-Step
?2
?1
w1
w2
.
Image Plane
Image Plane
m arg min ? wi E (m,?i)
- Best labelling found efficiently using a Single
Graph Cut
52Outline
- Problem Formulation
- Form of Shape Prior
- Optimization
- Results
53Results
Using LPS Model for Cow
Segmentation
Image
54Results
Using LPS Model for Cow
In the absence of a clear boundary between object
and background
Segmentation
Image
55Results
Using LPS Model for Cow
Segmentation
Image
56Results
Using LPS Model for Cow
Segmentation
Image
57Results
Using LPS Model for Horse
Segmentation
Image
58Results
Using LPS Model for Horse
Segmentation
Image
59Results
Our Method
Leibe and Schiele
Image
60Results
Shape
ShapeAppearance
Appearance
Without ? x(mx?)
Without ?x(Dmx)
61- Conclusions
- New model for introducing global shape prior in
MRF - Method of combining detection and segmentation
- Efficient LBP for detecting articulated objects
- Future Work
- Other shape parameters need to be explored
- Method needs to be extended to handle multiple
visual aspects