Efficient Image Retrieval Methods For Large Scale Dynamic Image Databases - PowerPoint PPT Presentation

1 / 66
About This Presentation
Title:

Efficient Image Retrieval Methods For Large Scale Dynamic Image Databases

Description:

Efficient Image Retrieval Methods For Large Scale Dynamic Image Databases – PowerPoint PPT presentation

Number of Views:201
Avg rating:3.0/5.0
Slides: 67
Provided by: mrph6
Category:

less

Transcript and Presenter's Notes

Title: Efficient Image Retrieval Methods For Large Scale Dynamic Image Databases


1
Efficient Image Retrieval Methods For Large Scale
Dynamic Image Databases
  • Suman Karthik
  • 200407013
  • Advisor Dr. C.V.Jawahar

2
Images
  • Cheap Imaging Hardware
  • Plummeting Storage costs
  • User Generated Content

3
Image Databases
  • Large Scale
  • Millions to billions of images
  • Dynamic
  • Highly dynamic in nature

Number of Images on Flickr from December 2005 to
November 2007 In millions
4
CBIR
  • Content Based IR
  • Uses image content
  • Pros
  • Good Quality
  • Annotation agnostic
  • Cons
  • Inefficient
  • Not scalable

shape
color
texture
5
Bag Of Words
Index
Feature Extraction
Vector Quantization
Semantic Indexing
Words
W
Compute SIFT descriptors Lowe99
D1
D2
D3
Inverted Index
PLSA, Hoffman, 2001
J Sivic Zisserman,2003 Nister Henrik,2006
Philbin,Sivic,Zisserman et la,2008
6
Dynamic Databases
  • Large scale
  • New images added continuously
  • High rate of change
  • Nature of data not known apriori

Internet
Videos
Images
7
Text vs Images Dynamic databases
  • Vocabulary known
  • Rate of change of vocabulary low
  • Stable vocabulary
  • Vocabulary unknown
  • Rate of change of vocabulary high
  • Unstable vocabulary

8
Quantization and Semantic indexingIn Dynamic
Databases
  • As DB changes vocabulary is outmoded
  • Updating vocabulary is too costly
  • Not incremental
  • Cannot keep up with rate of change
  • As DB changes semantic index is invalid
  • Updating semantic index is resource intensive
  • Not incremental
  • Cannot keep up with rate of change or scale

9
Dynamic Databases
Internet
Dynamic Database
Index
Feature Extraction
Vector Quantization
Semantic Indexing
Videos
Quantization and semantic indexing methods are a
bottleneck
Images
10
Objective 1
  • A. Motivation
  • CBIR is inefficient and not scalable
  • B. Objective
  • Develop methods to improve efficiency and
    scalability of CBIR
  • C. Contributions
  • C 1.1 Virtual Textual Representation
  • C 1.2 A new efficient indexing structure
  • C 1.3 Relevance feedback methods that improves
    performance

11
Objective 2
  • A. Motivation
  • Quantization is bottleneck for BoW when dealing
    with dynamic image databases
  • B. Objective
  • Develop incremental quantization method for
    BoW model to successfully deal with dynamic image
    databases
  • C. Contributions
  • C 2.1 Incremental Vector Quantization
  • C 2.2 Comparison of retrieval performance with
    existing methods
  • C 2.3 Comparison of incremental quantization
    with existing methods

12
Objective 3
  • A. Motivation
  • Semantic Indexing is not scalable for BoW when
    dealing with dynamic image databases
  • B. Objective
  • Develop incremental semantic indexing method
    for BoW model to successfully deal with dynamic
    image databases
  • C. Contributions
  • C 3.1 Bipartite Graph Model
  • C 3.2 An algorithm for semantic indexing on
    BGM
  • C 3.3 Search engines for images

13
CBIR
14
Literature
  • Global image retrieval
  • Region based image retrieval
  • Region Based Relevance feedback
  • Costly nearest neighbor based retrieval
  • Spatial Indexing
  • Relevance feedback heavily used

Image retrieval Past, present, and future,
Yong Rui, Thomas S. Huang, Shih F. Chang In
International Symposium on Multimedia Information
Processing 1997
Blobworld A System for Region-Based Image
Indexing and Retrieval, Chad Carson ,  Megan
Thomas ,  Serge Belongie ,  Joseph M. Hellerstein
,  Jitendra Malik In Third International
Conference on Visual Information Systems 1999
Region-Based Relevance Feedback In Image
Retrieval, Feng Jing ,  Mingjing Li ,  Hong-jiang
Zhang ,  Bo Zhang, Proc. IEEE International
Symposium on Circuits and Systems 2002
15
Search
16
Transformation
Feature Space
Bins represented by strings or words
Quantization
Compactness
Position
Color
17
Virtual Textual Representation
  • Quantization
  • Uniform quantization (grid)
  • Density based quantization(kmeans)
  • Each cell is a string

18
CBIR Indexing
  • Spatial Databases
  • Relevance feedback skews the feature space
    rendering spatial databases inefficient.

details
Indexing for Relevance Feedback Image
Retrieval, Jing Peng ,  Douglas R. Heisterkamp,
In Proceedings of the IEEE International
Conference on Image Processing (ICIP03)
19
Elastic Bucket Trie
Null
BBC
Insert
Query
CAB
CBA
A
C
B
Nodes
A
B
A
B
Overflow
Split
B
A
B
Buckets
Retrieved Bucket
20
Relevance Feedback
Retrieved
Query
Relevance Feedback
21
Region importance based relevance feedback
KEYWORDS
Pseudo Image for next iteration
Keyword Selection
Relevant Images
Extracted Words
Errors In Retrieval
22
Discriminative Relevance Feedback
  • Classification is given precedence over
    clustering.
  • Discriminative segments become the keywords.
  • Non-discriminative segments are ignored.

FLOWERS
ROSES
SURFERS
WAVES
23
Discriminative Relevance Feedback
KEYWORDS
Pseudo Image for next iteration
Keyword Selection
Relevant Images
Extracted Words
Irrelevant Images
No Errors In Retrieval
24
Performance
High Fscore
Low Fscore
  • Discriminative Relevance Feedback consistently
    out performs Region Based Importance method.

25
Region based Relevance feedback
Blobworld, (no indexing)
Our work
Non Spatial indexing
Simplicity (no indexing)
Global image retrieval
Local Image retrieval
Early CBIR
Spatial Indexing
Global relevance Feedback or No relevance feedback
26
Analysis
  • Relevance feedback algorithms need to be modified
    to work with text.
  • Keywords emerge with relevance feedback
    signifying association between key segments.
  • EBT can be used without any modifications with
    discriminative relevance feedback.
  • Advent of Bag of Words model for image retrieval

27
Quantization
28
Literature
  • Kmeans
  • Hierarchical Kmeans
  • Kmeans, Soft assignment
  • Time consuming offline quantization
  • Representative data available apriori
  • Quantization is not incremental

Video Google A Text Retrieval Approach to
Object Matching in Videos, Josef Sivic, Andrew
Zisserman, ICCV 2003
Scalable Recognition with a Vocabulary Tree, D.
Nistér and H. Stewénius, CVPR 2006
Lost in quantization Improving particular
object retrieval in large scale image databases,
James Philbin, Ondrej Chum, Michael Isard, Josef
Sivic, Andrew Zisserman, CVPR 2008
29
Losses
Quantization
  • Perceptual Loss
  • Under quantization
  • Synonymy
  • Poor precision
  • Binning Loss
  • Over quantization
  • Polysemy
  • Poor recall

30
Incremental Vector Quantization
  • Control perceptual loss
  • Minimize binning loss
  • Create quality code books
  • Data dependent
  • Incremental in nature

31
Algorithm
Puts a upper bound on perceptual loss
Soft BinAssignment Minimizes binning loss
Builds quality codebooks by ignoring noise
r
L 2 L minimum cardinality of a cell
32
(No Transcript)
33
An experiment
  • Given
  • All possible feature points in a feature space
    that could be generated by natural processes.
  • Quantize
  • K-means with apriori knowledge of entire data
  • IVQ with no apriori information.
  • Performance
  • F-score
  • Time taken for incremental quantization

Details
34
Fscore
IVQ outperforms Kmeans
IVQ 1115 bins Kmeans 1000 bins
35
Time
  • IVQ quantizes in 0.1 seconds
  • IVQ time complexity is linear
  • Kmeans takes 1000 seconds
  • Time complexity exponential

IVQ outperforms Kmeans
36
Holiday Dataset
  • Datasets
  • Holiday dataset
  • 1491 images
  • 500 categories
  • Pre-processing
  • sift feature extraction.
  • quantization using k-means.
  • quantization using ivq

37
Incremental Quantization
  • Datasets
  • ALOI dataset
  • 100,000 images
  • 1000 batches of 100 image each
  • Pre-processing
  • sift feature extraction.
  • quantization using k-means/online kmeans.
  • quantization using IVQ

S seconds, D Days
Batch 100 images of 100,000 image ALOI dataset
Added sequentially
38
Analysis
  • IVQ bins higher than Kmeans (constant perceptual
    loss)
  • IVQ efficient due to local changes
  • LSH used to accelerate IVQ
  • Semantic indexing can improve mAP

More
39
Incremental
IVQ (local)
Adaptive Vocabulary Tree (global)
Density Based
Online Kmeans
Kmeans
Offline quantization
Online quantization
Non density based
Regular Lattice
Non incremental
40
Semantic Indexing
41
Semantic Indexing
Words clustered around latent topics
Visual Words clustered around latent topics
Animal
Whippet
GSD
doberman
d
Whippet
daffodil
w
GSD
tulip
doberman
P(wd)
rose
daffodil
LSI, pLSA, LDA
tulip
rose
Flower
Hoffman 1999 Blei, Ng Jordan, 2004 R.
Lienhart and M. Slaney,2007
42
Literature
  • Visual pLSA
  • Visual LDA
  • Spatial semantic indexing
  • High space complexity due to large matrix
    operations.
  • Slow, resource intensive offline processing.

Discovering Objects and Their Location in
Images , Josef Sivic, Bryan Russell, Alexei A.
Efros, Andrew Zisserman, and Bill Freeman, ICCV
2005
Image Retrieval on Large-Scale Image Databases,
Eva Horster, Rainer Lienhart, Malcolm Slaney,
CIVR 2007
Spatial Latent Dirichlet Allocation, X. Wang
and E. Grimson, in Proceedings of Neural
Information Processing Systems Conference (NIPS)
2007
43
Bipartite Graph Model
Cash Flow Algorithm
w1
subprime
d1
Financial Crisis
  • Vector space model is encoded as bipartite graph
    of words and document.
  • TF values retained as edge weights.
  • IDF values retained as term weights

w2
reforms
d2
Bush Popularity
TF
11.7
w3
war
25
8.3
d3
50
Saddam Captured
5
100
50
12.5
w4
Iraq
25
d4
Iraq Pullout
IDF
12.5
w5
elections
d5
Obama Elected
w6
democrats
words
Documents
44
BGM with BoW
  • Feature extraction
  • Local detectors, SIFT
  • Vector quantization
  • K-means
  • BGM insertion
  • Words, Documents
  • TF
  • IDF

45
Why BGM is Superior ?
Query image
w1
w2
Inverted Index
Cash Flow
w5
w1
w2
w3
w4
46
Naïve vs BGM
  • Datasets
  • 9000 images of flickr.
  • 9 Sports Categories
  • 5 Animal Categories
  • Pre-processing
  • sift feature extraction.
  • quantization using k-means.
  • F-score
  • 2(pr)/(pr)

47
BGM vs pLSA, IpLSA
Number Of Concepts Known
mAP Time Space
pLSA 0.649 5144s 3267Mb
IpLSA 0.612 63s 3356Mb
BGM 0.594 42s 57Mb
  • pLSA
  • Cannot scale for large databases.
  • Cannot update incrementally.
  • Latent topic intialization difficult
  • Space complexity high
  • IpLSA
  • Cannot scale for large databases.
  • Cannot update new latent topics.
  • Latent topic intialization difficult
  • Space complexity high
  • BGMCashflow
  • Efficient
  • Low space complexity
  • Datasets
  • Holiday dataset
  • 1491 images
  • 500 categories
  • Pre-processing
  • sift feature extraction.
  • quantization using k-means.

Number Of Concepts unknown
mAP Time Space
pLSA 0.553 5062s 3267Mb
IpLSA 0.567 56s 3356Mb
BGM 0.594 42s 57Mb
48
Near Duplicate Retrieval
  • Dataset 500,000 movie frames
  • SIFT vectors
  • Kmeans quantization
  • Indexed using text search library Ferret.
  • Efficient Indexing and retrieval
  • Effectively scalable to large data.
  • Query frame given as query to Ferret index.

  • Cash propagated to every node until cut-off.


49
Sample Retrieval
Query
Retrieval
Fastest Indian
Fight Club
Harry Potter
50
Analysis
  • Low index insert time for new images
  • Less than 200 seconds to insert 1000 images in a
    million image index
  • Marginally higher retrieval time
  • Due to multiple levels of graph traversal
  • Memory usage minimal
  • Works without concept number apriori
  • BGM is a hybrid model
  • Generative
  • discriminative

51
Incremental
BGM IDF
Discriminative
IPLSA
BGM (generative discriminative)
BGM TF
Offline Semantic indexing
Online Semantic indexing
LDA
Generative
PLSA
Non incremental
52
Conclusion
  • Efficient methods for retrieval in large scale
    dynamic image databases
  • Scalability and adaptability have been addressed
  • A step closer to real world image retrieval
  • Features and their mixture, a long way to go

53
Future Work
  • Quality and quantity of features
  • Automatic feature modeling
  • Text search engines for image search
  • GPU based quantization methods
  • Multiple vocabularies for image retrieval
  • Multimodal semantic indexing with BGM

54
List of publications
  • Suman Karthik, C.V. Jawahar, "Incremental On-line
    semantic Indexing for Image Retrieval in Dynamic.
    Databases" 4th International Workshop on Semantic
    Learning and Applications, CVPR, 2008, Florida
  • Suman Karthik, C.V. Jawahar, "Analysis of
    Relevance Feedback in Content Based Image
    Retrieval", Proceedings of the 9th International
    Conference on Control, Automation, Robotics and
    Vision (ICARCV), 2006, Singapore.
  • Suman Karthik, C.V. Jawahar, Virtual Textual
    Representation for Efficient Image Retrieval.
    Proceedings of the 3rd International Conference
    on Visual Information Engineering (VIE), 26-28
    September 2006 in Bangalore, India.
  • Suman Karthik, C.V. Jawahar, Effecient Region
    Based Indexing and Retrieval for Images with
    Elastic Bucket Tries, Proceedings of the
    International Conference on Pattern Recognition
    (ICPR), 2006

55
The End
56
Intuitive way of learning content
  • Over segmentation and subsequent deduction of
    content through relevance feedback.

Document
Words
Transformation
Segmentation
Segments
Text
Image
Discriminative Relevance Feedback leverages this
advantage to achieve better performance than
standard techniques.
57
Kmeans
  • Pros
  • Simple
  • Efficient
  • Cons
  • Computationally expensive
  • Representative Training Set
  • Sensitive to parameter K

58
A naive quantization scheme
Quantization
F1
F3
F2
Advantages - High speed. No quantization
overhead - As dataset size grows precision
increases Disadvantages - Not data dependent,
no idea of visual concept - Information loss due
to hard assignment Suman karthik, C. V.
jawahar, Virtual Textual Representation for
Efficient Image Retrieval VIE 2006 Tuytelaars,
T. and Schmid, C. Vector Quantizing Feature
Space with a Regular Lattice ICCV 2007
59
C 2.1 Methodology
  • Data
  • 1000 Random feature vectors each generated from
    1000 normal distributions in a 2-d feature space.
    A total of 1 million feature points in the space.
  • 100,000 Virtual images falling into 100
    categories where each category image is generated
    by drawing random numbers from 10 normal
    distributions from the above data.
  • Algorithms
  • Kmeans (quantized with the entire data and ideal
    K1000)
  • IVQ
  • Kmeans with soft assignment
  • Measures
  • F-score for retrieval performance
  • Time estimates for incremental quantization

Back
60
Performance
Back
61
Performance
62
Image Retrieval
  • Contemporary approach
  • Uses textual cues
  • Pros
  • Simple
  • Efficient
  • Cons
  • Images are Subjective
  • Text cues unscalable
  • Quality Suffers

Love
Rose
Flower
Petals
Gift
Red
Bud
Green
63
Losses
  • High Perceptual Loss
  • High Binning Loss
  • Optimal Quantization

64
Image retrieval as Text retrieval
  • Can an image be indexed, queried for and
  • retrieved as a text document?

Can this become
this????????????
65
Relevance Feedback
  • Statistical
  • Delta mean algorithm
  • Query Point Movement
  • Inverse Variance
  • Membership Criterion
  • Kernel Based
  • Parzen Windows
  • SVM
  • Kernel BDA
  • Entropy Based
  • KL divergence

ltltBack
66
Semantic Indexing for Images
  • Objects and their location in images.  
  • Large Scale Image Databases
  • Web image selection
  • Spatial Latent Dirichlet Allocation
  • Image auto-annotation
  • High space complexity due to large matrix
    operations.
  • Slow, resource intensive offline processing.

Sivic, J. Russell, B.C. Efros, A.A. Zisserman, A.
Freeman
Lienhart, R.   Slaney, M
Keiji Yanai
Xianggang Wang, Eric Grimson
Monay, Florent and Gatica-Perez, Daniel,
Write a Comment
User Comments (0)
About PowerShow.com