TEMPORAL EVENT CLUSTERING FOR DIGITAL PHOTO COLLECTIONS - PowerPoint PPT Presentation

1 / 30
About This Presentation
Title:

TEMPORAL EVENT CLUSTERING FOR DIGITAL PHOTO COLLECTIONS

Description:

TEMPORAL EVENT CLUSTERING FOR DIGITAL PHOTO COLLECTIONS ... Wedding. BASIC CONCEPTS--- EXIF & CBIR. Exchangeable Image File (EXIF) ... – PowerPoint PPT presentation

Number of Views:84
Avg rating:3.0/5.0
Slides: 31
Provided by: ran68
Category:

less

Transcript and Presenter's Notes

Title: TEMPORAL EVENT CLUSTERING FOR DIGITAL PHOTO COLLECTIONS


1
TEMPORAL EVENT CLUSTERING FOR DIGITAL PHOTO
COLLECTIONS
  • Matthew Cooper, Jonathan Foote, Andreas
    Girgensohn, and Lynn Wilcox
  • ACM Multimedia
  • ACM Transactions on Multimedia Computing ,
    Communications and Application

2
OUTLINE
  • Introduction
  • Feature extraction
  • Clustering techniques
  • Supervised event clustering
  • Unsupervised event clustering
  • Clustering goodness criteria
  • Experimental result
  • Conclusion

3
Introduction
  • Users navigate their photos
  • Temporal order
  • Visual content
  • Associate time and content with the notion of a
    specific event
  • Photos associated with an event often exhibit
    little coherence in terms of either low-level
    image features or visual similarity
  • photographs from the same event are taken in
    relatively close proximity in time

4
Basic concepts--- Event
  • Events are naturally associated with specific
    times and places.
  • Birthday party
  • Vacation
  • Wedding

5
Basic concepts--- EXIF CBIR
Metadata
  • Exchangeable Image File (EXIF)
  • Time, Location, Focal length, Flash, etc.
  • gt Season, place, weather, indoor/outdoor,etc
  • Content-based Image Retrieval (CBIR)
  • Color, Texture, Shape, etc.
  • gt Face Fingerprint Recognition,etc

6
FEATURE EXTRACTION
  • EXIF headers are processed to extract the
    timestamp
  • The N photos in the collection are then ordered
    in time so the resulting timestamps,
  • tnn 1, . . . , N,satisfy t1 t2 tN
  • Time difference between indices (photos) is
    nonuniform

7
FEATURE EXTRACTION
  • Computing similarity matrices SK

temporal similarity matrix
8
FEATURE EXTRACTION
  • Computing similarity matrix
  • low-frequency discrete cosine transform (DCT)
    coefficients from each photo using the cosine
    distance measure

content-based similarity matrix
9
FEATURE EXTRACTION
  • computing novelty scores

peaks in the novelty scores cluster boundaries
between contiguous groups of similar photos
K1000
K10000
K100000
10
CLUSTERING TECHNIQUES
  • Supervised event clustering
  • Based on LVQ
  • Unsupervised event clustering
  • Scale-space analysis of the raw timestamp data
  • Temporal Similarity Analysis
  • Combining Time and Content-Based Similarity

11
Supervised event clustering
  • Let K take M values K K1, . . . , KM
  • Define the M N matrix N(j,i) ?Kj (i)
  • , where
  • Based on LVQ (Learning Vector Quantization)
  • Kohonen 1989
  • LVQ codebook discriminates between the two
    classes event boundary and event interior.
  • The codebook vectors for each class are used for
    nearest-neighbor classification of the novelty
    features for each photo in the test set.

12
Supervised event clustering
  • In the training phase, a codebook is calculated
    using an iterative procedure
  • Each step
  • Nearest codebook vector to each training sample
    is determined
  • shifted toward or away the training sample

If Nx and Mc are in the same class
If Nx and Mc arent in the same class
13
Supervised event clustering
  • ALGORITHM 1 (LVQ-BASED PHOTO CLUSTERING).
  • (1) Calculate novelty features from labeled
    sorted training data for each scale K
  • (i) compute the similarity matrix SK
  • (ii) compute the novelty score ?K
  • (2) Train LVQ using the iterative procedure
  • (3) Calculate novelty features for the testing
    data for each K
  • (i) compute the similarity matrix SK
  • (ii) compute the novelty score ?K
  • (4) Classify each test samples novelty features
    Ni using the LVQ codebook and the
    nearest-neighbor rule.

14
Unsupervised event clustering
  • scale-space analysis
  • operate on the raw timestamps
  • T0 t1, . . . , tN so that T0(i) ti
  • ALGORITHM 2 (SCALE-SPACE PHOTO CLUSTERING).
  • (1) Extract timestamp data from photo collection
    t1, . . . , tN.
  • (2) For each s in descending order
  • (i) compute Ts
  • (ii) detect peaks in Ts , tracing peaks from
    larger to smaller scales (decreasing s).

15
UNSUPERVISED EVENT CLUSTERING
  • Temporal Similarity Analysis
  • Locate peaks at each scale by analysis of the
    first difference of each novelty scores ?K ,
    proceeding from coarse scale to fine (decreasing
    K)
  • To build a hierarchical set of event boundaries,
    we include boundaries detected at coarse scales
    in the boundary lists for all finer scales.

checkerboard kernel used to compute the novelty
features
16
UNSUPERVISED EVENT CLUSTERING
  • Combining Time and Content-Based Similarity
  • constructed a content-based matrix SC using
    low-frequency DCT features and the cosine
    distance
  • if ti-tj gt
    48h
  • others
  • if ti-tj gt
    48h
  • others

17
CLUSTERING GOODNESS CRITERIA
  • Peak detection at each scale K results in a
    hierarchical set of candidate boundaries
  • Subset must be selected to define the final event
    clusters
  • Three different automatic approaches
  • Similarity-Based Confidence Score
  • Boundary Selection via Dynamic Programming
  • BIC-Based Boundary Selection

18
Similarity-Based Confidence Score
  • Detected boundaries at each level K,
  • BK b1, . . . , bnK ,
  • indexed by photo BK ? 1, . . . , N

average intercluster similarity between photos in
adjacent clusters
average intracluster similarity between the
photos within each cluster
19
Boundary Selection via Dynamic Programming
  • Reduced complexity
  • Begin with the set of peaks detected from the
    novelty features at all scales
  • Cost of the cluster between photos bi and bj

20
Boundary Selection via Dynamic Programming
  • Optimal partitions with m boundaries based on the
    optimal partition with m-1 boundaries
  • First, optimal partitions are computed with two
    clusters
  • EF (j,m) is the optimal partition of the photos
    with cardinality m

21
Boundary Selection via Dynamic Programming
  • Number of clusters increases, the total cost of
    the partition decreases monotonically
  • Selecting the optimal number of clusters, M,
    based on the total partition cost

22
BIC-Based Boundary Selection
  • This method is based on the Bayes information
    criterion (BIC) Schwarz 1978
  • Assumption
  • timestamps within an event are distributed
    normally around the event mean

Log-likelihood of the single segment model and
the penalty term
log-likelihood of the two segment model
? is 2 ,since we describe each segment using the
sample mean µ,and variance, s2
23
BIC-BASED BOUNDARY SELECTION
  • Employ the hierarchical coarse-to-fine approach
  • At each scale, we test only the newly detected
    boundaries (undetected at coarser scales)
  • Add the boundaries for which the left side
    exceeds the right side

24
ALGORITHM 3 (SIMILARITY-BASED PHOTO CLUSTERING)
  • (1) Extract and sort photo timestamps, t1, . . .
    , tn.
  • (2) For each K in decreasing order
  • (i) compute the similarity matrix Sk
  • (ii) compute the novelty score ?K
  • (iii) detect peaks in ?K
  • (iv) form event boundary list using event
    boundaries from previous iterations and newly
    detected peaks
  • (3) Determine a final boundary subset of
    collected boundaries over all scales considered
    according to one of the methods
  • (a) the confidence score
  • (b) the DP boundary selection approach
  • (c) the BIC boundary selection approach

25
EXPERIMENTAL RESULT
  • Run Times for Different Size Photo Collections
  • The times are in seconds
  • No Conf. indicates times for Steps 1 and 2
  • BIC peak selection (BIC)
  • Dynamic programming peak selection (DP)
  • similarity-based peak selection (Conf.)
  • Doubling the number of photos(N),the time for the
    segmentation step(No Conf.) increases linearly,
    while including the confidence measure (Conf.)
    incurs a polynomial cost.

26
EXPERIMENTAL RESULT
  • Compare the event clustering performance of
    eleven systems on two separate photo collections
  • Collection I consists of 1036 photos taken over
    15 months
  • Collection II consists of 413 photos taken over
    13 months
  • The first four algorithms in
  • the table are hand-tuned
  • to maximize performance.
  • The remaining algorithms
  • are fully automatic.

27
EXPERIMENTAL RESULT
  • Precision indicates the proportion of falsely
    labeled boundaries
  • Recall measures the proportion of true boundaries
    detected
  • The F-score is a composite of precision and
    recall

28
EXPERIMENTAL RESULT
29
EXPERIMENTAL RESULT
  • The adaptive-thresholding algorithms exhibited
    high recall and low precision on both test sets,
    even with manual tuning
  • Scale-space and the two similarity-based
    approaches demonstrated more consistent
    performance and traded off precision and recall
    more evenly

30
CONCLUSION
  • Employed the automatic temporal similarity-based
    method
  • Does not rely on preset thresholds or restrictive
    assumptions
  • As photo collections with location information
    become available, we hope to extend our system to
    combine temporal similarity, content-based
    similarity, and location-based similarity.
  • The automatic methods performance exceeded that
    of manually tuned alternatives in our testing,
    and have been well received by users of our photo
    management application.
Write a Comment
User Comments (0)
About PowerShow.com