Title: Contentbased Video Retrieval
1Content-based Video Retrieval
- Willem Jonker
- Philips Research / Twente University
- IFIP 2.6 meeting, Lausanne, May 15 17, 2002
2Contents
- Image Retrieval Current Practice
- Content Based Video Retrieval
- Retrieving events from Tennis Video
- Retrieving events from Formula-1 Video
3Video Content Management
4Image Retrieval Current Practice
5Image Retrieval Current Practice
6Image Retrieval Current Practice
7Precision Recall
8Multi-level Modeling
9Features
background
composition
color
texture
face recognition
motion
text rec. OCR
shape
closed caption
berg-etappe, etappe nr 17
speach rec.
10Query _at_ feature level
11QBIC
12Amore
13Video Retrieval
14Video Retrieval Modeling
15Video Retrieval Modeling
16Video Retrieval Modeling
17Video Retrieval Modeling
18Video Retrieval Query
19Video Retrieval Query
20Video Retrieval Query
21Video Retrieval Query
22The Semantic Gap
User
Concepts
Semantic gap
Raw Video Data
Data
23Tennis Goal
Video
Shot segmentation and classification
Shots (1) Playing (2) Close up
(3) Audience and other
24Tennis HMM History
- 50s - statisticians tried to characterize
processes with incomplete observations. - 60s - Baum et al. published HMM theory in a
series of papers. - 70s - HMMs were used at IBM and CMU for speech
recognition. - 90s - widespread applications (Gesture
recognition, human actions)
25Tennis HMM example
26Tennis HMM example
27Tennis Pre-processing
initial segmentation (robust M-estimator)
Step 1 Step 2 Step 3
locate player
fit 3D model
final segmentation
28Tennis Features
Shape Pie features Skeleton features
29Tennis HMM
Feature extraction
Time
Time
30Tennis HMM
. .
time
states
31Tennis Experiments
- Goals
- Determine the right feature set
- Investigate person independence
- Two series of experiments
- Same player in the training and evaluation
- Training with one player, evaluation of strokes
performed by various players - Six events Forehand, Backhand, Service, Smash,
Forehand volley, and Backhand volley
32Tennis Experiment 1 results
Recognition results ()
- HMM parameters
- Codebook size of 24 symbols
- HMMs with 8 states
- Comparison to related work
- Improvement of 15
- TV videos
33Tennis System Screen Dump
34Tennis Some Performance Figures
Dual Pentium II, 350 MHz Preprocessing 6.5 min.
per stroke of 45 frames initial segmentation
95sec fitting 3D models 150 sec final
segmentation 105 sec feature extraction 40
sec Training the model 20 sequences of 45
frames 50 iterations Baum-Welch 4 states 8
symbol codebook 12 sec 48 states 80 symbols 20
min. Inference between 1 and 30 sec depending
on model complexity in our case for 8 states
model with 20 symbols 2 sec
35Formula-1 Goal
36Formula-1 Bayesian Networks
A
B
C
D
E
P(X,Y) P(XY) P(Y) P(XY)P(Y)
P(YX)P(X) P(X1,X2,,Xn) P i1..n P(XiPa(Xi))
37Formula-1 Audio BN
Kword Key Words PRate Pause Rate AV Average
Values DR Dynamic Range MV Maximum Values STE
Short Time Energy MFCC Mel-Frequency Cepstral
Coefficients
38Formula-1 Audio DBNs
39Formula-1 Effect of Audio DBNs
40Formula-1 Example Videos
41Formula-1 Highlights on Audio
42Formula-1 Combined Audio-Video
43Formula-1 Experimental Results
44Formula-1 System Screen Dump
45Formula-1 Some Performance Figures
46Bridging the Gap
User
Concepts
Domain knowledge
Domain features
Features
Raw Video Data
Data
47DMW projecthttp//wwwhome.cs.utwente.nl/dmw/
48Some Papers
- M. Petkovic, W. Jonker, Content-Based Video
Retrieval by Integrating Spatio-Temporal and
Stochastic Recognition of Events, In Proc. IEEE
International Workshop on Detection and
Recognition of Events in Video, Vancouver,
Canada, July 2001. - M.Petkovic, W. Jonker, Z. Zivkovic, "Recognizing
Strokes in Tennis Videos Using Hidden Markov
Models", International Conference on
Visualization, Imaging and Image Processing,
Marbella, Spain, September 3-5, 2001. - H. E. Blok, M. Windhouwer, R. Zwol, M. Petkovic,
P. M. G. Apers, W. Jonker, M. Kersten, "Flexible
and Scalable Digital Library Search", 27th
International Conference on Very Large Databases,
Roma, Italy, September 11-14, 2001. - V. Mihajlovic, M. Petkovic, W. Jonker,
Content-Based Retrieval of TV Formula 1
Programs, 2nd International Workshop on
Multimedia Data Document Engineering (MDDE'02),
March 24 2002, Prague