Computer Assisted Visual InterActive Recognition (CAVIAR) - PowerPoint PPT Presentation

1 / 109
About This Presentation
Title:

Computer Assisted Visual InterActive Recognition (CAVIAR)

Description:

Computer Assisted Visual InterActive Recognition (CAVIAR) ... Atlantic Sturgeon. Blue Gill. April 16, 2004. Jie Zou, Doclab ECSE, RPI. 65. Other Applications ... – PowerPoint PPT presentation

Number of Views:63
Avg rating:3.0/5.0
Slides: 110
Provided by: Jie7
Category:

less

Transcript and Presenter's Notes

Title: Computer Assisted Visual InterActive Recognition (CAVIAR)


1
Computer Assisted Visual InterActive
Recognition(CAVIAR)
  • Jie Zou
  • RPI ECSE DocLab

Advisor Prof. George Nagy Committee Prof.
Qiang Ji Prof. Robert B. Kelley Prof. Mukkai
Krishnamoorthy
2
Agenda
  • Introduction
  • Related research
  • CAVIAR methodology
  • Interactive segmentation
  • CAVIAR flower recognition system
  • CAVIAR face recognition system
  • Conclusions

3
Agenda
  • Introduction
  • Related research
  • CAVIAR methodology
  • Interactive segmentation
  • CAVIAR flower recognition system
  • CAVIAR face recognition system
  • Conclusions

4
Motivation
  • All operational systems require human assistance
    (preprocessing, handling rejects).
  • CAVIAR makes parsimonious use of human visual
    talent throughout the process rather than only at
    the beginning or the end.

5
Scope of CAVIAR
  • Visual pattern recognition only
  • Each CAVIAR system addresses a specific domain
  • Many class classification
  • Low throughput

6
Research Goals
  • Allocation of human and machine responsibilities
  • Mathematical model
  • Framework and design principles
  • Prototype CAVIAR systems

7
Agenda
  • Introduction
  • Related research
  • CAVIAR methodology
  • Interactive segmentation
  • CAVIAR flower recognition system
  • CAVIAR face recognition system
  • Conclusions

8
Content-Based Image Retrieval
  • Typical search of CBIR system
  • Submit a query image.
  • Specify the relative importance of the features.
  • Relevance feedback (label the retrieved images as
    acceptable or not-acceptable).
  • Iterates until user finds the desired image.

9
CBIR vs. CAVIAR
CBIR
CAVIAR
Subjective retrieval
Objective classification
User judges retrieval results
Statistical decision boundary
Machine weights features
User weights features
Narrow domain
Broad domain
Relevance feedback
Relevance feedback
Model adjustment
10
Flower Recognition
  • Little research on automatic flower recognition
  • M. Das, R. Manmatha, and E.M. Riseman, Indexing
    flower patent images using domain knowledge,"
    IEEE Intelligent Systems, vol. 14, no. 5, pp.
    24-33, 1999.

11
Face Recognition
  • Started in 1960s. Now, most active pattern
    recognition application
  • Eigenface, dominant method
  • Geometrical feature models are appropriate for
    interactive recognition

12
Agenda
  • Introduction
  • Related research
  • CAVIAR methodology
  • Interactive segmentation
  • CAVIAR flower recognition system
  • CAVIAR face recognition system
  • Conclusions

13
Psychophysics
  • Attneave (1954) the nature of redundancy in
    visual stimulations, and information is
    concentrated along contours.
  • Miller (1956) plus or minus 7

14
Allocation of Human and Machine Responsibilities
Conventional System
CAVIAR
15
Notation
CAVIAR state
Model parameters
Features
Index vector
Training set
Label
16
Formal Description (1)
  • Finite state machine
  • Initial state created by
  • Model building
  • Feature extraction
  • Indexing

17
Formal Description (2)
  • Model manipulation leads to a state transition
    from state n to state n1
  • Model building ,
  • Feature extraction
  • Indexing

18
Formal Description (3)
  • The task can terminate at any state by
    identification.

19
Illustration (Video)
20
Agenda
  • Introduction
  • Related research
  • CAVIAR methodology
  • Interactive segmentation
  • CAVIAR flower recognition system
  • CAVIAR face recognition system
  • Conclusions

21
Notation
  • Parametric boundary
  • Exact boundary
  • Foreground region or
  • Background region or
  • Radius vectorintersectsat , andat .

22
Training Color Distributions
23
Training Circle Parameter Distributions
24
Training Deviation of Circular Model From Exact
Boundary
ß5.52
25
Automatic Segmentation Circle Partition
Use a circle to isolate a region, which contains
mostly flower colors.
26
Automatic Segmentation Generate Boundary
Likelihood Map
Distance to the circle
Magnitude of color gradient
Boundary pixels should be close to the circle,
and have high color gradient.
27
Automatic Segmentation Deformation on BLM
  • Circle center a foreground seed, and four
    corner pixels background seeds
  • Foreground and background regions compete with
    each other to expand.
  • Eventually, converge to the watershed of the seed
    pixels on the BLM.

28
Advantage of BLM over Gradient Map
29
Examples of the Result of Automatic Segmentation
30
Interactive Correction (Video)
31
Segment Flower Pictures with Interactive
Correction
  • 1078 flowers from 113 speciesBorjan Gagoski and
    Adam Callahan
  • 5.7 seed pixels, 15.2 seconds per picture Greenie
    Cheng

32
Agenda
  • Motivation
  • Related research
  • Interactive segmentation
  • CAVIAR methodology
  • CAVIAR flower recognition system
  • CAVIAR face recognition system
  • Conclusions

33
Flower Database
  • 320 by 240 resolution
  • Highly variable illumination, and complex
    background
  • 216 samples from 29 classes for development
  • 612 samples from 102 classes for evaluation

34
Rose Curve Model
  • Parametric curve withsix parameters.
  • Flowers are composed of petals, which
    havecircular symmetry.
  • When n0, rose curvereduces to circle.

35
Classification Features
number of petals.
the ratio of outer to inner radius.
first three order moments of the hue
andsaturation histograms
36
CAVIAR-Flower GUI
37
CAVIAR-Flower (Video)
38
Evaluation
  • CAVIAR compared to human-alone and machine-alone
  • Machine learning
  • Decision directed approximation
  • Finite state machine calibration

39
Experimental Protocol
  • Thanks to Borjan Gagoski

40
CAVIAR Compared to Human-Alone and Machine-Alone
  • Significantly reduce the recognition time
    compared to human-alone
  • Significantly increase the accuracy compared to
    machine-alone

41
Accuracy of Initial Automatic Recognition
T2 5 labeled
T3 1 labeled
T4 1 labeled 2 pseudo
T5 1 labeled 4 pseudo
42
Rank Order after Initial Automatic Recognition
T2 5 labeled
T3 1 labeled
T4 1 labeled 2 pseudo
T5 1 labeled 4 pseudo
43
Time of Interactive Recognition
T2 5 labeled
T3 1 labeled
T4 1 labeled 2 pseudo
T5 1 labeled 4 pseudo
44
Accuracy of Interactive Recognition
T2 5 labeled
T3 1 labeled
T4 1 labeled 2 pseudo
T5 1 labeled 4 pseudo
45
Observations about Machine Learning
  • Initialized with a single training samples per
    class.
  • Self-learning user classified pseudo-labeled
    samples improve the performance.
  • Performance of T5 is close to T2, suggesting that
    instead of initializing with many training
    samples, we can trust the systems self learning.

46
Calibration of Finite State Machine
  • 52 samples are immediately confirmed.
  • 90 samples are identified by 3 adjustments.
  • The probability of success on each adjustment is
    just over one half.

47
Summary of CAVIAR-Flower
  • Parameterized rose curve to model the flowers.
  • Display the rose curve and let user adjust it if
    necessary.
  • The evaluation of the system shows advantages of
    CAVIAR system.

48
Agenda
  • Motivation
  • Related research
  • Interactive segmentation
  • CAVIAR methodology
  • CAVIAR flower recognition system
  • CAVIAR face recognition system
  • Conclusions

49
CAVIAR - Face
  • Not to implement a state-of-the-art face
    recognition system.
  • To demonstrate the wider applicability of CAVIAR
    methodology.
  • 400 FERET pictures of 200 subjects (ba and
    bk series)

50
Face model and features
  • Face model contains only two pupils.
  • An automatic facial feature detection program
    locates the other 26 points.
  • Thanks to Yan Tong, Zhiwei Zhu, and Dr. Qiang
    Ji.

51
Face model and features
  • Similarity transform all 28 points to place two
    pupils at (-1,0) and (1,0).
  • The normalized coordinates of the 26 points are
    the features for classification.

52
Interactive Recognition
53
Interactive Recognition
54
Experiments
  • Three types of experiments human browsing only,
    BAAR, and CAVIAR
  • 200 training samples, one from each person
  • 200 test samples, one from each person
  • 15 subjects, each classifies 40 faces

55
CAVIAR Compared to Human-Alone, Machine-Alone,
and BAAR
56
Summary of CAVIAR-Face
  • A simple face recognition system
  • Still clearly shows the advantages of CAVIAR
    approach

57
Agenda
  • Motivation
  • Related research
  • Interactive segmentation
  • CAVIAR methodology
  • CAVIAR flower recognition system
  • CAVIAR face recognition system
  • Conclusions

58
Contributions
  • Domain specific geometrical models to mediate
    between human and computer
  • Allocation of human and machine in interactive
    visual recognition task
  • Model building (primarily human)
  • Feature extraction (primarily machine)
  • Classification (collaborative)
  • Finite state machine
  • Two experimental CAVIAR systems

59
CAVIAR Design Principles
  • Machine tries its best to infer the model and the
    candidates.
  • Machine should display its results, and let the
    user correct any errors.
  • Machine always accepts human correction.

60
Observations
  • CAVIAR system can significantly reduce the time
    compared to unaided human, and significantly
    increase the accuracy compared to unaided machine
    (without years of RD).
  • CAVIAR system can be initialized with a single
    training sample per class.
  • CAVIAR system demonstrates self-learning ability.
  • CAVIAR is better than BAAR, and provides more
    opportunities for machine learning from human
    intervention.

61
Directions For Future Work
  • Other applications
  • Mobile CAVIAR
  • Collaborative learning
  • Machine suggestion to human

62
Other applications Face
  • Face under head rotation, occlusion, and changes
    of illumination and facial expression
  • Elastic bunch graph model, and others

63
Other applications Skin Diseases
  • Nearly 1000 diagnoses
  • Big image atlases available
  • John Hopkins dermatology image atlas
  • University of Erlangen

64
Other applications Fish
Alabama Shad
Black Crappie
Atlantic Sturgeon
Blue Gill
  • U.S. Fish wild life service

65
Other Applications
  • Only a few successful visual recognition
    applications OCR, fingerprint, ??.
  • CAVIAR may make others viable.
  • Education

66
MobileCAVIAR
Courtesy to Abhishek Gattani
67
Mobile CAVIAR
  • Client-server architecture, with a hand-held
    computer as a client, connecting to an Internet
    server.
  • The easy accessibility of multiple pictures poses
    an interesting information fusion problem.

68
Collaborative Learning
  • When using CAVIAR, also collect training samples
  • Pattern identified by some users can benefit peer
    users
  • Extension of the concept of Open Mind Initiative

69
Machine Suggestion to Human
  • Currently, machine suggestions only annoy users.
  • Eventually may be able to make useful suggestions
    to acquire or refine particular features.

70
Thank you!
  • Questions and Comments

71
Motivation
  • Successful visual pattern recognition systems
    require several years RD.
  • All operational systems require human assistance
    (preprocessing, handling rejects).
  • Pronounced difference between human and machine.
  • Human gestalt tasks, object-background
    separation, etc.
  • Machine computing complex features, evaluating
    conditional probabilities, etc.

72
CAVIAR Introduction
It may be more effective to make parsimonious use
of human visual talent throughout the process
rather than at beginning or end.
73
Human and Machine Visual Perception
  • Visual perception is a kind of computation.
  • Two theories of human visual perception.
  • Recognition by components (RBC).
  • View-based recognition.
  • Machine visual perception is still in its infancy.

74
Human Computer Interaction
  • HCI is a wide topic.
  • Graphic user interface
  • Attentive user interface
  • Exploratory data analysis
  • Graphically display the data.
  • For designing a classifier, rather than actual
    classification.
  • Pattern recognition systems with human in the
    loop
  • Preprocessing
  • Handling rejects

75
CAVIAR vs. CBIR
  • CAVIAR is objective classification, CBIR is
    subjective retrieval.
  • Users arbitrarily emphasizing particular features
    is not a good idea in CAVIAR.
  • CAVIAR is on narrow domain, CBIR is usually on
    broad domain.
  • Relevance feedback is useful in CAVIAR, but more
    precise human-computer communication can be
    established through domain-specific models.

76
Unsupervised Decision-directed Approximation
  • Assume that the decision of the classifier is the
    true label of the unknown.
  • Works well if
  • Little overlap among the class-conditional
    densities.
  • Initial classifier is reasonably good.
  • Used in our experiment.
  • Benefits from human intervention.

77
Allocation of Human and Machine Responsibilities
78
Allocation Human and Machine Responsibilities
  • We give up strong segmentation for parametric
    model fitting.
  • Model building is primarily a human
    responsibility.
  • Feature extraction is done primarily by machine.
  • Classification requires human-machine
    collaboration.
  • Human adjusts the model instance.
  • Final human confirmation.

79
CAVIAR Compared to Machine-alone and Human-alone
(1)
  • Cost function is a convex combination of error,
    E, and time, T.
  • Machine-alone EEM, T0.
  • Human-alone E0, TTH.
  • CAVIAR Ei, Ti.

80
CAVIAR Compared to Machine-alone and Human-alone
(2)
81
CAVIAR Compared to Browsing-After-Automatic (BAA)
  • The user browses to find the correct candidate.

82
CAVIAR Compared to Browsing-After-Automatic (BAA)
  • In principle, CAVIAR can never be worse than BAA.
  • The practical scenarios are much more
    complicated, and very difficult to model.
  • CAVIAR provides more opportunity for machine
    learning from human intervention.

83
Automatic Image Segmentation
  • Intensively studied for decades, no
    off-the-shelf solution.
  • Segmentation methods includes
  • Clustering.
  • Edge-linking.
  • Region splitting and merging.
  • Hybrid optimization.
  • General image segmentation is extremely
    difficult.

84
Automatic Segmentation Circle Partition
a0.025.
Use a circle to isolate a region, which contains
mostly flower colors.
85
Interactive Image Segmentation
  • From human initialization, deform automatically
    to the object boundary.
  • Snakes or active contours.
  • Seeded region growing.
  • Human and computer collaborate step by step.
  • Intelligent scissors.
  • Intelligent paint.

86
Image Segmentation Evaluation
  • Analytical methods.
  • Analyze the principles and properties.
  • Empirical methods.
  • Compare the results to the ground-truth
    segmentation.
  • We believe that segmentation can only be
    evaluated according to its purpose.

87
Strong Segmentation Vs. Weak Segmentation
  • Strong segmentation
  • Division of the image into regions region F
    contains the pixels of object O, and nothing
    else. FO.
  • Weak segmentation
  • Partition of the image into conspicuous object
    region F without locating the precise boundary.
    FO.
  • Strong segmentation may not be necessary for
    pattern recognition.

88
Outline of the Interactive Segmentation Procedure
  • Training
  • Color distributions.
  • Circle parameter distributions.
  • Statistics of the deviation of the exact boundary
    from the circle.
  • Automatic segmentation
  • Fit circle by maximizing posterior.
  • Generate boundary likelihood map.
  • Watershed of seeds is the boundary.
  • Interactive correction
  • Shriek or expand the foreground region by
    introducing more seeds.

89
Interactive Correction
90
Evaluation of Automatic Segmentation
  • 174 test samples, 187 training samples from 29
    classes.
  • Region discrepancy (RD) 25.
  • Boundary discrepancy (BD) 8.7 pixels.
  • Classification rank order (RO) 3.84.

91
Rank Order is Not Correlated with RD and BD
92
Experiments Comparing Strong and Weak Segmentation
  • We compared classification results of strong
    segmentation and two (rose curve and circle) weak
    segmentations.
  • Test 612 unknown samples, with 510 training
    samples.

93
Automatic Segmentation is Not Reliable.
The classification result of each kind of
manualsegmentation is much better than that of
the corresponding automatic segmentation.
94
With Reliable Segmentation
The improvement in classification from
strongsegmentation is limited even if reliable
segmentationcan be achieved.
95
When Reliable Segmentation cannot be Achieved
The simpler weak segmentation is preferred if
reliable segmentation can not be achieved.
96
Strong Segmentation may not be necessary for
Pattern Recognition
  • Strong segmentation is generally difficult to
    achieve either automatically or interactively.
  • Strong segmentation is obviously over-fitted to a
    particular sample, not the class that the sample
    belongs to.
  • Our experiments empirically show this.

97
Summary of Interactive Image Segmentation
  • A procedure for model-based interactive
    segmentation.
  • The segmentation results approaches the exact
    boundary with little human efforts.
  • Extensions.
  • For highly textured image.
  • Adjusting circle after human clicking.
  • More elaborate models for both parametric
    partition and boundary likelihood map.

98
Analytical Solution to Fitting a Rose Curve
99
Training
100
Recognition
101
Accuracy of Initial Automatic Recognition
102
Rank Order after Initial Automatic Recognition
103
Top-3 Accuracy of Initial Automatic Recognition
104
Time of Interactive Recognition
105
Accuracy of Interactive Recognition
106
Rank Order After Automatic and Interactive
Recognition
  • Subjects do adjust the rose curves, which reduces
    the median rank orders.

107
CAVIAR Compared to BAA
  • Initial automatic classification is good. The
    average rank order is 6.6 (2 clicks).
  • The subjects are first-time user, not able to
    always find the right strategy.

108
Training
  • Manually entered the positions of 28 points for
    each of 200 training pictures.

The difficulty of face recognition with a
geometrical face model is due to unreliable
automatic feature point detection.
Human correction of eye centers is already very
helpful.
109
New Human-Computer Interfaces
  • Quality of model parameters.
  • Associate confidence measures to model
    parameters, which relates to classification
    features.
  • Assume the parameters adjusted by human have high
    confidence.
  • Occlusion can be communicated to the computer.
  • Categorical features.
  • Divide a family into clusters. (male/female).
  • Category-specific classifiers.
Write a Comment
User Comments (0)
About PowerShow.com