Title: Computer Assisted Visual InterActive Recognition (CAVIAR)
1Computer Assisted Visual InterActive
Recognition(CAVIAR)
Advisor Prof. George Nagy Committee Prof.
Qiang Ji Prof. Robert B. Kelley Prof. Mukkai
Krishnamoorthy
2Agenda
- Introduction
- Related research
- CAVIAR methodology
- Interactive segmentation
- CAVIAR flower recognition system
- CAVIAR face recognition system
- Conclusions
3Agenda
- Introduction
- Related research
- CAVIAR methodology
- Interactive segmentation
- CAVIAR flower recognition system
- CAVIAR face recognition system
- Conclusions
4Motivation
- All operational systems require human assistance
(preprocessing, handling rejects). - CAVIAR makes parsimonious use of human visual
talent throughout the process rather than only at
the beginning or the end.
5Scope of CAVIAR
- Visual pattern recognition only
- Each CAVIAR system addresses a specific domain
- Many class classification
- Low throughput
6Research Goals
- Allocation of human and machine responsibilities
- Mathematical model
- Framework and design principles
- Prototype CAVIAR systems
7Agenda
- Introduction
- Related research
- CAVIAR methodology
- Interactive segmentation
- CAVIAR flower recognition system
- CAVIAR face recognition system
- Conclusions
8Content-Based Image Retrieval
- Typical search of CBIR system
- Submit a query image.
- Specify the relative importance of the features.
- Relevance feedback (label the retrieved images as
acceptable or not-acceptable). - Iterates until user finds the desired image.
9CBIR vs. CAVIAR
CBIR
CAVIAR
Subjective retrieval
Objective classification
User judges retrieval results
Statistical decision boundary
Machine weights features
User weights features
Narrow domain
Broad domain
Relevance feedback
Relevance feedback
Model adjustment
10Flower Recognition
- Little research on automatic flower recognition
- M. Das, R. Manmatha, and E.M. Riseman, Indexing
flower patent images using domain knowledge,"
IEEE Intelligent Systems, vol. 14, no. 5, pp.
24-33, 1999.
11Face Recognition
- Started in 1960s. Now, most active pattern
recognition application - Eigenface, dominant method
- Geometrical feature models are appropriate for
interactive recognition
12Agenda
- Introduction
- Related research
- CAVIAR methodology
- Interactive segmentation
- CAVIAR flower recognition system
- CAVIAR face recognition system
- Conclusions
13Psychophysics
- Attneave (1954) the nature of redundancy in
visual stimulations, and information is
concentrated along contours. - Miller (1956) plus or minus 7
14Allocation of Human and Machine Responsibilities
Conventional System
CAVIAR
15Notation
CAVIAR state
Model parameters
Features
Index vector
Training set
Label
16Formal Description (1)
- Finite state machine
- Initial state created by
- Model building
- Feature extraction
- Indexing
17Formal Description (2)
- Model manipulation leads to a state transition
from state n to state n1 - Model building ,
- Feature extraction
- Indexing
18Formal Description (3)
- The task can terminate at any state by
identification.
19Illustration (Video)
20Agenda
- Introduction
- Related research
- CAVIAR methodology
- Interactive segmentation
- CAVIAR flower recognition system
- CAVIAR face recognition system
- Conclusions
21Notation
- Parametric boundary
- Exact boundary
- Foreground region or
- Background region or
- Radius vectorintersectsat , andat .
22Training Color Distributions
23Training Circle Parameter Distributions
24Training Deviation of Circular Model From Exact
Boundary
ß5.52
25Automatic Segmentation Circle Partition
Use a circle to isolate a region, which contains
mostly flower colors.
26Automatic Segmentation Generate Boundary
Likelihood Map
Distance to the circle
Magnitude of color gradient
Boundary pixels should be close to the circle,
and have high color gradient.
27Automatic Segmentation Deformation on BLM
- Circle center a foreground seed, and four
corner pixels background seeds - Foreground and background regions compete with
each other to expand. - Eventually, converge to the watershed of the seed
pixels on the BLM.
28Advantage of BLM over Gradient Map
29Examples of the Result of Automatic Segmentation
30Interactive Correction (Video)
31Segment Flower Pictures with Interactive
Correction
- 1078 flowers from 113 speciesBorjan Gagoski and
Adam Callahan - 5.7 seed pixels, 15.2 seconds per picture Greenie
Cheng
32Agenda
- Motivation
- Related research
- Interactive segmentation
- CAVIAR methodology
- CAVIAR flower recognition system
- CAVIAR face recognition system
- Conclusions
33Flower Database
- 320 by 240 resolution
- Highly variable illumination, and complex
background - 216 samples from 29 classes for development
- 612 samples from 102 classes for evaluation
34Rose Curve Model
- Parametric curve withsix parameters.
- Flowers are composed of petals, which
havecircular symmetry. - When n0, rose curvereduces to circle.
35Classification Features
number of petals.
the ratio of outer to inner radius.
first three order moments of the hue
andsaturation histograms
36CAVIAR-Flower GUI
37CAVIAR-Flower (Video)
38Evaluation
- CAVIAR compared to human-alone and machine-alone
- Machine learning
- Decision directed approximation
- Finite state machine calibration
39Experimental Protocol
40CAVIAR Compared to Human-Alone and Machine-Alone
- Significantly reduce the recognition time
compared to human-alone - Significantly increase the accuracy compared to
machine-alone
41Accuracy of Initial Automatic Recognition
T2 5 labeled
T3 1 labeled
T4 1 labeled 2 pseudo
T5 1 labeled 4 pseudo
42Rank Order after Initial Automatic Recognition
T2 5 labeled
T3 1 labeled
T4 1 labeled 2 pseudo
T5 1 labeled 4 pseudo
43Time of Interactive Recognition
T2 5 labeled
T3 1 labeled
T4 1 labeled 2 pseudo
T5 1 labeled 4 pseudo
44Accuracy of Interactive Recognition
T2 5 labeled
T3 1 labeled
T4 1 labeled 2 pseudo
T5 1 labeled 4 pseudo
45Observations about Machine Learning
- Initialized with a single training samples per
class. - Self-learning user classified pseudo-labeled
samples improve the performance. - Performance of T5 is close to T2, suggesting that
instead of initializing with many training
samples, we can trust the systems self learning.
46Calibration of Finite State Machine
- 52 samples are immediately confirmed.
- 90 samples are identified by 3 adjustments.
- The probability of success on each adjustment is
just over one half.
47Summary of CAVIAR-Flower
- Parameterized rose curve to model the flowers.
- Display the rose curve and let user adjust it if
necessary. - The evaluation of the system shows advantages of
CAVIAR system.
48Agenda
- Motivation
- Related research
- Interactive segmentation
- CAVIAR methodology
- CAVIAR flower recognition system
- CAVIAR face recognition system
- Conclusions
49CAVIAR - Face
- Not to implement a state-of-the-art face
recognition system. - To demonstrate the wider applicability of CAVIAR
methodology. - 400 FERET pictures of 200 subjects (ba and
bk series)
50Face model and features
- Face model contains only two pupils.
- An automatic facial feature detection program
locates the other 26 points. -
- Thanks to Yan Tong, Zhiwei Zhu, and Dr. Qiang
Ji.
51Face model and features
- Similarity transform all 28 points to place two
pupils at (-1,0) and (1,0). - The normalized coordinates of the 26 points are
the features for classification.
52Interactive Recognition
53Interactive Recognition
54Experiments
- Three types of experiments human browsing only,
BAAR, and CAVIAR - 200 training samples, one from each person
- 200 test samples, one from each person
- 15 subjects, each classifies 40 faces
55CAVIAR Compared to Human-Alone, Machine-Alone,
and BAAR
56Summary of CAVIAR-Face
- A simple face recognition system
- Still clearly shows the advantages of CAVIAR
approach
57Agenda
- Motivation
- Related research
- Interactive segmentation
- CAVIAR methodology
- CAVIAR flower recognition system
- CAVIAR face recognition system
- Conclusions
58Contributions
- Domain specific geometrical models to mediate
between human and computer - Allocation of human and machine in interactive
visual recognition task - Model building (primarily human)
- Feature extraction (primarily machine)
- Classification (collaborative)
- Finite state machine
- Two experimental CAVIAR systems
59CAVIAR Design Principles
- Machine tries its best to infer the model and the
candidates. - Machine should display its results, and let the
user correct any errors. - Machine always accepts human correction.
60Observations
- CAVIAR system can significantly reduce the time
compared to unaided human, and significantly
increase the accuracy compared to unaided machine
(without years of RD). - CAVIAR system can be initialized with a single
training sample per class. - CAVIAR system demonstrates self-learning ability.
- CAVIAR is better than BAAR, and provides more
opportunities for machine learning from human
intervention.
61Directions For Future Work
- Other applications
- Mobile CAVIAR
- Collaborative learning
- Machine suggestion to human
62Other applications Face
- Face under head rotation, occlusion, and changes
of illumination and facial expression - Elastic bunch graph model, and others
63Other applications Skin Diseases
- Nearly 1000 diagnoses
- Big image atlases available
- John Hopkins dermatology image atlas
- University of Erlangen
64Other applications Fish
Alabama Shad
Black Crappie
Atlantic Sturgeon
Blue Gill
- U.S. Fish wild life service
65Other Applications
- Only a few successful visual recognition
applications OCR, fingerprint, ??. - CAVIAR may make others viable.
- Education
66MobileCAVIAR
Courtesy to Abhishek Gattani
67Mobile CAVIAR
- Client-server architecture, with a hand-held
computer as a client, connecting to an Internet
server. - The easy accessibility of multiple pictures poses
an interesting information fusion problem.
68Collaborative Learning
- When using CAVIAR, also collect training samples
- Pattern identified by some users can benefit peer
users - Extension of the concept of Open Mind Initiative
69Machine Suggestion to Human
- Currently, machine suggestions only annoy users.
- Eventually may be able to make useful suggestions
to acquire or refine particular features.
70Thank you!
71Motivation
- Successful visual pattern recognition systems
require several years RD. - All operational systems require human assistance
(preprocessing, handling rejects). - Pronounced difference between human and machine.
- Human gestalt tasks, object-background
separation, etc. - Machine computing complex features, evaluating
conditional probabilities, etc.
72CAVIAR Introduction
It may be more effective to make parsimonious use
of human visual talent throughout the process
rather than at beginning or end.
73Human and Machine Visual Perception
- Visual perception is a kind of computation.
- Two theories of human visual perception.
- Recognition by components (RBC).
- View-based recognition.
- Machine visual perception is still in its infancy.
74Human Computer Interaction
- HCI is a wide topic.
- Graphic user interface
- Attentive user interface
- Exploratory data analysis
- Graphically display the data.
- For designing a classifier, rather than actual
classification. - Pattern recognition systems with human in the
loop - Preprocessing
- Handling rejects
75CAVIAR vs. CBIR
- CAVIAR is objective classification, CBIR is
subjective retrieval. - Users arbitrarily emphasizing particular features
is not a good idea in CAVIAR. - CAVIAR is on narrow domain, CBIR is usually on
broad domain. - Relevance feedback is useful in CAVIAR, but more
precise human-computer communication can be
established through domain-specific models.
76Unsupervised Decision-directed Approximation
- Assume that the decision of the classifier is the
true label of the unknown. - Works well if
- Little overlap among the class-conditional
densities. - Initial classifier is reasonably good.
- Used in our experiment.
- Benefits from human intervention.
77Allocation of Human and Machine Responsibilities
78Allocation Human and Machine Responsibilities
- We give up strong segmentation for parametric
model fitting. - Model building is primarily a human
responsibility. - Feature extraction is done primarily by machine.
- Classification requires human-machine
collaboration. - Human adjusts the model instance.
- Final human confirmation.
79CAVIAR Compared to Machine-alone and Human-alone
(1)
- Cost function is a convex combination of error,
E, and time, T. - Machine-alone EEM, T0.
- Human-alone E0, TTH.
- CAVIAR Ei, Ti.
80CAVIAR Compared to Machine-alone and Human-alone
(2)
81CAVIAR Compared to Browsing-After-Automatic (BAA)
- The user browses to find the correct candidate.
82CAVIAR Compared to Browsing-After-Automatic (BAA)
- In principle, CAVIAR can never be worse than BAA.
- The practical scenarios are much more
complicated, and very difficult to model. - CAVIAR provides more opportunity for machine
learning from human intervention.
83Automatic Image Segmentation
- Intensively studied for decades, no
off-the-shelf solution. - Segmentation methods includes
- Clustering.
- Edge-linking.
- Region splitting and merging.
- Hybrid optimization.
- General image segmentation is extremely
difficult.
84Automatic Segmentation Circle Partition
a0.025.
Use a circle to isolate a region, which contains
mostly flower colors.
85Interactive Image Segmentation
- From human initialization, deform automatically
to the object boundary. - Snakes or active contours.
- Seeded region growing.
- Human and computer collaborate step by step.
- Intelligent scissors.
- Intelligent paint.
86Image Segmentation Evaluation
- Analytical methods.
- Analyze the principles and properties.
- Empirical methods.
- Compare the results to the ground-truth
segmentation. - We believe that segmentation can only be
evaluated according to its purpose.
87Strong Segmentation Vs. Weak Segmentation
- Strong segmentation
- Division of the image into regions region F
contains the pixels of object O, and nothing
else. FO. - Weak segmentation
- Partition of the image into conspicuous object
region F without locating the precise boundary.
FO. - Strong segmentation may not be necessary for
pattern recognition.
88Outline of the Interactive Segmentation Procedure
- Training
- Color distributions.
- Circle parameter distributions.
- Statistics of the deviation of the exact boundary
from the circle. - Automatic segmentation
- Fit circle by maximizing posterior.
- Generate boundary likelihood map.
- Watershed of seeds is the boundary.
- Interactive correction
- Shriek or expand the foreground region by
introducing more seeds.
89Interactive Correction
90Evaluation of Automatic Segmentation
- 174 test samples, 187 training samples from 29
classes. - Region discrepancy (RD) 25.
- Boundary discrepancy (BD) 8.7 pixels.
- Classification rank order (RO) 3.84.
91Rank Order is Not Correlated with RD and BD
92Experiments Comparing Strong and Weak Segmentation
- We compared classification results of strong
segmentation and two (rose curve and circle) weak
segmentations. - Test 612 unknown samples, with 510 training
samples.
93Automatic Segmentation is Not Reliable.
The classification result of each kind of
manualsegmentation is much better than that of
the corresponding automatic segmentation.
94With Reliable Segmentation
The improvement in classification from
strongsegmentation is limited even if reliable
segmentationcan be achieved.
95When Reliable Segmentation cannot be Achieved
The simpler weak segmentation is preferred if
reliable segmentation can not be achieved.
96Strong Segmentation may not be necessary for
Pattern Recognition
- Strong segmentation is generally difficult to
achieve either automatically or interactively. - Strong segmentation is obviously over-fitted to a
particular sample, not the class that the sample
belongs to. - Our experiments empirically show this.
97Summary of Interactive Image Segmentation
- A procedure for model-based interactive
segmentation. - The segmentation results approaches the exact
boundary with little human efforts. - Extensions.
- For highly textured image.
- Adjusting circle after human clicking.
- More elaborate models for both parametric
partition and boundary likelihood map.
98Analytical Solution to Fitting a Rose Curve
99Training
100Recognition
101Accuracy of Initial Automatic Recognition
102Rank Order after Initial Automatic Recognition
103Top-3 Accuracy of Initial Automatic Recognition
104Time of Interactive Recognition
105Accuracy of Interactive Recognition
106Rank Order After Automatic and Interactive
Recognition
- Subjects do adjust the rose curves, which reduces
the median rank orders.
107CAVIAR Compared to BAA
- Initial automatic classification is good. The
average rank order is 6.6 (2 clicks). - The subjects are first-time user, not able to
always find the right strategy.
108Training
- Manually entered the positions of 28 points for
each of 200 training pictures.
The difficulty of face recognition with a
geometrical face model is due to unreliable
automatic feature point detection.
Human correction of eye centers is already very
helpful.
109New Human-Computer Interfaces
- Quality of model parameters.
- Associate confidence measures to model
parameters, which relates to classification
features. - Assume the parameters adjusted by human have high
confidence. - Occlusion can be communicated to the computer.
- Categorical features.
- Divide a family into clusters. (male/female).
- Category-specific classifiers.