Title: Handdrawn graphics recognition with Bayes Conditional Networks
1Hand-drawn graphics recognition with Bayes
Conditional Networks
- Martin Szummer
- CHIRP Project
- Microsoft Research Cambridge
2Background
- There is more to ink understanding than
handwritten text recognition!
3Introduction
- Bayes Conditional Networks a framework for joint
pattern classification - illustrated on container / connector recognition
in organization charts. - Motivation from product groups
- Tablet PC platform Sashi Raghupathy
- Infografix (IGX) Dan Albertson
4Challenges
5Previous Generative Approach
- Shape recognition using generative models
Krishnapuram, Bishop, Szummer
Bishop 03
6Discriminative Approaches
- For classification, model only the labels given
the ink data avoid modelling the ink itself - e.g. support vector machines but these assume
that all labels are independent! - Conditional Random Fields
- model dependence of labels
- applied to image classification Kumar and Hebert
03, but for 2D problems, parameter estimation is
problematic
7Bayes Conditional Networks (BCN)
- BCNs are undirected graphical models (CRFs)
trained in a Bayesian way - BCN model the probability of the labels
(connector/container) conditioned on features
computed from all ink on the page - Flexible features overlapping, correlated
8Approach
Input drawing with text removed TAB
demo Svensen, Bishop, Gangnet
91. Divide ink strokes into fragments
102. Construct a neighborhood graph on fragments
Potential functions measure compatibility of the
labels with the ink
ti
tj
q - trained parameters
113. Calculate features
box-full feature (strongest feature)
12t-junction feature
13Independent Classification
144. Jointly classify fragments as being part of a
container or connector
time 4 seconds (Matlab prototype)
15BCN training/testing
- Bayesian training of parameters q
- uses the Expectation Propagation (EP) framework
Minka 01 - approximates factors term-by-term
- combined with automatic relevance determination
for feature selection - Inference (testing)
- find most likely joint labeling (MAP)using
junction tree algorithm
16Ink Features
- Unary features depend on the fragment and
typically on its neighboring fragments - length, angle
- histograms of distances and angles
- templates t-junction, full-box
- Pairwise features
- combinations of unary features (, -, x)
- features defined on two neighboring fragments
relative angle, same-stroke
17Feature Selection
fullbox
- Unary
- feature
- weights
- After
- automatic
- relevance
- determination
tjunct
angle
length
18TAB members
19TAB members
20TAB members
21TAB members
22TAB members
23Benchmark
- Data 17 subjects draw given organization charts
on a TabletPC device
24Future Directions
- Recognition of complex objects
- segmentation
- hierarchical classification
- Design an architecture for a complex ink parsing
system - representing uncertainty and multiple hypotheses
between different modules of the system - Other applications of BCNs
- Image classification and segmentation
- Text analysis, information extraction
- Fax and scanned document analysis
25Acknowledgements
- Yuan Qi
- Michel Gangnet
- Christopher Bishop
- Geoffrey Hinton