Document Image Matching Based on Component Blocks - PowerPoint PPT Presentation

1 / 28

About This Presentation

Title:

Document Image Matching Based on Component Blocks

Description:

Document Image Matching Based on Component Blocks ... Department of Electronic & Information Eng., The Hong Kong Polytechnic Univ. ... – PowerPoint PPT presentation

Number of Views:62

Avg rating:3.0/5.0

Slides: 29

Provided by: hanchu5

Category:

more less

Transcript and Presenter's Notes

Title: Document Image Matching Based on Component Blocks

1
Document Image Matching Based on Component Blocks

Fuhui Long, Hanchuan Peng, Zheru Chi, and Wanchi
Siu
Center for Multimedia Signal Processing,
Department of Electronic Information Eng.,
The Hong Kong Polytechnic Univ.,
Email fhlong, phc, enzheru_at_eie.polyu.edu.hk
http//pandora.rad.jhu.edu/phc/publications.html

2
Outline

Introduction
Component Block Data Structure
Matching Algorithm
Experiments
Discussion Conclusion

3
Document Image Matching

Key technique for document image registration
retrieval
Can be applied widely for office automation,
digital library, video-conferencing, etc.

4
Current Techniques

Existing methods are mainly based on local
features of document page image
Cesarinis form-reader system (attributed
relational graphs)
Shimotsujis cell struct based 2-dimensional hash
table
Watanabes blank form structure of repetitions
and positions of cells
Tseng and Chens line segments based method
Fan and Changs line crossing relationship matrix
Watanabe and Huangs predefined logical structure
for business cards
Safaris projective geometry method
etc

5
Our Approach

Decompose a document page image into local
component blocks
Propose measurements to combine
------local block information
------global page layout information
It is closely related to our e-Doc technique,
which is developed for document databases

6
e-Doc

The Block-oriented e-Doc technique can be very
useful for document databases related
applications, including the document page image
retrieval, etc.

7
Preprocessing

Noise removing
Region based binarization and foreground
extraction
Correlation based skew correction
Image Blocking
Scan from bottom to top and from left to right
Use the simplest region growing method (not
pixel-by-pixel, but line-by-line, i.e. if there
is a pixel on the out boundary of current block,
then grow out one more line.)

8
Component Block List
We only make use of the block location size
information for matching.
9
Matching Algorithm
(Notation TBL Template Block List, CBL
Component Block List)
Procedure CBL-MA Input a CBL for the input
document image, a handle to a
template image database of K TBLs Output the
TBL with the minimum distance D to
CBL Preprocessing for k1,,K, do begin sort
the kth TBL by block size (from small to large)
end. Note the preprocessing is not a part of
this CBL-MA and needs to be done only once
beforehand begin sort the CBL by block size
(from small to large) for k1,,K, do begin
compute Dk, which is the distance between
CBL and the kth TBL end select the TBL
with the minimum Dk as output end.
10
Distance Definitions
Size Matching
Location Matching
Total Distance
11
Illustration of the Algorithm
Component block list

Sequencing with size
Sequenced block list

B
A
2. Matching with location
1. Matching with size
Template block list
BT
12
Experimental Data

A large document template data set of 1350
templates. Define 5 subsets with sizes
50 templates
100 templates
200 templates
500 templates
1350 templates
Use computer to generate all test data
(deformation images according to these templates)

13
Deformation Types

Detection Error
Block misdetection rate Pm
Block misaddition rate Pa
Block Size Variation
Block size variation rate Ps
Block size variation scale Ss
Block Location Displacement
Block displacement rate Pd
Block displacement scale Sd
Block Rotation
Block rotation rate Pr
Block rotation angle Dr

14
Data Examples
Template image
Deformation image
15
Results for Detection Errors
(1) CBL-MA can perform well (rc gt 85)
even when 50 blocks in the block list are lost
or wrongly added (see the column of Pm 0.5 and
Pa 0.5). Even when 80 blocks are lost or
added, this algorithm can still produce matching
accuracy nearly 60 (see the column of Pm 0.8
and Pa 0.8). (2) CBL-MA is more
insensitive to block misaddition than to block
misdetection. This is reasonable because when
additional blocks are wrongly put into CBL, the
original blocks still play, although weaker,
roles. On the contrary, the lost information due
to block misdetection is non-recoverable.
16
Results for Detection Errors
Pm Block misdetection rate Pa Block
misaddition rate
17
Results for Block Size Variation
For block size variation, the influence of
parameters Ps and Ss (here we set the same scale
factor for both block width and height) on rc is
given in Table 2. When blocks expand or shrink
greatly, CBL-MA can keep rc above 90 (the fourth
row of Table 2). At the same time, even when all
blocks have size variations (Ps 1.0), our
algorithm can produce a very high matching
accuracy of 95. Note that the latter corresponds
to many office automation applications, where Ss
is not very large, however, most blocks are
subject to some degree of size variation, i.e. Ps
is close to 1.
18
Results for Block Size Variation
Ps Block size variation rate Ss Block size
variation scale
19
Results for Block Displacement
For block location displacement, the influence of
parameters Pd and Sd on rc is given in Table 3.
Evidently CBL-MA is robust to block location
variation (rc always larger than 95).
20
Results for Block Displacement
Pd Block displacement rate Sd Block
displacement scale
21
Results for Block Rotation
For block rotation, the influence of parameters
Pr and Dr on rc is given in Table 4. For both
cases Dr15?, Pr varies from 0.2 to 1.0 and
Pr0.5, Dr varies from 5? to 45?, CBL-MA
produces satisfying classification, even when the
test images contain strong deformation, e.g. 50
component blocks have at most 45? rotation, or
all component blocks have at most 15? rotation.
Notice that block rotation will directly lead to
the significant change of block sizes.
22
Results for Block Rotation
Pr Block rotation rate Dr Block rotation
angle
23
Results for Template Set Size
Here under a general setting of parameters Pa
0.2, Pm0.2, Ps 0.2, Ss0.2, Pd 0.5, Sd 0.5,
Pr 0.5, Dr15?, we examine the influence of the
template image set size on the matching accuracy.
All the five template sets are used. For each
template set, we independently generate at least
2000 images for testing. The results are listed
in Table 5. It is clear that even when the
template set size grows to 500, the matching
accuracy is satisfying (gt80). For the
1350-template set (Set-E), rc is still around
70.
24
Results for Template Set Size
25
Comparison to Other Algorithms

It is noticed that the failure in detecting local
features (e.g. line-segments) usually immediately
results in bad performance of several other
algorithms.
However, our experiments demonstrate that even
when the block information is partially lost or
inaccurate, there is no significant performance
reduction of CBL-MA.

26
Computational Complexity

Denote n as the number of blocks in a CBL, m as
the number of blocks in a TBL, K as the number of
template images in a document image database.
When we use quicksort and binary search
algorithms, the typical computational complexity
is
O(nlogn) for CBL sorting
O(Kmlogn) for CBL/TBL size matching
O(Km(2TC1)) for CBL/TBL location matching
O(K) for CBL/TBL distance calculation
O(logK) for finding the minimum distance
Totally O((Kmn)logn)

27
Application Column Table Data Extraction
This page matching algorithm is then refined and
applied to automatic data extraction of column
forms.
---All fields in input image differ from each
other a lot and the local-feature based approach
can not work well. ---Our algorithm is a powerful
tool to find out the correct image template,
which is used to annotate the image data fields
to accomplish the data extraction successfully.
28
Conclusions

Based on block list (and tree), our algorithm can
effectively make use of the local information of
each page block and the global information of
page layout.
We present a method for effective document image
matching. The algorithm gives satisfying
performance for various image deformations.
The algorithm is robust to image distortion,
filled-in text, and noises.
We report an successful application of our
algorithm in column table data auto-reading.