Recognition of Multi-Fonts Character in Early-Modern Printed Books - PowerPoint PPT Presentation

About This Presentation

Title:

Recognition of Multi-Fonts Character in Early-Modern Printed Books

Description:

Tsukasa Kimesawa(2) and Kazuki Joe(1) (1) Nara Women's University, Japan ... Monochrome or 256-grayscale. 14. PDC. feature. Experiments Description(1/2) ... – PowerPoint PPT presentation

Number of Views:49

Avg rating:3.0/5.0

Slides: 28

Provided by: nara3

Category:

more less

Transcript and Presenter's Notes

Title: Recognition of Multi-Fonts Character in Early-Modern Printed Books

1
Recognition of Multi-Fonts Character in
Early-ModernPrinted Books

Chisato Ishikawa(1), Naomi Ashida(1),
Yurie Enomoto(1), Masami Takata(1),
Tsukasa Kimesawa(2) and Kazuki Joe(1)
(1) Nara Womens University, Japan
(2) National Diet Library, Japan
Currently work for Mitsubishi Electric co

2
Contents

Introduction
Multi-fonts character recognition
Feature extraction from character images
Learning method for feature
Experiments
Improvement of pre-process
Conclusions and future work

3
Introduction

The Digital Library from the Meiji Era
(Supported by the National Diet Library in
Japan)
Digital archive Books published in the Meiji and
Taisho eras

1868-1926
The digital data are opened at the project Web
site
Search box
Top page
Data Viewer
4
Introduction
Full text search, text function Not supported

Main bodies
of books

Image data
5
Flow of OCR
Character image
Input image data
Character image data X
Pre-process
Preprocessed image data X
Feature extraction
Feature vector v
Recognition
Recognized class no. n
6
Flow of our OCRPre-process
Character image

Noise reduction
Normalization
Removing margin
Normalizing size
Normalizing position

Input image data
Character image data X
Pre-process
Preprocessed image data X
Feature extraction
Feature vector v
Recognition
Recognized class no. n
7
Flow of our OCRFeature Extraction
Character image
Extraction of a PDC feature
Peripheral Direction Contributivity Reflects
four statuses of character-lines
Direction Connectivity Relative
position Complexion
Input image data
Character image data X
Preprocessing
Preprocessed image data X
Feature extraction
Feature vector v
Recognition
Recognized class no. n
8
PDC Feature
Scanning from 8 directions
Reflecting the position of character-lines
Target character image
Reflecting the direction and the connectivity of
character-lines
9
PDC Feature
Reflecting the complexity of character-lines
Scanning-line
Direction contributivity
Direction contributivity
Direction contributivity
Scanning-line
Base image
10
PDC Feature

PDC feature vector Direction contributivities set

Direction contributivity element 4
11
Flow of our OCRRecognition
Character image
Input image data

Recognition by an SVM

A character image data X

Support Vector Machine
High generalization capability
Independence of the number of target vector
dimension
Low calculation cost

Preprocessing
Preprocessed image data X
Feature extraction
Feature vector v
Recognition
Recognized class no. n
12
Experiments

Experimental sample data
Character images obtained from The Digital
Library from the Meiji era
Target characters

13
Examples of Sample Images
No.1 (?)
No.2 (?)
No.3 (?)
No.4 (?)
No.5 (?)
No.6 (?)
No.7 (?)
No.8 (?)
No.9 (?)
No.10 (?)
Monochrome or 256-grayscale
14
Experiments Description(1/2)

Conversion of character images to feature
vectors
Pre-process
Binarization Threshold 128
Noise Reduction Median filter (Filter size33)
Normalization Removing margin and scaling to
128128
Extraction of PDC features
Vector dimension 1536

Pre-process
Extraction of PDC features
PDC feature
3.
1.
2.
PDC feature
15
Experiments Description(2/2)

Learning and evaluation of a recognition model
Learning recognition model with training samples
to SVM
Used SVM LIB-SVM
Parameters of SVM Tweaked by grid search
Evaluation of the recognition model by using test
samples

Tweaked by grid-search
50 samples for each character
Training samples
SVM (LIB-SVM)
Learning
Parameters
PDC feature
Test samples
PDC feature
PDC feature
16
Result of Recognition Model Evaluation
?We have shown this result at 73th
Mathematical Modeling and Problem Solving (MPS)
in March, 2009.

Recognition rate 97.8

cf. Recognition rate by neural network(NN)??
77.6 Computation time ?? SVM NN 1 7.7
17
Recognition Error in Result

Some images are not recognized because of

or
18
Improvement of Pre-process

Pre-process
Binarization
Thresholdt128
First noise reduction
Median filter, Filter size33
Normalization

Second noise reduction
Based on estimated width of character-line
Normalization

19
Noise Reduction based on Estimation of
Character-line Width
Target image

Estimation of line width by using the largest
connected component X
lpn Length of the shortest connected line
pass through pixel pn (pn?X)
Elimination of connected component whose area is
smaller than

Estimated width of character-line bmedian
value of lpn
The largest component X
20
Noise Reduction based on Estimation of
Character-line Width
Target image

Estimation of line width by using the largest
connected component X
lpn Length of the shortest connected line
pass through pixel pn (pn?X)
Elimination of connected components whose area
are smaller than

Estimated width of character-line bmedian
value of lpn
21
Noise Reduction based on Estimation of
Character-line Width
Target image

Estimation of line width by using the largest
connected component X
lpn Length of the shortest connected line
pass through pixel pn (pn?X)
Elimination of connected components whose area
are smaller than

Estimated width of character-line bmedian
value of lpn
22
Result of Improved Pre-process Adoption

Recognition rate 97.8?99.0

23
DiscussionCase better recognition(Error?Correct)
Previous pre-process Error
Improved pre-process Correct
Quality of test samples are improved
Quality of training samples are
improved More efficient recognition
model
24
DiscussionCase unchanged(Error?Error)
Previous Error
Improved Error
Connected to character-line
Residual noise
Error
Similar form of character no.5(?)
Error
Shorter than major form ?Similar with one
horizontal line
Major form of no.8
25
DiscussionCase worse recognition (Correct?Error)
Pre-processed images
Previous Correct
Improved Error
Previous
Improved
Training samples with lack of line are reduced
Recognition rate of data with lack of line
becomes low
26
Conclusions and Future work