Henry S. Baird - PowerPoint PPT Presentation

About This Presentation

Title:

Henry S. Baird

Description:

Apply aggressive OCR to isolated chars. If it's known (or guessed) that the word is ... vert. Change intercharacter space. Pattern Recognition. Research Lab ... – PowerPoint PPT presentation

Number of Views:47

Avg rating:3.0/5.0

Slides: 27

Provided by: cseLe

Learn more at: https://www.cse.lehigh.edu

Category:

more less

Transcript and Presenter's Notes

Title: Henry S. Baird

1
A Highly Legible CAPTCHAthat Resists
Segmentation Attacks

Henry S. Baird
Michael A. Moll
Sui-Yu Wang

2
Some Typical CAPTCHAs

AltaVista
eBay/PayPal
Yahoo!
PARCs PessimalPrint
3
All These Are Vulnerable to Segment-then-Recogniz
e Attack

Effective strategy of attack
Segment image into characters
Apply aggressive OCR to isolated chars
If its known (or guessed) that the word is
spellable (e.g. legal
English), use the lexicon to constrain
interpretations
Patrice Simard (MS Research) et al report that
this breaks many
widely used CAPTCHAs

4
We try to generate word-imagesthat will be hard
to segment into characters

Slice characters up -vertical
cuts then -horizontal cuts
Set size of cuts to constant within a word
Choose positions of cuts randomly
Force pieces to drift apart scatter horiz.
vert.
Change intercharacter space

5
Character fragments can interpenetrate

Not only is it hard to segment the word into
characters, .
it can be hard to recombine characters
fragments into characters

6
Character fragments can interpenetrate

Not only is it hard to segment the word into
characters, .
it can be hard to recombine characters
fragments into characters

7
Nonsense Words

We use nonsense (but English-like) words (as in
BaffleText)
generated pseudorandomly by a stochastic
variable-length character n-gram model
trained on the Brown corpus
this
protects against lexicon-driven attacks
Why not use random strings?
We want to help human readers feel confident they
have made a plausible choice, so theyll put up
with severe image degradations (Cf. research in
psychophysics of reading.)

M. Chew H. S. Baird, BaffleText a Human
Interactive Proof, Proc., 10th SPIE/IST
Document Recognition and Retrieval Conf.,
(DRR2003), Santa Clara, CA, January 23-24, 2003.
8
How Well Can People Read These?

We carried out a human legibility trial with the
help of
60 volunteers students, faculty, staff
at Lehigh Univ.
plus colleagues at Avaya Labs Research

9
Subjects were told they got it right/wrong
after they rated its difficulty
10
Subjective difficulty ratingswere correlated
with objective difficulty

People often know when theyve done well
This can be used to ensure that challenges arent
too hard (frustrating, angering)

11
The same data, graphically
1 Easy 2 3 4 5 Impossible
Right Wrong
12
People Rated These Easy (1/5)

aferatic
memmari
heiwho
nampaign

13
Rated Medium Hard (3/5)

overch / ovorch
wouwould
atlager / adager
weland / wejund

14
Rated Impossible (5/5)

acchown / echaeva
gualing / gealthas
bothere / beadave
caquired / engaberse

15
Why is ScatterType legible?

Does it surprise you that this is legible?
I speculate that we can read it because
we exploit typeface consistency
the evidence is small details of
local shape
this ability seems largely unconscious

16
Ensuring that ScatterType is Legible

We mapped the domain of legibility
as a function of engineering
choices
typefaces
characters in the alphabet
cutting scattering parameters

cut fraction expansion fraction horizontal
scatter mean vertical scatter mean h v scatter
variance character separation
17
Some typefaces remain legiblewhile others
degrade quickly
18
Raising Legibility byPruning Typefaces
19
Some Characters QuicklyBecome Confusable
overch o e c confusions
20
Raising Accuracy byOmitting Characters
21
Ensuring Legibility

Pruning characters typefaces
raised legibility in the top two difficulty
levels to 90
Next step
restrict the range of cutting scatter parameters

22
Mean Horizontal Scattervs Mean Vertical Scatter
1 Easy 2 3 4 5 Impossible
Right Wrong
Mirage data analysis tool, Tin Kam Ho, Bell Labs.
23
Cut Fraction Histogram
1 Easy 2 3 4 5 Impossible
Right Wrong
24
Character Separation Histogram
1 Easy 2 3 4 5 Impossible
Right Wrong
25
Finding Parameter Rangesfor High Legibility
d Euclidean distance from origin of Mean Horiz
Scatter vs Mean Vertical Scatter
26
Guided by this Analysis, We Can Define
Legibility Regimes
Trivial large cut fraction and small expansion
Simple character separation also decreases
Easy in original trial, correct 81 of time
Medium Hard larger scatter distances degrades
legibility noticeably
27
Other Examples - Easy
wexped - difficult to segment e, x and p.
Shows difficulty of achieving 100 legibility
veral - same parameters as above but different
font. Not as difficult to segment
28
Other Examples - Too Hard
thern difficult to read, but easier than most
with the same parameter values. Font makes a big
difference.
wezre satisfactorily illegible, though probably
segmentable
29
Next Steps

By judicious restrictions on engineering
parameters, attempt to ensure human legibility
better than 99.5
Similarly, attempt to ensure 90 of challenges
have low subjective difficulty ratings (e.g. 1-3
out of 5)
You are welcome to try out ScatterType
arcturus.cse.lehigh.edu/CAPTCHAs
Also, we invite you to attack it
Well send you large batches, with ground-truth
Try to train a classifier to break it!

30
Future Work

We have exhausted the experimental data from the
1st trial
How can we automatically create images with given
difficulty?
We have generated many images that seem difficult
to segment automatically, but we dont understand
how to guarantee this
We need to understand the effects of typefaces on
ScatterType legibility
We want to study character-confusion pairs more
Attacking ScatterType
Testing on best OCR systems
Invite attacks from other researchers
Is it credible if we attack it ourselves, and
fail?