Parsing Images with Context/Content Sensitive Grammars Eran Borenstein, Stuart Geman, Ya Jin, Wei Zhang - PowerPoint PPT Presentation

1 / 40
About This Presentation
Title:

Parsing Images with Context/Content Sensitive Grammars Eran Borenstein, Stuart Geman, Ya Jin, Wei Zhang

Description:

Parsing Images with Context/Content Sensitive Grammars Eran Borenstein, Stuart Geman, Ya Jin, Wei Zhang – PowerPoint PPT presentation

Number of Views:131
Avg rating:3.0/5.0
Slides: 41
Provided by: Stuart253
Category:

less

Transcript and Presenter's Notes

Title: Parsing Images with Context/Content Sensitive Grammars Eran Borenstein, Stuart Geman, Ya Jin, Wei Zhang


1
Parsing Images with Context/Content Sensitive
Grammars Eran Borenstein, Stuart Geman, Ya
Jin, Wei Zhang
2
  1. Structured Representation in Neural Systems
  2. Vision is Hard
  3. Why is Vision Hard?
  4. Hierarchies of Reusable Parts
  5. Demonstration System Reading License Plates
  6. Generalization Face Detection

3
Artificial Intelligence
  • Knowledge Engineering

engineer everything, learn nothing
  • Learning Theory

engineer nothing, learn everything
  • Both Lack Model

4
Natural Intelligence
  • Strong Representation

simulation and semantics
  • Hierarchy and Reusability

ventral visual pathway, linguistics,
compositionality
5
  1. Structured Representation in Neural Systems
  2. Vision is Hard
  3. Why is Vision Hard?
  4. Hierarchies of Reusable Parts
  5. Demonstration System Reading License Plates
  6. Generalization Face Detection

6
License plate images from Logan Airport
Machines still cant reliably read license plates
7
Wafer IDs
Machines cant read fixed-font fixed-scale
characters as well as humans
8
Super Bowl
Machines cant find the bad guys at the Super Bowl
9
  1. Structured Representation in Neural Systems
  2. Vision is Hard
  3. Why is Vision Hard?
  4. Hierarchies of Reusable Parts
  5. Demonstration System Reading License Plates
  6. Generalization Face Detection

10
Instantiation
Vision is content sensitive
11
Clutter
Background is structured, and made of the same
stuff!
12
  1. Structured Representation in Neural Systems
  2. Vision is Hard
  3. Why is Vision Hard?
  4. Hierarchies of Reusable Parts
  5. Demonstration System Reading License Plates
  6. Generalization Face Detection

13
Hierarchical of Reusable Parts
e.g. animals, trees, rocks
e.g. contours, intermediate objects
Bricks
e.g. linelets, curvelets, T-junctions
e.g. discontinuities, gradient
14
Hierarchy of Disjunctions of Conjunctions
15
Hierarchy of Disjunctions of Conjunctions
16
Hierarchy of Disjunctions of Conjunctions
17
Hierarchy of Disjunctions of Conjunctions
18
Hierarchy of Disjunctions of Conjunctions
19
Hierarchy of Disjunctions of Conjunctions
20
Hierarchy of Disjunctions of Conjunctions
21
Interpretations and Probabilities
Interpretation
22
Interpretations and Probabilities
Interpretation
23
Interpretations and Probabilities
GRAPHICAL MODEL (Markov)
LIKELIHOOD RATIO (non-Markov)
X
24
Generative (Bayesian) Model
25
  1. Structured Representation in Neural Systems
  2. Vision is Hard
  3. Why is Vision Hard?
  4. Hierarchies of Reusable Parts
  5. Demonstration System Reading License Plates
  6. Generalization Face Detection

26
Test set 385 images, mostly from Logan Airport
Courtesy of Visics Corporation
27
Architecture
license plates
license numbers (3 digits 3 letters, 4 digits
2 letters)
plate boundaries, strings (2 letters, 3 digits, 3
letters, 4 digits)
generic letter, generic number, L-junctions of
sides
characters, plate sides
parts of characters, parts of plate sides
28
Image interpretation
Original Image
Top object
Top 10 objects
Top 25 objects
29
Image interpretation
Test image
30
Performance
  • 385 images
  • Six plates read with mistakes (gt98)
  • Approx. 99.5 characters read correctly
  • Zero false positives

31
Efficient discrimination Markov versus
Content-Sensitive dist.
Original image
Zoomed license region
Top object under Markov distribution
Top object under content-sensitive distribution
32
Efficient discrimination testing objects
against their parts
Test image
9 active 8 bricks under whole model
1 active 8 brick under parts model
33
Summary
Vision is Content Sensitive
Non-Markovian probability models
Background is Structured, and Made of the Same
Stuff
Objects come equipped with their own background
models
34
  1. Structured Representation in Neural Systems
  2. Vision is Hard
  3. Why is Vision Hard?
  4. Hierarchies of Reusable Parts
  5. Demonstration System Reading License Plates
  6. Generalization Face Detection

35
Plates Face Detection
36
Face Hierarchy
37
(No Transcript)
38
Sampling from Data Model
39
Sampling faces from the distribution
40
  • PATTERN SYNTHESIS
  • PATTERN RECOGNITION

  • Ulf Grenander
Write a Comment
User Comments (0)
About PowerShow.com