Dynamics of Learning VQ and Neural Gas - PowerPoint PPT Presentation

1 / 30
About This Presentation
Title:

Dynamics of Learning VQ and Neural Gas

Description:

Example: Winner Takes All (WTA) initialize K prototype vectors. present a single example ... most': update according to 'rank' sensitive to initialization ... – PowerPoint PPT presentation

Number of Views:53
Avg rating:3.0/5.0
Slides: 31
Provided by: Owne1028
Category:

less

Transcript and Presenter's Notes

Title: Dynamics of Learning VQ and Neural Gas


1
Dynamics of Learning VQand Neural Gas
  • Aree Witoelar, Michael Biehl
  • Mathematics and Computing Science
  • University of Groningen, Netherlands
  • in collaboration with Barbara Hammer (Clausthal),
    Anarta Ghosh (Groningen)

2
Outline
  • Vector Quantization (VQ)
  • Analysis of (L)VQ Dynamics
  • Results
  • Multiple prototypes
  • Neural Gas
  • Learning Vector Quantization
  • Summary

3
Vector Quantization
4
Vector Quantization
  • Objective
  • representation of (many) data with (few)
    prototype vectors

Find optimal set W for lowest quantization error
distance to nearest prototype
data
5
Example Winner Takes All (WTA)
  • prototypes at areas with high density data
  • (stochastic) on-line gradient descent with
    respect to a cost function

6
Problems
  • Winner Takes All

7
Dynamics of VQ Analysis
8
Modeltwo Gaussian clusters of high dimensional
data with class s 1,-1
Random vectors ? ? RN according to
classes 1, -1
prior prob. p, p- p p- 1
center vectors B, B- ? RN
variance ?, ?-
separation l
only separable in 2 dimensions ? simple model,
but not trivial
9
Online learning
sequence of independent random data
acc. to
? ? RN
ws ? RN
update of prototype vector
learning rate, step size
strength, direction of update etc.
prototypeclass
data class
fs describes the algorithm used
winner
10
1. Define few characteristic quantities of the
system

projections tocluster centers
length and overlapof prototypes
11
average over examples
In the thermodynamic limit N?8 ...
12
3. Derive ordinary differential equations
  • 4. Solve for Rss(t), Qst(t)
  • dynamics/asymptotic behavior (t ? 8)
  • quantization/generalization error
  • sensitivity to initial conditions, learning
    rates, structure of data

13
Results
14
Vector Quantization2 prototypes
characteristic quantities
Numerical integration of the ODEs (ws(0)0
p0.6, l1.0, ?1.5, ?- 1.0, ?0.01)
quantization error
15
2 prototypes
Projections of prototypes on the B,B- plane at
t50
Asymptotic positions (t?8) prototypes are on the
B,B- plane
16
Neural Gas a winner take most algorithm3
prototypes
?(t) decreased over time ?(t)?0 identical to WTA
?i2 ?f10-2
17
Sensitivity to initialization
Neural Gas
WTA
RS-
RS-
RS
RS
at t50
at t50
?HVQ0
E(W)
plateau
t
18
Summary of VQ and NG
  • WTA
  • (eventually) reaches minimum E(W)
  • depends on initialization possible large
    learning time
  • Neural Gas
  • more robust w.r.t. initialization

19
Learning Vector Quantization
20
Learning Vector Quantization (LVQ)
  • Objective
  • classification of data using prototype vectors

21
LVQ1
update winner towards/ away from data
two prototypes
22
Generalization error
class
misclassified data
eg
t
p0.6, p- 0.4 ?1.5, ?-1.0
23
Optimal decision boundary
(hyper)plane where
equal variance (??-) linear decision boundary
unequal variance ?gt?-
K2
24
LVQ1, three prototypes
? gt?- (?0.81, ?- 0.25)
?1,1,-1
?1,1,-1
eg
eg
p
p
  • LVQ1 K3 better
  • Optimal K3 better
  • LVQ1 K3 worse
  • Optimal K3 equal to K2
  • more prototypes not always better for LVQ1
  • best more prototypes on the class with the
    larger variance

25
Summary
  • dynamics of (Learning) Vector Quantization for
    high dimensional data
  • Neural Gas
  • more robust w.r.t. initialization than WTA
  • LVQ1
  • more prototypes not always better
  • variances matter

Outlook
  • study other algorithms e.g. LVQ/-, LFM, RSLVQ
  • more complex models
  • multi-prototype, multi-class problems

26
Questions
?
27
(No Transcript)
28
example LVQ1
29
Self averaging
Fluctuations decreases with larger degree of
freedom N
At N?8, fluctuations vanish (variance becomes
zero)
Monte Carlo simulations over 100 independent runs
30
LVQ /-
update correct and incorrect winners
ds min dk with ?s sµ
dt min dk with ?t ?sµ
31
Comparison LVQ1 and LVQ 2.1
LVQ2.1 ? performance depends on initial conditions
32
LVQ1, three prototypes
l1.0, ?0.81, ?- 0.25
?1,1,-1
? gt ?-
? lt ?-
eg
eg
p
p
3 prototypes better
3 prototypes worse
Write a Comment
User Comments (0)
About PowerShow.com