Additional NN Models - PowerPoint PPT Presentation

About This Presentation

Title:

Additional NN Models

Description:

Number of Views:121

Avg rating:3.0/5.0

Slides: 23

Provided by: qxu

Learn more at: https://userpages.cs.umbc.edu

Category:

Tags: additional | barton | models

Transcript and Presenter's Notes

Title: Additional NN Models

1
Additional NN Models

RL exists in many places
Originated from psychology( training animal)
Machine learning community, different theories
and algorithms
major difficulty credit/blame distribution
chess playing W/L (multi-step)
soccer playing W/L(multi-player)
In many applications, it is much easier to
determine good/bad, right/wrong,
acceptable/unacceptable than to provide precise
correct answer/error.
It is up to the learning process to improve the
systems performance based on the critics signal.

ARP the associative reword-and-penalty algorithm
for NN (Barton and Anandan, 1985)
Architecture

input x(k) output y(k) stochastic units z(k)
for random search
5

Random search by stochastic units zi
or let zi obey a continuous probability
distribution
function.
or let is
a random noise, obeys
certain distribution.
Key z is not a deterministic function of x,
this gives z a chance to be a good
output.
Prepare desired output (temporary)

(III)Recurrent BP
Recurrent networks network with feedback links
- state(output) of the network evolves along the
time.
- may or may not have hidden nodes.
- may or may not stabilize when t?
- how to learn w so that an initial state(input)
will lead to
a stable state with the desired output.
2. Unfolding
for any recurrent network with finite evolution
time, there is an equivalent feedforward network.
problems
too many repetitions
too many layers when the network need a long
time to

reach stable state.
standard BP needs to be relized to hard
duplicate weights.
3. Recurrent BP (1987)
system
assume at least one fixed point exists for the
system with the given initial state
when a fixed point is reduced
can be obtained.
error

15
(No Transcript)
16

The complete learning algorithm
incremental/sequential
W is updated by the preseting of each learning
pair using the weight-update procedure.
to ensure the dearned network is stable,
learning rate must be small(much smaller than the
rate for standard BP learning)
time consuming two relaxation processes are
involved for each step of weight update
better performance than BP in some applications

III network of radial basis functions
Motivations
better function approximation
BP network( hidden units are sigmoid)
training time is very long
generalization(with non-training input) not
always
good
Counter Propagation(hidden units are WTA)
poor approximation, especially with
interpolation
any input is forced to be classified into one
class and intern produces class/ output as its
function value.

Radial basis function
input vectors with equal distance to Ci will
have the same output.
Each hidden unit I has a receptive fied with Ci
as its center
if xCi , unit I has the largest output
if x!Ci, unit I has the smallest output
the size of the receptive field is determined by
During computation, hidden units are not WTA( no
lateral inhibition with an input x, usually more
than one hidden units can have non-zero output.
These outputs can be combined at output layer to
produce better approximation.