3. Applications - PowerPoint PPT Presentation

1 / 37
About This Presentation
Title:

3. Applications

Description:

Title: CMAC Author: MFYeh Last modified by: Bee Created Date: 7/5/2002 12:31:07 PM Document presentation format: Company: LHUST Other titles – PowerPoint PPT presentation

Number of Views:53
Avg rating:3.0/5.0
Slides: 38
Provided by: MFYeh
Category:

less

Transcript and Presenter's Notes

Title: 3. Applications


1
3. Applications
  • Function approximation t ? yt target, y
    actual output
  • Neural controlr referenceu control efforty
    system output r ? y

2
Neural Control Schemes
  • Supervised control
  • Hybrid control
  • Model reference control
  • Internal model control
  • Adaptive control
  • Reference C.L. Lin H.W. Su, Intelligent
    control theory in guidance and control system
    design an overview, Proc. Natl. Sci. Counc. ROC
    (A), pp. 15-30.

3
Supervisory Control
  • Neural network ? inverse NN controller
  • The neural controller in the system is utilized
    as an inverse system model.

4
Hybrid Control
  • Generalized learning (off-line learning)A rough
    approximation to the desired control law? Drive
    the plant over the operating range and
    without instability
  • Specialized learning (on-line learning)Improve
    the control provided by the NN controller

5
Model Reference Control
  • Must define its input-output pair r(t), yR(t)
    in advance
  • Attempts to make the plant output y(t) match the
    reference model output asymptotically.
  • The error e(t) is used to adjust the weights of
    an neural controller.

6
Internal Model Control
  • The NN plant model is first trained off-line to
    emulate the controller plant dynamics directly.
    During on-line operation, the error is used as a
    feedback signal and passed to the NN controller.
  • The effect of the NN controller is to subtract
    the effect of the control signal from the plant
    output, i.e., disturbances.
  • The IMC plays a role as a feedforward controller
    and can cancel the influence due to unmeasured
    disturbances.

7
Adaptive Control
  • The tracking error cost is evaluated according to
    some performance index.
  • The result is then used as a basis for adjusting
    the connection weights of the neural networks.
  • The weights are adjusted on-line using basic
    backpropagation rather than off-line.

8
Paper Study 1
  • Practical Stability Issues in CMAC Neural Network
    Control Systems
  • Fu-Chuang Chen Chih-Horng Chang
  • IEEE Trans. on Control Systems Technology,
  • Vol. 4, No. 1, pp. 86-91, 1996

9
Abstract
  • CMAC is a practical tool for improving existing
    nonlinear control systems.
  • CMAC can effectively reduce tracking error, but
    can also destabilize a control system which is
    otherwise stable.
  • Quantitative studies are presented to search for
    the cause of instability in the CMAC control
    system.

10
I. Introduction
  • CMAC is basically a look-up table method, very
    easy to implement, and at the same time it is a
    powerful and practical tool for nonlinear
    control.
  • There has been convergence result on the CMAC
    learning.

11
Main Purpose of This Paper
  • To introduce the CMAC control system from an
    industrial point of view.
  • To describe the unstable phenomenon.
  • To quantitatively study how the system parameters
    such as control gain, quantization,
    generalization, learning rate, etc., are related
    to the instability of the system.
  • To suggest ways to improve system stability.
  • To provide some experience evidence.

12
II. CMAC Control System
Use a workable traditional controller to
stabilize the plant and to help the CMAC learn to
provide precise control.
13
Functioning of CMAC
  • Initially the CMAC table is empty.
  • In each time step k, the CMAC involves a recall
    and a learning process.

14
Recall Process
  • Uses Yd(k1) and Y(k) as the address to generate
    the control signal from CMAC table, where Yd(k1)
    is the desired system output for the next time
    step.
  • CMAC has two inputs and one output.

15
Leaning Process
  • U(k) is treated as the desired output to modified
    the content stored at location Y(k) and Y(k1),
    where Y(k1) is the actual system output for the
    next time step k1.
  • To speed up the initial learning and to achieve
    better generalization, the generalization
    technique is employed, i.e., each input vector to
    CMAC for recall and learning will map to a number
    of memory locations instead of only one memory
    location.

16
Function Approximation
  • How precisely the CMAC can approximate a function
    is mainly determined by the quantization in each
    dimension of the input vector.
  • Reducing quantization would quickly increase the
    memory demand for storing the CMAC table.

17
Table Update Mechanism
  • Gradient-type learning rule
  • Wi(k1) Wi(k) ? ? U(k) ? Uc(k) / g
  • g the size of generation
  • Wi the content of the ith memory location,
    there being q locations to be updated
  • ? the learning rule
  • U the correct (desired) data
  • Uc the current (actual) data

18
III. A Typical Simulation Study
  • Proportional gain P 1.4
  • Learning rate ? 0.1
  • Generalization 50
  • Quantization 5/500(meaning five units divided
    into 500 divisions)
  • Reference command sin(2?k/200) with each
    sinusoidal cycle consisting of 400 time steps

Tracking error
Number of cycles
19
A Typical Simulation Study
  • In the first five cycles, the system is solely
    controlled by the P controller.
  • The CMAC is added at the 6th cycle, and then the
    error reduces quickly and significantly.
  • The error remains small for some time, and then
    it diverges around the 143th cycle.

Control output
Number of time steps
20
Discussion
  • The CMAC can significantly reduce the tracking
    error.
  • CMAC can destabilize a control system which is
    otherwise stable. The unstable phenomenon
    certainly comes from the interactions between the
    proportional controller and the CMAC network.

21
Discussion
  • The proportional controller can not be removed
    even when the magnitude of the proportional
    control is very small compared with that of the
    CMAC (i.e., when the system output error has been
    significantly reduced). Otherwise, the good
    tracking can not be maintained.

22
Growth of Oscillation
90th cycle
8th cycle
155th cycle
40th cycle
23
V. Method for Improving
System Stability
  • The continued learning of CMAC after the tracking
    error has reduced is the major cause of the
    instability.
  • Stopping the CMAC learning has two drawbacks.
    First, it can be difficult to determine when to
    stop the CMAC learning.Second, if the CMAC stops
    learning, then the CMAC control system cannot
    respond to any change in the reference command.

24
Modified Learning Rule
  • To effectively stop the CMAC learning when the
    tracking error is small, but at the same time
    allow the system to respond to any change in the
    reference command, a deadzone is added to the
    CMAC updating rule.Wi(k1) Wi(k) ? ? D U(k)
    ? Uc(k) / gwhere

25
VI. Experiment
26
Paper Study 2
  • Intelligent Controller Using CMACs with
    Self-Organized Structure and Its Application for
    a Process System
  • T. Yamamoto, H. Yanagino M. Kaneda
  • Proceedings of 1997 IEEE , pp. 76-81

27
Abstract
  • This paper describes a design scheme of
    intelligent system consists of some CMACs.
  • Each of CMACs is trained for the specified
    command signal.
  • A new CMAC is generated for unspecificed command
    signals, and the CMAC whose command is nearest
    for the new command signal, is eliminated.
  • The proposed intelligent controller can be
    designed with relatively small memories.

28
1. Introduction
  • The CMACs included in the intelligent controller
    are trained in both off-line and on-line learning
    process for each of the specified command
    signals.
  • For a unspecified command, a new CMAC is
    generated.
  • The initial weights are set by employing the
    linear interpolation to the trained weights
    included in two CMACs whose command signals are
    nearest for the new command signal.
  • The CMAC corresponding to the nearest command
    signal is eliminated.
  • The proposed intelligent controller can be
    designed with relatively small memories.

29
3. Intelligent Controller Design
controller
30
Outline
  • The input signals to each CMAC are the control
    error signal and the difference, that is, the
    two- dimensional CMACs are equipped in the
    intelligent control system.
  • By including the command signal as one of input
    signals in the CMAC, the intelligent control
    system can be constructed by using only a
    three-dimensional CMAC.

31
Off-line Learning Process
  • w(t) command signalu(t) teacher signal
  • Updated ruleh 1, 2, , K total number of
    the selected weightsg1(t) the gradient to
    update the weights

a1, b1, c1 positive cont.
32
On-line Learning Process
  • The last weights obtained in the off-line
    learning are used as initial ones in the on-line
    learning.
  • Updated rule k the time-delay of the
    systemg2(t) the gradient to update the weights

a2, b2, c2 positive cont.
33
Off-line vs On-line
  • In the off-line learning process, the teacher
    signal u(t) is generated by a certain control
    law, e.g., PID control law and human experts.
  • u(t) is utilized in order to determine the
    initial weights in the on-line learning of the
    CMAC.
  • In the on-line learning process, the teaching
    signal u(t) can not be obtained.
  • The desired reference model output ym(t) is
    introduced, and the on-line learning is performed
    so that the system output y(t) approaches to
    ym(t).

34
Self-organized Structure
  • A new CMAC is generated for a new command signal
  • The initial weights includes in the new CMAC are
    set by employing the linear interpolation to the
    trained weights included two CMACs which are
    nearest for the new command signal
  • The CMAC whose command signal is nearest for the
    new one is eliminated.

35
4. Experimental Results
  • Air pressure control system
  • Control object regulate the air pressure y to
    any desired values by manipulating the control
    value angle u.
  • In order to obtain u(t), PID control law is
    employed for this control system.

36
Control Result
Off-line learning(20 iterations)
Conventional PID control
37
Control Result (cont.)
On-line learning(after 5 more iterations)
Unspecified command signal
Write a Comment
User Comments (0)
About PowerShow.com