Advances in WP2 - PowerPoint PPT Presentation

1 / 17

About This Presentation

Title:

Advances in WP2

Description:

... during adaptation as the discriminative training (Error Backpropagation) assigns ... The ANN training is discriminative, contrary to that of GMM-HMMs, and absent ... – PowerPoint PPT presentation

Number of Views:25

Avg rating:3.0/5.0

Slides: 18

Provided by: cvspC

Category:

more less

Transcript and Presenter's Notes

Title: Advances in WP2

1
Advances in WP2

Nancy Meeting 6-7 July 2006

www.loquendo.com
2
Recent Work on NN Adaptation in WP2

State of the art LIN adaptation method
implemented and experimented on the benchmarks
(m12)
Innovative LHN adaptation method implemented and
experimented on the benchmarks (m21)
Experimental results on benchmark corpora and
Hiwire database with LIN and LHN (m21)
Further advances on new adaptation methods (m24)

3
LIN Adaptation
Acoustic phonetic Units
Emission Probabilities
.
Output layer
Speaker Independent MLP SI-MLP
.
2nd hidden layer
1st hidden layer
.
Input layer
Speech Signal parameters
4
LHN Adaptation
Acoustic phonetic Units
Emission Probabilities
.
Output layer
Speaker Independent MLP SI-MLP
.
2nd hidden layer
1st hidden layer
.
Input layer
Speech Signal parameters
5
Results Summary (W.E.R.)
Test set baseline LIN adapted E.R. LHN adapted E.R
WSJ0 16kHz bigr LM 10.5 9.4 10.5 8.4 20.0
WSJ1 Spoke-3 16kHz bigr LM 54.2 46.5 14.2 30.6 43.5
HIWIRE 8kHz 11.6 7.8 32.1 7.2 37.9
6
Papers presented

Roberto Gemello, Franco Mana, Stefano Scanzio,
Pietro Laface, Renato De Mori, Adaptation of
Hybrid ANN/HMM models using hidden linear
transformations and conservative training, Proc.
of Icassp 2006, Toulouse, France, May 2006
Dario Albesano, Roberto Gemello, Pietro Laface,
Franco Mana, Stefano Scanzio, Adaptation of
Artificial Neural Networks Avoiding Catastrophic
Forgetting, Proc. of IJCNN 2006, Vancouver,
Canada, July 2006

7
The Forgetting problem in ANN Adaptation

It is well known, in connectionist learning, that
acquiring new information in the adaptation
process can damage previously learned information
(Catastrophic Forgetting)
This effect must be taken into account when
adapting an ANN with limited amount of data,
which do not include enough samples for all the
classes.
The absent classes may be forgotten during
adaptation as the discriminative training (Error
Backpropagation) assigns always zero targets to
absent classes

8
Forgetting in ANN for ASR

While Adapting ASR ANN/HMM model, this problem
can arise when the adaptation set does not
contain examples for some phonemes, due to the
limited amount of adaptation data or the limited
vocabulary
The ANN training is discriminative, contrary to
that of GMM-HMMs, and absent phonemes will be
penalized by assigning to them a zero target
during the adaptation
That induces in the ANN a forgetting of the
capability to classify the absent phonemes. Thus,
while the HMM models for phonemes with no
observations remain un-adapted, the ANN output
units corresponding to phonemes with no
observations loose their characterization, rather
than staying not adapted

9
Example of Forgetting
Adaptation examples only of E, U, O (e.g. from
words uno, due, tre) no examples for the other
vowels (A, I, ? ) The classes with examples adapt
themselves, but tend to invade the classes with
no examples, that are partially forgotten
I
E
F2 (kHz)
A
e
U
O
F1 (kHz)
10
Conservative Training

We have introduced conservative training to
avoid the forgetting of absent phonemes
The idea is to avoid zero target for the absent
phonemes, using for them the output of the
Original NN as target
Let be FP the set of phonemes present in the
adaptation set and FA the set of absent ones. The
target are assigned according to the following
equations

11
Conservative Training target assignment policy
P2 is the class corresponding to the correct
phoneme Px class in the adaptation set
Ax absent class
12
Conservative Training

In this way, the phonemes that are absent in the
adaptation set are represented by the response
given by the Original NN
Thus, the absent phonemes are not absorbed by
the neighboring present phonemes
The results of adaptation with conservative
training are
Comparable performances on target environment
Preservation of performances on generalist
environment
Great improvement of performances in speaker
adaptation, when only few sentences are available

13
Adaptation tasks

Application data adaptation Directory Assistance
9325 Italian city names
53713 training 3917 test utterances
Vocabulary adaptation Command words
30 command words
6189 training 3094 test utterances
Channel-Environment adaptation Aurora-3
2951 training 654 test utterances

14
Adaptation Results on different tasks (WER)
Adaptation Task Adaptation Method Application Directory Assistance Vocabulary Command Words Channel-Environment Aurora-3 CH1
No adaptation 14.6 3.8 24.0
LIN 11.2 3.4 11.0
LIN CT 12.4 3.4 15.3
LHN 9.6 2.1 9.8
LHN CT 10.1 2.3 10.4
15
Mitigation of Catastrophic Forgetting using
Conservative Training
Tests using adapted models on Italian continuous
speech ( WER)
Models Adapted on Application Directory Assistance Vocabulary Command Words Channel-Environment Aurora-3 CH1
Adaptation Method LIN 36.3 42.7 108.6
Adaptation Method LIN CT 36.5 35.2 42.1
Adaptation Method LHN 40.6 63.7 152.1
Adaptation Method LHN CT 40.7 45.3 44.2
Adaptation Method No Adaptation 29.3 29.3 29.3
16
Conclusions

The new LHN adaptation method, developed within
the project, outperforms standard LIN adaptation
In adaptation tasks with missing classes,
Conservative Training reduces the catastrophic
forgetting effect, preserving the performance on
another generic task

17
Workplan

Selection of suitable benchmark databases (m6)
Baseline set-up for the selected databases (m8)
LIN adaptation method implemented and
experimented on the benchmarks (m12)
Experimental results on Hiwire database with
LIN (m18)
Innovative NN adaptation methods and algorithms
for acoustic modeling and experimental results
(m21)
Further advances on new adaptation methods (m24)
Unsupervised Adaptation algorithms and
experimentation (m33)