Gaussian Mixture Language Models for Speech Recognition

About This Presentation

Title:

Description:

Number of Views:159

Avg rating:3.0/5.0

Slides: 16

Provided by: XUAN87

Category:

more less

Transcript and Presenter's Notes

Title: Gaussian Mixture Language Models for Speech Recognition

1
Gaussian Mixture Language Models for Speech
Recognition

2
Introduction

3
Approach

M
M
N-1

M
L
4
Approach (cont.)

or
exponents can be used to control the dynamic
ranges of n-gram and Gaussian mixture
probabilities
5
Implementation

6
Experimental results

7
A discriminative training framework using n-best
speech recognition transcriptions and scores for
spoken utterance classification

8
Introduction

Conventionally, a two-phase approaches is adapted
for SUC (spoken utterance classification) task
ASR transcription
Semantic classification
It has been reported that reduction in WER (word
error rate) do not necessarily translate into CER
(classification error rate)
A novel discriminative training framework for
learning the language and classification model
is proposed

9
DT framework Using the N-best Lists

As long as enough words are recognized to trigger
the correct salient phrase, the correct meaning
is assigned to the utterance
Using ME Classifier
Joint association score

10
DT framework Using the N-best Lists (cont.)

The most likely to yield the correct class is
first extracted based on joint association score
from N-best list
Assign remaining sentences in the N-best list
Assignment of sentences in the N-best list to
classes is an effective mechanism for
discriminating the sentence in the N-best list
that is most likely to yield the correct class
from those that more likely to yield other wrong
classes

11
DT framework Using the N-best Lists (cont.)

12
DT framework Using the N-best Lists (cont.)

?
13
DT framework Using the N-best Lists (cont.)

14
Experimental Results

15
Conclusions

A new discriminative training framework for
spoken utterance classification was proposed
The use of N-best transcription is motivated by
the fact the same class is often associated with
many variants of spoken utterances

Write a Comment

User Comments (0)