Juicer: A weighted finitestate transducer speech decoder

About This Presentation

Title:

Juicer: A weighted finitestate transducer speech decoder

Description:

Juicer: A weighted finitestate transducer speech decoder – PowerPoint PPT presentation

Number of Views:142

Avg rating:3.0/5.0

Slides: 26

Provided by: din53

Category:

more less

Transcript and Presenter's Notes

Title: Juicer: A weighted finitestate transducer speech decoder

1
Juicer A weighted finite-state transducer speech
decoder

D. Moore1, J. Dines1,
M. Magimai Doss1, J. Vepa1,
O. Cheng1 and T. Hain2
1 IDIAP Research Institute
2 Department of Computer Science, University of
Sheffield

2
Overview

The speech decoding problem
Why develop another decoder?
WFST theory and practice
What is Juicer?
Benchmarking experiments
The future of Juicer

3
The speech decoding problem

Given a recording and models of speech
language, generate a text transcription of what
was said

Decoder
She had your dark suit.
Models
4
The speech decoding problem

5
The speech decoding problem

6
The speech decoding problem

ASR system building blocks
Grammar
N-gram language model
Lexical knowledge
pronunciation dictionary
Phonetic knowledge
context dependency
phonological rules
Acoustic knowledge
state distributions

Naive combination of these knowledge sources
leads to a large, inefficient representation of
the search space
7
The speech decoding problem

The main issue in decoding is carrying out an
efficient search of the space defined by the
knowledge sources
Two ways we can do this
Avoid performing redundant search
Dont pursue unpromising hypotheses
An additional issue flexibility of the decoder

8
Why develop another decoder?

Need of a state-of-the-art speech decoder that is
also suitable for on-going research
At present, such software is not freely available
to the research community
Open-source development and distribution
framework

9
WFST theory and practice

Maps sequences of input symbols to sequences of
output symbols
Transition pairs have an associated weight
In the example
Input sequence Ia b c d maps to output
sequence OX Y Z W, with the path weight a
function of all transition weights associated
with that path, f(0.1,0.2,0.5,0.1)

10
WFST theory and practiceWFST operations

Composition
Combination of transducers
Determinisation
Only one transition per input label
Minimisation
Least number of states and transitions
Weight pushing to aid in minimisation

11
WFST theory and practiceComposition
12
WFST theory and practiceDeterminisation
13
WFST theory and practiceWeight pushing
minimisation
14
WFST theory and practiceWFST and speech decoding