ICASSP 2005 - Poster M. Chibani - PowerPoint PPT Presentation

About This Presentation
Title:

ICASSP 2005 - Poster M. Chibani

Description:

The dependency of the ACB on the history of the signal makes it vulnerable to frame erasures. ... Listening using binaural headphones. AMR-WB in mode 2 (12.65 kb/s) ... – PowerPoint PPT presentation

Number of Views:97
Avg rating:3.0/5.0
Slides: 2
Provided by: mchibanipg
Category:

less

Transcript and Presenter's Notes

Title: ICASSP 2005 - Poster M. Chibani


1
INCREASING THE ROBUSTNESS OF CELP-BASED CODERS BY
CONSTRAINED OPTIMIZATION
Mohamed Chibani, Philippe Gournay and Roch
Lefebvre
University of Sherbrooke Sherbrooke, Quebec,
Canada
  • INTRODUCTION
  • The use of the adaptive codebook (ACB) in
    CELP-like speech coders allows the achievement of
    high quality speech, especially for voiced
    segments.
  • The problem
  • The dependency of the ACB on the history of the
    signal makes it vulnerable to frame erasures.
    When a frame is lost, it affects not only the
    corresponding segment, but also the following
    frames due to error propagation, and the decoder
    may take several frames to recover.
  • The proposed solution
  • To reduce the error propagation when a frame is
    lost, we propose to lessen the inter-frame
    dependency by limiting the energy of the ACB
    contribution to the synthesized speech. The ACB
    gain by itself is not limited.

4. EXPERIMENTAL RESULTS Constrained encoding
improves the recovery time of the decoder after a
frame erasure
6. CONSTRAINED JOINT QUANTIZATION OF THE ACB AND
ICB GAINS The ACB contribution can be controlled
more precisely during the gains quantization
process, since at this stage both ACB and ICB
excitation shapes are known. The
constrained optimal solution can be found
analytically using the Kuhn-Tucker conditions.
3. CONSTRAINED SEARCH ALGORITHM If the ACB
energy contribution is high, a fraction of this
contribution is forced to be left in the new
target. The ACB contribution is estimated by
comparing, in the perceptual domain, the energy
due to the ACB excitation to the target energy.
The quadratic error relative to the quantized ACB
and ICB gains
  • 2. EXCITATION SEARCH IN THE AMR-WB CODEC
  • The standard search for the excitation parameters
    is done sequentially. It consists first in
    finding the gain and the delay for the ACB, then
    the shape (pulse positions) for the innovative
    codebook (ICB). Finally, the ACB and ICB gains
    are jointly quantized. The procedure may be
    summarized as follows
  • Perform the LP analysis on the speech signal,
    extract the residual signal and build the target.
  • Search for the ACB parameters that match the
    target (in an analysis-by-synthesis loop),
    according to the error criterion
  • Update the target by subtracting the ACB
    contribution.
  • Search for the ICB parameters using the updated
    target.
  • Jointly quantize the ACB and ICB gains (quadratic
    error minimization in the perceptual domain).

A value of Rth1 as low as 0.55 preserves the
overall quality in clear channel while
significantly improving the robustness against
frame erasures.
NO PREFERENCE IN CLEAR CHANNEL
STRONG PREFERENCE WITH A FRAME LOSS RATE OF 10
  • 7. CONCLUSION
  • Limiting the ACB contribution, significantly
    improves the recovery time of the decoder after a
    frame loss.
  • In clear channel, the quality degradation due to
    the constraint is very small.
  • The modified coder is completely interoperable
    with the standard, it requires no extra bits and
    adds no extra delay.
  • Although the ACB contribution can be controlled
    more precisely at the gains quantization stage,
    modifying only the target for the ICB search is
    the simplest and most effective approach.
  • 5. SUBJECTIVE TESTS
  • Two subjective tests were carried out in order to
    compare the standard and modified codecs in the
    clear channel condition (no frame erasures), and
    with a simulated frame loss rate of 10.
  • Test features
  • 20 sentences for each condition.
  • 12 listeners participated in the tests.
  • Listening using binaural headphones.
  • AMR-WB in mode 2 (12.65 kb/s).
  • Audio files on http//www.gel.usherb.ca/gournay/
Write a Comment
User Comments (0)
About PowerShow.com