Audio and Speech Processing Topic3 Noise Reduction - PowerPoint PPT Presentation

1 / 40
About This Presentation
Title:

Audio and Speech Processing Topic3 Noise Reduction

Description:

p. 1. DSP-II. Audio and Speech Processing. Topic-3. Noise Reduction. Marc Moonen/Ann Spriet ... modified Bessel functions. previous frame. ignore formulas ... – PowerPoint PPT presentation

Number of Views:338
Avg rating:3.0/5.0
Slides: 41
Provided by: marcm52
Category:

less

Transcript and Presenter's Notes

Title: Audio and Speech Processing Topic3 Noise Reduction


1
Audio and Speech ProcessingTopic-3 Noise
Reduction
  • Marc Moonen/Ann Spriet
  • Dept. E.E./ESAT, K.U.Leuven
  • marc.moonen_at_esat.kuleuven.be
  • homes.esat.kuleuven.be/moonen/

2
Overview
  • Spectral subtraction for single-micr. noise
    reduction
  • Single-microphone noise reduction problem
  • Spectral subtraction basics (spectral filtering)
  • Features gain functions, implementation, musical
    noise,
  • Iterative Wiener filter based on speech modeling
  • Multi-channel Wiener filter for multi-micr. noise
    red.
  • Multi-microphone noise reduction problem
  • Multi-channel Wiener filter (spectralspatial
    filtering)
  • Kalman filter based on speech noise modeling
  • Kalman filters
  • Kalman filters for noise reduction

3
Single-Microphone Noise Reduction Problem
  • Microphone signal is
  • Goal Estimate sk based on yk
  • Applications
  • Speech enhancement in conferencing, handsfree
    telephony, hearing aids,
  • Digital audio restoration
  • Will consider speech applications sk speech
    signal


desired signal contribution
noise contribution
4
Spectral Subtraction Methods Basics
  • Signal chopped into frames (e.g. 10..20msec),
    for each frame a frequency domain representation
    is

  • (i-th frame)
  • However, speech signal is an on/off signal, hence
    some frames have speech noise, i.e.
  • some frames have noise only, i.e.
  • A speech detection algorithm is needed to
    distinguish between these 2 types of frames
    (based on energy/dynamic range/statistical
    properties,)

5
Spectral Subtraction Methods Basics
  • Definition ?(?) average amplitude of noise
    spectrum
  • Assumption noise characteristics change slowly,
    hence estimate ?(?) by (long-time) averaging over
    (M) noise-only frames
  • Estimate clean speech spectrum Si(?) (for each
    frame), using corrupted speech spectrum Yi(?)
    (for each frame, i.e. short-time estimate)
    estimated ?(?)
  • based on gain function

6
Spectral Subtraction Gain Functions
Magnitude Subtraction
Spectral Subtraction
Wiener Estimation
Maximum Likelihood
Non-linear Estimation
Ephraim-Malah Suppr. Rule most frequently
used in practice
see next slide
7
Spectral Subtraction Gain Functions
  • Ephraim-Malah Suppression Rule (EMSR)
  • with

ignore formulas
modified Bessel functions
previous frame
8
Spectral Subtraction Gain Functions
  • Example 1 Magnitude Subtraction
  • Signal model
  • Estimation of clean speech spectrum
  • PS half-wave rectification

9
Spectral Subtraction Gain Functions
  • Example 2 Wiener Estimation
  • Goal find linear filter Gi(?) such that MSE

  • is minimized
  • Solution
  • Assume speech sk and noise nk are
    uncorrelated, then...
  • PS half-wave rectification

10
Spectral Subtraction Implementation
Yn,i
yk
  • Short-time Fourier Transform (uniform
    DFT-modulated analysis filter bank)

  • estimate for Y(?n ) at time i
    (i-th frame)
  • Nnumber of frequency bins
    (channels) n0..N-1
  • Mdownsampling factor
  • Kframe length
    hk length-K analysis window (prototype
    filter)
  • frames with 50...66 overlap (i.e. 2-, 3-fold
    oversampling, N2M..3M)
  • subband processing
  • synthesis bank matched to analysis bank (see
    DSP-II)

Short-time analysis
Short-time synthesis
Gain functions
11
Spectral Subtraction Musical Noise
  • Audio demo car noise
  • Artifact musical noise
  • What?
  • Short-time estimates of Yi(?) fluctuate
    randomly in noise-
  • only frames, resulting in random gains Gi(?)
  • statistical analysis shows that broadband noise
    is transformed into signal composed of
    short-lived tones with randomly distributed
    frequencies (musical noise)

12
Spectral Subtraction Musical Noise
  • Solutions?
  • Magnitude averaging replace Yi(?) in calculation
    of Gi(?) by a local average over frames
  • EMSR (p7)
  • augment Gi(?) with soft-decision VAD
  • Gi(?) ? P(H1 Yi(?)). Gi(?)

13
Spectral Subtraction Iterative Wiener Filter
  • Basic
  • Wiener filtering based spectral
    subtraction (p9), with (improved) spectra
    estimation based on parametric models
  • Procedure
  • Estimate parameters of a speech model from noisy
    signal yk
  • Using estimated speech parameters, perform noise
    reduction (e.g. Wiener estimation, p. 9)
  • Re-estimate parameters of speech model from the
    speech signal estimate
  • Iterate 2 3

14
Spectral Subtraction Iterative Wiener Filter
pulse train


all-pole filter
voiced
pitch period
speech signal
uk
x
white noise generator
unvoiced
frequency domain time
domain
linear prediction parameters
15
Spectral Subtraction Iterative Wiener Filter
  • For each frame (vector) ym (iiteration
    nr.)
  • Estimate and
  • Construct Wiener Filter (p.9)
  • with
  • estimated during noise-only periods
  • 3. Filter speech frame ym

repeat until some error criterion is satisfied
16
Overview
  • Spectral subtraction for single-micr. noise
    reduction
  • Single-microphone noise reduction problem
  • Spectral subtraction basics (spectral filtering)
  • Features gain functions, implementation, musical
    noise,
  • Iterative Wiener filter based on speech modeling
  • Multi-channel Wiener filter for multi-micr. noise
    red.
  • Multi-microphone noise reduction problem
  • Multi-channel Wiener filter (spectralspatial
    filtering)
  • Kalman filter based on speech noise modeling
  • Kalman filters
  • Kalman filters for noise reduction

17
Multi-Microphone Noise Reduction Problem
speech source
?
(some) speech estimate
microphone signals
noise source(s)
speech part
noise part
18
Multi-Microphone Noise Reduction Problem
Will estimate speech part in microphone 1 ()
()
?
() Estimating sk is more difficult, would
include dereverberation (topic 6), etc..
() This is similar to single-microphone model
(p.3), where additional microphones
(m2..M) help to get a better estimate
19
Multi-Microphone Noise Reduction Problem
  • Data model
  • See Topic-2 on multi-path propagation, with q
    left out for conciseness.
  • Hm(?) is complete transfer function from
    speech source position to m-the microphone

20
Multi-Channel Wiener Filter (MWF)
  • Data model
  • Will use linear filters to obtain speech estimate
    (as in Topic-2)
  • Wiener filter (MMSE approach)
  • Note that (unlike in DSP-II) desired
    response signal S1(w) is unknown here (!), hence
    solution will be unusual

21
Multi-Channel Wiener Filter (MWF)
  • Wiener filter solution is (see DSP-II)
  • All quantities can be computed !
  • Special case of this is single-channel Wiener
    filter formulae on p.9 !

22
Multi-Channel Wiener Filter (MWF)
  • MWF combines spatial filtering (as in Topic-2)
    with single-channel spectral filtering (as in
    Topic-3/single-channel noise reduction)
  • if
  • then

23
Multi-Channel Wiener Filter (MWF)
  • then it can be shown that
  • represents a
    spatial filtering ()
  • Compare to formulae for superdirective
    delay-and-sum beamf. (Topic-2)
  • Delay-and-sum beamf. maximizes array gain in
    white noise field
  • Superdirective beamf. maximizes array gain in
    diffuse noise field
  • MWF maximizes array gain in unknown (!) noise
    field.
  • MWF is operated without invoking any prior
    knowledge (steering
  • vector/noise field) ! (the secret is in the
    voice activity detection (explain))
  • () Note that spatial filtering can
    improve SNR, spectral filtering never improves
    SNR
  • (at one frequency)

24
Multi-Channel Wiener Filter (MWF)
  • then it can be shown that (continued)
  • represents an additional
    spectral post-filter
  • i.e. single-channel Wiener estimate
    (p.9), applied to output signal
  • of spatial filter

  • (prove
    it!)

25
Multi-Channel Wiener Filter Implementation
  • Implementation with short-time Fourier
    transform see p.10
  • Implementation with time-domain linear
    filtering

26
Multi-Channel Wiener Filter Implementation
  • Solution is
  • Implementation with time-domain linear
    filtering

27
Multi-Channel Wiener Filter Implementation
  • Implementation with time-domain linear
    filtering
  • Block algorithm
  • For each block
  • -Apply voice activity detection
  • -Update correlation matrices
  • -Recompute filter coefficients
  • -Apply filters
  • Cheaper algorithms with stochastic gradients
  • Frequency domain algorithms
  • Details omitted here (see literature)

28
Speech Distortion Weighted MWF
  • SDW-MWF is MWF with additional tuning parameter
  • Design criterion for can be re-written as
  • i.e. speech distortionresidual noise is
    minimized

29
Speech Distortion Weighted MWF
  • Design criterion may now be modified to trade-off
    noise reduction against speech distortion
  • Then optimal solution is
  • i.e. (rather) straightforward
    modification
  • By increasing mu, more noise is reduced, at the
    expense of more speech distortion (which is
    acceptable to a certain level) .
  • means all emphasis is on noise
    reduction, speech distortion is ignored ( and
    then ! )

30
Overview
  • Spectral subtraction for single-micr. noise
    reduction
  • Single-microphone noise reduction problem
  • Spectral subtraction basics (spectral filtering)
  • Features gain functions, implementation, musical
    noise,
  • Iterative Wiener filter based on speech modeling
  • Multi-channel Wiener filter for multi-micr. noise
    red.
  • Multi-microphone noise reduction problem
  • Multi-channel Wiener filter (spectralspatial
    filtering)
  • Kalman filter based on speech noise modeling
  • Kalman filters
  • Kalman filters for noise reduction

31
Kalman Filter
  • Given state space model of a discrete-time MIMO
    system
  • with vk and wk mutually uncorrelated,
    white noises
  • Then
  • given A, B, C, D and input/output-observatio
    ns uk,yk, k1,2,... Kalman filter produces
    MMSE estimates of internal states xk, k1,2,...
    (Wiener filter for
    dynamic systems)

32
Kalman Filter
  • Definition MMSE-estimate of xk
    using all
  • available data
    up until time l
  • FILTERING estimate
  • PREDICTION estimate
  • SMOOTHING estimate

33
Kalman Filter filtering and 1-step prediction
  • Given together with error
    covariance matrix Pkk-1
  • Then obtain and
    using uk, yk
  • Step 1 Measurement Update
  • Step 2 Time Update


Kalman Gain
34
Kalman Filter smoothing
  • Estimate states x1, x2,, xN
  • based on data uk, yk, k 1, 2, N
  • How?
  • 1. forward run apply previous equations for k
    1, 2, N
  • Result estimates
  • 2. backward run apply following equations for k
    N, N -1, 1
  • Result (better) estimates

35
Kalman filter for Speech Enhancement
  • Assume AR model of speech and noise
  • Equivalent state-space model is

  • ymicrophone signal

uk, wk zero mean, unit variance,white noise
36
Kalman filter for Speech Enhancement
  • with

37
Kalman filter for Speech Enhancement
  • PS This was single-microphone case.
  • How can this be extended to
    multi-microphone case ?
  • Same A, x, v
  • C?

38
Kalman filter for Speech Enhancement
Iterative algorithm
iterations
yk
split signal in frames
estimate parameters
Kalman Smoother or Kalman Filter
reconstruct signal
  • Disadvantages iterative approach
  • complexity
  • delay

39
Kalman filter for Speech Enhancement
  • iteration index time index (no
    iterations)

Sequential algorithm
D
State Estimator Kalman Filter
Parameters Estimator (Kalman Filter)
D
40
CONCLUSIONS
  • Single-channel noise reduction
  • Basic system is spectral subtraction
  • Only spectral filtering, not easily extended to
    multi-channel case for additional spatial
    filtering
  • Hence can only exploit differences in spectra
    between noise and speech signal
  • noise reduction at expense of speech distortion
  • achievable noise reduction may be limited
  • Multi-channel noise reduction
  • Basic system is MWF, possibly extended with
    speech distortion regularization
  • Provides spectral spatial filtering (links
    with beamforming!)
  • Kalman filtering
  • alternative approach (though not easily applied
    in practice)
Write a Comment
User Comments (0)
About PowerShow.com