Title: Discovering Auditory Objects Through NonNegativity Constraint
1Discovering Auditory Objects Through
Non-Negativity Constraint
Paris Smaragdis SAPA 2004
- Minje Kim
- Intelligent Multimedia Lab.
- CSE, POSTECH
- Nov. 11th
2Outline
- Concepts of NMF
- Static-spectrum Object (Using NMF)
- Concepts of NMD (Non-negative Matrix
Deconvolution) - Time-varying Object (Using NMD)
3Concepts of NMF
4Non-negative Matrix Factorization
- Psychological and physiological evidence for
parts-based representations in the brain. - When NMF was first introduced it was used to
learn parts of faces. - It has been adopted as a very useful technique
for linear decomposition and dimensionality
reduction of non-negative data sets. - Given a non-negative matrix
, the goal is to approximate it as a product of
two non-negative matrices and
where , such that we
minimize the reconstruction error of by - Distinguished from other methods by its use of
non-negativity constraints.
5NMF vs PCA
6Cost Functions Update Rules
- Euclidean distance
- Kullback-Leibler divergence
7Static-spectrum Object (Using NMF)
8Experiments (Wedding March)
9Experiments (Drums)
10Experiments (Drums)
11Concepts of NMD (Non-negative Matrix
Deconvolution)
12Non-negative Matrix Deconvolution
- NMD allows us to deal with objects that have time
varying spectra. - Sequence of successive spectra and its
corresponding energy across time.
Where - operator is a shift operator that moves the
columns of its argument by i spots to the right.
13Cost Function Update Rule
- We can use the already existing framework of
NMF. Where - Update rule
andfirst update all Wt then update H using the
average result of its updates from all Wt. - If T1, then it reduces to the NMF updates.
14Experiments (Two Repeating Objects)
15Experiments (Ha-Yo, Using NMF)
16Experiments (Ha-Yo, Using NMD)
17Extraction and Reconstruction of Objects
Discussion
18Extraction and Reconstruction of Objects
- We can construct the input spectrogram using an
arbitrary number of objects to make a selective
approximation. - NMF
- NMD
19Discussion
- What is an Object?
- The data provided to this algorithm needs to
expose the individuality of each objects in a
disassociated way. - It seems inflexible to define objects as
repeating spectra, but actually not. - Real world sounds are the case with musical
signals where repeating patterns. - Despite the lack of statistical rigor and
simplistic cost function, performance of NMF and
NMD is considerably better than other
methods(i.e. PCA, ICA) in discovering auditory
objects. - Strict Non-negativity constraint fits naturally
to finding components of magnitude spectra.
20(No Transcript)