מושגי ליבה
Non-negative matrix factorization (NMF) can be extended to irregularly-sampled time-frequency representations by formulating it in terms of continuous functions instead of fixed vectors, enabling the use of implicit neural representations to model the underlying basis templates and activations.
תקציר
The paper introduces a new framework for linearly processing signals that are not regularly sampled, demonstrating how non-negative matrix factorization (NMF) can be extended to such representations.
Key highlights:
- Conventional NMF is limited to regularly-sampled data that can be stored in a matrix form, such as the short-time Fourier transform (STFT) magnitude spectrogram.
- The authors propose a formulation of NMF in terms of continuous functions, rather than fixed vectors, allowing the use of implicit neural representations to model the underlying basis templates and activations.
- This enables the application of NMF to a wider variety of signal representations that are not regularly sampled, such as the constant-Q transform (CQT), wavelets, and sinusoidal analysis models.
- The authors demonstrate that the proposed implicit neural NMF (iN-NMF) model performs comparably to standard matrix-based NMF on tasks like magnitude spectrogram reconstruction and monophonic source separation, while offering greater flexibility in handling different time-frequency representations.
- iN-NMF can generalize to different spectrogram resolutions without the need to retrain the model, unlike standard NMF which is constrained to a single window size.
סטטיסטיקה
The paper does not provide any specific numerical data or statistics. The key results are presented through qualitative comparisons and illustrations.
ציטוטים
"Instead of forcing these representations into a matrix form, we can think of time-frequency (T-F) representations as points in T-F space [10], with the points being magnitudes indexed in terms of their underlying time-frequency coordinates."
"Feeding these into our proposed model we obtain the functions shown in Figure 2. We note that the learned functions, as with NMF, do indeed reveal the spectrum of the two notes and when each note was active. They do so in a functional form that allows us to use these in alignment with any T-F decomposition."