Exploiting Nonnegative Matrix Factorization with Mixed Group Sparsity Constraint to Separate Speech Signal from Single-channel Mixture with Unknown Ambient Noise

Authors

  • Thanh Thi Hien Duong Hanoi University of Science and Technology image/svg+xml
  • Phuong Cong Nguyen Hanoi University of Science and Technology image/svg+xml
  • Cuong Quoc Nguyen Hanoi University of Science and Technology image/svg+xml

DOI:

https://doi.org/10.4108/eai.14-3-2018.154342

Keywords:

Speech enhancement, source separation, nonnegative matrix factorizarion (NMF), sparsity constraint, generic source spectral model

Abstract

This paper focuses on solving a challenging speech enhancement problem: improving the desired speech from a single-channel audio signal containing high-level unspecified noise (possibly environmental noise, music, other sounds, etc.). Using source separation technique, we investigate a solution combining nonnegative matrix factorization (NMF) with mixed group sparsity constraint that allows exploiting generic noise spectral model to guide the separation process. The experiment performed on a set of benchmarked audio signals with different types of real-world noise shows that the proposed algorithm yields better quantitative results in term of the signal-to-distortion ratio than the previously published algorithms.

Downloads

Published

14-03-2018

How to Cite

1.
Thi Hien Duong T, Cong Nguyen P, Quoc Nguyen C. Exploiting Nonnegative Matrix Factorization with Mixed Group Sparsity Constraint to Separate Speech Signal from Single-channel Mixture with Unknown Ambient Noise. EAI Endorsed Trans Context Aware Syst App [Internet]. 2018 Mar. 14 [cited 2025 Jan. 4];4(13):e5. Available from: https://publications.eai.eu/index.php/casa/article/view/1960