Sparse Representations for Speech Enhancement

ZHAO Nan, XU Xin, YANG Yi. Sparse Representations for Speech Enhancement[J]. Chinese Journal of Electronics, 2011, 20(2): 268-272.

Citation:

ZHAO Nan, XU Xin, YANG Yi. Sparse Representations for Speech Enhancement[J]. Chinese Journal of Electronics, 2011, 20(2): 268-272.

Citation:

ZHAO Nan, XU Xin, YANG Yi. Sparse Representations for Speech Enhancement[J]. Chinese Journal of Electronics, 2011, 20(2): 268-272.

Abstract

This paper applies the sparse and redundant representation techniques to the problem of speech enhancement. More specifically, the K-SVD algorithm was used to train a data-driven overcomplete dictionary that describes the sparsity of speech. Orthogonal matching pursuit was employed to reconstruct the clean speech as a direct sparse decomposition technique over redundant dictionaries. Furthermore, the principle of iteration was introduced to the denoising process. When training was done on the noisy speech directly, the overall trainingreconstructing algorithm became fused into one iterative procedure. Simulation shows that our proposed approach outperforms the conventional methods in terms of spectrogram analysis, objective and subjective measures.

FullText(HTML)

Export File