WANG Xiaofei, GUO Yanmeng, FU Qiang, YAN Yonghong. Speech Enhancement Using Multi-channel Post-Filtering with Modified Signal Presence Probability in Reverberant Environment[J]. Chinese Journal of Electronics, 2016, 25(3): 512-519. doi: 10.1049/cje.2016.05.017
Citation: WANG Xiaofei, GUO Yanmeng, FU Qiang, YAN Yonghong. Speech Enhancement Using Multi-channel Post-Filtering with Modified Signal Presence Probability in Reverberant Environment[J]. Chinese Journal of Electronics, 2016, 25(3): 512-519. doi: 10.1049/cje.2016.05.017

Speech Enhancement Using Multi-channel Post-Filtering with Modified Signal Presence Probability in Reverberant Environment

doi: 10.1049/cje.2016.05.017
Funds:  This work is supported by the National Natural Science Foundation of China (No.11161140319, No.91120001, No.61271426), the Strategic Priority Research Program of the Chinese Academy of Sciences (No.XDA06030100, No.XDA06030500), the National High Technology Research and Development Program of China (863 Program) (No.2012AA012503) and the Chinese Academy of Sciences Priority Deployment Project (No.KGZD-EW-103-2).
More Information
  • Corresponding author: FU Qiang received the Ph.D. degree in electronic engineering from Xidian University, Xian, in 2000. From 2001 to 2002, he was working as a senior research associate in Center for Spoken Language Understanding (CSLU), OGI School of Science and Engineering at Oregon Health & Science University, Oregon, USA. From 2002 to 2004, he was working as a senior postdoctoral research fellow in Department of Electric and Computer Engineering, University of Limerick, Ireland. He is currently a professor in Institute of Acoustics, Chinese Academy of Sciences, China. His research interests are in speech analysis, microphone array processing, far-distant speech recognition, audio-visual signal processing, machine learning for signal processing, etc. Dr. Qiang Fu is a member of IEEE Signal Processing Society.
  • Received Date: 2014-04-15
  • Rev Recd Date: 2014-09-09
  • Publish Date: 2016-05-10
  • In this paper, a multi-channel post-filtering approach in reverberant environment based on detection and estimation scheme is presented. A modified Signal presence probability (SPP), which is in consideration of reverberation, is proposed with a novel estimator Direct-toreverberate ratio (DRR) to adapt to distant-talking scene. SPP is known a key estimator to instruct the updating of transient noise or residual directional interference and form gain function in the time-frequency domain, consequently a new desired signal detection scheme is proposed to improve its accuracy. Appropriate spectral enhancement technique is applied to the noisy speech signal taking advantage of the modified SPP estimator. The proposed multi-channel post-filtering is tested in different nonstationary noisy and reverberant environments. Experimental results show that it achieves considerable improvement on signal preservation of the desired speech with more noise reduction over the comparative algorithms.
  • loading
  • J. Benesty, J. Chen and Y. Huang, Microphone Array Signal Processing, Springer, 2008.
    M. Brandstein and D. Ward, Microphone Arrays: Signal Processing Techniques and Applications, Springer, 2001.
    L.J. Griffiths and C.W. Jim, "An alternative approach to linearly constrained adaptive beamforming", IEEE Transactions on Antennas and Propagation, Vol.30, No.1, pp.27-34, 1982.
    S. Gannot, D. Burshtein and E. Weinstein, "Signal enhancement using beamforming and nonstationarity with applications to speech", IEEE Transactions on Signal Processing, Vol.49, No.8, pp.1614-1626, 2001.
    Y. Avargel and I. Cohen, "On multiplicative transfer function approximation in the short-time fourier transform domain", IEEE Signal Processing Letters, Vol.14, No.5, pp.337-340, 2007.
    R. Talmon, et al., "Convolutive transfer function generalized sidelobe canceler", IEEE Transactions on Audio, Speech, and Language Processing, Vol.17, No.7, pp.1420-1434, 2009.
    J. Bitzer, et al., "Theoretical noise reduction limits of the generalized sidelobe canceller (GSC) for speech enhancement", Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.5, pp.2965-2968, 1999.
    S. Gannot, et al., "Theoretical analysis of the general transfer function GSC", Proc. of Int. Workshop on Acoustic Echo and Noise Control (IWAENC), pp.103-106, 2001.
    R. Zelinski, "A microphone array with adaptive post-filtering for noise reduction in reverberant rooms", Proc. of International Conference on Acoustics, Speech, and Signal Processing, pp.2578-2581, 1988.
    I.A. McCowan and H. Bourlard, "Microphone array post-filter based on noise field coherence", IEEE Transactions on Speech and Audio Processing, Vol.11, No.6, pp.709-716, 2003.
    J. Hu and M. Lee, "Multi-channel post-filtering based on spatial coherence measure", Signal Processing, Vol.105, No.12, pp.338- 349, 2014.
    I. Cohen, "Multichannel post-filtering in nonstationary noise environments", IEEE Transactions on Signal Processing, Vol.52, No.5, pp.1149-1160, 2004.
    S. Gannot and I. Cohen, "Speech enhancement based on the general transfer function GSC and postfiltering", IEEE Transactions on Speech and Audio Processing, Vol.12, No.6, pp.561- 571, 2004.
    K. Li, Q. Fu, and Y. Yan, "Speech enhancement using robust generalized side lobe canceller with multi-channel postfiltering in adverse environments", Chinese Journal of Electronics, Vol.21, No.1, pp.85-90, 2012.
    I. Cohen and B. Berdugo, "Noise estimation by minima controlled recursive averaging for robust speech enhancement", IEEE Signal Processing Letters, Vol.9, No.1, pp.12-15, 2002.
    Y. Ephraim and D. Malah, "Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator", IEEE Transactions on Acoustics, Speech and Signal Processing, Vol.32, No.6, pp.1109-1121, 1984.
    M. Jeub, C. Nelke, C. Beaugeant, et al., "Blind estimation of the coherent-to-diffuse energy ratio from noisy speech signals", Proc. of 19th European Signal Processing Conference (EUSIPCO 2011), pp.1347-1351, 2011.
    I. Cohen and B. Berdugo, "Two-channel signal detection and speech enhancement based on the transient beam-to-reference ratio", Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing , Vol.5, pp.233-236, 2003.
    J.B. Allen and D.A. Berkley, "Image method for efficiently simulating small-room acoustics", The Journal of the Acoustical Society of America, Vol.65, No.4, pp.943-950, 1979.
    S.R. Quackenbush, T.P. Barnwell and M.A. Clements, Objective Measures of Speech Quality, Prentice Hall, 1988.
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (252) PDF downloads(679) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return