JIANG Ye, TANG Zhenmin, WANG Longbiao. Identification of a Distant Speaker and Its Robustness[J]. Chinese Journal of Electronics, 2011, 20(2): 278-282.
Citation: JIANG Ye, TANG Zhenmin, WANG Longbiao. Identification of a Distant Speaker and Its Robustness[J]. Chinese Journal of Electronics, 2011, 20(2): 278-282.

Identification of a Distant Speaker and Its Robustness

  • Received Date: 2010-08-01
  • Rev Recd Date: 2010-10-01
  • Publish Date: 2011-04-25
  • Robust speaker identification is presented for speech recorded by distant microphones. Three compensation approaches are investigated to improve the robustness of speaker identification in such environments. The first approach applies spectral subtraction before feature extraction to reduce the late-reverberation effect. The second approach makes use of feature warping as feature compensation in distant speaker identification under mismatched training-testing conditions. The third approach employs a novel method of initializing Gaussian mixture model parameters: combined division and k-means clustering. The experiment results show that, relative to the baseline system based on CMN, the channel-average recognition rates for the compensated system were 11.4%, 15.4%, 17.0%, and 17.8% higher for the TIMIT database and 6.8%, 6.4%, 9.3%, and 14.0% higher for the JNAS database for four different environments. In addition, the results show that the combination of the three approaches has better performance than the use of a single compensation method.
  • loading
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (674) PDF downloads(611) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return