ZHONG Jiang, SUN Qigan, LI Xue, WEN Luosheng. A Novel Feature Selection Method Based on Probability Latent Semantic Analysis for Chinese Text Classification[J]. Chinese Journal of Electronics, 2011, 20(2): 228-232.
Citation: ZHONG Jiang, SUN Qigan, LI Xue, WEN Luosheng. A Novel Feature Selection Method Based on Probability Latent Semantic Analysis for Chinese Text Classification[J]. Chinese Journal of Electronics, 2011, 20(2): 228-232.

A Novel Feature Selection Method Based on Probability Latent Semantic Analysis for Chinese Text Classification

  • Received Date: 2010-05-01
  • Rev Recd Date: 2010-10-01
  • Publish Date: 2011-04-25
  • In this paper, a novel Chinese text feature selection algorithmbased on Probability latent semantic analysis (PLSA) was presented for text classification. The algorithm first employs the Expectation-maximization method (EM) to calculate the correlations between words and the latent topics for every category documents. It then selects feature words for each latent topics and merge those words to describe the corresponding category documents. At last, it merges all feature words of every category into classification feature words. An empirical comparison with other four effective feature selection methods on a benchmark data is presented in this paper. The results show that this method could get the best classification performance.
  • loading
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (752) PDF downloads(1170) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return