WEI Chuyuan, LI Fangfang, ZHAN Qiang, FAN Xiongzhong. Coupled Matrix Factorization for Question Similarity[J]. Chinese Journal of Electronics, 2016, 25(4): 665-671. doi: 10.1049/cje.2016.06.034
Citation: WEI Chuyuan, LI Fangfang, ZHAN Qiang, FAN Xiongzhong. Coupled Matrix Factorization for Question Similarity[J]. Chinese Journal of Electronics, 2016, 25(4): 665-671. doi: 10.1049/cje.2016.06.034

Coupled Matrix Factorization for Question Similarity

doi: 10.1049/cje.2016.06.034
Funds:  This work is supported by the National Natural Science Foundation of China (No.61371194, No.61462004), and the National Basic Research Program of China (973 Program) (No.2013CB329303).
  • Received Date: 2015-07-06
  • Rev Recd Date: 2015-10-19
  • Publish Date: 2016-07-10
  • Community question answering (CQA) has provided an increasingly popular service where users ask and answer questions and access historical question-answer pairs. As a fundamental task in CQA, question similarity measure is to compute the similarity between the queried question and the historical questions which have been solved by other users. We mine and use the most important semantic features as the semantic representation of questions, and try to incorporate the couplings of semantic features into vector space model. We propose Coupled question similarity (CQS) model, and compute the similarity in matrix factorization framework. Experiments conducted on real CQA data sets demonstrate that with the incorporation of such couplings, the performance of sentence similarity is improved compared to a variety of baseline methods significantly.
  • loading
  • H. Duan, Y. Cao, Y. Lin and Y. Yu, "Searching questions by identifying questions topics and question focus", Proc. of ACL-08:HLT, Ohio, USA, pp.156-164, 2008.
    W. Zhang, T. Liu, Y. Yang, L. Cao, Y. Zhang and R. Ji, "A topic clustering approach to finding similar questions from large question and answer archives", PLoS ONE, Vol.9, No.3, pp.1-8, 2014.
    F. Mandreoli, R. Martoglia and P. Tiberio, "A syntactic approach for searching similarities within sentences", Proc. of the Eleventh International Conference on Information and Knowledge Management, VA, USA, pp.635-637, 2002.
    K. Wang, Z. Ming and T.S. Chua, "A syntactic tree matching approach to finding similar questions in community-based qa services", Proc. of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, Boston, USA, pp.187-194, 2009.
    P. Achananuparp, X. Hu and X. Shen, "The evaluation of sentence similarity measures", Data Warehousing and Knowledge Discovery, Lecture Notes in Computer Science, Vol.5182, pp.305-316, 2008.
    W. Guo and M. Diab, "Modelling sentences in the latent space", Proc. of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Korea, pp.864-872, 2012.
    T.K. Landauer, D. Laham, B. Rehder, et al., "How well can passage meaning be derived without using word order? A comparison of latent semantic analysis and humans", Proc. of 19th Annual Meeting of the Cognitive Science Society, Lawrence Erlbaum, Mawhwah, NJ, pp.412-417, 1997.
    T. Hofmann, "Probabilistic latent semantic analysis", Proc. of the 15th Conference on Uncertainty in Artificial Intelligence, Stockholm, Sweden, pp.289-296, 1999.
    D.M. Blei, A. Ng and M. Jordan, "Latent dirichlet allocation", Journal of Machine Learning Research, Vol.3, pp.993-1022, 2003.
    A. Celikyilmaz, D. Hakkani-Tur and G. Tur, "LDA based similarity modeling for question answering", Proc. of the NAACL HLT 2010 Workshop on Semantic Search, Los Angeles, USA, pp.1-9, 2010.
    N. Srebro and T. Jaakkola, "Weighted low-rank approximations", Proc. of the 20th International Conference on Machine Learning, Washington, DC, USA, 2003.
    J. Jeon, W.B. Croft and J.H. Lee, "Finding similar questions in large question and answer archives", Proc. of the 14th ACM International Conference on Information and Knowledge Management, Bremen, Germany, pp.84-90, 2005.
    V. Jijkoun and D.E. Rijke, "Retrieving answers from frequently asked questions pages on the web", Proc. of the 14th ACM International Conference on Information and Knowledge Management, Bremen, Germany, pp.76-83, 2005.
    G. Salton, "Theory of Indexing", Society for Industrial and Applied Mathematics, Philadelphia, PA, 1975.
    X. Hu, N. Sun, C. Zhang and T. Chua, "Exploiting internal and external semantics for the clustering of short texts using world knowledge", Proc. of CIKM'09, Hong Kong, China, pp.919-928, 2009.
    Z. Ji, F. Xu, B. Wang and B. He, "Question-answer topic model for question retrieval in community question answering", Proc. of the 21st ACM International Conference on Information and Knowledge Management, HI, USA, pp.2471-2474, 2012.
    A. Celikyilmaz and D. Hakkani-Tur, "A graph-based semi-supervised learning for question semantic labeling", Proc. of the NAACL HLT 2010 Workshop on Semantic Search, Los Angeles, USA, pp.27-35, 2010.
    A. Hotho, S. Staab and G. Stumme, "Wordnet improves text document clustering", Proc. of the Semantic Web Workshop at SIGIR-2003, 26th Annual International ACM SIGIR Conference, Toronto, Canada, 2003.
    E. Gabrilovich and S. Markovitch, "Overcoming the brittleness bottleneck using wikipedia:Enhancing text categorization with encyclopedic knowledge", Proc. of AAAI, Boston, MA, pp.1301-1306, 2006.
    X. Cao, G. Cong, B. Cui and Jensen, "A generalized framework of exploring category information for question retrieval in community question answer archives", Proc. of the 18th ACM Conference on Information and Knowledge Management, Hongkong, China, pp.265-274, 2009.
    G. Zhou, L. Cai, L. Zhao and K. Liu, "Phrase-based translation model for question retrieval in community question answer archives", Proc. of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, USA, pp.653-662, 2011.
    B. Qu, G. Cong, C. Li, A. Sun and H. Chen, "An evaluation of classification models for question topic categorization", Journal of the American Society for Information Science and Technology, Vol.63, No.5, pp.889-903, 2012.
    G. Zhou, Y. Chen, D. Zeng and J. Zhao, "Group non-negative matrix factorization with natural categories for question retrieval in community question answer archives", Proc. of the 25th International Conference on Computational Linguistics, Dublin, Ireland, pp.89-98, 2014.
    H. Wen, G. Ding, C. Liu and J. Wang, "Matrix factorization meets cosine similarity:Addressing sparsity problem in collaborative filtering recommender system", Web Technologies and Applications Lecture Notes in Computer Science, Vol.8709, pp.306-317, 2014.
    P. Achananuparp, X. Hu, X. Zhou and X. Zhang, "Utilizing sentence similarity and question type similarity to response to similar questions in knowledge-sharing community", Proc. of QA Web 2008 Workshop, Beijing, China, 2008.
    Y. Li, M.D. McLean, Z.A. Bandar, "Sentence similarity based on semantic nets and corpus statistics", IEEE Transactions on Knowledge and Data Engineering, Vol.18, No.8, pp.1138-1150, 2006.
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (159) PDF downloads(505) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return