WANG Junhua, ZUO Wanli, PENG Tao. Hyponymy Graph Model for Word Semantic Similarity Measurement[J]. Chinese Journal of Electronics, 2015, 24(1): 96-101.
Citation: WANG Junhua, ZUO Wanli, PENG Tao. Hyponymy Graph Model for Word Semantic Similarity Measurement[J]. Chinese Journal of Electronics, 2015, 24(1): 96-101.

Hyponymy Graph Model for Word Semantic Similarity Measurement

Funds:  This work is supported by the National Natural Science Foundation of China (No.60973040, No.61300148, No.60903098), and the Key Scientific and Technological Break-through Program of Jilin Province (No.20130206051GX).
More Information
  • Corresponding author: ZUO Wanli was born in 1957, received the B.E., M.S. and Ph.D. degrees from Jilin University in 1982, 1985, and 2005, respectively. He is currently a professor and doctor supervisor at the College of Computer Science and Technology, Jilin University. He is also an ACM professional member, CCF distinguished member, member of System Software Disciplinary Committee of CCF. His research interest include database, Web mining, information retrieval, machine learning, and natural language processing. (Email: wanli@jlu.edu.cn)
  • Received Date: 2013-05-01
  • Rev Recd Date: 2013-09-01
  • Publish Date: 2015-01-10
  • Measuring word semantic similarity is a generic problem with a broad range of applications such as ontology mapping, computational linguistics and artificial intelligence. Previous approaches to computing word semantic similarity did not consider concept occurrence frequency and word's sense number. This paper introduced Hyponymy graph, and based on which proposed a novel word semantic similarity model. For two words to be compared, we first retrieve their related concepts; then produce lowest common ancestor matrix and distance matrix between concepts; finally calculate distance-based similarity and information-based similarity, which are integrated to get final semantic similarity. The main contribution of our method is that both concept occurrence frequency and word's sense number are taken into account. This similarity measurement more closely fits with human rating and effectively simulates human thinking process. Our experimental results on benchmark dataset M&C and R&G with WordNet2.1 as platform demonstrate roughly 0.9%-1.2% improvements over existing best approaches.
  • loading
  • Yang Yuehua, Du Junping and He Bowei, "A novel ontologybased semantic retrieval model for food safety domain", Chinese Journal of Electronics, Vol.22, No.2, pp.247-252, 2013.
    Wang Xiaoyan, Bao Tie, Y. V. Ramana Reddy, Liu Shufen and Wang Luyi, "An ontology centric approach for building collaborative tagging-based systems to manage personal knowledge in KAM", Chinese Journal of Electronics, Vol.22, No.3, pp.442- 448, 2013.
    Yang Yuehua, Du Junping and Zi Lingling, "Bootstrappingbased automatic acquisition of domain concepts for ontology construction", Chinese Journal of Electronics, Vol.22, No.2, pp.313-318, 2013.
    Philip Resnik, "Using information content to evaluate semantic similarity in a taxonomy", Proc. of the 14th International Joint Conference for Artificial Intelligence, Montreal, Canada, pp.448-453, 1995.
    Zhibiao Wu and Martha Palmer, "Verbs semantics and lexical selection", Proc. of the 32nd Annual Meeting of the Association for Computational Linguistics, Las Cruces, New Mexico, pp.133-138, 1994.
    Eneko Agirre and German Rigau, "A proposal for word sense disambiguation using conceptual distance", Proc. of 1st International Conference on Recent Advances in Natural Language Processing, Bulgaria, pp.35-43, 1995.
    Jay J. Jiang and David W. Conrath, "Semantic similarity based on corpus statistics and lexical taxonomy", Proc. of International Conference on Research in Computational Linguistics, Taiwan, pp.1-15, 1997.
    Dekang Lin, "An information-theoretic definition of similarity", Proc. of the 15th International Conference on Machine Learning, San Francisco, CA, USA, pp.296-304, 1998.
    Claudia Leacock and Martin Chodorow, "Combining local context and WordNet similarity for word sense identification", in book: WordNet: An Electronic Lexical Database, C. Fellbaum Ed., MIT Press, pp.265-283, 1998.
    Graeme Hirst and David St-Onge, "Lexical chains as representations of context for the detection and correction of malapropisms", WordNet: An Electronic Lexical Database, C. Fellbaum Ed., MIT Press, pp.305-332, 1998.
    Yuhua Li, Zuhair A. Bandar and David McLean, "An approach for measuring semantic similarity between words using multiple information sources", IEEE Transactions on Knowledge and Data Engineering, Vol.15, No.4, pp.871-882, 2003.
    Dongqiang Yang and David M.W. Powers, "Measuring semantic similarity in the taxonomy of WordNet", Proc. of the 28th Australasian Computer Science Conference, Newcastle, Australia, pp.315-322, 2005.
    Alexander Budanitsky and Graeme Hirst, "Evaluating WordNet-based measures of lexical semantic relatedness", Computational Linguistics, Vol.32, No.1, pp.13-47, 2006.
    Marco A. Alvarez and SeungJin Lim, "A graph modeling of semantic similarity between words", Proc. of International Conference on Semantic Computing, Irvine, California, USA, pp.355-362, 2007.
    Peng Qin, Zhao Lu, Yu Yan and Fang Wu, "A new measure of word semantic similarity based on WordNet hierarchy and DAG theory", Proc. of International Conference on Web Information Systems and Mining, Shanghai, China, pp.181-185, 2009.
    Giuseppe Pirrò, "A semantic similarity metric combining features and intrinsic information content", Data & Knowledge Engineering, Vol.68, No.11, pp.1289-1308, 2009.
    Songmei Cai and Zhao Lu, "An improved semantic similarity measure for word pairs", Proc. of 2010 International Conference on e-Education, e-Business, e-Management and e- Learning, Sanya, China, pp.212-216, 2010.
    David Sánchez, Montserrat Batet and David Isern, "Ontologybased information content computation", Knowledge-Based Systems, Vol.24, No.2, pp.297-303, 2011.
    David Sánchez, Montserrat Batet, David Isern and Aïda Valls, "Ontology-based semantic similarity: A new feature-based approach", Expert Systems with Applications, Vol.39, No.9, pp.7718-7728, 2012.
    Hongzhe Liu, Hong Bao and De Xu, "Concept vector for semantic similarity and relatedness based on WordNet structure", Journal of Systems and Software, Vol.85, No.2, pp.370-381, 2012.
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (212) PDF downloads(1350) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return