YANG Yuehua, DU Junping, ZI Lingling. Bootstrapping-based Automatic Acquisition of Domain Concepts for Ontology Construction[J]. Chinese Journal of Electronics, 2013, 22(2): 313-318.
Citation: YANG Yuehua, DU Junping, ZI Lingling. Bootstrapping-based Automatic Acquisition of Domain Concepts for Ontology Construction[J]. Chinese Journal of Electronics, 2013, 22(2): 313-318.

Bootstrapping-based Automatic Acquisition of Domain Concepts for Ontology Construction

Funds:  This work is supported by the National Basic Research Program of China (973 Program) (No.2012CB821200 No.2012CB821206), the National Natural Science Foundation of China (No.91024001, No.61070142), and Beijing Natural Science Foundation (No.4111002).
  • Received Date: 2012-04-01
  • Rev Recd Date: 2012-05-01
  • Publish Date: 2013-04-25
  • The domain concept is one of the key elements of ontology. In order to automatically acquire the domain concepts during the domain ontology construction, we proposed a novel Bootstrapping-based automatic acquisition algorithm of domain concepts. In our work, the compound words are extracted according to the combination conditions of the mutual information and the information entropy. The candidate domain concepts determinant conditions based on the co-occurrence sentences frequency are presented. Besides, to avoid omitting the domain concepts with the lower frequency or semantically similar to other domain concepts, the semantic factor is introduced. The experiments results have demonstrated that the compound domain concepts and the semantically similar domain concepts with the lower sentences frequency can also be extracted by using the proposed algorithm. And the proposed algorithm has obtained higher precision and recall, so it is effective and feasible.
  • loading
  • Hou Xin, Zhang Xutang, Jin Tianguo, Peng Gaoliang, LiuWenjian, "Automatic construction of domain ontology oriented to knowledge and information management", Computer Integrated Manufacturing Systems, Vol.17, No.1, pp.159-170, 2011.
    Ji Peipei, Yan Xiaoyan, Cen Yonghua, "A survey of term recognition and extraction for domain-specific Chinese text information processing", Library and Information Service, Vol.54, No.16, pp.124-129, 2010.
    Li Weigang, Liu Ting, Li Sheng, "Automated entity relation tuple extraction using Web mining", Acta Electronica Sinica, Vol.35, No.11, pp.2111-2116, 2007. (in Chinese)
    Gu Jun, Wang Hao, "Study on term extraction on the basis of chinese domain texts", New Technology of Library and Information Service, Vol.27, No.4, pp.29-34, 2011.
    Hazen Rebecca, Esbroeck Alex, Mongkolwat Pat, Channin David, "Automatic extraction of concepts to extend RadLex", Journal of Digital Imaging, Vol.24, No.1, pp.165-169, 2011.
    MohammadSyafrullah, NaomieSalim, "Improving term extraction using particle swarm optimization techniques", Journal of Computing, Vol.2, No.2, pp.116-120, 2010.
    José Luis Ochoa, ′Agela Almela, Maria Luisa Hern′andez- Alcaraz, Rafael Valencia-García, "Learning morphosyntactic patterns for multiword term extraction", Scientific Research and Essays, Vol.6, No.26, pp.5563-5578, 2011.
    Shams, R., Shahnawaz Chowdhury, M.S.A., Abu Saleh Shawon, S.M., "Domain-specific textual commonsense concept acquisition using a corpus", Proc. of 2011 International Conference on Communications, Computing and Control Applications (CCCA), Hong Kong, China, pp.1-6, 2011.
    Manabu Torii, Kavishwar Wagholikar, Hongfang Liu, "Using machine learning for concept extraction on clinical documents from multiple data sources", J. Am. Med. Inform. Assoc., Vol.18, No.5, pp.580-587, 2011.
    J.D. Cohen, "Highlights: Language domain-independent automatic indexing terms for abstracting", Journal of the American Soeiety Information Science, Vol.46, No.3, pp.162-174, 2007.
    K.T. Frantzi, S. Ananinadou, "The C-Value/NC-Value domain independent method for multi-word term extraction", Journal of Natural Language Processing, Vol.6, No.3, pp.145-179, 2008.
    Wei Xiaoli, Sun Yong, Zhang Shu, Miao Yan, "Ontological concept extraction method based on maximum entropy model", Computer Engineering, Vol.35, No.24, pp.114-116, 2009.
    Wang Hongbin, Liu Daxin, Wang Nianbin, "Research on sieving algorithm of domain-specific concept from ontology learning", Systems Engineering and Electronics, Vol.32, No.1, pp.175-178, 2010.
    Steven Abney, "Bootstrapping", Proc. of the 40th Annual Meeting of the Association for Computational Linguistics (ACL-02), Philadelphia, PA, USA, pp.360-367, 2002.
    Aviv Segev, Quan Z. Sheng, "Bootstrapping ontologies for web services", IEEE Transactions on Services Computing, Vol.5, No.1, pp.33-44, 2012.
    Chen Wenliang, Zhu Jingbo, Yao Tianshun, Zhang Yuxin, "Automatic learning field words by bootstrapping", Proc. of JSCL- 2003, Harbin, China, pp.67-72, 2003.
    Michael Thelen and Ellen Riloff, "A bootstrapping method for learning semantic lexicons using extraction pattern contexts", Proc. of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Philadelphia, pp.214-221, 2002.
    Zhang Yufang, Yang Fen, Xiong Zhongyang, Chen Xiaoli, "Study on context based domain ontology concept extraction and relation extraction", Application Research of Computers, Vol.27, No.1, pp.74-76, 2010.
  • 加载中


    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (367) PDF downloads(1081) Cited by()
    Proportional views


    DownLoad:  Full-Size Img  PowerPoint