WANG Xuesong, CHENG Yuhu, LI Lijing, “Protein Function Prediction Based on Active Semi-supervised Learning,” Chinese Journal of Electronics, vol. 25, no. 4, pp. 595-600, 2016, doi: 10.1049/cje.2016.07.005
Citation: WANG Xuesong, CHENG Yuhu, LI Lijing, “Protein Function Prediction Based on Active Semi-supervised Learning,” Chinese Journal of Electronics, vol. 25, no. 4, pp. 595-600, 2016, doi: 10.1049/cje.2016.07.005

Protein Function Prediction Based on Active Semi-supervised Learning

doi: 10.1049/cje.2016.07.005
Funds:  This work is supported by the Fundamental Research Funds for the Central Universities (No.2013RC10, No.2013RC12, No.2014YC07).
More Information
  • Corresponding author: CHENG Yuhu (corresponding author) received the Ph.D. degree from the Institute of Automation, Chinese Academy of Sciences in 2005. He is currently a professor in the School of Information and Electrical Engineering, China University of Mining and Technology. His main research interests include machine learning and intelligent system. (Email:chengyuhu@163.com)
  • Received Date: 2014-05-04
  • Rev Recd Date: 2014-06-30
  • Publish Date: 2016-07-10
  • In our study, the active learning and semi-supervised learning methods are comprehensively used for label delivery of proteins with known functions in Protein-protein interaction (PPI) network so as to predict the functions of unknown proteins. Because the real PPI network is generally observed with overlapping protein nodes with multiple functions, the mislabeling of overlapping protein may result in accumulation of prediction errors. For this reason, prior to executing the label delivery process of semi-supervised learning, the adjacency matrix is used to detect overlapping proteins. As the topological structure description of interactive relation between proteins, PPI network is observed with party hub protein nodes that play an important role, in co-expression with its neighborhood. Therefore, to reduce the manual labeling cost, party hub proteins most beneficial for improvement of prediction accuracy are selected for class labeling and the labeled party hub proteins are added into the labeled sample set for semi-supervised learning later. As the experimental results of real yeast PPI network show, the proposed algorithm can achieve high prediction accuracy with few labeled samples.
  • loading
  • Z.R. Zhou, X.F. Song and M.H. Wang, "Predicting protein-protein interactions based on ensemble classifiers", Acta Electronica Sinica, Vol.38, No.6, pp.1464-1467, 2010. (in Chinese)
    B. Titz, M. Schlesner and P. Uetz, "What do we learn from high-throughput protein interaction data", Expert Review of Proteomics, Vol.1, No.1, pp.111-121, 2004.
    B. Schwikowski, P. Uetz and S. Fields, "A network of interacting proteins in yeast", Nature Biotechnology, Vol.18, No.12, pp.1257-1261, 2000.
    H.N. Chua, W.K. Sung and L. Wong, "Exploiting indirect neighbours and topological weight to predict protein function from protein-protein interactions", Lecture Notes in Computer Science, Vol.3916, pp.1-9, 2006.
    M.P. Samanta and S.D. Liang, "Predicting protein functions from redundancies in large-scale protein interaction networks", Proceedings of the National Academy of Sciences of the United States of American, Vol.100, No.22, pp.12579-12583, 2003.
    A. Vazquez, A. Flammini, A. Maritan, et al., "Global protein function prediction from protein-protein interaction networks", Nature Biotechnology, Vol.21, No.6, pp.697-700, 2003.
    U. Karaoz, T.M. Murali, S. Letovsky, et al., "Whole-genome annotation by using evidence integration in functional-linkage networks", Proceedings of the National Academy of Sciences of the United States of American, Vol.101, No.9, pp.2888-2893, 2004.
    S. Sun, Y. Zhao, Y. Jiao, et al., "Faster and more accurate global protein function assignment from protein interaction networks using the MFGO algorithm", FEBS Letters, Vol.580, No.7, pp.1891-1896, 2006.
    E. Nabieva, K. Jim, A. Agarwal, et al., "Whole-proteome prediction of protein function via graph-theoretic analysis of interaction maps", Bioinformatics, Vol.21, No.s1, pp.302-310, 2005.
    X.J. Lei, S. Wu, L. Ge, et al., "Clustering PPI data based on ant colony optimization algorithm", Chinese Journal of Electronics, Vol.22, No.1, pp.118-123, 2013.
    L.S. Li, D.G. Huang, M. Wang, et al., "Integrating active learning strategy to the ensemble kernel-based method for protein-protein interaction extraction", Chinese Journal of Electronics, Vol.22, No.1, pp.41-45, 2013.
    D.Y. Zhou, O. Bousquet, T.N. Lal, et al., "Learning with local and global consistency", Proceedings of Advances in Neural Information Processing Systems, Vancouver, British Columbia, pp.321-328, 2003.
    X.S. Wang, Y.H. Cheng and Jie Ji, "Semi-supervised regression algorithm based on an optimal combined graph", Chinese Journal of Electronics, Vol.22, No.4, pp.724-728, 2013.
    X.S. Wang, X.L. Zhang and Y.H. Cheng, "Barebones learning with local and global consistency", Control and Decision, Vol.26, No.11, pp.1726-1730, 2011. (in Chinese)
    I.K. Jordan, Y.I. Wolf and E.V. Koonin, "No simple dependence between protein evolution rate and the number of protein-protein interactions:Only the most prolific interactors tend to evolve slowly", BMC Evolutionary Biology, Vol.3, No.1, pp.1-8, 2003.
    I.W. Taylor, R. Linding, D.W. Farley, et al., "Dynamic modularity in protein interaction networks predicts breast cancer outcome", Nature Biotechnology, Vol.27, No.2, pp.199-204, 2009.
    J.D.J. Han, N. Bertin, T. Hao, et al., "Evidence for dynamically organized modularity in the yeast protein-protein interaction network", Nature, Vol.430, No.6995, pp.88-93, 2004.
    S. Mostafavi and Q. Morris, "Fast integration of heterogeneous data sources for predicting gene function with limited annotation", Bioinformatics, Vol.26, No.14, pp.1759-1765, 2010.
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (482) PDF downloads(884) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return