Volume 33 Issue 1
Jan.  2024
Turn off MathJax
Article Contents
Yajing GUO, Xiujuan LEI, Yi PAN, “An Encoding-Decoding Framework Based on CNN for circRNA-RBP Binding Sites Prediction,” Chinese Journal of Electronics, vol. 33, no. 1, pp. 256–263, 2024 doi: 10.23919/cje.2022.00.361
Citation: Yajing GUO, Xiujuan LEI, Yi PAN, “An Encoding-Decoding Framework Based on CNN for circRNA-RBP Binding Sites Prediction,” Chinese Journal of Electronics, vol. 33, no. 1, pp. 256–263, 2024 doi: 10.23919/cje.2022.00.361

An Encoding-Decoding Framework Based on CNN for circRNA-RBP Binding Sites Prediction

doi: 10.23919/cje.2022.00.361
More Information
  • Author Bio:

    Yajing GUO is currently pursuing the Ph.D. degree in the School of Computer Science at Shaanxi Normal University, Xi’an, China. Her research interests include bioinformatics and deep learning. (Email: guoyajing@snnu.edu.cn)

    Xiujuan LEI is a Professor in the School of Computer Science at Shaanxi Normal University, Xi’an, China. Her research interests include intelligent computing and bioinformatics. (Email: xjlei@snnu.edu.cn)

    Yi PAN is currently a Professor of the Faculty of Computer Science and Control Engineering, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, China. He has served as Chair of Computer Science Department at Georgia State University during 2005–2020. His current research interests mainly include bioinformatics and health informatics using big data analytics, cloud computing, and machine learning technologies. (Email:yi.pan@siat.ac.cn)

  • Corresponding author: Email: xjlei@snnu.edu.cn
  • Received Date: 2022-11-01
  • Accepted Date: 2023-04-17
  • Available Online: 2023-08-08
  • Publish Date: 2024-01-05
  • Predicting RNA binding protein (RBP) binding sites on circular RNAs (circRNAs) is a fundamental step to understand their interaction mechanism. Numerous computational methods are developed to solve this problem, but they cannot fully learn the features. Therefore, we propose circ-CNNED, a convolutional neural network (CNN)-based encoding and decoding framework. We first adopt two encoding methods to obtain two original matrices. We preprocess them using CNN before fusion. To capture the feature dependencies, we utilize temporal convolutional network (TCN) and CNN to construct encoding and decoding blocks, respectively. Then we introduce global expectation pooling to learn latent information and enhance the robustness of circ-CNNED. We perform circ-CNNED across 37 datasets to evaluate its effect. The comparison and ablation experiments demonstrate that our method is superior. In addition, motif enrichment analysis on four datasets helps us to explore the reason for performance improvement of circ-CNNED.
  • loading
  • [1]
    C. X. Liu and L. L. Chen, “Circular RNAs: Characterization, cellular roles, and applications,” Cell, vol. 185, no. 13, article no. 2390, 2022. doi: 10.1016/j.cell.2022.06.001
    [2]
    Z. Y. Wu, X. Yu, S. J. Zhang, et al., “Mechanism underlying circRNA dysregulation in the TME of digestive system cancer,” Frontiers in Immunology, vol. 13, article no. 951561, 2022. doi: 10.3389/fimmu.2022.951561
    [3]
    X. X. Zeng, Y. Zhong, W. Lin, et al., “Predicting disease-associated circular RNAs using deep forests combined with positive-unlabeled learning methods,” Briefings in Bioinformatics, vol. 21, no. 4, pp. 1425–1436, 2020. doi: 10.1093/bib/bbz080
    [4]
    Y. Pan, X. J. Lei, and Y. C. Zhang, “Association predictions of genomics, proteinomics, transcriptomics, microbiome, metabolomics, pathomics, radiomics, drug, symptoms, environment factor, and disease networks: A comprehensive approach,” Medicinal Research Reviews, vol. 42, no. 1, pp. 441–461, 2022. doi: 10.1002/med.21847
    [5]
    C. C. Wang, C. D. Han, Q. Zhao, et al., “Circular RNAs and complex diseases: From experimental results to computational models,” Briefings in Bioinformatics, vol. 22, no. 6, article no. bbab286, 2021. doi: 10.1093/bib/bbab286
    [6]
    L. S. Kristensen, T. Jakobsen, H. Hager, et al., “The emerging roles of circRNAs in cancer and oncology,” Nature reviews Clinical oncology, vol. 19, no. 3, pp. 188–206, 2022. doi: 10.1038/s41571-021-00585-y
    [7]
    X. Dong, K. Chen, W. B. Chen, et al., “circRIP: An accurate tool for identifying circRNA-RBP interactions,” Briefings in Bioinformatics, vol. 23, no. 4, article no. bbac186, 2022. doi: 10.1093/bib/bbac186
    [8]
    W. W. Du, C. Zhang, W. N. Yang, et al., “Identifying and characterizing circRNA-protein interaction,” Theranostics, vol. 7, no. 17, pp. 4183–4191, 2017. doi: 10.7150/thno.21299
    [9]
    W. W. Du, W. N. Yang, E. Liu, et al., “Foxo3 circular RNA retards cell cycle progression via forming ternary complexes with p21 and CDK2,” Nucleic Acids Research, vol. 44, no. 6, pp. 2846–2858, 2016. doi: 10.1093/nar/gkw027
    [10]
    S. B. Qu, X. S. Yang, X. L. Li, et al., “Circular RNA: A new star of noncoding RNAs,” Cancer Letters, vol. 365, no. 2, pp. 141–148, 2015. doi: 10.1016/j.canlet.2015.06.003
    [11]
    M. Gagliardi and M. R. Matarazzo, “RIP: RNA immunoprecipitation,” in Polycomb Group Proteins, C. Lanzuolo and B. Bodega, Eds. Humana Press, New York, NY, USA, pp. 73–86, 2016.
    [12]
    C. Barnes and A. Kanhere, “Identification of RNA-protein interactions through in vitro RNA pull-down assays,” in Polycomb Group Proteins, C. Lanzuolo and B. Bodega, Eds. Humana Press, New York, NY, USA, pp. 99–113, 2016.
    [13]
    D. D. Licatalosi, A. Mele, J. J. Fak, et al., “HITS-CLIP yields genome-wide insights into brain alternative RNA processing,” Nature, vol. 456, no. 7221, pp. 464–469, 2008. doi: 10.1038/nature07488
    [14]
    P. Glažar, P. Papavasileiou, and N. Rajewsky, “circBase: A database for circular RNAs,” RNA, vol. 20, no. 11, pp. 1666–1670, 2014. doi: 10.1261/rna.043687.113
    [15]
    D. B. Dudekula, A. C. Panda, I. Grammatikakis, et al., “CircInteractome: A web tool for exploring circular RNAs and their interacting proteins and microRNAs,” RNA Biology, vol. 13, no. 1, pp. 34–42, 2016. doi: 10.1080/15476286.2015.1128065
    [16]
    S. Ghosal, S. Das, R. Sen, et al., “Circ2Traits: A comprehensive database for circular RNA potentially associated with disease and traits,” Frontiers in Genetics, vol. 4, article no. 283, 2013. doi: 10.3389/fgene.2013.00283
    [17]
    X. P. Chen, P. Han, T. Zhou, et al., “circRNADb: A comprehensive database for human circular RNAs with protein-coding annotations,” Scientific Reports, vol. 6, no. 1, article no. 34985, 2016. doi: 10.1038/srep34985
    [18]
    M. Liu, Q. Wang, J. Shen, et al., “Circbank: A comprehensive database for circRNA with standard nomenclature,” RNA Biology, vol. 16, no. 7, pp. 899–905, 2019. doi: 10.1080/15476286.2019.1600395
    [19]
    Y. J. Guo, X. J. Lei, L. Liu, et al., “circ2CBA: Prediction of circRNA-RBP binding sites combining deep learning and attention mechanism,” Frontiers of Computer Science, vol. 17, article no. 175904, 2022. doi: 10.1007/s11704-022-2151-0
    [20]
    P. P. Li and Z. P. Liu, “PST-PRNA: Prediction of RNA-binding sites using protein surface topography and deep learning,” Bioinformatics, vol. 38, no. 8, pp. 2162–2168, 2022. doi: 10.1093/bioinformatics/btac078
    [21]
    X. J. Lei, T. B. Mudiyanselage, Y. C. Zhang, et al., “A comprehensive survey on computational methods of non-coding RNA and disease association prediction,” Briefings in bioinformatics , vol. 22, no. 4, article no. bbaa350, 2021. doi: 10.1093/bib/bbaa350
    [22]
    M. T. Niu, Q. Zou, and C. Y. Wang, “GMNN2CD: Identification of circRNA-disease associations based on variational inference and graph Markov neural networks,” Bioinformatics, vol. 38, no. 8, pp. 2246–2253, 2022. doi: 10.1093/bioinformatics/btac079
    [23]
    Z. F. Wang and X. J. Lei, “Identifying the sequence specificities of circRNA-binding proteins based on a capsule network architecture,” BMC Bioinformatics, vol. 22, no. 1, article no. 19, 2021. doi: 10.1186/s12859-020-03942-3
    [24]
    K. M. Zhang, X. Y. Pan, Y. Yang, et al., “CRIP: Predicting circRNA-RBP-binding sites using a codon-based encoding and hybrid deep neural networks,” RNA, vol. 25, no. 12, pp. 1604–1615, 2019. doi: 10.1261/rna.070565.119
    [25]
    Z. F. Wang and X. J. Lei, “Prediction of RBP binding sites on circRNAs using an LSTM-based deep sequence learning architecture,” Briefings in Bioinformatics, vol. 22, no. 6, article no. bbab342, 2021. doi: 10.1093/bib/bbab342
    [26]
    Y. N. Yang, Z. L. Hou, Y. S. Wang, et al., “HCRNet: High-throughput circRNA-binding event identification from CLIP-seq data using deep temporal convolutional network,” Briefings in Bioinformatics, vol. 23, no. 2, article no. bbac027, 2022. doi: 10.1093/bib/bbac027
    [27]
    S. J. Bai, J. Z. Kolter, and V. Koltun, “An empirical evaluation of generic convolutional and recurrent networks for sequence modeling,” arXiv preprint, arXiv: 1803.01271, 2018.
    [28]
    Z. F. Wang and X. J. Lei, “A web server for identifying circRNA-RBP variable-length binding sites based on stacked generalization ensemble deep learning network,” Methods, vol. 205, pp. 179–190, 2022. doi: 10.1016/j.ymeth.2022.06.014
    [29]
    Y. Huang, B. F. Niu, Y. Gao, et al., “CD-HIT Suite: A web server for clustering and comparing biological sequences,” Bioinformatics, vol. 26, no. 5, pp. 680–682, 2010. doi: 10.1093/bioinformatics/btq003
    [30]
    Y. J. Guo and X. J. Lei, “A pseudo-Siamese framework for circRNA-RBP binding sites prediction integrating BiLSTM and soft attention mechanism,” Methods, vol. 207, pp. 57–64, 2022. doi: 10.1016/J.YMETH.2022.09.003
    [31]
    X. Luo, X. M. Tu, Y. Ding, et al., “Expectation pooling: An effective and interpretable pooling method for predicting DNA-protein binding,” Bioinformatics, vol. 36, no. 5, pp. 1405–1412, 2020. doi: 10.1093/bioinformatics/btz768
    [32]
    C. Z. Jia, Y. Bi, J. X. Chen, et al., “PASSION: An ensemble neural network approach for identifying the binding sites of RBPs on circRNAs,” Bioinformatics, vol. 36, no. 15, pp. 4276–4282, 2020. doi: 10.1093/bioinformatics/btaa522
    [33]
    M. T. Niu, Q. Zou, and C. Lin, “CRBPDL: Identification of circRNA-RBP interaction sites using an ensemble neural network approach,” PLoS Computational Biology, vol. 18, no. 1, article no. e1009798, 2022. doi: 10.1371/journal.pcbi.1009798
    [34]
    Y. N. Yang, Z. L. Hou, Z. Q. Ma, et al., “iCircRBP-DHN: Identification of circRNA-RBP interaction sites using deep hierarchical network,” Briefings in Bioinformatics, vol. 22, no. 4, article no. bbaa274, 2021. doi: 10.1093/bib/bbaa274
    [35]
    R. C. McLeay and T. L. Bailey, “Motif Enrichment Analysis: A unified framework and an evaluation on ChIP data,” BMC Bioinformatics, vol. 11, article no. 165, 2010. doi: 10.1186/1471-2105-11-165
    [36]
    T. L. Bailey and P. Machanick, “Inferring direct DNA binding from ChIP-seq,” Nucleic Acids Research, vol. 40, no. 17, article no. e128, 2012. doi: 10.1093/nar/gks433
    [37]
    P. Machanick and T. L. Bailey, “MEME-ChIP: Motif analysis of large DNA datasets,” Bioinformatics, vol. 27, no. 12, pp. 1696–1697, 2011. doi: 10.1093/bioinformatics/btr189
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Figures(11)  / Tables(2)

    Article Metrics

    Article views (350) PDF downloads(40) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return