Volume 31 Issue 1
Jan.  2022
Turn off MathJax
Article Contents
DUAN Hua, FENG Tong, LIU Songning, ZHANG Yulin, SU Jionglong. Tumor Classification of Gene Expression Data by Fuzzy Hybrid Twin SVM[J]. Chinese Journal of Electronics, 2022, 31(1): 99-106. doi: 10.1049/cje.2020.00.260
Citation: DUAN Hua, FENG Tong, LIU Songning, ZHANG Yulin, SU Jionglong. Tumor Classification of Gene Expression Data by Fuzzy Hybrid Twin SVM[J]. Chinese Journal of Electronics, 2022, 31(1): 99-106. doi: 10.1049/cje.2020.00.260

Tumor Classification of Gene Expression Data by Fuzzy Hybrid Twin SVM

doi: 10.1049/cje.2020.00.260
Funds:  This work was supported by the National Natural Science Foundation of China (U1931207, 61702306), Sci. & Tech. Development Fund of Shandong Province of China (ZR2017BF015, ZR2017MF027), the Humanities and Social Science Research Project of the Ministry of Education (18YJAZH017), the Taishan Scholar Program of Shandong Province, SDUST Research Fund (2015TDJH102, 2019KJN024), and National Statistical Science Research Project in 2019 (2019LY49)
More Information
  • Author Bio:

    received the Ph.D. degree in applied mathematics from Shanghai Jiaotong University, Shanghai, China, in 2008. She is currently a Professor with Shandong University of Science and Technology, Qingdao, China. Her current research interests include Petri nets, process mining, and machine learning. (Email: huaduan59@163.com)

    is currently a master candidate of mathematics and systems science in Shandong University of Science and Technology, Qingdao, China. His research interests include bioinformatics and deep learning. (Email: fengtong_666@163.com)

    is studying for a bachelor’s degree at Shandong University of Science and Technology, Qingdao, China. His current research interests include big data and machine learning. (Email: lsongning@163.com)

    (corresponding author) received the Ph.D. degree in computer software and theory from Shandong University of Science and Technology, Qingdao, China. He is currently an Associate Professor at College of Mathematics and Systems Science, Shandong University of Science and Technology. His research interests include bioinformatics and system biology. (Email: zhangyulin@sdust.edu.cn)

    holds a Ph.D. degree in statistics (Warwick) and a Ph.D. degree in automatic control and systems engineering (Sheffield). He is currently the Deputy Dean of School of Artificial Intelligence and Advanced Computing, XJTLU Entrepreneur College (Taicang). His research interest include bioinformatics, artificial intelligence, and medical image processing. (Email: Jionglong.Su@xjtlu.edu.cn)

  • Received Date: 2020-08-21
  • Accepted Date: 2021-03-31
  • Available Online: 2021-10-19
  • Publish Date: 2022-01-05
  • A new classification model, the fuzzy hybrid twin support vector machine (TWSVM), namely FHTWSVM, is proposed by combining the fuzzy TWSVM and the hypersphere support vector machine (SVM). The hypersphere SVM is utilized for generating the hyperspheres for the positive and negative class with the smallest possible radius, so that the hyperspheres can contain as many samples as possible. The samples which the hyperspheres cover form a new sample set. Furthermore a distance-based fuzzy function is utilized to calculate the fuzzy factors for the samples. Finally FHTWSVM is used to train all samples with the parameters optimized by grid search. This method can maximize intra-class clustering for noise removal and reduce the influence of outliers. To demonstrate the superiority of the performance of FHTWSVM over other classifiers, e.g., KNN, RF, Bayesian, TWSVM, AdaBoost and XGBoost, a series of experiments is conducted using eight gene expression datasets. The evaluation results show that the proposed approach can improve the classification performance as well as reduce prediction errors for the datasets.
  • loading
  • [1]
    C. Corinna and V. Vladimir, “Support vector network,” Mach Learn, vol.20, no.3, pp.273–97, 1995.
    V.N. Vapnik, “An overview of statistical learning theory,” IEEE Transactions on Neural Networks, vol.10, no.5, pp.988–999, 1999. doi: 10.1109/72.788640
    N. Cristianini, John, and Shawe-Taylor, An Introduction to Support Vector Machines and Other Kernel-based Learning Methods, Cambridge: Cambridge University Press, 2000.
    King-Shy Goh, Edward Chang, and Kwang-Ting Cheng, “SVM binary classifier ensembles for image classification,” in Proc. of the Tenth International Conference on Information and Knowledge Management, Atlanta Georgia, pp.395−402, 2001.
    S. Koda, A. Zeggada, F. Meigani, et al., “Spatial and structured SVM for multilabel image classification,” IEEE Transactions on Geoscience & Remote Sensing, vol.56, no.10, pp.5948–5960, 2018.
    J. Liu, Z. Wang, and X. Xiao, “A hybrid SVM decision fusion modeling for robust continuous digital speech recognition,” Pattern Recognition Letters, vol.28, no.8, pp.912–920, 2007. doi: 10.1016/j.patrec.2006.12.007
    D. H. Al-Nuaimi, I. A. Hashim, I. Abidin, et al., “Performance of feature-based techniques for automatic digital modulation recognition and classification—A review,” Electronics, vol.8, no.12, article no.1407, 2019.
    M. Sheikhan, M. Bejani, and D. Gharavian, “Modular neural-SVM scheme for speech emotion recognition using anova feature selection method,” Neural Computing & Applications, vol.23, no.1, pp.215–227, 2013.
    X. Zhang, X. Liu, and Z. J. Wang, “Evaluation of a set of new ORF kernel functions of SVM for speech recognition,” Engineering Applications of Artificial Intelligence, vol.26, no.10, pp.2574–2580, 2013. doi: 10.1016/j.engappai.2013.04.008
    S. Wang, J. Wang, H. Chen et al., “SVM-based tumor classification with gene expression data,” in Proc. of the Second International Conference on Advanced Data Mining and Applications, Xi’an, pp.864−870, 2006.
    J. Sachdeva, V. Kumar, I. Gupta, et al., “Multiclass brain tumor classification using GA-SVM”, IEEE 4th International Conference on Developments in E-systems Engineering, Dubai, pp.182−187, 2011.
    O. L. Mangasarian and E. W. Wild, “Multisurface proximal support vector machine classfication via generalized eigenvalues,” IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.28, no.1, pp.69–74, 2006.
    R. Khemchandani, Jayadeva, and S. Chandra, “Twin support vector machine for pattern classification,” IEEE Trans. on Pattern Analysis and Machine Intelligence, vol.29, no.5, pp.905–910, 2007. doi: 10.1109/TPAMI.2007.1068
    M. A. Kumar and M. Gopal, “Application of smoothing technique on twin support vector machines,” Pattern Recognition Letters, vol.29, pp.1842–1848, 2008. doi: 10.1016/j.patrec.2008.05.016
    M. A. Kumar and M. Gopal, “Least squares twin support vector machines for pattern classification,” Expert Systems with Applications, vol.36, no.4, pp.7535–7543, 2009. doi: 10.1016/j.eswa.2008.09.066
    Y. Shao, C. Zhang, X. Wang, et al., “Improvements on twin support vector machines,” IEEE Transactions on Neural Networks, vol.22, no.6, pp.962–968, 2011. doi: 10.1109/TNN.2011.2130540
    Q. Ye, C. Zhao, N. Ye, et al., “Localized twin svm via convex minimization,” Neurocomputing, vol.74, no.4, pp.580–587, 2011. doi: 10.1016/j.neucom.2010.09.015
    D. Tomar, S. Singhal, and S. Agarwal, “Weighted least square twin support vector machine for imbalanced dataset,” International Journal of Database Theory & Application, vol.7, no.2, pp.25–36, 2014.
    T. Divya and A. Sonali, “Feature selection based least square twin support vector machine for diagnosis of heart disease,” International Journal of Bio-Science and Bio-Technology, vol.6, no.2, pp.69–82, 2014. doi: 10.14257/ijbsbt.2014.6.2.07
    J. A. Nasiri, N. M. Charkari, and K. Mozafari, “Energy-based model of least squares twin support vector machines for human action recognition,” Signal Processing, vol.104, no.6, pp.248–257, 2014.
    Tomar, Divya, D. Ojha, and S. Agarwal, “An emotion detection system based on multi least squares twin support vector machine,” Advances in Artificial Intelligence, vol.7, no.4, pp.197–208, 2014.
    J. A. Nasiri, N. M. Charkari, and S. Jalili, “Least squares twin multi-class classification support vector machine,” Pattern Recognition, vol.48, no.3, pp.984–992, 2015. doi: 10.1016/j.patcog.2014.09.020
    D. Tomar and S. Agarwal, “Twin support vector machine: A review from 2007 to 2014,” Egyptian Informatics Journal, vol.16, no.1, pp.55–69, 2015. doi: 10.1016/j.eij.2014.12.003
    K. B. Duan, J. C. Rajapakse, H. Wang, et al., “Multiple SVM-RFE for gene selection in cancer classification with expression data,” IEEE Trans. on Nanobioscience, vol.4, no.3, pp.228–234, 2005. doi: 10.1109/TNB.2005.853657
    A. Brazma, P. Hingamp, J. Quackenbush, et al., “Minimum information about a microarray experiment (MIAME)—Toward standards for microarray data,” Nature Genetic, vol.29, pp.365–371, 2001. doi: 10.1038/ng1201-365
    M. A. Shipp, K. N. Ross, P. Tamayo, et al., “Diffuse large b-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning,” Nature Medicine, vol.8, no.1, pp.68–74, 2002. doi: 10.1038/nm0102-68
    T. R. Golub, D. K. Slonim, P. Tamayo, et al., “Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring,” Science, vol.286, no.5439, pp.531–537, 1999. doi: 10.1126/science.286.5439.531
    D. A. Wigle, I. Jurisica, N. Radulovich, et al., “Molecular profiling of non-small cell lung cancer and correlation with disease-free survival,” Cancer Research, vol.62, no.11, pp.3005–3008, 2002.
    D. Singh, “Gene expression correlates of clinical prostate cancer behavior,” Cancer Cell, vol.1, no.2, pp.203–209, 2002. doi: 10.1016/S1535-6108(02)00030-2
    P. A. Northcott, A. Korshunov, H. Witt, et al., “Medulloblastoma comprises four distinct molecular variants,” J. of Clinical Oncology, vol.29, no.11, pp.1408–1414, 2011. doi: 10.1200/JCO.2009.27.4324
    S. L. Pomeroy, Pablo, and Tamayo, “Prediction of central nervous system embryonal tumour outcome based on gene expression,” Nature, vol.415, no.6870, pp.436–442, 2002. doi: 10.1038/415436a
    C. L. Nutt, D. R. Mani, R. A. Betensky, et al., “Gene expression-based classification of malignant gliomas correlates better with survival than histological classification,” Cancer Research, vol.63, no.7, pp.1602–1607, 2003.
    G. Roffo, S. Melzi, Castellani, et al., “Infinite latent feature selection: A probabilistic latent graph-based ranking approach”, Proceedings of the IEEE International Conference on Computer Vision, Venice, pp.1398−1406, 2017.
    R. Shang, W. Wang, R. Stolkin, et al., “Non-negative spectral learning and sparse regression-based dual-graph regularized feature selection,” IEEE Transactions on Cybernetics, vol.48, no.2, pp.793–806, 2018.
    R. Shang, Y. Meng, W. Wang, et al., “Local discriminative based sparse subspace learning for feature selection,” Pattern Recognition, vol.92, pp.219–230, 2019.
    R. Shang, K. Xu, F. Shang, et al., “Sparse and low-redundant subspace learning-based dual-graph regularized robust feature selection,” Knowledge-Based Systems, vol.187, article no.104830, 2020.
    J. X. Liu and Y. L. Zhang, “An attribute-weighted bayes classifier based on asymmetric correlation coefficient,” Int. Journal of Pattern Recognition and Artificial Intelligence, vol.34, no.10, article no.2050025, 2020. doi: 10.1142/S0218001420500251
    Y. Zhang, T. Feng, S. Wang, et al., “A novel xgboost method to identify cancer tissue-of-origin based on copy number variations,” Frontiers in Genetics, vol.11, article no.585029, 2020.
  • 加载中


    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Figures(2)  / Tables(3)

    Article Metrics

    Article views (432) PDF downloads(32) Cited by()
    Proportional views


    DownLoad:  Full-Size Img  PowerPoint