Liu Linjing, Chen Xingjian, Wong Ka-Chun
Department of Computer Science, City University of Hong Kong, Hong Kong, China.
Hong Kong Institute for Data Science, City University of Hong Kong, Hong Kong, China.
Bioinformatics. 2021 Oct 11;37(19):3099-3105. doi: 10.1093/bioinformatics/btab236.
Early cancer detection is significant for patient mortality rate reduction. Although machine learning has been widely employed in that context, there are still deficiencies. In this work, we studied different machine learning algorithms for early cancer detection and proposed an Adaptive Support Vector Machine (ASVM) method by synergizing Shuffled Frog Leaping Algorithm and Support Vector Machine (SVM) in this study.
Since ASVM regulates SVM for parameter adaption based on data characteristics, the experimental results reflected the robust generalization capability of ASVM on different datasets under different settings; for instance, ASVM can enhance the sensitivity by over 10% for early cancer detection compared with SVM. Besides, our proposed ASVM outperformed Grid Search + SVM and Random Search + SVM by significant margins in terms of the area under the ROC curve (AUC) (0.938 versus 0.922 versus 0.921).
The proposed algorithm and dataset are available at https://github.com/ElaineLIU-920/ASVM-for-Early-Cancer-Detection.
Supplementary data are available at Bioinformatics online.
早期癌症检测对于降低患者死亡率具有重要意义。尽管机器学习已在该领域广泛应用,但仍存在不足。在这项工作中,我们研究了用于早期癌症检测的不同机器学习算法,并通过将洗牌蛙跳算法与支持向量机(SVM)相结合,提出了一种自适应支持向量机(ASVM)方法。
由于ASVM基于数据特征对SVM进行参数自适应调节,实验结果反映了ASVM在不同设置下对不同数据集具有强大的泛化能力;例如,与SVM相比,ASVM在早期癌症检测中可将灵敏度提高10%以上。此外,我们提出的ASVM在ROC曲线下面积(AUC)方面显著优于网格搜索+SVM和随机搜索+SVM(分别为0.938、0.922和0.921)。
所提出的算法和数据集可在https://github.com/ElaineLIU-920/ASVM-for-Early-Cancer-Detection获取。
补充数据可在《生物信息学》在线获取。