支持向量机在基于癌症生物标志物对患者群体进行分层中发挥作用吗？

Do Support Vector Machines Play a Role in Stratifying Patient Population Based on Cancer Biomarkers?

作者信息

Lanza Ben, Parashar Deepak

机构信息

Statistics and Epidemiology Unit, Warwick Medical School, University of Warwick, Coventry, UK.

Warwick Cancer Research Centre, University of Warwick, Coventry, UK.

出版信息

Arch Proteom Bioinform. 2021;2(1):20-38.

PMID:34778890

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7611982/

Abstract

Biomarkers are known to be the key driver behind targeted cancer therapies by either stratifying the patients into risk categories or identifying patient subgroups most likely to benefit. However, the ability of a biomarker to stratify patients relies heavily on the type of clinical endpoint data being collected. Of particular interest is the scenario when the biomarker involved is a continuous one where the challenge is often to identify cut-offs or thresholds that would stratify the population according to the level of clinical outcome or treatment benefit. On the other hand, there are well-established Machine Learning (ML) methods such as the Support Vector Machines (SVM) that classify data, both linear as well as non-linear, into subgroups in an optimal way. SVMs have proven to be immensely useful in data-centric engineering and recently researchers have also sought its applications in healthcare. Despite their wide applicability, SVMs are not yet in the mainstream of toolkits to be utilised in observational clinical studies or in clinical trials. This research investigates the very role of SVMs in stratifying the patient population based on a continuous biomarker across a variety of datasets. Based on the mathematical framework underlying SVMs, we formulate and fit algorithms in the context of biomarker stratified cancer datasets to evaluate their merits. The analysis reveals their superior performance for certain data-types when compared to other ML methods suggesting that SVMs may have the potential to provide a robust yet simplistic solution to stratify real cancer patients based on continuous biomarkers, and hence accelerate the identification of subgroups for improved clinical outcomes or guide targeted cancer therapies.

摘要

生物标志物被认为是靶向癌症治疗背后的关键驱动因素，它可以将患者分层到不同风险类别中，或者识别出最有可能受益的患者亚组。然而，生物标志物对患者进行分层的能力在很大程度上依赖于所收集的临床终点数据的类型。特别值得关注的是这样一种情况，即所涉及的生物标志物是连续型的，此时面临的挑战通常是确定能够根据临床结果水平或治疗获益程度对人群进行分层的临界值或阈值。另一方面，有一些成熟的机器学习（ML）方法，如支持向量机（SVM），它能够以最优方式将线性和非线性数据分类到不同子组中。支持向量机已被证明在以数据为中心的工程领域非常有用，最近研究人员也在探索其在医疗保健领域的应用。尽管其适用性广泛，但支持向量机尚未成为用于观察性临床研究或临床试验的主流工具包。本研究调查了支持向量机在基于连续生物标志物对不同数据集的患者群体进行分层方面所起的作用。基于支持向量机的数学框架，我们在生物标志物分层的癌症数据集背景下制定并拟合算法，以评估它们的优点。分析表明，与其他机器学习方法相比，支持向量机在某些数据类型上具有卓越的性能，这表明支持向量机有可能提供一个强大而简单的解决方案，用于基于连续生物标志物对真实癌症患者进行分层，从而加速亚组的识别以改善临床结果或指导靶向癌症治疗。

相似文献

Do Support Vector Machines Play a Role in Stratifying Patient Population Based on Cancer Biomarkers?

Arch Proteom Bioinform. 2021;2(1):20-38.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Semi-supervised clinical text classification with Laplacian SVMs: an application to cancer case management.

J Biomed Inform. 2013 Oct;46(5):869-75. doi: 10.1016/j.jbi.2013.06.014. Epub 2013 Jul 8.

New Machine Learning Applications to Accelerate Personalized Medicine in Breast Cancer: Rise of the Support Vector Machines.

OMICS. 2020 May;24(5):241-246. doi: 10.1089/omi.2020.0001. Epub 2020 Mar 31.

Condensed vector machines: learning fast machine for large data.

IEEE Trans Neural Netw. 2010 Dec;21(12):1903-14. doi: 10.1109/TNN.2010.2079947. Epub 2010 Oct 18.

Advances with support vector machines for novel drug discovery.

Expert Opin Drug Discov. 2019 Jan;14(1):23-33. doi: 10.1080/17460441.2019.1549033. Epub 2018 Nov 29.

(Machine-)Learning to analyze in vivo microscopy: Support vector machines.

Biochim Biophys Acta Proteins Proteom. 2017 Nov;1865(11 Pt B):1719-1727. doi: 10.1016/j.bbapap.2017.09.013. Epub 2017 Sep 30.

Deep Support Vector Machines for the Identification of Stress Condition from Electrodermal Activity.

Int J Neural Syst. 2020 Jul;30(7):2050031. doi: 10.1142/S0129065720500318. Epub 2020 Jun 5.

The future of Cochrane Neonatal.

Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.

Vicinal support vector classifier using supervised kernel-based clustering.

Artif Intell Med. 2014 Mar;60(3):189-96. doi: 10.1016/j.artmed.2014.01.003. Epub 2014 Feb 7.

引用本文的文献

Design and analysis of umbrella trials: Where do we stand?

Front Med (Lausanne). 2022 Oct 12;9:1037439. doi: 10.3389/fmed.2022.1037439. eCollection 2022.

本文引用的文献

Applications of Support Vector Machine (SVM) Learning in Cancer Genomics.

Cancer Genomics Proteomics. 2018 Jan-Feb;15(1):41-51. doi: 10.21873/cgp.20063.

Prognostic and predictive biomarkers in prostate cancer: latest evidence and clinical implications.

Ther Adv Med Oncol. 2017 Aug;9(8):565-573. doi: 10.1177/1758834017719215. Epub 2017 Jul 5.

p53 mutations in cancer.

Nat Cell Biol. 2013 Jan;15(1):2-8. doi: 10.1038/ncb2641.

Problems with risk reclassification methods for evaluating prediction models.

Am J Epidemiol. 2011 Jun 1;173(11):1327-35. doi: 10.1093/aje/kwr013. Epub 2011 May 9.

Integrating biomarkers in clinical trials.

Expert Rev Mol Diagn. 2011 Mar;11(2):171-82. doi: 10.1586/erm.10.120.

Are random forests better than support vector machines for microarray-based cancer classification?

AMIA Annu Symp Proc. 2007 Oct 11;2007:686-90.

Prognostic versus predictive value of biomarkers in oncology.

Eur J Cancer. 2008 May;44(7):946-53. doi: 10.1016/j.ejca.2008.03.006. Epub 2008 Apr 7.

Selecting differentially expressed genes from microarray experiments.

Biometrics. 2003 Mar;59(1):133-42. doi: 10.1111/1541-0420.00016.

Free/total PSA ratio is a powerful predictor of future prostate cancer morbidity in men with initial PSA levels of 4.1 to 10.0 ng/mL.

Urology. 2003 Apr;61(4):760-4. doi: 10.1016/s0090-4295(02)02427-5.

Drug design by machine learning: support vector machines for pharmaceutical data analysis.

Comput Chem. 2001 Dec;26(1):5-14. doi: 10.1016/s0097-8485(01)00094-8.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

支持向量机在基于癌症生物标志物对患者群体进行分层中发挥作用吗？

Do Support Vector Machines Play a Role in Stratifying Patient Population Based on Cancer Biomarkers?

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献