Suppr超能文献

用于预测未知原发灶转移性宫颈癌原发部位的机器学习模型

Machine Learning Models to Predict Primary Sites of Metastatic Cervical Carcinoma From Unknown Primary.

作者信息

Lu Di, Jiang Jianjun, Liu Xiguang, Wang He, Feng Siyang, Shi Xiaoshun, Wang Zhizhi, Chen Zhiming, Yan Xuebin, Wu Hua, Cai Kaican

机构信息

Department of Thoracic Surgery, Nanfang Hospital, Southern Medical University, Guangzhou, China.

Department of Thoracic Surgery, Peking University Shenzhen Hospital, Shenzhen, China.

出版信息

Front Genet. 2020 Dec 21;11:614823. doi: 10.3389/fgene.2020.614823. eCollection 2020.

Abstract

Metastatic cervical carcinoma from unknown primary (MCCUP) accounts for 1-4% of all head and neck tumors, and identifying the primary site in MCCUP is challenging. The most common histopathological type of MCCUP is squamous cell carcinoma (SCC), and it remains difficult to identify the primary site pathologically. Therefore, it seems necessary and urgent to develop novel and effective methods to determine the primary site in MCCUP. In the present study, the RNA sequencing data of four types of SCC and Pan-Cancer from the cancer genome atlas (TCGA) were obtained. And after data pre-processing, their differentially expressed genes (DEGs) were identified, respectively. Gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis indicated that these significantly changed genes of four types of SCC share lots of similar molecular functions and histological features. Then three machine learning models, [Random Forest (RF), support vector machine (SVM), and neural network (NN)] which consisted of ten genes to distinguish these four types of SCC were developed. Among the three models with prediction tests, the RF model worked best in the external validation set, with an overall predictive accuracy of 88.2%, sensitivity of 88.71%, and specificity of 95.42%. The NN model is the second in efficacy, with an overall accuracy of 82.02%, sensitivity of 81.23%, and specificity of 93.04%. The SVM model is the last, with an overall accuracy of 76.69%, sensitivity of 74.81%, and specificity of 90.84%. The present analysis of similarities and differences among the four types of SCC, and novel models developments for distinguishing four types of SCC with informatics methods shed lights on precision MCCUP diagnosis in the future.

摘要

原发灶不明的转移性宫颈癌(MCCUP)占所有头颈肿瘤的1%-4%,确定MCCUP的原发部位具有挑战性。MCCUP最常见的组织病理学类型是鳞状细胞癌(SCC),从病理学上确定原发部位仍然困难。因此,开发新的有效方法来确定MCCUP的原发部位似乎是必要且紧迫的。在本研究中,获取了癌症基因组图谱(TCGA)中四种SCC和泛癌的RNA测序数据。经过数据预处理后,分别鉴定了它们的差异表达基因(DEG)。基因本体(GO)和京都基因与基因组百科全书(KEGG)通路分析表明,这四种SCC的这些显著变化的基因具有许多相似的分子功能和组织学特征。然后开发了由十个基因组成的三种机器学习模型[随机森林(RF)、支持向量机(SVM)和神经网络(NN)]来区分这四种SCC。在进行预测测试的三种模型中,RF模型在外部验证集中表现最佳,总体预测准确率为88.2%,灵敏度为88.71%,特异性为95.42%。NN模型的效果次之,总体准确率为82.02%,灵敏度为81.23%,特异性为93.04%。SVM模型排在最后,总体准确率为76.69%,灵敏度为74.81%,特异性为90.84%。目前对四种SCC异同的分析以及用信息学方法区分四种SCC的新模型开发为未来精准的MCCUP诊断提供了思路。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/7779672/345750d45709/fgene-11-614823-g001.jpg

相似文献

1
Machine Learning Models to Predict Primary Sites of Metastatic Cervical Carcinoma From Unknown Primary.
Front Genet. 2020 Dec 21;11:614823. doi: 10.3389/fgene.2020.614823. eCollection 2020.
4
Identification of genes and pathways involved in kidney renal clear cell carcinoma.
BMC Bioinformatics. 2014;15 Suppl 17(Suppl 17):S2. doi: 10.1186/1471-2105-15-S17-S2. Epub 2014 Dec 16.
5
PreMSIm: An R package for predicting microsatellite instability from the expression profiling of a gene panel in cancer.
Comput Struct Biotechnol J. 2020 Mar 19;18:668-675. doi: 10.1016/j.csbj.2020.03.007. eCollection 2020.
7
Identification of the functional alteration signatures across different cancer types with support vector machine and feature analysis.
Biochim Biophys Acta Mol Basis Dis. 2018 Jun;1864(6 Pt B):2218-2227. doi: 10.1016/j.bbadis.2017.12.026. Epub 2017 Dec 19.
8
Glioma stages prediction based on machine learning algorithm combined with protein-protein interaction networks.
Genomics. 2020 Jan;112(1):837-847. doi: 10.1016/j.ygeno.2019.05.024. Epub 2019 May 29.
9
Machine Learning Models of Survival Prediction in Trauma Patients.
J Clin Med. 2019 Jun 5;8(6):799. doi: 10.3390/jcm8060799.
10
Outcome prediction of intracranial aneurysm treatment by flow diverters using machine learning.
Neurosurg Focus. 2018 Nov 1;45(5):E7. doi: 10.3171/2018.8.FOCUS18332.

引用本文的文献

1
An improved random forest algorithm for tracing the origin of metastatic renal cancer tissues.
Arch Med Sci. 2023 Jul 11;21(3):789-801. doi: 10.5114/aoms/168973. eCollection 2025.
2
Recent advances in applications of machine learning in cervical cancer research: a focus on prediction models.
Obstet Gynecol Sci. 2025 Jul;68(4):247-259. doi: 10.5468/ogs.25041. Epub 2025 May 29.
3
Enhanced Immunohistochemistry Interpretation with a Machine Learning-Based Expert System.
Diagnostics (Basel). 2024 Aug 24;14(17):1853. doi: 10.3390/diagnostics14171853.

本文引用的文献

2
Cyclin B2 Overexpression in Human Hepatocellular Carcinoma is Associated with Poor Prognosis.
Arch Med Res. 2019 Jan;50(1):10-17. doi: 10.1016/j.arcmed.2019.03.003. Epub 2019 Apr 4.
4
6
Inhibition of BUB1 Kinase by BAY 1816032 Sensitizes Tumor Cells toward Taxanes, ATR, and PARP Inhibitors and .
Clin Cancer Res. 2019 Feb 15;25(4):1404-1414. doi: 10.1158/1078-0432.CCR-18-0628. Epub 2018 Nov 14.
9
Genomic, Pathway Network, and Immunologic Features Distinguishing Squamous Carcinomas.
Cell Rep. 2018 Apr 3;23(1):194-212.e6. doi: 10.1016/j.celrep.2018.03.063.
10
Serum and urine metabolomics study reveals a distinct diagnostic model for cancer cachexia.
J Cachexia Sarcopenia Muscle. 2018 Feb;9(1):71-85. doi: 10.1002/jcsm.12246. Epub 2017 Nov 19.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验