• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于预测未知原发灶转移性宫颈癌原发部位的机器学习模型

Machine Learning Models to Predict Primary Sites of Metastatic Cervical Carcinoma From Unknown Primary.

作者信息

Lu Di, Jiang Jianjun, Liu Xiguang, Wang He, Feng Siyang, Shi Xiaoshun, Wang Zhizhi, Chen Zhiming, Yan Xuebin, Wu Hua, Cai Kaican

机构信息

Department of Thoracic Surgery, Nanfang Hospital, Southern Medical University, Guangzhou, China.

Department of Thoracic Surgery, Peking University Shenzhen Hospital, Shenzhen, China.

出版信息

Front Genet. 2020 Dec 21;11:614823. doi: 10.3389/fgene.2020.614823. eCollection 2020.

DOI:10.3389/fgene.2020.614823
PMID:33408743
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7779672/
Abstract

Metastatic cervical carcinoma from unknown primary (MCCUP) accounts for 1-4% of all head and neck tumors, and identifying the primary site in MCCUP is challenging. The most common histopathological type of MCCUP is squamous cell carcinoma (SCC), and it remains difficult to identify the primary site pathologically. Therefore, it seems necessary and urgent to develop novel and effective methods to determine the primary site in MCCUP. In the present study, the RNA sequencing data of four types of SCC and Pan-Cancer from the cancer genome atlas (TCGA) were obtained. And after data pre-processing, their differentially expressed genes (DEGs) were identified, respectively. Gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis indicated that these significantly changed genes of four types of SCC share lots of similar molecular functions and histological features. Then three machine learning models, [Random Forest (RF), support vector machine (SVM), and neural network (NN)] which consisted of ten genes to distinguish these four types of SCC were developed. Among the three models with prediction tests, the RF model worked best in the external validation set, with an overall predictive accuracy of 88.2%, sensitivity of 88.71%, and specificity of 95.42%. The NN model is the second in efficacy, with an overall accuracy of 82.02%, sensitivity of 81.23%, and specificity of 93.04%. The SVM model is the last, with an overall accuracy of 76.69%, sensitivity of 74.81%, and specificity of 90.84%. The present analysis of similarities and differences among the four types of SCC, and novel models developments for distinguishing four types of SCC with informatics methods shed lights on precision MCCUP diagnosis in the future.

摘要

原发灶不明的转移性宫颈癌(MCCUP)占所有头颈肿瘤的1%-4%,确定MCCUP的原发部位具有挑战性。MCCUP最常见的组织病理学类型是鳞状细胞癌(SCC),从病理学上确定原发部位仍然困难。因此,开发新的有效方法来确定MCCUP的原发部位似乎是必要且紧迫的。在本研究中,获取了癌症基因组图谱(TCGA)中四种SCC和泛癌的RNA测序数据。经过数据预处理后,分别鉴定了它们的差异表达基因(DEG)。基因本体(GO)和京都基因与基因组百科全书(KEGG)通路分析表明,这四种SCC的这些显著变化的基因具有许多相似的分子功能和组织学特征。然后开发了由十个基因组成的三种机器学习模型[随机森林(RF)、支持向量机(SVM)和神经网络(NN)]来区分这四种SCC。在进行预测测试的三种模型中,RF模型在外部验证集中表现最佳,总体预测准确率为88.2%,灵敏度为88.71%,特异性为95.42%。NN模型的效果次之,总体准确率为82.02%,灵敏度为81.23%,特异性为93.04%。SVM模型排在最后,总体准确率为76.69%,灵敏度为74.81%,特异性为90.84%。目前对四种SCC异同的分析以及用信息学方法区分四种SCC的新模型开发为未来精准的MCCUP诊断提供了思路。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/7779672/88f88096e0a1/fgene-11-614823-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/7779672/345750d45709/fgene-11-614823-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/7779672/55ca4ef72885/fgene-11-614823-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/7779672/fea55cd1462d/fgene-11-614823-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/7779672/88f88096e0a1/fgene-11-614823-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/7779672/345750d45709/fgene-11-614823-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/7779672/55ca4ef72885/fgene-11-614823-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/7779672/fea55cd1462d/fgene-11-614823-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/7779672/88f88096e0a1/fgene-11-614823-g004.jpg

相似文献

1
Machine Learning Models to Predict Primary Sites of Metastatic Cervical Carcinoma From Unknown Primary.用于预测未知原发灶转移性宫颈癌原发部位的机器学习模型
Front Genet. 2020 Dec 21;11:614823. doi: 10.3389/fgene.2020.614823. eCollection 2020.
2
Machine learning models predict the primary sites of head and neck squamous cell carcinoma metastases based on DNA methylation.机器学习模型基于DNA甲基化预测头颈部鳞状细胞癌转移的原发部位。
J Pathol. 2022 Apr;256(4):378-387. doi: 10.1002/path.5845. Epub 2022 Jan 20.
3
Personal Health Information Inference Using Machine Learning on RNA Expression Data from Patients With Cancer: Algorithm Validation Study.利用癌症患者 RNA 表达数据进行机器学习的个人健康信息推断:算法验证研究。
J Med Internet Res. 2020 Aug 10;22(8):e18387. doi: 10.2196/18387.
4
Identification of genes and pathways involved in kidney renal clear cell carcinoma.肾透明细胞癌相关基因和通路的鉴定
BMC Bioinformatics. 2014;15 Suppl 17(Suppl 17):S2. doi: 10.1186/1471-2105-15-S17-S2. Epub 2014 Dec 16.
5
PreMSIm: An R package for predicting microsatellite instability from the expression profiling of a gene panel in cancer.PreMSIm:一个用于通过癌症中基因面板的表达谱预测微卫星不稳定性的R包。
Comput Struct Biotechnol J. 2020 Mar 19;18:668-675. doi: 10.1016/j.csbj.2020.03.007. eCollection 2020.
6
Textural differences between renal cell carcinoma subtypes: Machine learning-based quantitative computed tomography texture analysis with independent external validation.肾细胞癌亚型的纹理差异:基于机器学习的定量 CT 纹理分析及独立外部验证。
Eur J Radiol. 2018 Oct;107:149-157. doi: 10.1016/j.ejrad.2018.08.014. Epub 2018 Aug 16.
7
Identification of the functional alteration signatures across different cancer types with support vector machine and feature analysis.基于支持向量机和特征分析鉴定不同癌症类型中的功能改变特征。
Biochim Biophys Acta Mol Basis Dis. 2018 Jun;1864(6 Pt B):2218-2227. doi: 10.1016/j.bbadis.2017.12.026. Epub 2017 Dec 19.
8
Glioma stages prediction based on machine learning algorithm combined with protein-protein interaction networks.基于机器学习算法与蛋白质相互作用网络的脑胶质瘤分期预测。
Genomics. 2020 Jan;112(1):837-847. doi: 10.1016/j.ygeno.2019.05.024. Epub 2019 May 29.
9
Machine Learning Models of Survival Prediction in Trauma Patients.创伤患者生存预测的机器学习模型
J Clin Med. 2019 Jun 5;8(6):799. doi: 10.3390/jcm8060799.
10
Outcome prediction of intracranial aneurysm treatment by flow diverters using machine learning.使用机器学习对颅内动脉瘤血管内治疗的结果预测。
Neurosurg Focus. 2018 Nov 1;45(5):E7. doi: 10.3171/2018.8.FOCUS18332.

引用本文的文献

1
An improved random forest algorithm for tracing the origin of metastatic renal cancer tissues.一种用于追踪转移性肾癌组织起源的改进随机森林算法。
Arch Med Sci. 2023 Jul 11;21(3):789-801. doi: 10.5114/aoms/168973. eCollection 2025.
2
Recent advances in applications of machine learning in cervical cancer research: a focus on prediction models.机器学习在宫颈癌研究中的应用新进展:聚焦预测模型
Obstet Gynecol Sci. 2025 Jul;68(4):247-259. doi: 10.5468/ogs.25041. Epub 2025 May 29.
3
Enhanced Immunohistochemistry Interpretation with a Machine Learning-Based Expert System.

本文引用的文献

1
Random forest can accurately predict the development of end-stage renal disease in immunoglobulin a nephropathy patients.随机森林可以准确预测免疫球蛋白A肾病患者终末期肾病的发展。
Ann Transl Med. 2019 Jun;7(11):234. doi: 10.21037/atm.2018.12.11.
2
Cyclin B2 Overexpression in Human Hepatocellular Carcinoma is Associated with Poor Prognosis.Cyclin B2 在人肝癌中的过表达与不良预后相关。
Arch Med Res. 2019 Jan;50(1):10-17. doi: 10.1016/j.arcmed.2019.03.003. Epub 2019 Apr 4.
3
Application of an Artificial Neural Network in the Diagnosis of Chronic Lymphocytic Leukemia.
基于机器学习的专家系统增强免疫组织化学解读
Diagnostics (Basel). 2024 Aug 24;14(17):1853. doi: 10.3390/diagnostics14171853.
4
Identification of novel biomarkers to distinguish clear cell and non-clear cell renal cell carcinoma using bioinformatics and machine learning.利用生物信息学和机器学习鉴定新型生物标志物以区分肾透明细胞癌和非透明细胞癌。
PLoS One. 2024 Jun 10;19(6):e0305252. doi: 10.1371/journal.pone.0305252. eCollection 2024.
5
Classification of tumor types using XGBoost machine learning model: a vector space transformation of genomic alterations.使用 XGBoost 机器学习模型对肿瘤类型进行分类:基因组改变的向量空间变换。
J Transl Med. 2023 Nov 21;21(1):836. doi: 10.1186/s12967-023-04720-4.
人工神经网络在慢性淋巴细胞白血病诊断中的应用。
Cureus. 2019 Feb 4;11(2):e4004. doi: 10.7759/cureus.4004.
4
CDC20 contributes to the development of human cutaneous squamous cell carcinoma through the Wnt/β‑catenin signaling pathway.CDC20 通过 Wnt/β-连环蛋白信号通路促进人类皮肤鳞状细胞癌的发展。
Int J Oncol. 2019 May;54(5):1534-1544. doi: 10.3892/ijo.2019.4727. Epub 2019 Feb 27.
5
STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets.STRING v11:具有增强覆盖范围的蛋白质-蛋白质相互作用网络,支持在全基因组实验数据集的功能发现。
Nucleic Acids Res. 2019 Jan 8;47(D1):D607-D613. doi: 10.1093/nar/gky1131.
6
Inhibition of BUB1 Kinase by BAY 1816032 Sensitizes Tumor Cells toward Taxanes, ATR, and PARP Inhibitors and .BAY 1816032 抑制 BUB1 激酶可增强肿瘤细胞对紫杉烷类、ATR 和 PARP 抑制剂的敏感性。
Clin Cancer Res. 2019 Feb 15;25(4):1404-1414. doi: 10.1158/1078-0432.CCR-18-0628. Epub 2018 Nov 14.
7
MAD2 Combined with Mitotic Spindle Apparatus (MSA) and Anticentromere Antibody (ACA) for Diagnosis of Small Cell Lung Cancer (SCLC).MAD2 联合有丝分裂纺锤体装置(MSA)和着丝粒抗体(ACA)用于诊断小细胞肺癌(SCLC)。
Med Sci Monit. 2018 Oct 22;24:7541-7547. doi: 10.12659/MSM.909772.
8
Identification of target gene and prognostic evaluation for lung adenocarcinoma using gene expression meta-analysis, network analysis and neural network algorithms.基于基因表达荟萃分析、网络分析和神经网络算法鉴定肺腺癌的靶基因和预后评估。
J Biomed Inform. 2018 Oct;86:120-134. doi: 10.1016/j.jbi.2018.09.004. Epub 2018 Sep 6.
9
Genomic, Pathway Network, and Immunologic Features Distinguishing Squamous Carcinomas.基因组、通路网络和免疫特征区分鳞状细胞癌。
Cell Rep. 2018 Apr 3;23(1):194-212.e6. doi: 10.1016/j.celrep.2018.03.063.
10
Serum and urine metabolomics study reveals a distinct diagnostic model for cancer cachexia.血清和尿液代谢组学研究揭示了癌症恶病质的独特诊断模型。
J Cachexia Sarcopenia Muscle. 2018 Feb;9(1):71-85. doi: 10.1002/jcsm.12246. Epub 2017 Nov 19.