• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

整合转录组学和混合机器学习可实现鼻咽癌的高精度诊断建模。

Integrating transcriptomics and hybrid machine learning enables high-accuracy diagnostic modeling for nasopharyngeal carcinoma.

作者信息

Wang Hehe, Zhang Junge, Cheng Peng, Yu Lujie, Li Chunlin, Wang Yaowen

机构信息

Department of Otolaryngology, Head and Neck Surgery, The First Affiliated Hospital of Ningbo University, Ningbo, China.

Department of Anesthesiology, The First Affiliated Hospital of Ningbo University, Ningbo, China.

出版信息

Discov Oncol. 2025 Jun 12;16(1):1067. doi: 10.1007/s12672-025-02932-2.

DOI:10.1007/s12672-025-02932-2
PMID:40504346
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12162433/
Abstract

BACKGROUND

Nasopharyngeal carcinoma (NPC) lacks biomarkers demonstrating both high specificity and sensitivity for early diagnosis. This study aimed to develop robust machine learning (ML)-driven diagnostic models and identify key biomarkers through integrated analysis of multi-cohort transcriptomic data.

METHODS

Seven NPC transcriptomic datasets (GSE12452, GSE40290, GSE53819, and GSE64634 were merged to form the training cohort, while GSE13597, GSE34573, and GSE61218 served as independent external validation sets) were integrated and preprocessed using ComBat for batch effect correction. Differential expression analysis identified 293 differentially expressed genes (DEGs). Twelve ML algorithms (including Stepglm, glmBoost, and RF) were systematically combined into 113 distinct models to classify NPC versus normal tissues. Top-performing models underwent external validation. Immune infiltration patterns and functional enrichment were analyzed using CIBERSORT and GSEA/GSVA, respectively.

RESULTS

The Stepglm[both]-RF hybrid model demonstrated exceptional performance with AUCs of 0.999 (training set; 95% CI: 0.997-1.000), 1.000 (GSE61218/GSE34573 validation), and 0.960 (GSE13597 validation). The glmBoost-RF model showed comparable efficacy, achieving AUCs of 1.000 (training), 0.950 (GSE61218), 1.000 (GSE34573), and 0.947 (GSE13597). Single-gene analysis identified RCN1 as a promising diagnostic marker (AUC = 0.953), with elevated expression levels correlating with poor prognosis in head and neck squamous cell carcinoma (HNSCC; p < 0.05). Immune profiling revealed significant enrichment of M1 macrophages and concomitant reduction of memory B cells in NPC. Functional enrichment analysis associated RCN1 with cell cycle regulation and immune-related pathways.

CONCLUSION

This study establishes two high-performance ML models (Stepglm[both]-RF and glmBoost-RF) with low variability for NPC diagnosis and identifies RCN1 as a dual-function biomarker with diagnostic and prognostic potential. The findings provide a scalable framework for early NPC detection and novel insights into immune microenvironment dysregulation.

摘要

背景

鼻咽癌(NPC)缺乏对早期诊断具有高特异性和敏感性的生物标志物。本研究旨在通过对多队列转录组数据的综合分析,开发强大的机器学习(ML)驱动的诊断模型并识别关键生物标志物。

方法

将七个NPC转录组数据集(GSE12452、GSE40290、GSE53819和GSE64634合并形成训练队列,而GSE13597、GSE34573和GSE61218作为独立的外部验证集)进行整合,并使用ComBat进行预处理以校正批次效应。差异表达分析确定了293个差异表达基因(DEG)。将十二种ML算法(包括Stepglm、glmBoost和RF)系统地组合成113个不同的模型,以区分NPC组织与正常组织。对表现最佳的模型进行外部验证。分别使用CIBERSORT和GSEA/GSVA分析免疫浸润模式和功能富集情况。

结果

Stepglm[两者]-RF混合模型表现卓越,训练集的AUC为0.999(95%CI:0.997 - 1.000),在GSE61218/GSE34573验证集中为1.000,在GSE13597验证集中为0.960。glmBoost - RF模型显示出相当的效能,训练集的AUC为1.000,在GSE61218中为0.950,在GSE34573中为1.000,在GSE13597中为0.947。单基因分析确定RCN1为一个有前景的诊断标志物(AUC = 0.953),其表达水平升高与头颈部鳞状细胞癌(HNSCC)的不良预后相关(p < 0.05)。免疫图谱显示NPC中M1巨噬细胞显著富集,记忆B细胞随之减少。功能富集分析将RCN1与细胞周期调控和免疫相关途径联系起来。

结论

本研究建立了两个用于NPC诊断的高性能ML模型(Stepglm[两者]-RF和glmBoost - RF),变异性低,并确定RCN1为具有诊断和预后潜力的双功能生物标志物。这些发现为早期NPC检测提供了一个可扩展的框架,并为免疫微环境失调提供了新的见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ba1/12162433/7f4f45051b3b/12672_2025_2932_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ba1/12162433/6e336c618590/12672_2025_2932_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ba1/12162433/e48cd14ccedb/12672_2025_2932_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ba1/12162433/6fdf133ef7a1/12672_2025_2932_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ba1/12162433/987bcb70d4b8/12672_2025_2932_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ba1/12162433/c1e9eb3be5eb/12672_2025_2932_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ba1/12162433/dae8c4572433/12672_2025_2932_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ba1/12162433/7f4f45051b3b/12672_2025_2932_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ba1/12162433/6e336c618590/12672_2025_2932_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ba1/12162433/e48cd14ccedb/12672_2025_2932_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ba1/12162433/6fdf133ef7a1/12672_2025_2932_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ba1/12162433/987bcb70d4b8/12672_2025_2932_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ba1/12162433/c1e9eb3be5eb/12672_2025_2932_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ba1/12162433/dae8c4572433/12672_2025_2932_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ba1/12162433/7f4f45051b3b/12672_2025_2932_Fig7_HTML.jpg

相似文献

1
Integrating transcriptomics and hybrid machine learning enables high-accuracy diagnostic modeling for nasopharyngeal carcinoma.整合转录组学和混合机器学习可实现鼻咽癌的高精度诊断建模。
Discov Oncol. 2025 Jun 12;16(1):1067. doi: 10.1007/s12672-025-02932-2.
2
[Screen of key characteristic genes of nasopharyngeal carcinoma (NPC) base on machine learning and analysis of their correlation with immune cells].基于机器学习的鼻咽癌关键特征基因筛选及其与免疫细胞的相关性分析
Xi Bao Yu Fen Zi Mian Yi Xue Za Zhi. 2023;39(11):988-995.
3
Lactate-related gene signatures as prognostic predictors and comprehensive analysis of immune profiles in nasopharyngeal carcinoma.乳酸相关基因特征作为鼻咽癌预后预测指标及免疫图谱的综合分析
J Transl Med. 2024 Dec 20;22(1):1116. doi: 10.1186/s12967-024-05935-9.
4
Machine learning-driven identification of exosome- related biomarkers in head and neck squamous cell carcinoma.机器学习驱动的头颈部鳞状细胞癌中外泌体相关生物标志物的识别
Front Immunol. 2025 May 22;16:1590331. doi: 10.3389/fimmu.2025.1590331. eCollection 2025.
5
Identification of DTL as Related Biomarker and Immune Infiltration Characteristics of Nasopharyngeal Carcinoma via Comprehensive Strategies.通过综合策略鉴定DTL作为鼻咽癌相关生物标志物及免疫浸润特征
Int J Gen Med. 2022 Mar 2;15:2329-2345. doi: 10.2147/IJGM.S352330. eCollection 2022.
6
Identification of Key Genes in Nasopharyngeal Carcinoma Based on Bioinformatics Analysis.基于生物信息学分析的鼻咽癌关键基因鉴定。
Comput Intell Neurosci. 2022 Jun 7;2022:9022700. doi: 10.1155/2022/9022700. eCollection 2022.
7
Construction of diagnostic and prognostic models based on gene signatures of nasopharyngeal carcinoma by machine learning methods.基于机器学习方法的鼻咽癌基因特征构建诊断和预后模型。
Transl Cancer Res. 2023 May 31;12(5):1254-1269. doi: 10.21037/tcr-22-2700. Epub 2023 Apr 10.
8
Exploring nasopharyngeal carcinoma genetics: Bioinformatics insights into pathways and gene associations.探索鼻咽癌遗传学:生物信息学对途径和基因关联的见解。
Med J Malaysia. 2024 Sep;79(5):615-625.
9
Screening and identification of key biomarkers in nasopharyngeal carcinoma: Evidence from bioinformatic analysis.鼻咽癌关键生物标志物的筛选与鉴定:来自生物信息学分析的证据
Medicine (Baltimore). 2019 Nov;98(48):e17997. doi: 10.1097/MD.0000000000017997.
10
Identification of key pathways and genes in nasopharyngeal carcinoma using bioinformatics analysis.利用生物信息学分析鉴定鼻咽癌中的关键通路和基因。
Oncol Lett. 2019 May;17(5):4683-4694. doi: 10.3892/ol.2019.10133. Epub 2019 Mar 8.

本文引用的文献

1
Downregulation of RCN1 inhibits esophageal squamous cell carcinoma progression and M2 macrophage polarization.下调 RCN1 抑制食管鳞癌细胞的进展和 M2 巨噬细胞极化。
PLoS One. 2024 May 7;19(5):e0302780. doi: 10.1371/journal.pone.0302780. eCollection 2024.
2
Celloscope: a probabilistic model for marker-gene-driven cell type deconvolution in spatial transcriptomics data.Celloscope:一种用于空间转录组学数据中基于标记基因驱动的细胞类型去卷积的概率模型。
Genome Biol. 2023 May 17;24(1):120. doi: 10.1186/s13059-023-02951-8.
3
GeneRanger and TargetRanger: processed gene and protein expression levels across cells and tissues for target discovery.
GeneRanger 和 TargetRanger:用于目标发现的跨细胞和组织的基因和蛋白质表达水平的处理。
Nucleic Acids Res. 2023 Jul 5;51(W1):W213-W224. doi: 10.1093/nar/gkad399.
4
Nasopharyngeal carcinoma cells promote regulatory T cell development and suppressive activity via CD70-CD27 interaction.鼻咽癌细胞通过 CD70-CD27 相互作用促进调节性 T 细胞的发育和抑制活性。
Nat Commun. 2023 Apr 6;14(1):1912. doi: 10.1038/s41467-023-37614-6.
5
RCN2 promotes Nasopharyngeal carcinoma progression by curbing Calcium flow and Mitochondrial apoptosis.RCN2 通过抑制钙离子流动和线粒体凋亡促进鼻咽癌的进展。
Cell Oncol (Dordr). 2023 Aug;46(4):1031-1048. doi: 10.1007/s13402-023-00796-8. Epub 2023 Mar 23.
6
The formation of HuR/YB1 complex is required for the stabilization of target mRNA to promote myogenesis.HuR/YB1 复合物的形成对于稳定靶 mRNA 以促进成肌作用是必需的。
Nucleic Acids Res. 2023 Feb 22;51(3):1375-1392. doi: 10.1093/nar/gkac1245.
7
Therapy sculpts the complex interplay between cancer and the immune system during tumour evolution.治疗在肿瘤进化过程中塑造了癌症和免疫系统之间的复杂相互作用。
Genome Med. 2022 Dec 7;14(1):137. doi: 10.1186/s13073-022-01138-3.
8
HDAC I/IIb selective inhibitor Purinostat Mesylate combined with GLS1 inhibition effectively eliminates CML stem cells.HDAC I/IIb选择性抑制剂甲磺酸盐嘌呤司他丁与GLS1抑制联合使用可有效消除慢性粒细胞白血病干细胞。
Bioact Mater. 2022 Sep 18;21:483-498. doi: 10.1016/j.bioactmat.2022.08.006. eCollection 2023 Mar.
9
Identification of LARS as an essential gene for osteosarcoma proliferation through large-Scale CRISPR-Cas9 screening database and experimental verification.通过大规模 CRISPR-Cas9 筛选数据库和实验验证鉴定 LARS 为骨肉瘤增殖的必需基因。
J Transl Med. 2022 Aug 12;20(1):355. doi: 10.1186/s12967-022-03571-9.
10
Construction of a lncRNA-mRNA Co-Expression Network for Nasopharyngeal Carcinoma.鼻咽癌长链非编码RNA-信使核糖核酸共表达网络的构建
Front Oncol. 2022 Jul 7;12:809760. doi: 10.3389/fonc.2022.809760. eCollection 2022.