• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于加权基因相关网络分析和机器学习的早期肺部节点分类预测。

Classification prediction of early pulmonary nodes based on weighted gene correlation network analysis and machine learning.

机构信息

Department of Radiotherapy, Chongqing University Cancer Hospital, Chongqing, China.

Department of Equipment, The Second Affiliated Hospital of Chongqing Medical University, Chongqing, China.

出版信息

J Cancer Res Clin Oncol. 2023 Jul;149(7):3915-3924. doi: 10.1007/s00432-022-04312-7. Epub 2022 Aug 26.

DOI:10.1007/s00432-022-04312-7
PMID:36018512
Abstract

OBJECTIVE

To use weighted gene correlation network analysis (WGCNA) and machine learning algorithm to predict classification of early pulmonary nodes with public databases.

METHODS

The expression data and clinical data of lung cancer patients were firstly extracted from public database (GTEx and TCGA) to study the differentially expressed genes (DEGs) of lung adenocarcinoma (LUAD). The intersection of three R packages (Dseq2, Limma, EdgeR) methods were selected as candidate DEGs for further study. WGCNA was used to obtain relevant modules and key genes of lung cancer classification, GO and KEGG enrichment analysis was performed. The model was built using two machine learning methods, Least Absolute Shrinkage and Selection Operator (LASSO) regression and tumor classification was also predicted with extreme Gradient Boosting (XGBoost) algorithm.

RESULTS

DEGs analysis revealed that there were 1306 LUAD genes. WGCNA module analysis showed that a total of 116 genes were significantly related to classification, and module genes were mainly related to 14 KEGG pathways. The machine learning algorithm identified 10 target genes by LASSO regression analysis of differential genes, and 18 genes were identified by XGBoost model. A total of 6 genes were found from the intersection of the above methods as classification signatures of early pulmonary nodules, including "HMGB3" "ARHGAP6" "TCF21" "FCN3" "COL6A6" "GOLM1".

CONCLUSION

Using DEGs analysis, WGCNA method and machine learning algorithm, six gene signatures related to early stage of LUAD, which can assist clinicians in disease classification prediction.

摘要

目的

利用加权基因共相关网络分析(WGCNA)和机器学习算法,基于公共数据库预测早期肺结节的分类。

方法

首先从公共数据库(GTEx 和 TCGA)中提取肺癌患者的表达数据和临床数据,以研究肺腺癌(LUAD)的差异表达基因(DEGs)。使用三个 R 包(Dseq2、Limma、EdgeR)的交集方法筛选候选 DEGs 进行进一步研究。使用 WGCNA 获得与肺癌分类相关的模块和关键基因,进行 GO 和 KEGG 富集分析。使用两种机器学习方法(最小绝对值收缩和选择算子(LASSO)回归和肿瘤分类)构建模型,并使用极端梯度提升(XGBoost)算法进行肿瘤分类预测。

结果

DEGs 分析显示有 1306 个 LUAD 基因。WGCNA 模块分析表明,共有 116 个基因与分类显著相关,模块基因主要与 14 个 KEGG 通路相关。通过差异基因的 LASSO 回归分析,机器学习算法鉴定了 10 个靶基因,XGBoost 模型鉴定了 18 个基因。通过上述方法的交集,共发现 6 个基因作为早期肺结节分类的特征,包括“HMGB3”“ARHGAP6”“TCF21”“FCN3”“COL6A6”“GOLM1”。

结论

本研究通过 DEGs 分析、WGCNA 方法和机器学习算法,确定了 6 个与 LUAD 早期阶段相关的基因特征,可辅助临床医生进行疾病分类预测。

相似文献

1
Classification prediction of early pulmonary nodes based on weighted gene correlation network analysis and machine learning.基于加权基因相关网络分析和机器学习的早期肺部节点分类预测。
J Cancer Res Clin Oncol. 2023 Jul;149(7):3915-3924. doi: 10.1007/s00432-022-04312-7. Epub 2022 Aug 26.
2
Molecular subtypes of lung adenocarcinoma patients for prognosis and therapeutic response prediction with machine learning on 13 programmed cell death patterns.基于 13 种程序性细胞死亡模式的机器学习对肺腺癌患者预后和治疗反应预测的分子亚型。
J Cancer Res Clin Oncol. 2023 Oct;149(13):11351-11368. doi: 10.1007/s00432-023-05000-w. Epub 2023 Jun 28.
3
Deciphering Shared Gene Signatures and Immune Infiltration Characteristics Between Gestational Diabetes Mellitus and Preeclampsia by Integrated Bioinformatics Analysis and Machine Learning.通过综合生物信息学分析和机器学习破译妊娠期糖尿病和子痫前期之间共享的基因特征及免疫浸润特征
Reprod Sci. 2025 May 15. doi: 10.1007/s43032-025-01847-1.
4
Caveolin-1 inhibits the proliferation and invasion of lung adenocarcinoma via EGFR degradation.小窝蛋白-1通过表皮生长因子受体(EGFR)降解抑制肺腺癌的增殖和侵袭。
Sci Rep. 2025 Jul 1;15(1):21654. doi: 10.1038/s41598-025-05259-8.
5
Developing a Panel of Shared Susceptibility Genes as Diagnostic Biomarkers for chronic obstructive pulmonary disease and Heart Failure.开发一组共享易感性基因作为慢性阻塞性肺疾病和心力衰竭的诊断生物标志物。
Comput Biol Med. 2025 Sep;196(Pt A):110657. doi: 10.1016/j.compbiomed.2025.110657. Epub 2025 Jul 4.
6
Exploring the role of ferroptosis in pemphigus: identification of diagnostic markers and regulatory mechanisms.探索铁死亡在天疱疮中的作用:诊断标志物和调控机制的鉴定
Front Med (Lausanne). 2025 Jun 19;12:1615865. doi: 10.3389/fmed.2025.1615865. eCollection 2025.
7
Identification and verification of immune and oxidative stress-related diagnostic indicators for malignant lung nodules through WGCNA and machine learning.通过加权基因共表达网络分析(WGCNA)和机器学习识别并验证恶性肺结节的免疫和氧化应激相关诊断指标
Sci Rep. 2025 Jul 1;15(1):22449. doi: 10.1038/s41598-025-04639-4.
8
Integrated multi-omics and machine learning reveals immune-metabolic signatures in osteoarthritis: from bulk RNA-seq to single-cell resolution.综合多组学和机器学习揭示骨关节炎中的免疫代谢特征:从批量RNA测序到单细胞分辨率
Front Immunol. 2025 Jun 16;16:1599930. doi: 10.3389/fimmu.2025.1599930. eCollection 2025.
9
Machine learning-based radiomics for differentiating lung cancer subtypes in brain metastases using CE-T1WI.基于机器学习的影像组学在使用对比增强T1加权成像鉴别脑转移瘤中肺癌亚型的应用
Front Oncol. 2025 Jun 19;15:1599882. doi: 10.3389/fonc.2025.1599882. eCollection 2025.
10
Integrated analysis of uterine leiomyosarcoma and leiomyoma utilizing TCGA and GEO data: a WGCNA and machine learning approach.利用TCGA和GEO数据对子宫平滑肌肉瘤和平滑肌瘤进行综合分析:一种加权基因共表达网络分析和机器学习方法。
Transl Cancer Res. 2025 May 30;14(5):2999-3016. doi: 10.21037/tcr-2024-2465. Epub 2025 May 13.

引用本文的文献

1
Multifaceted bioinformatic analysis of m6A-related ferroptosis and its link with gene signatures and tumour-infiltrating immune cells in gliomas.多方面的 m6A 相关铁死亡的生物信息学分析及其与胶质瘤基因特征和肿瘤浸润免疫细胞的关系。
J Cell Mol Med. 2024 Sep;28(17):e70060. doi: 10.1111/jcmm.70060.

本文引用的文献

1
Metabolic and Evolutionary Engineering of Diploid Yeast for the Production of First- and Second-Generation Ethanol.用于生产第一代和第二代乙醇的二倍体酵母的代谢与进化工程
Front Bioeng Biotechnol. 2022 Jan 28;9:835928. doi: 10.3389/fbioe.2021.835928. eCollection 2021.
2
Identification of Key mRNAs as Prediction Models for Early Metastasis of Pancreatic Cancer Based on LASSO.基于LASSO的关键mRNA作为胰腺癌早期转移预测模型的鉴定
Front Bioeng Biotechnol. 2021 Aug 17;9:701039. doi: 10.3389/fbioe.2021.701039. eCollection 2021.
3
The landscape of gene co-expression modules correlating with prognostic genetic abnormalities in AML.
与 AML 预后相关的基因共表达模块的景观。
J Transl Med. 2021 May 29;19(1):228. doi: 10.1186/s12967-021-02914-2.
4
A risk prediction model of gene signatures in ovarian cancer through bagging of GA-XGBoost models.通过集成 GA-XGBoost 模型的套袋算法构建卵巢癌基因特征风险预测模型。
J Adv Res. 2020 Nov 11;30:113-122. doi: 10.1016/j.jare.2020.11.006. eCollection 2021 May.
5
Comprehensive Analysis of the Immune and Prognostic Implication of COL6A6 in Lung Adenocarcinoma.COL6A6在肺腺癌中的免疫及预后意义的综合分析
Front Oncol. 2021 Feb 26;11:633420. doi: 10.3389/fonc.2021.633420. eCollection 2021.
6
Integrative Genomic and Transcriptomic Analyses of Tumor Suppressor Genes and Their Role on Tumor Microenvironment and Immunity in Lung Squamous Cell Carcinoma.肺鳞状细胞癌中肿瘤抑制基因的综合基因组和转录组分析及其对肿瘤微环境和免疫的作用
Front Immunol. 2021 Feb 25;12:598671. doi: 10.3389/fimmu.2021.598671. eCollection 2021.
7
Novel gene signatures for stage classification of the squamous cell carcinoma of the lung.新型基因标志物用于肺鳞癌的分期分类。
Sci Rep. 2021 Mar 1;11(1):4835. doi: 10.1038/s41598-021-83668-1.
8
CRISPR/Cas9-mediated knock-in of alligator cathelicidin gene in a non-coding region of channel catfish genome.CRISPR/Cas9 介导的鳄鲶抗菌肽基因在斑点叉尾鮰基因组非编码区的敲入。
Sci Rep. 2020 Dec 17;10(1):22271. doi: 10.1038/s41598-020-79409-5.
9
A genomic signature for accurate classification and prediction of clinical outcomes in cancer patients treated with immune checkpoint blockade immunotherapy.一种用于准确分类和预测接受免疫检查点阻断免疫疗法治疗的癌症患者临床结局的基因组特征。
Sci Rep. 2020 Nov 25;10(1):20575. doi: 10.1038/s41598-020-77653-3.
10
LncRNA LINC00163 upregulation suppresses lung cancer development though transcriptionally increasing TCF21 expression.长链非编码RNA LINC00163的上调通过转录增加TCF21表达来抑制肺癌发展。
Am J Cancer Res. 2018 Dec 1;8(12):2494-2506. eCollection 2018.