• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于诊断肺结核的随机森林和人工神经网络联合模型。

A united model for diagnosing pulmonary tuberculosis with random forest and artificial neural network.

作者信息

Zhu Qingqing, Liu Jie

机构信息

Anhui Provincial Tuberculosis Institute, Hefei, Anhui, China.

出版信息

Front Genet. 2023 Mar 9;14:1094099. doi: 10.3389/fgene.2023.1094099. eCollection 2023.

DOI:10.3389/fgene.2023.1094099
PMID:36968608
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10033863/
Abstract

Pulmonary tuberculosis (PTB) is a chronic infectious disease and is the most common type of TB. Although the sputum smear test is a gold standard for diagnosing PTB, the method has numerous limitations, including low sensitivity, low specificity, and insufficient samples. The present study aimed to identify specific biomarkers of PTB and construct a model for diagnosing PTB by combining random forest (RF) and artificial neural network (ANN) algorithms. Two publicly available cohorts of TB, namely, the GSE83456 (training) and GSE42834 (validation) cohorts, were retrieved from the Gene Expression Omnibus (GEO) database. A total of 45 and 61 differentially expressed genes (DEGs) were identified between the PTB and control samples, respectively, by screening the GSE83456 cohort. An RF classifier was used for identifying specific biomarkers, following which an ANN-based classification model was constructed for identifying PTB samples. The accuracy of the ANN model was validated using the receiver operating characteristic (ROC) curve. The proportion of 22 types of immunocytes in the PTB samples was measured using the CIBERSORT algorithm, and the correlations between the immunocytes were determined. Differential analysis revealed that 11 and 22 DEGs were upregulated and downregulated, respectively, and 11 biomarkers specific to PTB were identified by the RF classifier. The weights of these biomarkers were determined and an ANN-based classification model was subsequently constructed. The model exhibited outstanding performance, as revealed by the area under the curve (AUC), which was 1.000 for the training cohort. The AUC of the validation cohort was 0.946, which further confirmed the accuracy of the model. Altogether, the present study successfully identified specific genetic biomarkers of PTB and constructed a highly accurate model for the diagnosis of PTB based on blood samples. The model developed herein can serve as a reliable reference for the early detection of PTB and provide novel perspectives into the pathogenesis of PTB.

摘要

肺结核(PTB)是一种慢性传染病,也是最常见的结核病类型。尽管痰涂片检查是诊断PTB的金标准,但该方法存在诸多局限性,包括敏感性低、特异性低和样本不足。本研究旨在识别PTB的特异性生物标志物,并通过结合随机森林(RF)和人工神经网络(ANN)算法构建PTB诊断模型。从基因表达综合数据库(GEO)中检索了两个公开可用的结核病队列,即GSE83456(训练)和GSE42834(验证)队列。通过筛选GSE83456队列,分别在PTB样本和对照样本之间鉴定出总共45个和61个差异表达基因(DEG)。使用RF分类器识别特异性生物标志物,随后构建基于ANN的分类模型以识别PTB样本。使用受试者工作特征(ROC)曲线验证ANN模型的准确性。使用CIBERSORT算法测量PTB样本中22种免疫细胞的比例,并确定免疫细胞之间的相关性。差异分析显示,分别有11个和22个DEG上调和下调,RF分类器鉴定出11个PTB特异性生物标志物。确定这些生物标志物的权重,随后构建基于ANN的分类模型。该模型表现出出色的性能,训练队列的曲线下面积(AUC)为1.000。验证队列的AUC为0.946,进一步证实了模型的准确性。总之,本研究成功识别了PTB的特异性遗传生物标志物,并构建了基于血液样本的高度准确的PTB诊断模型。本文开发的模型可为PTB的早期检测提供可靠参考,并为PTB的发病机制提供新的视角。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1abd/10033863/38b79fc50ee8/fgene-14-1094099-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1abd/10033863/87d8aab79585/fgene-14-1094099-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1abd/10033863/a60ffe2103ad/fgene-14-1094099-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1abd/10033863/14f772c6f704/fgene-14-1094099-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1abd/10033863/e1e2a6d0242b/fgene-14-1094099-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1abd/10033863/483c384bdeb3/fgene-14-1094099-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1abd/10033863/09a0c41705aa/fgene-14-1094099-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1abd/10033863/38b79fc50ee8/fgene-14-1094099-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1abd/10033863/87d8aab79585/fgene-14-1094099-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1abd/10033863/a60ffe2103ad/fgene-14-1094099-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1abd/10033863/14f772c6f704/fgene-14-1094099-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1abd/10033863/e1e2a6d0242b/fgene-14-1094099-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1abd/10033863/483c384bdeb3/fgene-14-1094099-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1abd/10033863/09a0c41705aa/fgene-14-1094099-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1abd/10033863/38b79fc50ee8/fgene-14-1094099-g007.jpg

相似文献

1
A united model for diagnosing pulmonary tuberculosis with random forest and artificial neural network.一种用于诊断肺结核的随机森林和人工神经网络联合模型。
Front Genet. 2023 Mar 9;14:1094099. doi: 10.3389/fgene.2023.1094099. eCollection 2023.
2
Construction of Novel Gene Signature-Based Predictive Model for the Diagnosis of Acute Myocardial Infarction by Combining Random Forest With Artificial Neural Network.基于随机森林与人工神经网络相结合构建新型基因特征预测模型用于急性心肌梗死的诊断
Front Cardiovasc Med. 2022 May 25;9:876543. doi: 10.3389/fcvm.2022.876543. eCollection 2022.
3
Development and Verification of a Combined Diagnostic Model for Sarcopenia with Random Forest and Artificial Neural Network.基于随机森林和人工神经网络的肌少症联合诊断模型的建立与验证。
Comput Math Methods Med. 2022 Aug 23;2022:2957731. doi: 10.1155/2022/2957731. eCollection 2022.
4
Construction and analysis of a conjunctive diagnostic model of HNSCC with random forest and artificial neural network.基于随机森林和人工神经网络构建并分析头颈部鳞状细胞癌联合诊断模型。
Sci Rep. 2023 Apr 25;13(1):6736. doi: 10.1038/s41598-023-32620-6.
5
RTP4, a Biomarker Associated with Diagnosing Pulmonary Tuberculosis and Pan-Cancer Analysis.RTP4,一种与诊断肺结核和泛癌分析相关的生物标志物。
Mediators Inflamm. 2023 Apr 26;2023:2318473. doi: 10.1155/2023/2318473. eCollection 2023.
6
Construction and validation of a joint diagnosis model based on random forest and artificial intelligence network for hepatitis B-related hepatocellular carcinoma.基于随机森林和人工智能网络的乙型肝炎相关肝细胞癌联合诊断模型的构建与验证
Transl Cancer Res. 2024 Feb 29;13(2):1068-1082. doi: 10.21037/tcr-23-1197. Epub 2024 Feb 26.
7
Development of a joint diagnostic model of thyroid papillary carcinoma with artificial neural network and random forest.基于人工神经网络和随机森林的甲状腺乳头状癌联合诊断模型的开发
Front Genet. 2022 Oct 7;13:957718. doi: 10.3389/fgene.2022.957718. eCollection 2022.
8
MiRNA-155 and miRNA-132 as potential diagnostic biomarkers for pulmonary tuberculosis: A preliminary study.MiRNA - 155和miRNA - 132作为肺结核潜在诊断生物标志物的初步研究。
Microb Pathog. 2016 Nov;100:78-83. doi: 10.1016/j.micpath.2016.09.005. Epub 2016 Sep 8.
9
Establishment and Analysis of a Combined Diagnostic Model of Polycystic Ovary Syndrome with Random Forest and Artificial Neural Network.多囊卵巢综合征联合诊断模型的建立与分析:随机森林和人工神经网络方法。
Biomed Res Int. 2020 Aug 20;2020:2613091. doi: 10.1155/2020/2613091. eCollection 2020.
10
Gene network in pulmonary tuberculosis based on bioinformatic analysis.基于生物信息学分析的肺结核基因网络
BMC Infect Dis. 2020 Aug 18;20(1):612. doi: 10.1186/s12879-020-05335-6.

引用本文的文献

1
Predictive modelling of air pollution affecting human tuberculosis risk on Mainland China.中国大陆空气污染对人类结核病风险影响的预测模型
Sci Rep. 2025 Jul 2;15(1):23633. doi: 10.1038/s41598-025-08078-z.
2
Predicting the risk of pulmonary embolism in patients with tuberculosis using machine learning algorithms.使用机器学习算法预测肺结核患者发生肺栓塞的风险。
Eur J Med Res. 2024 Dec 23;29(1):618. doi: 10.1186/s40001-024-02218-3.
3
Identification of diagnostic biomarkers correlate with immune infiltration in extra-pulmonary tuberculosis by integrating bioinformatics and machine learning.

本文引用的文献

1
Diagnostic biomarkers for active tuberculosis: progress and challenges.活动性结核病的诊断生物标志物:进展与挑战。
EMBO Mol Med. 2022 Dec 7;14(12):e14088. doi: 10.15252/emmm.202114088. Epub 2022 Oct 31.
2
The Establishment of Hypertrophic Cardiomyopathy Diagnosis Model via Artificial Neural Network and Random Decision Forest Method.基于人工神经网络和随机决策森林方法的肥厚型心肌病诊断模型的建立。
Mediators Inflamm. 2022 Sep 15;2022:2024974. doi: 10.1155/2022/2024974. eCollection 2022.
3
Machine learning in the loop for tuberculosis diagnosis support.
通过整合生物信息学和机器学习鉴定与肺外结核免疫浸润相关的诊断生物标志物。
Front Microbiol. 2024 Feb 7;15:1349374. doi: 10.3389/fmicb.2024.1349374. eCollection 2024.
基于机器学习的结核病诊断辅助系统。
Front Public Health. 2022 Jul 26;10:876949. doi: 10.3389/fpubh.2022.876949. eCollection 2022.
4
Urine biomarkers of pulmonary tuberculosis.尿生物标志物与肺结核。
Expert Rev Respir Med. 2022 Jun;16(6):615-621. doi: 10.1080/17476348.2022.2090341. Epub 2022 Jun 21.
5
Transcriptional Profiling and Deriving a Seven-Gene Signature That Discriminates Active and Latent Tuberculosis: An Integrative Bioinformatics Approach.转录谱分析和构建区分活动性和潜伏性结核病的七基因特征:一种综合生物信息学方法。
Genes (Basel). 2022 Mar 29;13(4):616. doi: 10.3390/genes13040616.
6
The WHO Global Tuberculosis 2021 Report - not so good news and turning the tide back to End TB.世界卫生组织 2021 年全球结核病报告——不容乐观的消息与遏制结核病蔓延的努力
Int J Infect Dis. 2022 Nov;124 Suppl 1:S26-S29. doi: 10.1016/j.ijid.2022.03.011. Epub 2022 Mar 20.
7
Using biological information to analyze potential miRNA-mRNA regulatory networks in the plasma of patients with non-small cell lung cancer.利用生物信息学分析非小细胞肺癌患者血浆中潜在的 miRNA-mRNA 调控网络。
BMC Cancer. 2022 Mar 21;22(1):299. doi: 10.1186/s12885-022-09281-1.
8
Construction and evaluation of an integrated predictive model for chronic kidney disease based on the random forest and artificial neural network approaches.基于随机森林和人工神经网络方法构建和评估慢性肾脏病的综合预测模型。
Biochem Biophys Res Commun. 2022 May 7;603:21-28. doi: 10.1016/j.bbrc.2022.02.099. Epub 2022 Feb 25.
9
Pyridoxal phosphate, pyridoxamine phosphate, and folic acid based on ceRNA regulatory network as potential biomarkers for the diagnosis of pulmonary tuberculosis.基于ceRNA调控网络的磷酸吡哆醛、磷酸吡哆胺和叶酸作为肺结核诊断的潜在生物标志物。
Infect Genet Evol. 2022 Apr;99:105240. doi: 10.1016/j.meegid.2022.105240. Epub 2022 Feb 9.
10
Low cerebrospinal fluid Amyloid-βeta 1-42 in patients with tuberculous meningitis.结核性脑膜炎患者脑脊液中β淀粉样蛋白 1-42 含量低。
BMC Neurol. 2021 Nov 16;21(1):449. doi: 10.1186/s12883-021-02468-2.