• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

预测……中的耐药性:一种用于基因组突变分析的机器学习方法

Predicting Drug Resistance in : A Machine Learning Approach to Genomic Mutation Analysis.

作者信息

Paredes-Gutierrez Guillermo, Perea-Jacobo Ricardo, Acosta-Mesa Héctor-Gabriel, Mezura-Montes Efren, Morales Reyes José Luis, Zenteno-Cuevas Roberto, Guerrero-Chevannier Miguel-Ángel, Muñiz-Salazar Raquel, Flores Dora-Luz

机构信息

Facultad de Ingeniería Arquitectura y Diseño, Universidad Autónoma de Baja California, Campus Ensenada, Ensenada 22860, Mexico.

Escuela de Ciencias de la Salud, Universidad Autónoma de Baja California, Campus Ensenada, Ensenada 22890, Mexico.

出版信息

Diagnostics (Basel). 2025 Jan 24;15(3):279. doi: 10.3390/diagnostics15030279.

DOI:10.3390/diagnostics15030279
PMID:39941209
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11817661/
Abstract

Tuberculosis (TB), caused by (), remains a leading cause of death from infectious diseases globally. The treatment of active TB relies on first- and second-line drugs, however, the emergence of drug resistance poses a significant challenge to global TB control efforts. Recent advances in whole-genome sequencing combined with machine learning have shown promise in predicting drug resistance. This study aimed to evaluate the performance of four machine learning models in classifying resistance to ethambutol, isoniazid, and rifampicin in isolates. Four machine learning models-Extreme Gradient Boosting Classifier (XGBC), Logistic Gradient Boosting Classifier (LGBC), Gradient Boosting Classifier (GBC), and an Artificial Neural Network (ANN)-were trained using a Variant Call Format (VCF) dataset preprocessed by the CRyPTIC consortium. Three datasets were used: the original dataset, a principal component analysis (PCA)-reduced dataset, and a dataset prioritizing significant mutations identified by the XGBC model. The models were trained and tested across these datasets, and their performance was compared using sensitivity, specificity, Precision, F1-scores and Accuracy. All models were applied to the PCA-reduced dataset, while the XGBC model was also evaluated using the mutation-prioritized dataset. The XGBC model trained on the original dataset outperformed the others, achieving sensitivity values of 0.97, 0.90, and 0.94; specificity values of 0.97, 0.99, and 0.96; and F1-scores of 0.93, 0.94, and 0.92 for ethambutol, isoniazid, and rifampicin, respectively. These results demonstrate the superior accuracy of the XGBC model in classifying drug resistance. The study highlights the effectiveness of using a binary representation of mutations to train the XGBC model for predicting resistance and susceptibility to key TB drugs. The XGBC model trained on the original dataset demonstrated the highest performance among the evaluated models, suggesting its potential for clinical application in combating drug-resistant tuberculosis. Further research is needed to validate and expand these findings for broader implementation in TB diagnostics.

摘要

由()引起的结核病(TB)仍是全球传染病死亡的主要原因。活动性结核病的治疗依赖于一线和二线药物,然而,耐药性的出现对全球结核病控制工作构成了重大挑战。全基因组测序与机器学习相结合的最新进展在预测耐药性方面显示出了前景。本研究旨在评估四种机器学习模型对分离株中乙胺丁醇、异烟肼和利福平耐药性进行分类的性能。使用由CRYPTIC联盟预处理的变异调用格式(VCF)数据集训练了四种机器学习模型——极端梯度提升分类器(XGBC)、逻辑梯度提升分类器(LGBC)、梯度提升分类器(GBC)和人工神经网络(ANN)。使用了三个数据集:原始数据集、主成分分析(PCA)降维数据集以及优先考虑XGBC模型识别出的显著突变的数据集。在这些数据集上对模型进行训练和测试,并使用敏感性、特异性、精确率、F1分数和准确率比较它们的性能。所有模型都应用于PCA降维数据集,同时还使用突变优先数据集对XGBC模型进行了评估。在原始数据集上训练的XGBC模型表现优于其他模型,乙胺丁醇、异烟肼和利福平的敏感性值分别为0.97、0.90和0.94;特异性值分别为0.97、0.99和0.96;F1分数分别为0.93、0.94和0.92。这些结果证明了XGBC模型在分类耐药性方面具有更高的准确性。该研究突出了使用突变的二元表示来训练XGBC模型以预测对关键结核病药物的耐药性和敏感性的有效性。在原始数据集上训练的XGBC模型在评估模型中表现出最高性能,表明其在对抗耐药结核病的临床应用中的潜力。需要进一步研究来验证和扩展这些发现,以便在结核病诊断中更广泛地应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2768/11817661/6f3f068be7fe/diagnostics-15-00279-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2768/11817661/89c60b07968d/diagnostics-15-00279-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2768/11817661/874dafc7c3a0/diagnostics-15-00279-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2768/11817661/3782f16e25e8/diagnostics-15-00279-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2768/11817661/6f3f068be7fe/diagnostics-15-00279-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2768/11817661/89c60b07968d/diagnostics-15-00279-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2768/11817661/874dafc7c3a0/diagnostics-15-00279-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2768/11817661/3782f16e25e8/diagnostics-15-00279-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2768/11817661/6f3f068be7fe/diagnostics-15-00279-g004.jpg

相似文献

1
Predicting Drug Resistance in : A Machine Learning Approach to Genomic Mutation Analysis.预测……中的耐药性:一种用于基因组突变分析的机器学习方法
Diagnostics (Basel). 2025 Jan 24;15(3):279. doi: 10.3390/diagnostics15030279.
2
Machine learning-based prediction of antibiotic resistance in Mycobacterium tuberculosis clinical isolates from Uganda.基于机器学习对乌干达结核分枝杆菌临床分离株抗生素耐药性的预测
BMC Infect Dis. 2024 Dec 5;24(1):1391. doi: 10.1186/s12879-024-10282-7.
3
Evolution of Extensively Drug-Resistant Tuberculosis over Four Decades: Whole Genome Sequencing and Dating Analysis of Mycobacterium tuberculosis Isolates from KwaZulu-Natal.四十年间广泛耐药结核病的演变:来自夸祖鲁 - 纳塔尔省结核分枝杆菌分离株的全基因组测序与年代分析
PLoS Med. 2015 Sep 29;12(9):e1001880. doi: 10.1371/journal.pmed.1001880. eCollection 2015 Sep.
4
Co-resistance to isoniazid and second-line anti-tuberculosis drugs in isoniazid-resistant tuberculosis at a tertiary care hospital in Thailand.泰国一家三级保健医院耐异烟肼结核病中的异烟肼和二线抗结核药物的共同耐药性。
Microbiol Spectr. 2024 Mar 5;12(3):e0346223. doi: 10.1128/spectrum.03462-23. Epub 2024 Feb 7.
5
Beyond multidrug resistance: Leveraging rare variants with machine and statistical learning models in Mycobacterium tuberculosis resistance prediction.超越多药耐药性:利用机器和统计学习模型在结核分枝杆菌耐药性预测中的罕见变异。
EBioMedicine. 2019 May;43:356-369. doi: 10.1016/j.ebiom.2019.04.016. Epub 2019 Apr 29.
6
Whole genome sequencing of drug resistant Mycobacterium tuberculosis isolates from a high burden tuberculosis region of North West Pakistan.对来自巴基斯坦西北部高负担结核病地区的耐药结核分枝杆菌分离株进行全基因组测序。
Sci Rep. 2019 Oct 18;9(1):14996. doi: 10.1038/s41598-019-51562-6.
7
A modified decision tree approach to improve the prediction and mutation discovery for drug resistance in Mycobacterium tuberculosis.一种改进的决策树方法,用于提高对结核分枝杆菌耐药性的预测和突变发现。
BMC Genomics. 2022 Jan 11;23(1):46. doi: 10.1186/s12864-022-08291-4.
8
Advantages of updated WHO mutation catalog combined with existing whole-genome sequencing-based approaches for Mycobacterium tuberculosis resistance prediction.更新后的世界卫生组织突变目录与现有的基于全基因组测序的方法相结合用于预测结核分枝杆菌耐药性的优势。
Genome Med. 2025 Mar 26;17(1):31. doi: 10.1186/s13073-025-01458-0.
9
GenTB: A user-friendly genome-based predictor for tuberculosis resistance powered by machine learning.GenTB:一款基于基因组的用户友好型结核耐药预测器,由机器学习驱动。
Genome Med. 2021 Aug 30;13(1):138. doi: 10.1186/s13073-021-00953-4.
10
Whole-genome sequencing of presumptive MDR-TB isolates from a tertiary healthcare setting in Mumbai.对来自孟买一家三级保健医疗机构的疑似耐多药结核分枝杆菌分离株进行全基因组测序。
J Glob Antimicrob Resist. 2022 Dec;31:256-262. doi: 10.1016/j.jgar.2022.10.004. Epub 2022 Oct 20.

本文引用的文献

1
Evaluation of Xpert MTB/XDR test for susceptibility testing of Mycobacterium tuberculosis to first and second-line drugs in Uganda.评价 Xpert MTB/XDR 检测在乌干达用于检测结核分枝杆菌对一线和二线药物敏感性的效果。
PLoS One. 2023 Aug 17;18(8):e0284545. doi: 10.1371/journal.pone.0284545. eCollection 2023.
2
Direct detection of drug-resistant using targeted next generation sequencing.利用靶向下一代测序直接检测耐药性。
Front Public Health. 2023 Jun 29;11:1206056. doi: 10.3389/fpubh.2023.1206056. eCollection 2023.
3
A data compendium associating the genomes of 12,289 Mycobacterium tuberculosis isolates with quantitative resistance phenotypes to 13 antibiotics.
一个关联了 12289 株结核分枝杆菌分离株基因组与 13 种抗生素定量耐药表型的数据集。
PLoS Biol. 2022 Aug 9;20(8):e3001721. doi: 10.1371/journal.pbio.3001721. eCollection 2022 Aug.
4
A modified decision tree approach to improve the prediction and mutation discovery for drug resistance in Mycobacterium tuberculosis.一种改进的决策树方法,用于提高对结核分枝杆菌耐药性的预测和突变发现。
BMC Genomics. 2022 Jan 11;23(1):46. doi: 10.1186/s12864-022-08291-4.
5
GenTB: A user-friendly genome-based predictor for tuberculosis resistance powered by machine learning.GenTB:一款基于基因组的用户友好型结核耐药预测器,由机器学习驱动。
Genome Med. 2021 Aug 30;13(1):138. doi: 10.1186/s13073-021-00953-4.
6
Machine Learning Predicts Accurately Drug Resistance From Whole Genome Sequencing Data.机器学习可根据全基因组测序数据准确预测耐药性。
Front Genet. 2019 Sep 26;10:922. doi: 10.3389/fgene.2019.00922. eCollection 2019.
7
A large scale evaluation of TBProfiler and Mykrobe for antibiotic resistance prediction in .对TBProfiler和Mykrobe在……中预测抗生素耐药性的大规模评估。 (原句结尾不完整,翻译可能存在一定局限性)
PeerJ. 2019 May 1;7:e6857. doi: 10.7717/peerj.6857. eCollection 2019.
8
Beyond multidrug resistance: Leveraging rare variants with machine and statistical learning models in Mycobacterium tuberculosis resistance prediction.超越多药耐药性:利用机器和统计学习模型在结核分枝杆菌耐药性预测中的罕见变异。
EBioMedicine. 2019 May;43:356-369. doi: 10.1016/j.ebiom.2019.04.016. Epub 2019 Apr 29.
9
Genome-wide association studies and polygenic risk scores for skin cancer: clinically useful yet?皮肤癌的全基因组关联研究和多基因风险评分:目前在临床上有用吗?
Br J Dermatol. 2019 Dec;181(6):1146-1155. doi: 10.1111/bjd.17917. Epub 2019 Jul 7.
10
Lung Cancer Risk Prediction Using Common SNPs Located in GWAS-Identified Susceptibility Regions.利用位于全基因组关联研究(GWAS)确定的易感区域的常见单核苷酸多态性(SNPs)预测肺癌风险
J Thorac Oncol. 2015 Nov;10(11):1538-45. doi: 10.1097/JTO.0000000000000666.