• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

机器学习技术在结核病耐药性分析中的应用。

Application of machine learning techniques to tuberculosis drug resistance analysis.

机构信息

Department of Engineering Science, Institute of Biomedical Engineering.

Nuffield Department of Medicine, University of Oxford.

出版信息

Bioinformatics. 2019 Jul 1;35(13):2276-2282. doi: 10.1093/bioinformatics/bty949.

DOI:10.1093/bioinformatics/bty949
PMID:30462147
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6596891/
Abstract

MOTIVATION

Timely identification of Mycobacterium tuberculosis (MTB) resistance to existing drugs is vital to decrease mortality and prevent the amplification of existing antibiotic resistance. Machine learning methods have been widely applied for timely predicting resistance of MTB given a specific drug and identifying resistance markers. However, they have been not validated on a large cohort of MTB samples from multi-centers across the world in terms of resistance prediction and resistance marker identification. Several machine learning classifiers and linear dimension reduction techniques were developed and compared for a cohort of 13 402 isolates collected from 16 countries across 6 continents and tested 11 drugs.

RESULTS

Compared to conventional molecular diagnostic test, area under curve of the best machine learning classifier increased for all drugs especially by 23.11%, 15.22% and 10.14% for pyrazinamide, ciprofloxacin and ofloxacin, respectively (P < 0.01). Logistic regression and gradient tree boosting found to perform better than other techniques. Moreover, logistic regression/gradient tree boosting with a sparse principal component analysis/non-negative matrix factorization step compared with the classifier alone enhanced the best performance in terms of F1-score by 12.54%, 4.61%, 7.45% and 9.58% for amikacin, moxifloxacin, ofloxacin and capreomycin, respectively, as well increasing area under curve for amikacin and capreomycin. Results provided a comprehensive comparison of various techniques and confirmed the application of machine learning for better prediction of the large diverse tuberculosis data. Furthermore, mutation ranking showed the possibility of finding new resistance/susceptible markers.

AVAILABILITY AND IMPLEMENTATION

The source code can be found at http://www.robots.ox.ac.uk/ davidc/code.php.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

及时识别结核分枝杆菌(MTB)对现有药物的耐药性对于降低死亡率和防止现有抗生素耐药性的扩大至关重要。机器学习方法已广泛应用于及时预测 MTB 对特定药物的耐药性,并识别耐药标记物。然而,它们尚未在全球多个中心的大量 MTB 样本中进行耐药性预测和耐药标记物识别的验证。我们开发并比较了几种机器学习分类器和线性降维技术,用于来自六大洲 16 个国家的 13402 个分离株的队列,这些分离株测试了 11 种药物。

结果

与传统的分子诊断测试相比,最佳机器学习分类器的曲线下面积(AUC)增加了所有药物,尤其是吡嗪酰胺、环丙沙星和氧氟沙星的 AUC 分别增加了 23.11%、15.22%和 10.14%(P<0.01)。逻辑回归和梯度提升树被发现比其他技术表现更好。此外,与仅使用分类器相比,逻辑回归/梯度提升树加上稀疏主成分分析/非负矩阵分解步骤可以分别将阿米卡星、莫西沙星、氧氟沙星和卷曲霉素的 F1 分数最佳性能提高 12.54%、4.61%、7.45%和 9.58%,并提高阿米卡星和卷曲霉素的 AUC。结果提供了对各种技术的全面比较,并证实了机器学习在更好地预测大型多样化结核病数据方面的应用。此外,突变排名显示了发现新的耐药/敏感标记物的可能性。

可用性和实现

源代码可在 http://www.robots.ox.ac.uk/davidc/code.php 找到。

补充信息

补充数据可在 Bioinformatics 在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/191e/6596891/aca00565cca6/bty949f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/191e/6596891/c01eaca3914a/bty949f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/191e/6596891/aca00565cca6/bty949f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/191e/6596891/c01eaca3914a/bty949f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/191e/6596891/aca00565cca6/bty949f2.jpg

相似文献

1
Application of machine learning techniques to tuberculosis drug resistance analysis.机器学习技术在结核病耐药性分析中的应用。
Bioinformatics. 2019 Jul 1;35(13):2276-2282. doi: 10.1093/bioinformatics/bty949.
2
Machine learning for classifying tuberculosis drug-resistance from DNA sequencing data.基于 DNA 测序数据的机器学习方法用于结核分枝杆菌耐药性分类。
Bioinformatics. 2018 May 15;34(10):1666-1671. doi: 10.1093/bioinformatics/btx801.
3
DeepAMR for predicting co-occurrent resistance of Mycobacterium tuberculosis.用于预测结核分枝杆菌共现耐药性的 DeepAMR。
Bioinformatics. 2019 Sep 15;35(18):3240-3249. doi: 10.1093/bioinformatics/btz067.
4
Accurate and rapid prediction of tuberculosis drug resistance from genome sequence data using traditional machine learning algorithms and CNN.利用传统机器学习算法和 CNN 从基因组序列数据中准确快速预测结核病耐药性。
Sci Rep. 2022 Feb 14;12(1):2427. doi: 10.1038/s41598-022-06449-4.
5
Machine learning-based prediction of antibiotic resistance in Mycobacterium tuberculosis clinical isolates from Uganda.基于机器学习对乌干达结核分枝杆菌临床分离株抗生素耐药性的预测
BMC Infect Dis. 2024 Dec 5;24(1):1391. doi: 10.1186/s12879-024-10282-7.
6
Beyond multidrug resistance: Leveraging rare variants with machine and statistical learning models in Mycobacterium tuberculosis resistance prediction.超越多药耐药性:利用机器和统计学习模型在结核分枝杆菌耐药性预测中的罕见变异。
EBioMedicine. 2019 May;43:356-369. doi: 10.1016/j.ebiom.2019.04.016. Epub 2019 Apr 29.
7
Advantages of updated WHO mutation catalog combined with existing whole-genome sequencing-based approaches for Mycobacterium tuberculosis resistance prediction.更新后的世界卫生组织突变目录与现有的基于全基因组测序的方法相结合用于预测结核分枝杆菌耐药性的优势。
Genome Med. 2025 Mar 26;17(1):31. doi: 10.1186/s13073-025-01458-0.
8
Machine Learning Predicts Accurately Drug Resistance From Whole Genome Sequencing Data.机器学习可根据全基因组测序数据准确预测耐药性。
Front Genet. 2019 Sep 26;10:922. doi: 10.3389/fgene.2019.00922. eCollection 2019.
9
Rapid, antibiotic incubation-free determination of tuberculosis drug resistance using machine learning and Raman spectroscopy.利用机器学习和拉曼光谱技术快速、无需抗生素孵育即可检测结核药物耐药性。
Proc Natl Acad Sci U S A. 2024 Jun 18;121(25):e2315670121. doi: 10.1073/pnas.2315670121. Epub 2024 Jun 11.
10
Genotypic and phenotypic comparison of drug resistance profiles of clinical multidrug-resistant Mycobacterium tuberculosis isolates using whole genome sequencing in Latvia.使用全基因组测序技术在拉脱维亚对临床耐多药结核分枝杆菌分离株的耐药表型和基因型进行比较。
BMC Infect Dis. 2023 Sep 28;23(1):638. doi: 10.1186/s12879-023-08629-7.

引用本文的文献

1
A comprehensive machine learning for high throughput Tuberculosis sequence analysis, functional annotation, and visualization.一种用于高通量结核病序列分析、功能注释和可视化的综合机器学习方法。
Sci Rep. 2025 Jul 16;15(1):25866. doi: 10.1038/s41598-025-98654-0.
2
Feature selection and aggregation for antibiotic resistance GWAS in : a comparative study.抗生素耐药性全基因组关联研究中的特征选择与聚合:一项比较研究
Front Microbiol. 2025 Jun 18;16:1586476. doi: 10.3389/fmicb.2025.1586476. eCollection 2025.
3
Predicting rifampicin resistance in using machine learning informed by protein structural and chemical features.

本文引用的文献

1
Prediction of Susceptibility to First-Line Tuberculosis Drugs by DNA Sequencing.基于 DNA 测序的一线抗结核药物敏感性预测。
N Engl J Med. 2018 Oct 11;379(15):1403-1415. doi: 10.1056/NEJMoa1800474. Epub 2018 Sep 26.
2
Validation of Novel Mycobacterium tuberculosis Isoniazid Resistance Mutations Not Detectable by Common Molecular Tests.新型结核分枝杆菌异烟肼耐药突变的验证,这些突变不能通过常见的分子检测方法检出。
Antimicrob Agents Chemother. 2018 Sep 24;62(10). doi: 10.1128/AAC.00974-18. Print 2018 Oct.
3
Machine learning for classifying tuberculosis drug-resistance from DNA sequencing data.
利用蛋白质结构和化学特征通过机器学习预测利福平耐药性。
ERJ Open Res. 2025 Jun 30;11(3). doi: 10.1183/23120541.00952-2024. eCollection 2025 May.
4
Whole-genome phenotype prediction with machine learning: open problems in bacterial genomics.利用机器学习进行全基因组表型预测:细菌基因组学中的开放性问题
Bioinformatics. 2025 Jul 1;41(7). doi: 10.1093/bioinformatics/btaf206.
5
Machine learning-based approach for identification of new resistance associated mutations from whole genome sequences of .基于机器学习的方法从……的全基因组序列中鉴定新的耐药相关突变
Bioinform Adv. 2025 Mar 11;5(1):vbaf050. doi: 10.1093/bioadv/vbaf050. eCollection 2025.
6
Machine Learning-Based High-Throughput Screening, Molecular Modeling and Quantum Chemical Analysis to Investigate Mycobacterium tuberculosis MetRS Inhibitors.基于机器学习的高通量筛选、分子建模和量子化学分析以研究结核分枝杆菌甲硫氨酰-tRNA合成酶抑制剂
ChemistryOpen. 2025 Jul;14(7):e202400460. doi: 10.1002/open.202400460. Epub 2025 Feb 25.
7
Machine learning detection of heteroresistance in Escherichia coli.机器学习检测大肠杆菌中的异质性耐药
EBioMedicine. 2025 Mar;113:105618. doi: 10.1016/j.ebiom.2025.105618. Epub 2025 Feb 21.
8
Model for predicting drug resistance based on the clinical profile of tuberculosis patients using machine learning techniques.基于机器学习技术,利用结核病患者临床特征预测耐药性的模型。
PeerJ Comput Sci. 2024 Oct 14;10:e2246. doi: 10.7717/peerj-cs.2246. eCollection 2024.
9
Machine learning-based prediction of antibiotic resistance in Mycobacterium tuberculosis clinical isolates from Uganda.基于机器学习对乌干达结核分枝杆菌临床分离株抗生素耐药性的预测
BMC Infect Dis. 2024 Dec 5;24(1):1391. doi: 10.1186/s12879-024-10282-7.
10
Leveraging large-scale Mycobacterium tuberculosis whole genome sequence data to characterise drug-resistant mutations using machine learning and statistical approaches.利用大规模结核分枝杆菌全基因组序列数据,通过机器学习和统计方法来描述耐药突变。
Sci Rep. 2024 Nov 7;14(1):27091. doi: 10.1038/s41598-024-77947-w.
基于 DNA 测序数据的机器学习方法用于结核分枝杆菌耐药性分类。
Bioinformatics. 2018 May 15;34(10):1666-1671. doi: 10.1093/bioinformatics/btx801.
4
Mycobacterium tuberculosis resistance prediction and lineage classification from genome sequencing: comparison of automated analysis tools.结核分枝杆菌耐药性预测和基因组测序谱系分类:自动化分析工具比较。
Sci Rep. 2017 Apr 20;7:46327. doi: 10.1038/srep46327.
5
HyCLASSS: A Hybrid Classifier for Automatic Sleep Stage Scoring.HyCLASSS:一种用于自动睡眠阶段评分的混合分类器。
IEEE J Biomed Health Inform. 2018 Mar;22(2):375-385. doi: 10.1109/JBHI.2017.2668993. Epub 2017 Feb 17.
6
Detecting hospital-acquired infections: A document classification approach using support vector machines and gradient tree boosting.利用支持向量机和梯度提升树进行医院获得性感染检测:一种文档分类方法。
Health Informatics J. 2018 Mar;24(1):24-42. doi: 10.1177/1460458216656471. Epub 2016 Aug 4.
7
Genetic Determinants of Drug Resistance in Mycobacterium tuberculosis and Their Diagnostic Value.结核分枝杆菌耐药性的遗传决定因素及其诊断价值。
Am J Respir Crit Care Med. 2016 Sep 1;194(5):621-30. doi: 10.1164/rccm.201510-2091OC.
8
Whole-genome sequencing for prediction of Mycobacterium tuberculosis drug susceptibility and resistance: a retrospective cohort study.全基因组测序用于预测结核分枝杆菌药物敏感性和耐药性:一项回顾性队列研究。
Lancet Infect Dis. 2015 Oct;15(10):1193-1202. doi: 10.1016/S1473-3099(15)00062-6. Epub 2015 Jun 23.
9
Rapid determination of anti-tuberculosis drug resistance from whole-genome sequences.通过全基因组序列快速测定抗结核药物耐药性。
Genome Med. 2015 May 27;7(1):51. doi: 10.1186/s13073-015-0164-0. eCollection 2015.
10
Genome sequencing of 161 Mycobacterium tuberculosis isolates from China identifies genes and intergenic regions associated with drug resistance.中国 161 株结核分枝杆菌分离株的基因组测序鉴定出与耐药性相关的基因和基因间区域。
Nat Genet. 2013 Oct;45(10):1255-60. doi: 10.1038/ng.2735. Epub 2013 Sep 1.