Department of Statistics, ICMR - National Institute for Research in Tuberculosis, Chennai, India.
University of Madras, Chennai, India.
PLoS One. 2024 Oct 16;19(10):e0309151. doi: 10.1371/journal.pone.0309151. eCollection 2024.
Despite advancements in detection and treatment, tuberculosis (TB), an infectious illness caused by the Mycobacterium TB bacteria, continues to pose a serious threat to world health. The TB diagnosis phase includes a patient's medical history, physical examination, chest X-rays, and laboratory procedures, such as molecular testing and sputum culture. In artificial intelligence (AI), machine learning (ML) is an advanced study of statistical algorithms that can learn from historical data and generalize the results to unseen data. There are not many studies done on the ML algorithm that enables the prediction of treatment success for patients with pulmonary TB (PTB). The objective of this study is to identify an effective and predictive ML algorithm to evaluate the detection of treatment success in PTB patients and to compare the predictive performance of the ML models. In this retrospective study, a total of 1236 PTB patients who were given treatment under a randomized controlled clinical trial at the ICMR-National Institute for Research in Tuberculosis, Chennai, India were considered for data analysis. The multiple ML models were developed and tested to identify the best algorithm to predict the sputum culture conversion of TB patients during the treatment period. In this study, decision tree (DT), random forest (RF), support vector machine (SVM) and naïve bayes (NB) models were validated with high performance by achieving an area under the curve (AUC) of receiver operating characteristic (ROC) greater than 80%. The salient finding of the study is that the DT model was produced as a better algorithm with the highest accuracy (92.72%), an AUC (0.909), precision (95.90%), recall (95.60%) and F1-score (95.75%) among the ML models. This methodology may be used to study the precise ML model classification for predicting the treatment success of TB patients during the treatment period.
尽管在检测和治疗方面取得了进展,但由结核分枝杆菌引起的传染病肺结核 (TB) 仍然对世界健康构成严重威胁。TB 诊断阶段包括患者的病史、体检、胸部 X 光检查和实验室程序,如分子检测和痰培养。在人工智能 (AI) 中,机器学习 (ML) 是对统计算法的高级研究,这些算法可以从历史数据中学习,并将结果推广到未见数据。在用于预测肺结核 (PTB) 患者治疗成功的 ML 算法方面,研究并不多。本研究的目的是确定一种有效的、可预测的 ML 算法,以评估 PTB 患者治疗成功的检测,并比较 ML 模型的预测性能。在这项回顾性研究中,总共考虑了 1236 名在印度钦奈 ICMR-国家结核病研究所进行随机对照临床试验的 PTB 患者进行数据分析。开发和测试了多种 ML 模型,以确定最佳算法来预测治疗期间 TB 患者的痰培养转换。在这项研究中,决策树 (DT)、随机森林 (RF)、支持向量机 (SVM) 和朴素贝叶斯 (NB) 模型通过获得大于 80%的接收器工作特性 (ROC) 曲线下面积 (AUC) 来验证具有高性能。该研究的突出发现是,DT 模型作为一种更好的算法,具有最高的准确性 (92.72%)、AUC(0.909)、精度 (95.90%)、召回率 (95.60%) 和 F1 分数 (95.75%),在 ML 模型中。这种方法可用于研究用于预测治疗期间 TB 患者治疗成功的精确 ML 模型分类。