Department of Statistics, Abdul Wali Khan University, Mardan, Pakistan.
Programmatic Management of Drug-Resistant Tuberculosis, Saidu Teaching Hospital, Swat, Pakistan.
J Healthc Eng. 2021 Aug 31;2021:2567080. doi: 10.1155/2021/2567080. eCollection 2021.
In this paper, we have focused on machine learning (ML) feature selection (FS) algorithms for identifying and diagnosing multidrug-resistant (MDR) tuberculosis (TB). MDR-TB is a universal public health problem, and its early detection has been one of the burning issues. The present study has been conducted in the Malakand Division of Khyber Pakhtunkhwa, Pakistan, to further add to the knowledge on the disease and to deal with the issues of identification and early detection of MDR-TB by ML algorithms. These models also identify the most important factors causing MDR-TB infection whose study gives additional insights into the matter. ML algorithms such as random forest, k-nearest neighbors, support vector machine, logistic regression, leaset absolute shrinkage and selection operator (LASSO), artificial neural networks (ANNs), and decision trees are applied to analyse the case-control dataset. This study reveals that close contacts of MDR-TB patients, smoking, depression, previous TB history, improper treatment, and interruption in first-line TB treatment have a great impact on the status of MDR. Accordingly, weight loss, chest pain, hemoptysis, and fatigue are important symptoms. Based on accuracy, sensitivity, and specificity, SVM and RF are the suggested models to be used for patients' classifications.
本文专注于机器学习 (ML) 特征选择 (FS) 算法,以识别和诊断耐多药结核病 (TB)。耐多药-TB 是一个普遍的公共卫生问题,其早期检测一直是一个热点问题。本研究在巴基斯坦开伯尔-普赫图赫瓦省的马尔坎德地区进行,旨在进一步增加对该疾病的了解,并通过 ML 算法解决耐多药-TB 的识别和早期检测问题。这些模型还确定了导致耐多药-TB 感染的最重要因素,对这些因素的研究提供了更多的见解。随机森林、k-最近邻、支持向量机、逻辑回归、最小绝对收缩和选择算子 (LASSO)、人工神经网络 (ANNs) 和决策树等 ML 算法被应用于分析病例对照数据集。这项研究表明,耐多药-TB 患者的密切接触者、吸烟、抑郁、既往 TB 病史、不适当的治疗以及一线 TB 治疗中断对耐多药状态有很大影响。相应地,体重减轻、胸痛、咯血和疲劳是重要症状。基于准确性、敏感性和特异性,SVM 和 RF 是建议用于患者分类的模型。