Multinational Organization Network Sponsoring Translational and Epidemiological Research (MONSTER) Initiative, Salvador, Brazil.
Laboratório de Análise e Visualização de Dados, Fundação Oswaldo Cruz, Porto Velho, Brazil.
BMC Public Health. 2024 May 23;24(1):1385. doi: 10.1186/s12889-024-18815-0.
Identifying patients at increased risk of loss to follow-up (LTFU) is key to developing strategies to optimize the clinical management of tuberculosis (TB). The use of national registry data in prediction models may be a useful tool to inform healthcare workers about risk of LTFU. Here we developed a score to predict the risk of LTFU during anti-TB treatment (ATT) in a nationwide cohort of cases using clinical data reported to the Brazilian Notifiable Disease Information System (SINAN).
We performed a retrospective study of all TB cases reported to SINAN between 2015 and 2022; excluding children (< 18 years-old), vulnerable groups or drug-resistant TB. For the score, data before treatment initiation were used. We trained and internally validated three different prediction scoring systems, based on Logistic Regression, Random Forest, and Light Gradient Boosting. Before applying our models we splitted our data into training (~ 80% data) and test (~ 20%) sets, and then compared the model metrics using the test data set.
Of the 243,726 cases included, 41,373 experienced LTFU whereas 202,353 were successfully treated. The groups were different with regards to several clinical and sociodemographic characteristics. The directly observed treatment (DOT) was unbalanced between the groups with lower prevalence in those who were LTFU. Three models were developed to predict LTFU using 8 features (prior TB, drug use, age, sex, HIV infection and schooling level) with different score composition approaches. Those prediction scoring systems exhibited an area under the curve (AUC) ranging between 0.71 and 0.72. The Light Gradient Boosting technique resulted in the best prediction performance, weighting specificity and sensitivity. A user-friendly web calculator app was developed ( https://tbprediction.herokuapp.com/ ) to facilitate implementation.
Our nationwide risk score predicts the risk of LTFU during ATT in Brazilian adults prior to treatment commencement utilizing schooling level, sex, age, prior TB status, and substance use (drug, alcohol, and/or tobacco). This is a potential tool to assist in decision-making strategies to guide resource allocation, DOT indications, and improve TB treatment adherence.
识别失访(LTFU)风险增加的患者是制定策略优化结核病(TB)临床管理的关键。利用国家登记数据建立预测模型可能是向卫生保健工作者告知 LTFU 风险的有用工具。在这里,我们使用向巴西传染病报告系统(SINAN)报告的临床数据,在全国范围内的病例队列中开发了一种预测抗结核治疗(ATT)期间 LTFU 风险的评分。
我们对 2015 年至 2022 年期间向 SINAN 报告的所有结核病病例进行了回顾性研究;排除儿童(<18 岁)、弱势群体或耐多药结核病。对于评分,使用治疗开始前的数据。我们基于逻辑回归、随机森林和轻梯度提升训练和内部验证了三种不同的预测评分系统。在应用我们的模型之前,我们将数据分为训练(80%的数据)和测试(20%)集,然后使用测试数据集比较模型指标。
在纳入的 243726 例病例中,41373 例发生 LTFU,202353 例成功治疗。这两组在几个临床和社会人口学特征方面存在差异。在失访组中,直接观察治疗(DOT)的比例较低。使用 8 个特征(既往结核病、药物使用、年龄、性别、HIV 感染和受教育程度)和不同的评分组成方法开发了三种预测 LTFU 的模型。这些预测评分系统的曲线下面积(AUC)在 0.71 至 0.72 之间。Light Gradient Boosting 技术产生了最佳的预测性能,权衡了特异性和敏感性。开发了一个用户友好的网络计算器应用程序(https://tbprediction.herokuapp.com/),以方便实施。
我们的全国风险评分利用受教育程度、性别、年龄、既往结核病状态和药物使用(药物、酒精和/或烟草),预测巴西成年人 ATT 前开始治疗时的 LTFU 风险。这是一种潜在的工具,可以帮助制定决策策略,指导资源分配、DOT 指示和提高结核病治疗依从性。