Rodrigues Moreno M S, Barreto-Duarte Beatriz, Vinhaes Caian L, Araújo-Pereira Mariana, Fukutani Eduardo R, Bergamaschi Keityane Bone, Kristki Afrânio, Cordeiro-Santos Marcelo, Rolla Valeria C, Sterling Timothy R, Queiroz Artur T L, Andrade Bruno B
Fundação Oswaldo Cruz.
Universidade Salvador.
Res Sq. 2023 Dec 11:rs.3.rs-3706875. doi: 10.21203/rs.3.rs-3706875/v1.
Identifying patients at increased risk of loss to follow-up (LTFU) is key to developing strategies to optimize the clinical management of tuberculosis (TB). The use of national registry data in prediction models may be a useful tool to inform healthcare workers about risk of LTFU. Here we developed a score to predict the risk of LTFU during anti-TB treatment (ATT) in a nationwide cohort of cases using clinical data reported to the Brazilian Notifiable Disease Information System (SINAN).
We performed a retrospective study of all TB cases reported to SINAN between 2015-2022; excluding children (<18 years-old), vulnerable groups or drug-resistant TB. For the score, data before treatment initiation were used. We trained and internally validated three different prediction scoring systems, based on Logistic Regression, Random Forest, and Light Gradient Boosting. Before applying our models we split our data into train (80% data) and test (20%), and then we compare model metrics using a test data set.
Of the 243,726 cases included, 41,373 experienced LTFU whereas 202,353 were successfully treated and cured. The groups were different with regards to several clinical and sociodemographic characteristics. The directly observed treatment (DOT) was unbalanced between the groups with lower prevalence in those who were LTFU. Three models were developed to predict LTFU using 8 features (prior TB, drug use, age, sex, HIV infection and schooling level) with different score composition approaches. Those prediction scoring system exhibited an area under the curve (AUC) ranging between 0.71 and 0.72. The Light Gradient Boosting technique resulted in the best prediction performance, weighting specificity, and sensibility. A user-friendly web calculator app was created (https://tbprediction.herokuapp.com/) to facilitate implementation.
Our nationwide risk score predicts the risk of LTFU during ATT in Brazilian adults prior to treatment commencement. This is a potential tool to assist in decision-making strategies to guide resource allocation, DOT indications, and improve TB treatment adherence.
识别失访风险增加的患者是制定优化结核病(TB)临床管理策略的关键。在预测模型中使用国家登记数据可能是一种有用的工具,可让医护人员了解失访风险。在此,我们利用报告给巴西法定传染病信息系统(SINAN)的临床数据,开发了一个评分系统,以预测全国范围内结核病病例队列在抗结核治疗(ATT)期间的失访风险。
我们对2015年至2022年间报告给SINAN的所有结核病病例进行了回顾性研究;排除儿童(<18岁)、弱势群体或耐药结核病患者。对于该评分系统,使用治疗开始前的数据。我们基于逻辑回归、随机森林和轻梯度提升训练并内部验证了三种不同的预测评分系统。在应用我们的模型之前,我们将数据分为训练集(约80%的数据)和测试集(约20%),然后使用测试数据集比较模型指标。
在纳入的243,726例病例中,41,373例失访,而有202,353例成功治疗并治愈。两组在几个临床和社会人口学特征方面存在差异。直接观察治疗(DOT)在两组之间不均衡,失访者中患病率较低。利用8个特征(既往结核病、药物使用、年龄、性别、HIV感染和受教育程度),采用不同的评分组成方法,开发了三个模型来预测失访。这些预测评分系统的曲线下面积(AUC)在0.71至0.72之间。轻梯度提升技术产生了最佳的预测性能、加权特异性和敏感性。创建了一个用户友好的网络计算器应用程序(https://tbprediction.herokuapp.com/)以促进实施。
我们的全国风险评分可在治疗开始前预测巴西成年人在抗结核治疗期间的失访风险。这是一种潜在的工具,可协助制定决策策略,以指导资源分配、DOT指征并提高结核病治疗依从性。