Department of Clinical Hematology, Hospices Civils de Lyon, Hôpital Lyon Sud, Lyon, France; International Center for Infectiology Research, Inserm U1111, Lyon, France.
Department of Clinical Hematology, Hospices Civils de Lyon, Hôpital Lyon Sud, Lyon, France.
Lancet Digit Health. 2024 May;6(5):e323-e333. doi: 10.1016/S2589-7500(24)00044-X.
Acute leukaemias are life-threatening haematological cancers characterised by the infiltration of transformed immature haematopoietic cells in the blood and bone marrow. Prompt and accurate diagnosis of the three main acute leukaemia subtypes (ie acute lymphocytic leukaemia [ALL], acute myeloid leukaemia [AML], and acute promyelocytic leukaemia [APL]) is of utmost importance to guide initial treatment and prevent early mortality but requires cytological expertise that is not always available. We aimed to benchmark different machine-learning strategies using a custom variable selection algorithm to propose an extreme gradient boosting model to predict leukaemia subtypes on the basis of routine laboratory parameters.
This multicentre model development and validation study was conducted with data from six independent French university hospital databases. Patients aged 18 years or older diagnosed with AML, APL, or ALL in any one of these six hospital databases between March 1, 2012, and Dec 31, 2021, were recruited. 22 routine parameters were collected at the time of initial disease evaluation; variables with more than 25% of missing values in two datasets were not used for model training, leading to the final inclusion of 19 parameters. The performances of the final model were evaluated on internal testing and external validation sets with area under the receiver operating characteristic curves (AUCs), and clinically relevant cutoffs were chosen to guide clinical decision making. The final tool, Artificial Intelligence Prediction of Acute Leukemia (AI-PAL), was developed from this model.
1410 patients diagnosed with AML, APL, or ALL were included. Data quality control showed few missing values for each cohort, with the exception of uric acid and lactate dehydrogenase for the cohort from Hôpital Cochin. 679 patients from Hôpital Lyon Sud and Centre Hospitalier Universitaire de Clermont-Ferrand were split into the training (n=477) and internal testing (n=202) sets. 731 patients from the four other cohorts were used for external validation. Overall AUCs across all validation cohorts were 0·97 (95% CI 0·95-0·99) for APL, 0·90 (0·83-0·97) for ALL, and 0·89 (0·82-0·95) for AML. Cutoffs were then established on the overall cohort of 1410 patients to guide clinical decisions. Confident cutoffs showed two (0·14%) wrong predictions for ALL, four (0·28%) wrong predictions for APL, and three (0·21%) wrong predictions for AML. Use of the overall cutoff greatly reduced the number of missing predictions; diagnosis was proposed for 1375 (97·5%) of 1410 patients for each category, with only a slight increase in wrong predictions. The final model evaluation across both the internal testing and external validation sets showed accuracy of 99·5% for ALL diagnosis, 98·8% for AML diagnosis, and 99·7% for APL diagnosis in the confident model and accuracy of 87·9% for ALL diagnosis, 86·3% for AML diagnosis, and 96·1% for APL diagnosis in the overall model.
AI-PAL allowed for accurate diagnosis of the three main acute leukaemia subtypes. Based on ten simple laboratory parameters, its broad availability could help guide initial therapies in a context where cytological expertise is lacking, such as in low-income countries.
None.
急性白血病是危及生命的血液系统恶性肿瘤,其特征是不成熟的造血细胞在血液和骨髓中浸润。准确快速地诊断三种主要的急性白血病亚型(即急性淋巴细胞白血病[ALL]、急性髓细胞白血病[AML]和急性早幼粒细胞白血病[APL])对于指导初始治疗和预防早期死亡至关重要,但需要细胞学专业知识,而这种专业知识并不总是能够获得。我们旨在使用定制的变量选择算法对不同的机器学习策略进行基准测试,提出一种极端梯度提升模型,根据常规实验室参数预测白血病亚型。
这项多中心模型开发和验证研究纳入了来自法国六所大学附属医院的六个独立数据库的数据。在 2012 年 3 月 1 日至 2021 年 12 月 31 日期间,在这六家医院中任何一家被诊断为 AML、APL 或 ALL 的年龄在 18 岁及以上的患者均被纳入研究。在初始疾病评估时收集了 22 个常规参数;在两个数据集中有超过 25%缺失值的变量未用于模型训练,因此最终纳入了 19 个参数。使用内部测试和外部验证集的受试者工作特征曲线(AUC)评估最终模型的性能,并选择了有临床意义的截断值来指导临床决策。最终工具,即人工智能预测急性白血病(AI-PAL),就是从这个模型中开发出来的。
共纳入了 1410 名被诊断为 AML、APL 或 ALL 的患者。数据质量控制显示,每个队列的缺失值都很少,除了 Hôpital Cochin 队列中的尿酸和乳酸脱氢酶。里昂南大学医院和克莱蒙费朗中心医院的 679 名患者被分为训练集(n=477)和内部测试集(n=202)。其他四个队列的 731 名患者用于外部验证。所有验证队列的总体 AUC 为 APL 0·97(95%CI 0·95-0·99)、ALL 0·90(0·83-0·97)和 AML 0·89(0·82-0·95)。然后在 1410 名患者的总队列中建立了截断值以指导临床决策。置信截断值显示 ALL 有 2 个(0·14%)错误预测,APL 有 4 个(0·28%)错误预测,AML 有 3 个(0·21%)错误预测。使用总体截断值大大减少了错误预测的数量;对于每个类别,1410 名患者中的 1375 名(97.5%)都提出了诊断建议,仅略有增加错误预测。在内部测试和外部验证集的最终模型评估中,ALL 诊断的准确率为 99.5%,AML 诊断的准确率为 98.8%,APL 诊断的准确率为 99.7%,在置信模型中,ALL 诊断的准确率为 87.9%,AML 诊断的准确率为 86.3%,APL 诊断的准确率为 96.1%。
AI-PAL 能够准确诊断三种主要的急性白血病亚型。基于十个简单的实验室参数,其广泛的可用性可以帮助指导缺乏细胞学专业知识的情况下的初始治疗,例如在低收入国家。
无。