Institute for Geoinformatics, University of Münster, Münster, Germany.
Institute for Computer Science, University of Münster, Münster, Germany.
Blood Adv. 2024 Jan 9;8(1):70-79. doi: 10.1182/bloodadvances.2023011076.
The detection of genetic aberrations is crucial for early therapy decisions in acute myeloid leukemia (AML) and recommended for all patients. Because genetic testing is expensive and time consuming, a need remains for cost-effective, fast, and broadly accessible tests to predict these aberrations in this aggressive malignancy. Here, we developed a novel fully automated end-to-end deep learning pipeline to predict genetic aberrations directly from single-cell images from scans of conventionally stained bone marrow smears already on the day of diagnosis. We used this pipeline to compile a multiterabyte data set of >2 000 000 single-cell images from diagnostic samples of 408 patients with AML. These images were then used to train convolutional neural networks for the prediction of various therapy-relevant genetic alterations. Moreover, we created a temporal test cohort data set of >444 000 single-cell images from further 71 patients with AML. We show that the models from our pipeline can significantly predict these subgroups with high areas under the curve of the receiver operating characteristic. Potential genotype-phenotype links were visualized with 2 different strategies. Our pipeline holds the potential to be used as a fast and inexpensive automated tool to screen patients with AML for therapy-relevant genetic aberrations directly from routine, conventionally stained bone marrow smears already on the day of diagnosis. It also creates a foundation to develop similar approaches for other bone marrow disorders in the future.
检测遗传异常对于急性髓细胞白血病(AML)的早期治疗决策至关重要,建议所有患者进行检测。由于基因检测既昂贵又耗时,因此仍然需要具有成本效益、快速且广泛适用的检测方法,以便在这种侵袭性恶性肿瘤中预测这些异常。在这里,我们开发了一种新颖的全自动端到端深度学习管道,可直接从常规染色骨髓涂片扫描的单细胞图像中预测遗传异常,这些图像是在诊断当天获得的。我们使用该管道编译了一个包含超过 200 万个单细胞图像的多 TB 数据集,这些图像来自 408 名 AML 患者的诊断样本。然后,我们使用这些图像来训练卷积神经网络,以预测各种与治疗相关的遗传改变。此外,我们还创建了一个包含来自另外 71 名 AML 患者的>444000 个单细胞图像的临时测试队列数据集。我们表明,我们管道中的模型可以非常显著地预测这些亚组,其接收者操作特征曲线下的面积很高。通过 2 种不同的策略可视化了潜在的基因型-表型联系。我们的管道有可能成为一种快速且廉价的自动化工具,直接从诊断当天常规染色的骨髓涂片筛查 AML 患者的治疗相关遗传异常。它还为未来开发其他骨髓疾病的类似方法奠定了基础。