Virgolin Marco, Wang Ziyuan, Alderliesten Tanja, Bosman Peter A N
Centrum Wiskunde and Informatica, Life Sciences and Health Group, Amsterdam, The Netherlands.
Amsterdam UMC, University of Amsterdam, Department of Radiation Oncology, Amsterdam, The Netherlands.
J Med Imaging (Bellingham). 2020 Jul;7(4):046501. doi: 10.1117/1.JMI.7.4.046501. Epub 2020 Jul 30.
Current phantoms used for the dose reconstruction of long-term childhood cancer survivors lack individualization. We design a method to predict highly individualized abdominal three-dimensional (3-D) phantoms automatically. We train machine learning (ML) models to map (2-D) patient features to 3-D organ-at-risk (OAR) metrics upon a database of 60 pediatric abdominal computed tomographies with liver and spleen segmentations. Next, we use the models in an automatic pipeline that outputs a personalized phantom given the patient's features, by assembling 3-D imaging from the database. A step to improve phantom realism (i.e., avoid OAR overlap) is included. We compare five ML algorithms, in terms of predicting OAR left-right (LR), anterior-posterior (AP), inferior-superior (IS) positions, and surface Dice-Sørensen coefficient (sDSC). Furthermore, two existing human-designed phantom construction criteria and two additional control methods are investigated for comparison. Different ML algorithms result in similar test mean absolute errors: for liver LR, IS, and spleen AP, IS; for liver AP and spleen LR; for abdomen sDSC; and to 65% for liver and spleen sDSC. One ML algorithm (GP-GOMEA) significantly performs the best for 6/9 metrics. The control methods and the human-designed criteria in particular perform generally worse, sometimes substantially ( error for spleen IS, sDSC for liver). The automatic step to improve realism generally results in limited metric accuracy loss, but fails in one case (out of 60). Our ML-based pipeline leads to phantoms that are significantly and substantially more individualized than currently used human-designed criteria.
目前用于长期儿童癌症幸存者剂量重建的体模缺乏个性化。我们设计了一种方法来自动预测高度个性化的腹部三维(3-D)体模。我们在一个包含60例儿科腹部计算机断层扫描以及肝脏和脾脏分割的数据库上训练机器学习(ML)模型,以将(2-D)患者特征映射到三维危及器官(OAR)指标。接下来,我们在一个自动流程中使用这些模型,通过从数据库中组装三维图像,根据患者特征输出个性化体模。其中包括一个提高体模逼真度的步骤(即避免OAR重叠)。我们比较了五种ML算法在预测OAR左右(LR)、前后(AP)、上下(IS)位置以及表面骰子-索伦森系数(sDSC)方面的表现。此外,还研究了两种现有的人工设计体模构建标准和两种额外的对照方法以作比较。不同的ML算法导致相似的测试平均绝对误差:对于肝脏的LR、IS以及脾脏的AP、IS;对于肝脏的AP和脾脏的LR;对于腹部的sDSC;对于肝脏和脾脏的sDSC为65%。一种ML算法(GP-GOMEA)在9个指标中的6个上表现显著最佳。对照方法和人工设计的标准总体表现通常较差,有时差距很大(脾脏IS的误差,肝脏sDSC的误差)。提高逼真度的自动步骤通常导致有限的指标精度损失,但在一个案例中(共60个案例)失败。我们基于ML的流程生成的体模比目前使用人工设计的标准生成的体模在个性化程度上有显著且实质性的提高。