Zhang Zhendong, Criscuolo Edward Robert, Hao Yao, McKeown Trevor, Yang Deshan
Department of Radiation Oncology, Duke University, Durham, North Carolina, USA.
Department of Radiation Oncology, Washington University School of Medicine, St. Louis, Missouri, USA.
Med Phys. 2025 Jan;52(1):703-715. doi: 10.1002/mp.17507. Epub 2024 Nov 6.
Evaluating deformable image registration (DIR) algorithms is vital for enhancing algorithm performance and gaining clinical acceptance. However, there is a notable lack of dependable DIR benchmark datasets for assessing DIR performance except for lung images. To address this gap, we aim to introduce our comprehensive liver computed tomography (CT) DIR landmark dataset library. This dataset is designed for efficient and quantitative evaluation of various DIR methods for liver CTs, paving the way for more accurate and reliable image registration techniques.
Forty CT liver image pairs were acquired from several publicly available image archives and authors' institutions under institutional review board (IRB) approval. The images were processed with a semi-automatic procedure to generate landmark pairs: (1) for each case, liver vessels were automatically segmented on one image; (2) landmarks were automatically detected at vessel bifurcations; (3) corresponding landmarks in the second image were placed using two deformable image registration methods to avoid algorithm-specific biases; (4) a comprehensive validation process based on quantitative evaluation and manual assessment was applied to reject outliers and ensure the landmarks' positional accuracy. This workflow resulted in an average of ∼56 landmark pairs per image pair, comprising a total of 2220 landmarks for 40 cases. The general landmarking accuracy of this procedure was evaluated using digital phantoms and manual landmark placement. The landmark pair target registration errors (TRE) on digital phantoms were 0.37 ± 0.26 and 0.55 ± 0.34 mm respectively for the two selected DIR algorithms used in our workflow, with 97% of landmark pairs having TREs below 1.5 mm. The distances from the calculated landmarks to the averaged manual placement were 1.27 ± 0.79 mm.
All data, including image files and landmark information, are publicly available at Zenodo (https://zenodo.org/records/13738577). Instructions for using our data can be found on our GitHub page at https://github.com/deshanyang/Liver-DIR-QA.
The landmark dataset generated in this work is the first collection of large-scale liver CT DIR landmarks prepared on real patient images. This dataset can provide researchers with a dense set of ground truth benchmarks for the quantitative evaluation of DIR algorithms within the liver.
评估可变形图像配准(DIR)算法对于提高算法性能和获得临床认可至关重要。然而,除了肺部图像外,用于评估DIR性能的可靠DIR基准数据集明显不足。为了填补这一空白,我们旨在引入我们全面的肝脏计算机断层扫描(CT)DIR地标数据集库。该数据集旨在对肝脏CT的各种DIR方法进行高效且定量的评估,为更准确可靠的图像配准技术铺平道路。
在机构审查委员会(IRB)批准下,从几个公开可用的图像存档和作者所在机构获取了40对CT肝脏图像。图像通过半自动程序进行处理以生成地标对:(1)对于每个病例,在一张图像上自动分割肝脏血管;(2)在血管分叉处自动检测地标;(3)使用两种可变形图像配准方法在第二张图像中放置相应地标,以避免特定于算法的偏差;(4)应用基于定量评估和人工评估的全面验证过程以排除异常值并确保地标的位置准确性。此工作流程导致每对图像平均约有56个地标对,40个病例总共2220个地标。使用数字模型和人工地标放置评估了此过程的一般地标准确性。对于我们工作流程中使用的两种选定DIR算法,数字模型上的地标对目标配准误差(TRE)分别为0.37±0.26和0.55±0.34毫米,97%的地标对TRE低于1.5毫米。计算出的地标到平均人工放置位置的距离为1.27±0.79毫米。
所有数据,包括图像文件和地标信息,均可在Zenodo(https://zenodo.org/records/13738577)上公开获取。使用我们数据的说明可在我们的GitHub页面https://github.com/deshanyang/Liver-DIR-QA上找到。
本研究中生成的地标数据集是首个基于真实患者图像准备的大规模肝脏CT DIR地标的集合。该数据集可为研究人员提供一组密集的地面真值基准,用于肝脏内DIR算法的定量评估。