Kiser Kendall J, Ahmed Sara, Stieb Sonja, Mohamed Abdallah S R, Elhalawani Hesham, Park Peter Y S, Doyle Nathan S, Wang Brandon J, Barman Arko, Li Zhao, Zheng W Jim, Fuller Clifton D, Giancardo Luca
John P. and Kathrine G. McGovern Medical School, Houston, TX, USA.
Center for Precision Health, UTHealth School of Biomedical Informatics, Houston, TX, USA.
Med Phys. 2020 Nov;47(11):5941-5952. doi: 10.1002/mp.14424. Epub 2020 Aug 28.
This manuscript describes a dataset of thoracic cavity segmentations and discrete pleural effusion segmentations we have annotated on 402 computed tomography (CT) scans acquired from patients with non-small cell lung cancer. The segmentation of these anatomic regions precedes fundamental tasks in image analysis pipelines such as lung structure segmentation, lesion detection, and radiomics feature extraction. Bilateral thoracic cavity volumes and pleural effusion volumes were manually segmented on CT scans acquired from The Cancer Imaging Archive "NSCLC Radiomics" data collection. Four hundred and two thoracic segmentations were first generated automatically by a U-Net based algorithm trained on chest CTs without cancer, manually corrected by a medical student to include the complete thoracic cavity (normal, pathologic, and atelectatic lung parenchyma, lung hilum, pleural effusion, fibrosis, nodules, tumor, and other anatomic anomalies), and revised by a radiation oncologist or a radiologist. Seventy-eight pleural effusions were manually segmented by a medical student and revised by a radiologist or radiation oncologist. Interobserver agreement between the radiation oncologist and radiologist corrections was acceptable. All expert-vetted segmentations are publicly available in NIfTI format through The Cancer Imaging Archive at https://doi.org/10.7937/tcia.2020.6c7y-gq39. Tabular data detailing clinical and technical metadata linked to segmentation cases are also available. Thoracic cavity segmentations will be valuable for developing image analysis pipelines on pathologic lungs - where current automated algorithms struggle most. In conjunction with gross tumor volume segmentations already available from "NSCLC Radiomics," pleural effusion segmentations may be valuable for investigating radiomics profile differences between effusion and primary tumor or training algorithms to discriminate between them.
本手稿描述了一个数据集,该数据集包含我们对从非小细胞肺癌患者获取的402份计算机断层扫描(CT)图像上标注的胸腔分割和离散胸腔积液分割。这些解剖区域的分割是图像分析流程中诸如肺结构分割、病变检测和放射组学特征提取等基础任务的前提。双侧胸腔体积和胸腔积液体积是在从癌症影像存档库“NSCLC放射组学”数据集中获取的CT扫描图像上手动分割的。首先通过基于U-Net算法在无癌胸部CT上进行训练自动生成402个胸腔分割,然后由一名医学生进行手动校正以纳入完整的胸腔(正常、病理和肺不张的肺实质、肺门、胸腔积液、纤维化、结节、肿瘤及其他解剖异常),最后由放射肿瘤学家或放射科医生进行修订。78例胸腔积液由一名医学生手动分割,并由放射科医生或放射肿瘤学家进行修订。放射肿瘤学家和放射科医生校正之间的观察者间一致性是可接受的。所有经过专家审核的分割结果以NIfTI格式通过癌症影像存档库在https://doi.org/10.7937/tcia.2020.6c7y-gq39上公开提供。还提供了详细的表格数据,这些数据详细说明了与分割病例相关的临床和技术元数据。胸腔分割对于在病理性肺部开发图像分析流程将具有重要价值,而目前的自动化算法在病理性肺部面临的挑战最大。结合“NSCLC放射组学”中已有的大体肿瘤体积分割,胸腔积液分割对于研究积液与原发性肿瘤之间的放射组学特征差异或训练算法以区分它们可能具有重要价值。