Professional Master Program in Artificial Intelligence in Medicine, College of Medicine, Taipei Medical University, Taipei, 106, Taiwan; Research Center for Artificial Intelligence in Medicine, Taipei Medical University, Taipei, 106, Taiwan; Translational Imaging Research Center, Taipei Medical University Hospital, Taipei, 110, Taiwan.
International Master/Ph.D. Program in Medicine, College of Medicine, Taipei Medical University, Taipei, 110, Taiwan; Orthopedic and Trauma Department, Cho Ray Hospital, Ho Chi Minh City, 70000, Viet Nam.
Comput Biol Med. 2021 May;132:104320. doi: 10.1016/j.compbiomed.2021.104320. Epub 2021 Mar 9.
In the field of glioma, transcriptome subtypes have been considered as an important diagnostic and prognostic biomarker that may help improve the treatment efficacy. However, existing identification methods of transcriptome subtypes are limited due to the relatively long detection period, the unattainability of tumor specimens via biopsy or surgery, and the fleeting nature of intralesional heterogeneity. In search of a superior model over previous ones, this study evaluated the efficiency of eXtreme Gradient Boosting (XGBoost)-based radiomics model to classify transcriptome subtypes in glioblastoma patients.
This retrospective study retrieved patients from TCGA-GBM and IvyGAP cohorts with pathologically diagnosed glioblastoma, and separated them into different transcriptome subtypes groups. GBM patients were then segmented into three different regions of MRI: enhancement of the tumor core (ET), non-enhancing portion of the tumor core (NET), and peritumoral edema (ED). We subsequently used handcrafted radiomics features (n = 704) from multimodality MRI and two-level feature selection techniques (Spearman correlation and F-score tests) in order to find the features that could be relevant.
After the feature selection approach, we identified 13 radiomics features that were the most meaningful ones that can be used to reach the optimal results. With these features, our XGBoost model reached the predictive accuracies of 70.9%, 73.3%, 88.4%, and 88.4% for classical, mesenchymal, neural, and proneural subtypes, respectively. Our model performance has been improved in comparison with the other models as well as previous works on the same dataset.
The use of XGBoost and two-level feature selection analysis (Spearman correlation and F-score) could be expected as a potential combination for classifying transcriptome subtypes with high performance and might raise public attention for further research on radiomics-based GBM models.
在神经胶质瘤领域,转录组亚型被认为是一种重要的诊断和预后生物标志物,可以帮助提高治疗效果。然而,由于检测周期较长、通过活检或手术获得肿瘤标本不可行以及肿瘤内异质性转瞬即逝等原因,现有的转录组亚型识别方法存在一定的局限性。为了寻求优于以往方法的模型,本研究评估了基于极端梯度提升(XGBoost)的放射组学模型在胶质母细胞瘤患者中分类转录组亚型的效率。
本回顾性研究从 TCGA-GBM 和 IvyGAP 队列中检索了经病理诊断为胶质母细胞瘤的患者,并将其分为不同的转录组亚型组。然后,将 GBM 患者分为 MRI 的三个不同区域:肿瘤核心增强区(ET)、肿瘤核心非增强区(NET)和瘤周水肿区(ED)。我们随后使用多模态 MRI 的手工制作的放射组学特征(n=704)和两级特征选择技术(Spearman 相关性和 F 分数检验)来寻找可能相关的特征。
经过特征选择方法,我们确定了 13 个最有意义的放射组学特征,可以用来达到最佳结果。使用这些特征,我们的 XGBoost 模型对经典、间质、神经和前神经亚型的预测准确率分别达到了 70.9%、73.3%、88.4%和 88.4%。与其他模型以及同一数据集上的先前工作相比,我们的模型性能有所提高。
XGBoost 和两级特征选择分析(Spearman 相关性和 F 分数)的使用有望成为一种潜在的组合,用于高性能地分类转录组亚型,并可能引起人们对基于放射组学的 GBM 模型的进一步研究的关注。