National Research Council of Italy - Institute for High Performance Computing and Networking (ICAR), Via P. Castellino 111, 80131 Naples, Italy.
National Research Council of Italy - Institute of Molecular Bioimaging and Physiology (IBFM), Via F.lli Cervi 93, 20090 Segrate, MI, Italy.
Artif Intell Med. 2017 Sep;81:41-53. doi: 10.1016/j.artmed.2017.03.004. Epub 2017 Mar 18.
Patients under radiotherapy for head-and-neck cancer often suffer of long-term xerostomia, and/or consistent shrinkage of parotid glands. In order to avoid these drawbacks, adaptive therapy can be planned for patients at risk, if the prediction is obtained timely, before or during the early phase of treatment. Artificial intelligence can address the problem, by learning from examples and building classification models. In particular, fuzzy logic has shown its suitability for medical applications, in order to manage uncertain data, and to build transparent rule-based classifiers. In previous works, clinical, dosimetric and image-based features were considered separately, to find different possible predictors of parotid shrinkage. On the other hand, a few works reported possible image-based predictors of xerostomia, while the combination of different types of features has been little addressed.
This paper proposes the application of a novel machine learning approach, based on both statistics and fuzzy logic, aimed at the classification of patients at risk of i) parotid gland shrinkage and ii) 12-months xerostomia. Both problems are addressed with the aim of individuating predictors and models to classify respective outcomes.
Knowledge is extracted from a real dataset of radiotherapy patients, by means of a recently developed method named Likelihood-Fuzzy Analysis, based on the representation of statistical information by fuzzy rule-based models. This method enables to manage heterogeneous variables and missing data, and to obtain interpretable fuzzy models presenting good generalization power (thus high performance), and to measure classification confidence. Numerous features are extracted to characterize patients, coming from different sources, i.e. clinical features, dosimetric parameters, and radiomics-based measures obtained by texture analysis of Computed Tomography images. A learning approach based on the composition of simple models in a more complicated one allows to consider the features separately, in order to identify predictors and models to use when only some data source is available, and obtaining more accurate results when more information can be combined.
Regarding parotid shrinkage, a number of good predictors is detected, some already known and confirmed here, and some others found here, in particular among radiomics-based features. A number of models are also designed, some using single features and others involving models composition to improve classification accuracy. In particular, the best model to be used at the initial treatment stage, and another one applicable at the half treatment stage are identified. Regarding 12-months toxicity, some possible predictors are detected, in particular among radiomics-based features. Moreover, the relation between final parotid shrinkage rate and 12-months xerostomia is evaluated. The method is compared to the naïve Bayes classifier, which reveals similar results in terms of classification accuracy and best predictors. The interpretable fuzzy rule-based models are explicitly presented, and the dependence between predictors and outcome is explained, thus furnishing in some cases helpful insights about the considered problems.
Thanks to the performance and interpretability of the fuzzy classification method employed, predictors of both parotid shrinkage and xerostomia are detected, and their influence on each outcome is revealed. Moreover, models for predicting parotid shrinkage at initial and half radiotherapy stages are found.
接受头颈部癌症放射治疗的患者常患有长期口干症和/或腮腺持续萎缩。为了避免这些缺点,如果能够及时预测,在治疗早期或早期就可以为有风险的患者计划适应性治疗。人工智能可以通过学习示例和构建分类模型来解决这个问题。特别是,模糊逻辑已显示出其在医疗应用中的适用性,以便管理不确定的数据,并构建基于规则的透明分类器。在之前的工作中,已经分别考虑了临床、剂量学和基于图像的特征,以寻找不同的可能的腮腺收缩预测因子。另一方面,已有一些报告可能与口干相关的基于图像的预测因子,而很少涉及不同类型特征的组合。
本文提出了一种新的机器学习方法的应用,该方法基于统计学和模糊逻辑,旨在对 i)腮腺收缩和 ii)12 个月口干风险的患者进行分类。这两个问题都旨在确定预测因子和模型,以分类各自的结果。
通过一种名为似然模糊分析的新方法从放射治疗患者的真实数据集提取知识,该方法基于模糊规则基模型表示统计信息。该方法能够管理异构变量和缺失数据,并获得可解释的模糊模型,具有良好的泛化能力(因此具有高性能),并可测量分类置信度。提取了大量特征来描述患者,这些特征来自不同的来源,即临床特征、剂量学参数以及通过计算机断层扫描图像的纹理分析获得的放射组学测量值。基于在更复杂的模型中组合简单模型的学习方法,可以分别考虑特征,以识别当只有一些数据源可用时使用的预测因子和模型,并在可以组合更多信息时获得更准确的结果。
关于腮腺收缩,检测到一些良好的预测因子,其中一些已经在此处得到证实,还有一些是在此处发现的,特别是在放射组学特征中。还设计了一些模型,其中一些使用单个特征,另一些则涉及模型组合以提高分类准确性。特别是,确定了在初始治疗阶段使用的最佳模型,以及在半治疗阶段适用的另一个模型。关于 12 个月毒性,检测到一些可能的预测因子,特别是在放射组学特征中。此外,还评估了最终腮腺收缩率与 12 个月口干之间的关系。该方法与朴素贝叶斯分类器进行了比较,后者在分类准确性和最佳预测因子方面显示出相似的结果。明确呈现了可解释的模糊规则基模型,并解释了预测因子与结果之间的关系,从而在某些情况下提供了有关所考虑问题的有用见解。
由于所采用的模糊分类方法的性能和可解释性,检测到了腮腺收缩和口干的预测因子,并揭示了它们对每个结果的影响。此外,还找到了预测初始和半放射治疗阶段腮腺收缩的模型。