Department of Radiation Oncology, University of Michigan, Ann Arbor, MI, 48103, USA.
Department of Radiation Oncology, Virginia Commonwealth University, Richmond, VA, 23298, USA.
Med Phys. 2019 Apr;46(4):1914-1921. doi: 10.1002/mp.13433. Epub 2019 Mar 4.
Developing automated methods to identify task-driven quality assurance (QA) procedures is key toward increasing safety, efficacy, and efficiency. We investigate the use of machine learning (ML) methods for possible visualization, automation, and targeting of QA, and assess its performance using multi-institutional data.
To enable automated analysis of QA data given its higher dimensional nature, we used nonlinear kernel mapping with support vector data description (SVDD) driven approaches. Instead of using labeled data as in typical support vector machine (SVM) applications, which requires exhaustive annotation, we applied a clustering extension of SVDD, which identifies the minimal enclosing hypersphere in the feature space defined by a kernel function separating normal operations from possible failures (i.e., outliers). In our case, QA test data are mapped by a Gaussian kernel to a higher dimensional feature space and then the minimal enclosing sphere was identified. This sphere, when mapped back to the input data space along the principal components, can separate the data into several components, each enclosing a separate cluster of QA points that could be used to evaluate tolerance boundaries and test reliability. We evaluated this approach for gantry sag, radiation field shift, and [multileaf collimator (MLC)] offset data acquired using electronic portal imaging devices (EPID), as representative examples.
Data from eight LINACS and seven institutions (n = 119) were collected. A standardized EPID image of a phantom with fiducials provided deviation estimates between the radiation field and phantom center at four cardinal gantry angles. Deviation measurements in the horizontal direction (0°, 180°) were used to determine the gantry sag and deviations in the vertical direction (90°, 270°) were used to determine the field shift. These measurements were fed into the SVDD clustering algorithm with varying hypersphere radii (Gaussian widths). For gantry sag analysis, two clusters were identified one of which contained 2.5% of the outliers and also exceeded the 1 mm tolerance set by TG-142. In the case of field shifts, SVM clustering identified two distinct classes of measurements primarily driven by variations in the second principal component at 270°. Results from MLC analysis identified one outlier cluster (0.34%) along Leaf offset Constancy (LoC) axis that coincided with TG-142 limits.
Machine learning methods based on SVDD clustering are promising for developing automated QA tools and providing insights into their reliability and reproducibility.
开发用于识别以任务为驱动的质量保证(QA)程序的自动化方法是提高安全性、有效性和效率的关键。我们研究了使用机器学习(ML)方法进行 QA 的可视化、自动化和靶向的可能性,并使用多机构数据评估其性能。
为了能够对 QA 数据进行自动化分析,考虑到其具有更高的维度性质,我们使用非线性核映射和支持向量数据描述(SVDD)驱动方法。与典型的支持向量机(SVM)应用程序中使用标记数据不同,这需要详尽的注释,我们应用了 SVDD 的聚类扩展,该扩展在由核函数定义的特征空间中识别最小包围超球面,该核函数将正常操作与可能的故障(即异常值)分开。在我们的情况下,QA 测试数据通过高斯核映射到更高维的特征空间,然后确定最小包围球。这个球,沿着主成分映射回输入数据空间,可以将数据分成几个部分,每个部分都包含一个单独的 QA 点簇,可以用来评估容差边界和测试可靠性。我们使用电子门户成像设备(EPID)获取的龙门架下垂、辐射场移位和[多叶准直器(MLC)]偏移数据评估了这种方法,作为代表性示例。
收集了来自八个 LINAC 和七个机构的数据(n=119)。带有基准的模体的标准化 EPID 图像提供了在四个基本龙门角度下辐射场和模体中心之间的偏差估计。水平方向(0°,180°)的偏差测量用于确定龙门架下垂,垂直方向(90°,270°)的偏差测量用于确定场位移。这些测量值被馈入具有不同超球半径(高斯宽度)的 SVDD 聚类算法。对于龙门架下垂分析,确定了两个聚类,其中一个聚类包含 2.5%的异常值,并且也超过了 TG-142 设置的 1 毫米容差。在场位移的情况下,SVM 聚类主要由第二个主成分在 270°的变化驱动,确定了两个不同的测量类别。MLC 分析的结果确定了一个离群点聚类(0.34%)沿叶偏移恒定性(LoC)轴,与 TG-142 限制一致。
基于 SVDD 聚类的机器学习方法对于开发自动化 QA 工具并深入了解其可靠性和可重复性具有广阔的前景。