数据聚类，以选择具有临床相关性的测试用例，用于算法基准测试和特征描述。

Data clustering to select clinically-relevant test cases for algorithm benchmarking and characterization.

机构信息

Department of Physics and Astronomy, University of Calgary, 2500 University Dr NW, Calgary, Alberta, T2N 1N4, Canada. Department of Medical Physics, Tom Baker Cancer Centre, 1331 29 St NW, Calgary, Alberta, T2N 4N2, Canada. Author to whom any correspondence should be addressed.

出版信息

Phys Med Biol. 2020 Mar 6;65(5):055014. doi: 10.1088/1361-6560/ab6e54.

DOI:10.1088/1361-6560/ab6e54

PMID:31962297

Abstract

Algorithm benchmarking and characterization are an important part of algorithm development and validation prior to clinical implementation. However, benchmarking may be limited to a small collection of test cases due to the resource-intensive nature of establishing 'ground-truth' references. This study proposes a framework for selecting test cases to assess algorithm and workflow equivalence. Effective test case selection may minimize the number of ground-truth comparisons required to establish robust and clinically relevant benchmarking and characterization results. To demonstrate the proposed framework, we clustered differences between two independent workflows estimating during-treatment dose objective violations for 15 head and neck cancer patients (15 planning CTs, 105 on-unit CBCTs). Each workflow used a different deformable image registration algorithm to estimate inter-fractional anatomy and contour changes. The Hopkins statistic tested whether workflow output was inherently clustered and k-medoid clustering formalized cluster assignment. Further statistical analyses verified the relevance of clusters to algorithm output. Data at cluster centers ('medoids') were considered as candidate test cases representative of workflow-relevant algorithm differences. The framework indicated that differences in estimated dose objective violations were naturally grouped (Hopkins = 0.75, providing 90% confidence). K-medoid clustering identified five clusters which stratified workflow differences (MANOVA: p < 0.001) in estimated parotid gland D50%, spinal cord/brainstem Dmax, and high dose CTV coverage dose violations (Kendall's tau: p < 0.05). Systematic algorithm differences resulting in workflow discrepancies were: parotid gland volumes (ANOVA: p < 0.001), external contour deformations (t-test: p = 0.022), and CTV-to-PTV margins (t-test: 0.009), respectively. Five candidate test cases were verified as representative of the five clusters. The framework successfully clustered workflow outputs and identified five test cases representative of clinically relevant algorithm discrepancies. This approach may improve the allocation of resources during the benchmarking and characterization process and the applicability of results to clinical data.

摘要

算法基准测试和特征描述是在将算法应用于临床之前进行开发和验证的重要部分。然而，由于建立“真实基准”参考的资源密集性质，基准测试可能仅限于一小部分测试用例。本研究提出了一种选择测试用例来评估算法和工作流程等效性的框架。有效的测试用例选择可以最大限度地减少建立稳健且与临床相关的基准测试和特征描述结果所需的真实基准比较数量。为了演示所提出的框架，我们对两种独立的工作流程进行了聚类，这两种工作流程用于估计 15 例头颈部癌症患者（15 次计划 CT、105 次在治疗期间的 CBCT）的治疗期间剂量目标违反情况。每个工作流程都使用不同的可变形图像配准算法来估计分次间解剖结构和轮廓变化。Hopkins 统计检验了工作流程输出是否固有地聚类，k-中值聚类正式确定了聚类分配。进一步的统计分析验证了聚类与算法输出的相关性。在聚类中心（“中值”）的数据被视为代表工作流程相关算法差异的候选测试用例。该框架表明，估计的剂量目标违反差异自然分组（Hopkins = 0.75，置信度为 90%）。k-中值聚类确定了五个聚类，这些聚类分层了工作流程差异（MANOVA：p < 0.001），包括估计的腮腺 D50%、脊髓/脑干 Dmax 和高剂量 CTV 覆盖剂量违反情况（Kendall's tau：p < 0.05）。导致工作流程差异的系统算法差异是：腮腺体积（ANOVA：p < 0.001）、外部轮廓变形（t 检验：p = 0.022）和 CTV-到-PTV 边界（t 检验：0.009）。五个候选测试用例被验证为五个聚类的代表。该框架成功地对工作流程输出进行了聚类，并确定了五个代表临床相关算法差异的测试用例。这种方法可以提高基准测试和特征描述过程中的资源分配效率，并提高结果在临床数据中的适用性。

相似文献

Data clustering to select clinically-relevant test cases for algorithm benchmarking and characterization.数据聚类，以选择具有临床相关性的测试用例，用于算法基准测试和特征描述。

Phys Med Biol. 2020 Mar 6;65(5):055014. doi: 10.1088/1361-6560/ab6e54.

Lasso logistic regression to derive workflow-specific algorithm performance requirements as demonstrated for head and neck cancer deformable image registration in adaptive radiation therapy.套索逻辑回归以推导出特定于工作流程的算法性能要求，如自适应放射治疗中头颈部癌症可变形图像配准所示。

Phys Med Biol. 2020 Sep 28;65(19):195013. doi: 10.1088/1361-6560/ab9fc8.

Validation of a dose warping algorithm using clinically realistic scenarios.使用临床实际场景对剂量变形算法进行验证。

Br J Radiol. 2015 May;88(1049):20140691. doi: 10.1259/bjr.20140691. Epub 2015 Mar 20.

Framework for the quantitative assessment of adaptive radiation therapy protocols.自适应放射治疗方案的定量评估框架。

J Appl Clin Med Phys. 2018 Nov;19(6):26-34. doi: 10.1002/acm2.12437. Epub 2018 Aug 29.

Accuracy of software-assisted contour propagation from planning CT to cone beam CT in head and neck radiotherapy.头颈部放疗中从计划CT到锥形束CT的软件辅助轮廓传播的准确性。

Acta Oncol. 2016 Nov;55(11):1324-1330. doi: 10.1080/0284186X.2016.1185149. Epub 2016 Aug 24.

Target and organ dose estimation from intensity modulated head and neck radiation therapy using 3 deformable image registration algorithms.使用3种可变形图像配准算法对调强头颈放射治疗的靶区和器官剂量进行估计。

Pract Radiat Oncol. 2015 Jul-Aug;5(4):e317-25. doi: 10.1016/j.prro.2015.01.008. Epub 2015 Mar 5.

Head and Neck Margin Reduction With Adaptive Radiation Therapy: Robustness of Treatment Plans Against Anatomy Changes.采用自适应放射治疗减少头颈部边缘：治疗计划针对解剖结构变化的稳健性

Int J Radiat Oncol Biol Phys. 2016 Nov 1;96(3):653-60. doi: 10.1016/j.ijrobp.2016.07.011. Epub 2016 Jul 21.

A Comparative Evaluation of 3 Different Free-Form Deformable Image Registration and Contour Propagation Methods for Head and Neck MRI: The Case of Parotid Changes During Radiotherapy.三种不同的自由形式可变形图像配准和轮廓传播方法对头颈部MRI的比较评估：以放疗期间腮腺变化为例

Technol Cancer Res Treat. 2017 Jun;16(3):373-381. doi: 10.1177/1533034617691408. Epub 2017 Feb 7.

A novel surrogate to identify anatomical changes during radiotherapy of head and neck cancer patients.一种用于识别头颈癌患者放疗期间解剖学变化的新型替代物。

Med Phys. 2017 Mar;44(3):924-934. doi: 10.1002/mp.12067. Epub 2017 Feb 21.

Feasibility of automated proton therapy plan adaptation for head and neck tumors using cone beam CT images.使用锥形束CT图像对头颈部肿瘤进行自动质子治疗计划调整的可行性。

Radiat Oncol. 2016 Apr 30;11:64. doi: 10.1186/s13014-016-0641-7.

引用本文的文献

Patient-Reported Outcomes-Guided Adaptive Radiation Therapy for Head and Neck Cancer.患者报告结局指导的头颈癌自适应放射治疗

Front Oncol. 2021 Oct 19;11:759724. doi: 10.3389/fonc.2021.759724. eCollection 2021.

Determining Clinical Patient Selection Guidelines for Head and Neck Adaptive Radiation Therapy Using Random Forest Modelling and a Novel Simplification Heuristic.使用随机森林建模和一种新型简化启发式方法确定头颈部自适应放射治疗的临床患者选择指南

Front Oncol. 2021 Jun 7;11:650335. doi: 10.3389/fonc.2021.650335. eCollection 2021.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

数据聚类，以选择具有临床相关性的测试用例，用于算法基准测试和特征描述。

Data clustering to select clinically-relevant test cases for algorithm benchmarking and characterization.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献