Mayo Charles S, Mierzwa Michelle, Moran Jean M, Matuszak Martha M, Wilkie Joel, Sun Grace, Yao John, Weyburn Grant, Anderson Carlos J, Owen Dawn, Rao Arvind
Department of Radiation Oncology, University of Michigan, Ann Arbor, Michigan.
Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan.
Adv Radiat Oncol. 2020 Jan 12;5(6):1296-1304. doi: 10.1016/j.adro.2019.12.007. eCollection 2020 Nov-Dec.
We combined clinical practice changes, standardizations, and technology to automate aggregation, integration, and harmonization of comprehensive patient data from the multiple source systems used in clinical practice into a big data analytics resource system (BDARS). We then developed novel artificial intelligence algorithms, coupled with the BDARS, to identify structure dose volume histograms (DVH) metrics associated with dysphagia.
From the BDARS harmonized data of ≥22,000 patients, we identified 132 patients recently treated for head and neck cancer who also demonstrated dysphagia scores that worsened from base line to a maximum grade ≥2. We developed a method that used both physical and biologically corrected (α/β = 2.5) DVH curves to test both absolute and percentage volume based DVH metrics. Combining a statistical categorization algorithm with machine learning (SCA-ML) provided more extensive detailing of response threshold evidence than either approach alone. A sensitivity guided, minimum input, machine learning (ML) model was iteratively constructed to identify the key structure DVH metric thresholds.
Seven swallowing structures producing 738 candidate DVH metrics were ranked for association with dysphagia using SCA-ML scoring. Structures included superior pharyngeal constrictor (SPC), inferior pharyngeal constrictor (IPC), larynx, and esophagus. Bilateral parotid and submandibular gland (SG) structures were categorized by relative mean dose (eg, SG_high, SG_low) as a dose versus tumor centric analog to contra and ipsilateral designations. Structure DVH metrics with high SCA-ML scores included the following: SPC: D20% (equivalent dose [EQD2] Gy) ≥47.7; SPC: D25% (Gy) ≥50.4; IPC: D35% (Gy) ≥61.7; parotid_low: D60% (Gy) ≥13.2; and SG_high: D35% (Gy) ≥61.7. Larynx: D25% (Gy) ≥21.2 and SG_low: D45% ≥28.2 had high SCA-ML scores but were segmented on less than 90% of plans. A model based on SPC: D20% (EQD2 Gy) alone had sensitivity and area under the curve of 0.88 ± 0.13 and 0.74 ± 0.17, respectively.
This study provides practical demonstration of combining big data with artificial intelligence to increase volume of evidence in clinical learning paradigms.
我们结合临床实践变革、标准化和技术,将临床实践中使用的多个源系统的综合患者数据进行自动化聚合、整合与协调,纳入一个大数据分析资源系统(BDARS)。然后,我们开发了新颖的人工智能算法,并与BDARS相结合,以识别与吞咽困难相关的结构剂量体积直方图(DVH)指标。
从BDARS中≥22,000名患者的协调数据中,我们识别出132名近期接受头颈癌治疗的患者,他们的吞咽困难评分从基线恶化至最高等级≥2。我们开发了一种方法,使用物理和生物校正(α/β = 2.5)的DVH曲线来测试基于绝对体积和百分比体积的DVH指标。将统计分类算法与机器学习(SCA-ML)相结合,比单独使用任何一种方法都能提供更广泛的反应阈值证据细节。迭代构建了一个敏感性引导、最小输入的机器学习(ML)模型,以识别关键结构DVH指标阈值。
使用SCA-ML评分对产生738个候选DVH指标的7个吞咽结构与吞咽困难的相关性进行了排名。这些结构包括咽上缩肌(SPC)、咽下缩肌(IPC)、喉和食管。双侧腮腺和下颌下腺(SG)结构根据相对平均剂量(例如,SG_high、SG_low)进行分类,作为与肿瘤中心剂量相对的类似对侧和同侧的指定。SCA-ML评分高的结构DVH指标包括:SPC:D20%(等效剂量[EQD2]Gy)≥47.7;SPC:D25%(Gy)≥50.4;IPC:D35%(Gy)≥61.7;腮腺_low:D60%(Gy)≥I3.2;以及SG_high:D35%(Gy)≥61.7。喉:D25%(Gy)≥21.2和SG_low:D45%≥28.2的SCA-ML评分高,但在不到90%的计划中被分割。仅基于SPC:D20%(EQD2 Gy)的模型的敏感性和曲线下面积分别为0.88±0.13和0.74±0.17。
本研究提供了将大数据与人工智能相结合以增加临床学习范式中证据量的实际例证。