Department of Radiology, Second Affiliated Hospital of PLA Naval Medical University, Shanghai, China.
Shanghai Aitrox Technology Corporation Limited, Shanghai, China.
J Appl Clin Med Phys. 2023 Nov;24(11):e14171. doi: 10.1002/acm2.14171. Epub 2023 Oct 2.
To construct and evaluate the performance of a machine learning-based low dose computed tomography (LDCT)-derived parametric response mapping (PRM) model for predicting pulmonary function test (PFT) results.
A total of 615 subjects from a community-based screening population (40-74 years old) with PFT parameters, including the ratio of the first second forced expiratory volume to forced vital capacity (FEV1/FVC), the percentage of forced expiratory volume in the one second predicted (FEV1%), and registered inspiration-to-expiration chest CT scanning were enrolled retrospectively. Subjects were classified into a normal, high risk, and COPD group based on PFT. Data of 72 PRM-derived quantitative parameters were collected, including volume and volume percentage of emphysema, functional-small airways disease, and normal lung tissue. A machine-learning with random forest regression model and a multilayer perceptron (MLP) model were constructed and tested on PFT prediction, which was followed by evaluation of classification performance based on the PFT predictions.
The machine-learning model based on PRM parameters showed better performance for predicting PFT than MLP, with a coefficient of determination (R ) of 0.749 and 0.792 for FEV1/FVC and FEV1%, respectively. The Mean Squared Errors (MSE) for FEV1/FVC and FEV1% are 0.0030 and 0.0097 for the random forest model, respectively. The Root Mean Squared Errors (RMSE) for FEV1/FVC and FEV1% are 0.055 and 0.098, respectively. The sensitivity, specificity, and accuracy for differentiating between the normal group and high-risk group were 34/40 (85%), 65/72 (90%), and 99/112 (88%), respectively. For differentiating between the non-COPD group and COPD group, the sensitivity, specificity, and accuracy were 8/9 (89%), 112/112 (100%), 120/121 (99%), respectively.
The machine learning-based random forest model predicts PFT results in a community screening population based on PRM, and it identifies high risk COPD from normal populations with high sensitivity and reliably predicts of high-risk COPD.
构建并评估基于机器学习的低剂量计算机断层扫描(LDCT)衍生的参数映射(PRM)模型在预测肺功能测试(PFT)结果方面的性能。
回顾性纳入了来自社区筛查人群(40-74 岁)的 615 名 PFT 参数、包括一秒率与用力肺活量比值(FEV1/FVC)、一秒用力呼气容积预计值百分比(FEV1%)以及登记吸气末到呼气末胸部 CT 扫描的患者。根据 PFT 将患者分为正常、高危和 COPD 组。收集了 72 个 PRM 衍生的定量参数的数据,包括肺气肿、功能性小气道疾病和正常肺组织的体积和体积百分比。构建并测试了基于随机森林回归模型和多层感知器(MLP)模型的机器学习模型,用于预测 PFT,并根据 PFT 预测结果评估分类性能。
基于 PRM 参数的机器学习模型在预测 PFT 方面的性能优于 MLP,FEV1/FVC 和 FEV1%的决定系数(R)分别为 0.749 和 0.792。随机森林模型的均方误差(MSE)分别为 FEV1/FVC 和 FEV1%的 0.0030 和 0.0097。FEV1/FVC 和 FEV1%的均方根误差(RMSE)分别为 0.055 和 0.098。正常组与高危组之间的鉴别灵敏度、特异度和准确度分别为 34/40(85%)、65/72(90%)和 99/112(88%)。在非 COPD 组和 COPD 组之间的鉴别中,灵敏度、特异度和准确度分别为 8/9(89%)、112/112(100%)和 120/121(99%)。
基于机器学习的随机森林模型基于 PRM 预测社区筛查人群的 PFT 结果,并以高灵敏度识别出正常人群中的高危 COPD,可靠地预测高危 COPD。