Horng Steven, Liao Ruizhi, Wang Xin, Dalal Sandeep, Golland Polina, Berkowitz Seth J
Department of Radiology, Beth Israel Deaconess Medical Center, Harvard Medical School, 330 Brookline Ave, Boston, MA 02215 (S.H., S.J.B.); Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, Mass (R.L., P.G.); and Clinical Informatics Solutions and Services, Philips Research, Cambridge, Mass (X.W., S.D.).
S.H. (e-mail:
Radiol Artif Intell. 2021 Jan 6;3(2):e190228. doi: 10.1148/ryai.2021190228. eCollection 2021 Mar.
To develop a machine learning model to classify the severity grades of pulmonary edema on chest radiographs.
In this retrospective study, 369 071 chest radiographs and associated radiology reports from 64 581 patients (mean age, 51.71 years; 54.51% women) from the MIMIC-CXR chest radiograph dataset were included. This dataset was split into patients with and without congestive heart failure (CHF). Pulmonary edema severity labels from the associated radiology reports were extracted from patients with CHF as four different ordinal levels: 0, no edema; 1, vascular congestion; 2, interstitial edema; and 3, alveolar edema. Deep learning models were developed using two approaches: a semisupervised model using a variational autoencoder and a pretrained supervised learning model using a dense neural network. Receiver operating characteristic curve analysis was performed on both models.
The area under the receiver operating characteristic curve (AUC) for differentiating alveolar edema from no edema was 0.99 for the semisupervised model and 0.87 for the pretrained models. Performance of the algorithm was inversely related to the difficulty in categorizing milder states of pulmonary edema (shown as AUCs for semisupervised model and pretrained model, respectively): 2 versus 0, 0.88 and 0.81; 1 versus 0, 0.79 and 0.66; 3 versus 1, 0.93 and 0.82; 2 versus 1, 0.69 and 0.73; and 3 versus 2, 0.88 and 0.63.
Deep learning models were trained on a large chest radiograph dataset and could grade the severity of pulmonary edema on chest radiographs with high performance.See also the commentary by Auffermann in this issue.© RSNA, 2021.
开发一种机器学习模型,用于对胸部X光片上的肺水肿严重程度进行分级。
在这项回顾性研究中,纳入了来自MIMIC-CXR胸部X光片数据集的64581例患者(平均年龄51.71岁;54.51%为女性)的369071张胸部X光片及相关放射学报告。该数据集被分为有和没有充血性心力衰竭(CHF)的患者。从CHF患者的相关放射学报告中提取肺水肿严重程度标签,分为四个不同的序数级别:0,无水肿;1,血管充血;2,间质性水肿;3,肺泡性水肿。使用两种方法开发深度学习模型:一种是使用变分自编码器的半监督模型,另一种是使用密集神经网络的预训练监督学习模型。对这两种模型都进行了受试者操作特征曲线分析。
半监督模型区分肺泡性水肿与无水肿的受试者操作特征曲线下面积(AUC)为0.99,预训练模型为0.87。算法的性能与对较轻程度肺水肿状态进行分类的难度呈负相关(分别显示为半监督模型和预训练模型的AUC):2级与0级,分别为0.88和0.81;1级与0级,分别为0.79和0.66;3级与1级,分别为0.93和0.82;2级与1级,分别为0.69和0.73;3级与2级,分别为0.88和0.63。
深度学习模型在一个大型胸部X光片数据集上进行了训练,能够对胸部X光片上的肺水肿严重程度进行高性能分级。另见本期奥弗曼的评论。©RSNA,2021。