1Department of Neurosurgery, Clinical Neuroscience Center, University Hospital Zurich, University of Zurich, Switzerland.
2Department of Neurosurgery, Tuscany School of Neurosurgery, University of Firenze, Italy; and.
Neurosurg Focus. 2018 Nov 1;45(5):E12. doi: 10.3171/2018.8.FOCUS18243.
OBJECTIVEGross-total resection (GTR) is often the primary surgical goal in transsphenoidal surgery for pituitary adenoma. Existing classifications are effective at predicting GTR but are often hampered by limited discriminatory ability in moderate cases and by poor interrater agreement. Deep learning, a subset of machine learning, has recently established itself as highly effective in forecasting medical outcomes. In this pilot study, the authors aimed to evaluate the utility of using deep learning to predict GTR after transsphenoidal surgery for pituitary adenoma.METHODSData from a prospective registry were used. The authors trained a deep neural network to predict GTR from 16 preoperatively available radiological and procedural variables. Class imbalance adjustment, cross-validation, and random dropout were applied to prevent overfitting and ensure robustness of the predictive model. The authors subsequently compared the deep learning model to a conventional logistic regression model and to the Knosp classification as a gold standard.RESULTSOverall, 140 patients who underwent endoscopic transsphenoidal surgery were included. GTR was achieved in 95 patients (68%), with a mean extent of resection of 96.8% ± 10.6%. Intraoperative high-field MRI was used in 116 (83%) procedures. The deep learning model achieved excellent area under the curve (AUC; 0.96), accuracy (91%), sensitivity (94%), and specificity (89%). This represents an improvement in comparison with the Knosp classification (AUC: 0.87, accuracy: 81%, sensitivity: 92%, specificity: 70%) and a statistically significant improvement in comparison with logistic regression (AUC: 0.86, accuracy: 82%, sensitivity: 81%, specificity: 83%) (all p < 0.001).CONCLUSIONSIn this pilot study, the authors demonstrated the utility of applying deep learning to preoperatively predict the likelihood of GTR with excellent performance. Further training and validation in a prospective multicentric cohort will enable the development of an easy-to-use interface for use in clinical practice.
目的
在经蝶窦手术治疗垂体腺瘤中,大体全切除(GTR)通常是主要的手术目标。现有的分类方法在预测 GTR 方面非常有效,但在中度病例中往往受到判别能力有限的限制,并且组间一致性较差。深度学习作为机器学习的一个分支,最近已被证明在预测医疗结果方面非常有效。在这项初步研究中,作者旨在评估使用深度学习预测经蝶窦手术治疗垂体腺瘤后 GTR 的效用。
方法
使用前瞻性登记处的数据。作者训练了一个深度神经网络,从 16 个术前可获得的影像学和手术变量预测 GTR。应用类别不平衡调整、交叉验证和随机丢弃来防止过拟合并确保预测模型的稳健性。作者随后将深度学习模型与传统的逻辑回归模型和 Knosp 分类进行了比较,以作为金标准。
结果
共有 140 名接受内镜经蝶窦手术的患者入组。95 例(68%)患者实现了 GTR,平均切除程度为 96.8%±10.6%。116 例(83%)手术中使用了术中高场 MRI。深度学习模型获得了优异的曲线下面积(AUC;0.96)、准确性(91%)、敏感性(94%)和特异性(89%)。与 Knosp 分类(AUC:0.87,准确性:81%,敏感性:92%,特异性:70%)相比,这是一种改进,与逻辑回归相比,这是一种统计学上的显著改进(AUC:0.86,准确性:82%,敏感性:81%,特异性:83%)(均 p<0.001)。
结论
在这项初步研究中,作者证明了应用深度学习技术术前预测 GTR 可能性的效用,具有出色的性能。在一项前瞻性多中心队列中进一步培训和验证,将使开发易于在临床实践中使用的界面成为可能。