Suppr超能文献

预测垂体手术结果的机器学习模型:报告质量与当前证据的系统评价

Machine Learning Models to Forecast Outcomes of Pituitary Surgery: A Systematic Review in Quality of Reporting and Current Evidence.

作者信息

Rech Matheus M, de Macedo Filho Leonardo, White Alexandra J, Perez-Vega Carlos, Samson Susan L, Chaichana Kaisorn L, Olomu Osarenoma U, Quinones-Hinojosa Alfredo, Almeida Joao Paulo

机构信息

Department of Neurosurgery, University of Caxias do Sul, Caxias do Sul 95070-560, RS, Brazil.

Department of Neurosurgery, Mayo Clinic Florida, Jacksonville, FL 32224, USA.

出版信息

Brain Sci. 2023 Mar 15;13(3):495. doi: 10.3390/brainsci13030495.

Abstract

BACKGROUND

The complex nature and heterogeneity involving pituitary surgery results have increased interest in machine learning (ML) applications for prediction of outcomes over the last decade. This study aims to systematically review the characteristics of ML models involving pituitary surgery outcome prediction and assess their reporting quality.

METHODS

We searched the PubMed, Scopus, and Web of Knowledge databases for publications on the use of ML to predict pituitary surgery outcomes. We used the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) to assess report quality. Our search strategy was based on the terms "artificial intelligence", "machine learning", and "pituitary".

RESULTS

20 studies were included in this review. The principal models reported in each article were post-surgical endocrine outcomes ( = 10), tumor management ( = 3), and intra- and postoperative complications ( = 7). Overall, the included studies adhered to a median of 65% (IQR = 60-72%) of TRIPOD criteria, ranging from 43% to 83%. The median reported AUC was 0.84 (IQR = 0.80-0.91). The most popular algorithms were support vector machine ( = 5) and random forest ( = 5). Only two studies reported external validation and adherence to any reporting guideline. Calibration methods were not reported in 15 studies. No model achieved the phase of actual clinical applicability.

CONCLUSION

Applications of ML in the prediction of pituitary outcomes are still nascent, as evidenced by the lack of any model validated for clinical practice. Although studies have demonstrated promising results, greater transparency in model development and reporting is needed to enable their use in clinical practice. Further adherence to reporting guidelines can help increase AI's real-world utility and improve clinical practice.

摘要

背景

垂体手术结果的复杂性和异质性使得在过去十年中,机器学习(ML)在预测手术结果方面的应用越来越受到关注。本研究旨在系统回顾涉及垂体手术结果预测的ML模型的特征,并评估其报告质量。

方法

我们在PubMed、Scopus和Web of Knowledge数据库中搜索关于使用ML预测垂体手术结果的出版物。我们使用个体预后或诊断多变量预测模型的透明报告(TRIPOD)来评估报告质量。我们的搜索策略基于“人工智能”、“机器学习”和“垂体”等术语。

结果

本综述纳入了20项研究。每篇文章中报告的主要模型为术后内分泌结果(n = 10)、肿瘤管理(n = 3)以及术中和术后并发症(n = 7)。总体而言,纳入的研究符合TRIPOD标准的中位数为65%(IQR = 60 - 72%),范围为43%至83%。报告的AUC中位数为0.84(IQR = 0.80 - 0.91)。最常用的算法是支持向量机(n = 5)和随机森林(n = 5)。只有两项研究报告了外部验证以及对任何报告指南的遵循情况。15项研究未报告校准方法。没有模型达到实际临床应用阶段。

结论

ML在垂体结果预测中的应用仍处于起步阶段,缺乏任何经过临床实践验证的模型就证明了这一点。尽管研究已显示出有前景的结果,但模型开发和报告需要更高的透明度,以便能够在临床实践中使用。进一步遵循报告指南有助于提高人工智能在现实世界中的实用性并改善临床实践。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e5d4/10046799/586476845e63/brainsci-13-00495-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验