Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, OX3 7LD, UK; NIHR Oxford Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, Oxford, United Kingdom.
Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, OX3 7LD, UK.
J Clin Epidemiol. 2021 Oct;138:60-72. doi: 10.1016/j.jclinepi.2021.06.024. Epub 2021 Jun 29.
Evaluate the completeness of reporting of prognostic prediction models developed using machine learning methods in the field of oncology.
We conducted a systematic review, searching the MEDLINE and Embase databases between 01/01/2019 and 05/09/2019, for non-imaging studies developing a prognostic clinical prediction model using machine learning methods (as defined by primary study authors) in oncology. We used the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) statement to assess the reporting quality of included publications. We described overall reporting adherence of included publications and by each section of TRIPOD.
Sixty-two publications met the inclusion criteria. 48 were development studies and 14 were development with validation studies. 152 models were developed across all publications. Median adherence to TRIPOD reporting items was 41% [range: 10%-67%] and at least 50% adherence was found in 19% (n=12/62) of publications. Adherence was lower in development only studies (median: 38% [range: 10%-67%]); and higher in development with validation studies (median: 49% [range: 33%-59%]).
Reporting of clinical prediction models using machine learning in oncology is poor and needs urgent improvement, so readers and stakeholders can appraise the study methods, understand study findings, and reduce research waste.
评估使用机器学习方法在肿瘤学领域开发的预后预测模型的报告完整性。
我们进行了一项系统评价,在 2019 年 1 月 1 日至 2019 年 5 月 9 日期间,检索了 MEDLINE 和 Embase 数据库,以寻找使用机器学习方法(由主要研究作者定义)开发预后临床预测模型的非影像学研究。我们使用透明报告个体化预后或诊断的多变量预测模型(TRIPOD)声明来评估纳入出版物的报告质量。我们描述了纳入出版物的总体报告一致性以及 TRIPOD 各部分的报告一致性。
共有 62 篇出版物符合纳入标准。其中 48 篇为开发研究,14 篇为开发与验证研究。所有出版物共开发了 152 个模型。TRIPOD 报告项目的中位数依从率为 41%[范围:10%-67%],至少有 50%的依从率在 19%(n=12/62)的出版物中发现。仅在开发研究中,依从率较低(中位数:38%[范围:10%-67%]);在开发与验证研究中,依从率较高(中位数:49%[范围:33%-59%])。
在肿瘤学中使用机器学习报告临床预测模型的报告情况较差,急需改进,以便读者和利益相关者可以评估研究方法、理解研究结果,并减少研究浪费。