Pijls Bart G
Department of Orthopaedics, Leiden University Medical Center, Leiden, the Netherlands.
J Orthop. 2023 Nov 22;48:103-106. doi: 10.1016/j.jor.2023.11.051. eCollection 2024 Feb.
Machine learning assisted systematic reviewing may help to reduce the work burden in systematic reviews. The aim of this study is therefore to determine by a non-developer the performance of machine learning assisted systematic reviewing on previously published orthopaedic reviews in retrieving relevant papers.
Active learning for Systematic Reviews (ASReview) was tested against the results from three previously published systematic reviews in the field of orthopaedics with 20 iterations for each review. The reviews covered easy, intermediate and advanced scenarios. The outcomes of interest were the percentage work saved at 95% recall (WSS@95), the percentage work saved at 100% recall (WSS@100) and the percentage of relevant references identified after having screened the first 10% of the records (RRF@10). Means and corresponding [95% confidence intervals] were calculated.
The WSS@95 was respectively 72 [71-74], 72 [72-73] and 50 [50-51] for the easy, intermediate and advanced scenarios. The WSS@100 was respectively 72 [71-73], 62 [61-63] and 37 [36-38] for the easy, intermediate and advanced scenarios. The RRF@10 was respectively 79 [78-81], 70 [69-71] and 58 [56-60] for the easy, intermediate and advanced scenarios.
Machine learning assisted systematic reviewing was efficient in retrieving relevant papers for systematic review in orthopaedics. The majority of relevant papers were identified after screening only 10% of the papers. All relevant papers were identified after screening 30%-40% of the total papers meaning that 60%-70% of the work can potentially be saved.
机器学习辅助的系统评价可能有助于减轻系统评价的工作负担。因此,本研究的目的是由非开发者确定机器学习辅助的系统评价在检索先前发表的骨科综述相关论文方面的性能。
针对骨科领域先前发表的三篇系统评价结果,对系统评价主动学习(ASReview)进行了测试,每次评价进行20次迭代。这些评价涵盖了简单、中等和复杂场景。感兴趣的结果是在召回率为95%时节省的工作量百分比(WSS@95)、召回率为100%时节省的工作量百分比(WSS@100)以及在筛选前10%的记录后识别出的相关参考文献百分比(RRF@10)。计算了平均值和相应的[95%置信区间]。
在简单、中等和复杂场景中,WSS@95分别为72[71-74]、72[72-73]和50[50-51]。在简单、中等和复杂场景中,WSS@100分别为72[71-73]、62[61-63]和37[36-38]。在简单、中等和复杂场景中,RRF@10分别为79[78-81]、70[69-71]和58[56-60]。
机器学习辅助的系统评价在检索骨科系统评价的相关论文方面是有效的。仅筛选10%的论文后就识别出了大部分相关论文。筛选30%-40%的总论文后识别出了所有相关论文,这意味着有可能节省60%-70%的工作量。