• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

机器学习可实现泌尿科系统评价和荟萃分析的自动化筛选。

Machine learning enables automated screening for systematic reviews and meta-analysis in urology.

机构信息

Department of Urology and Urological Surgery, University Medical Center Mannheim, University of Heidelberg, Theodor-Kutzer-Ufer 1-3, 68167, Mannheim, Germany.

Department of Urology, University of Leipzig, Leipzig, Germany.

出版信息

World J Urol. 2024 Jul 10;42(1):396. doi: 10.1007/s00345-024-05078-y.

DOI:10.1007/s00345-024-05078-y
PMID:38985296
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11236840/
Abstract

PURPOSE

To investigate and implement semiautomated screening for meta-analyses (MA) in urology under consideration of class imbalance.

METHODS

Machine learning algorithms were trained on data from three MA with detailed information of the screening process. Different methods to account for class imbalance (Sampling (up- and downsampling, weighting and cost-sensitive learning), thresholding) were implemented in different machine learning (ML) algorithms (Random Forest, Logistic Regression with Elastic Net Regularization, Support Vector Machines). Models were optimized for sensitivity. Besides metrics such as specificity, receiver operating curves, total missed studies, and work saved over sampling were calculated.

RESULTS

During training, models trained after downsampling achieved the best results consistently among all algorithms. Computing time ranged between 251 and 5834 s. However, when evaluated on the final test data set, the weighting approach performed best. In addition, thresholding helped to improve results as compared to the standard of 0.5. However, due to heterogeneity of results no clear recommendation can be made for a universal sample size. Misses of relevant studies were 0 for the optimized models except for one review.

CONCLUSION

It will be necessary to design a holistic methodology that implements the presented methods in a practical manner, but also takes into account other algorithms and the most sophisticated methods for text preprocessing. In addition, the different methods of a cost-sensitive learning approach can be the subject of further investigations.

摘要

目的

在考虑类别不平衡的情况下,研究并实施泌尿外科半自动荟萃分析(MA)筛选。

方法

在具有详细筛选过程信息的三个 MA 数据上对机器学习算法进行训练。实现了不同的方法来解决类别不平衡问题(采样(上采样和下采样、加权和代价敏感学习)、阈值),并在不同的机器学习(ML)算法(随机森林、具有弹性网正则化的逻辑回归、支持向量机)中实现。对模型进行了优化以提高敏感性。除了特异性、接收者操作曲线、总漏检研究和过采样节省的工作等指标外,还计算了其他指标。

结果

在训练过程中,在所有算法中,经过下采样后训练的模型始终能取得最佳效果。计算时间在 251 到 5834 秒之间。然而,当在最终的测试数据集上进行评估时,加权方法的效果最佳。此外,与 0.5 的标准相比,阈值有助于提高结果。但是,由于结果的异质性,无法为通用的样本量推荐一种明确的方法。除了一篇综述外,优化后的模型都没有遗漏相关的研究。

结论

需要设计一种整体方法,以实际的方式实现所提出的方法,同时还考虑其他算法和最复杂的文本预处理方法。此外,代价敏感学习方法的不同方法可以进一步研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/44c8/11236840/528202f09993/345_2024_5078_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/44c8/11236840/d94a0b7192d4/345_2024_5078_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/44c8/11236840/e4e0731acc5d/345_2024_5078_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/44c8/11236840/528202f09993/345_2024_5078_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/44c8/11236840/d94a0b7192d4/345_2024_5078_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/44c8/11236840/e4e0731acc5d/345_2024_5078_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/44c8/11236840/528202f09993/345_2024_5078_Fig3_HTML.jpg

相似文献

1
Machine learning enables automated screening for systematic reviews and meta-analysis in urology.机器学习可实现泌尿科系统评价和荟萃分析的自动化筛选。
World J Urol. 2024 Jul 10;42(1):396. doi: 10.1007/s00345-024-05078-y.
2
Aligning text mining and machine learning algorithms with best practices for study selection in systematic literature reviews.将文本挖掘和机器学习算法与系统文献综述中的研究选择最佳实践相结合。
Syst Rev. 2020 Dec 13;9(1):293. doi: 10.1186/s13643-020-01520-5.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
Screening PubMed abstracts: is class imbalance always a challenge to machine learning?筛选PubMed摘要:类别不平衡对机器学习而言始终是一项挑战吗?
Syst Rev. 2019 Dec 6;8(1):317. doi: 10.1186/s13643-019-1245-8.
5
In-depth evaluation of machine learning methods for semi-automating article screening in a systematic review of mechanistic literature.在对机制文献的系统评价中,对用于半自动文章筛选的机器学习方法进行深入评估。
Res Synth Methods. 2023 Mar;14(2):156-172. doi: 10.1002/jrsm.1589. Epub 2022 Jul 23.
6
Social Reminiscence in Older Adults' Everyday Conversations: Automated Detection Using Natural Language Processing and Machine Learning.老年人日常对话中的社会怀旧:使用自然语言处理和机器学习的自动检测。
J Med Internet Res. 2020 Sep 15;22(9):e19133. doi: 10.2196/19133.
7
Evaluating machine learning algorithms to Predict 30-day Unplanned REadmission (PURE) in Urology patients.评估机器学习算法预测泌尿外科患者 30 天非计划性再入院(PURE)
BMC Med Inform Decis Mak. 2023 Jun 13;23(1):108. doi: 10.1186/s12911-023-02200-9.
8
Comparison of an Ensemble of Machine Learning Models and the BERT Language Model for Analysis of Text Descriptions of Brain CT Reports to Determine the Presence of Intracranial Hemorrhage.基于机器学习模型集成与 BERT 语言模型的脑 CT 报告文本描述分析用于判断颅内出血的比较研究
Sovrem Tekhnologii Med. 2024;16(1):27-34. doi: 10.17691/stm2024.16.1.03. Epub 2024 Feb 28.
9
Decoding semi-automated title-abstract screening: findings from a convenience sample of reviews.解码半自动标题-摘要筛选:来自便利样本综述的研究结果。
Syst Rev. 2020 Nov 27;9(1):272. doi: 10.1186/s13643-020-01528-x.
10
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.

本文引用的文献

1
Can large language models replace humans in systematic reviews? Evaluating GPT-4's efficacy in screening and extracting data from peer-reviewed and grey literature in multiple languages.大型语言模型能否在系统评价中取代人类?评估 GPT-4 从多种语言的同行评议文献和灰色文献中进行筛选和提取数据的效果。
Res Synth Methods. 2024 Jul;15(4):616-626. doi: 10.1002/jrsm.1715. Epub 2024 Mar 14.
2
Automated Paper Screening for Clinical Reviews Using Large Language Models: Data Analysis Study.使用大型语言模型对临床综述进行自动化论文筛选:数据分析研究。
J Med Internet Res. 2024 Jan 12;26:e48996. doi: 10.2196/48996.
3
Framework for a living systematic review and meta-analysis for the surgical treatment of bladder cancer: introducing EVIglance to urology.
膀胱癌手术治疗的实时系统评价与荟萃分析框架:向泌尿外科引入EVIglance
Int J Surg Protoc. 2023 Sep 18;27(2):9-15. doi: 10.1097/SP9.0000000000000008. eCollection 2023 Oct.
4
Text mining to support abstract screening for knowledge syntheses: a semi-automated workflow.文本挖掘支持知识综合的摘要筛选:一种半自动化工作流程。
Syst Rev. 2021 May 26;10(1):156. doi: 10.1186/s13643-021-01700-x.
5
Impact of perioperative blood transfusions on oncologic outcomes after radical cystectomy: A systematic review and meta-analysis of comparative studies.根治性膀胱切除术围手术期输血对肿瘤学结局的影响:系统评价和比较研究的荟萃分析。
Surg Oncol. 2021 Sep;38:101592. doi: 10.1016/j.suronc.2021.101592. Epub 2021 May 5.
6
Radiomics in Renal Cell Carcinoma-A Systematic Review and Meta-Analysis.肾细胞癌中的放射组学——一项系统综述与荟萃分析
Cancers (Basel). 2021 Mar 17;13(6):1348. doi: 10.3390/cancers13061348.
7
Machine learning for identifying relevant publications in updates of systematic reviews of diagnostic test studies.用于在诊断试验研究系统评价更新中识别相关出版物的机器学习方法。
Res Synth Methods. 2021 Jul;12(4):506-515. doi: 10.1002/jrsm.1486. Epub 2021 Mar 28.
8
Analysis of the time and workers needed to conduct systematic reviews of medical interventions using data from the PROSPERO registry.利用PROSPERO注册库的数据,分析对医学干预措施进行系统评价所需的时间和人员。
BMJ Open. 2017 Feb 27;7(2):e012545. doi: 10.1136/bmjopen-2016-012545.
9
Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement.系统评价与Meta分析的首选报告项目:PRISMA声明。
BMJ. 2009 Jul 21;339:b2535. doi: 10.1136/bmj.b2535.
10
Post hoc choice of cut points introduced bias to diagnostic research.事后选择切点给诊断研究带来了偏倚。
J Clin Epidemiol. 2006 Aug;59(8):798-801. doi: 10.1016/j.jclinepi.2005.11.025. Epub 2006 May 26.