• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一项涉及数千名参赛者的公开竞赛未能为新的诊断测试准确性系统评价构建有用的抽象分类器。

An open competition involving thousands of competitors failed to construct useful abstract classifiers for new diagnostic test accuracy systematic reviews.

机构信息

Department of Internal Medicine, Kyoto Min-iren Asukai Hospital, Kyoto, Japan.

Scientific Research Works Peer Support Group (SRWS-PSG), Osaka, Japan.

出版信息

Res Synth Methods. 2023 Sep;14(5):707-717. doi: 10.1002/jrsm.1649. Epub 2023 Jun 20.

DOI:10.1002/jrsm.1649
PMID:37337729
Abstract

There are currently no abstract classifiers, which can be used for new diagnostic test accuracy (DTA) systematic reviews to select primary DTA study abstracts from database searches. Our goal was to develop machine-learning-based abstract classifiers for new DTA systematic reviews through an open competition. We prepared a dataset of abstracts obtained through database searches from 11 reviews in different clinical areas. As the reference standard, we used the abstract lists that required manual full-text review. We randomly splitted the datasets into a train set, a public test set, and a private test set. Competition participants used the training set to develop classifiers and validated their classifiers using the public test set. The classifiers were refined based on the performance of the public test set. They could submit as many times as they wanted during the competition. Finally, we used the private test set to rank the submitted classifiers. To reduce false exclusions, we used the Fbeta measure with a beta set to seven for evaluating classifiers. After the competition, we conducted the external validation using a dataset from a cardiology DTA review. We received 13,774 submissions from 1429 teams or persons over 4 months. The top-honored classifier achieved a Fbeta score of 0.4036 and a recall of 0.2352 in the external validation. In conclusion, we were unable to develop an abstract classifier with sufficient recall for immediate application to new DTA systematic reviews. Further studies are needed to update and validate classifiers with datasets from other clinical areas.

摘要

目前没有抽象分类器可用于新的诊断测试准确性(DTA)系统评价,以从数据库搜索中选择主要的 DTA 研究摘要。我们的目标是通过公开竞赛开发基于机器学习的新 DTA 系统评价的摘要分类器。我们准备了一个数据集,其中包括通过来自 11 个不同临床领域的综述的数据库搜索获得的摘要。作为参考标准,我们使用需要手动全文审查的摘要列表。我们将数据集随机分为训练集、公共测试集和私有测试集。竞赛参与者使用训练集开发分类器,并使用公共测试集验证其分类器。根据公共测试集的性能对分类器进行了改进。他们可以在竞赛期间多次提交。最后,我们使用私有测试集对提交的分类器进行排名。为了减少错误排除,我们使用了 Fbeta 度量,其中 beta 设置为 7,用于评估分类器。竞赛结束后,我们使用来自心脏病学 DTA 综述的数据集进行了外部验证。在 4 个月的时间里,我们收到了来自 1429 个团队或个人的 13774 次提交。在外部验证中,排名最高的分类器获得了 0.4036 的 Fbeta 分数和 0.2352 的召回率。总之,我们无法开发出具有足够召回率的摘要分类器,以立即应用于新的 DTA 系统评价。需要进一步的研究来使用来自其他临床领域的数据集更新和验证分类器。

相似文献

1
An open competition involving thousands of competitors failed to construct useful abstract classifiers for new diagnostic test accuracy systematic reviews.一项涉及数千名参赛者的公开竞赛未能为新的诊断测试准确性系统评价构建有用的抽象分类器。
Res Synth Methods. 2023 Sep;14(5):707-717. doi: 10.1002/jrsm.1649. Epub 2023 Jun 20.
2
Search strategies to identify diagnostic accuracy studies in MEDLINE and EMBASE.在MEDLINE和EMBASE中识别诊断准确性研究的检索策略。
Cochrane Database Syst Rev. 2013 Sep 11;2013(9):MR000022. doi: 10.1002/14651858.MR000022.pub3.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
5
Increased workload for systematic review literature searches of diagnostic tests compared with treatments: challenges and opportunities.与治疗相比,诊断测试系统评价文献检索的工作量增加:挑战与机遇。
JMIR Med Inform. 2014 May 27;2(1):e11. doi: 10.2196/medinform.3037.
6
Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers.机器学习算法在(放化疗)治疗结果预测中的应用:分类器的实证比较。
Med Phys. 2018 Jul;45(7):3449-3459. doi: 10.1002/mp.12967. Epub 2018 Jun 13.
7
Applying machine classifiers to update searches: Analysis from two case studies.应用机器分类器更新搜索:来自两个案例研究的分析。
Res Synth Methods. 2022 Jan;13(1):121-133. doi: 10.1002/jrsm.1537. Epub 2021 Nov 25.
8
Measuring quality of reporting in systematic reviews of diagnostic test accuracy studies in medical imaging: comparison of PRISMA-DTA and PRISMA.医学影像诊断试验准确性研究系统评价中报告质量的衡量:PRISMA-DTA与PRISMA的比较
Ultrasound Obstet Gynecol. 2023 Feb;61(2):257-266. doi: 10.1002/uog.26043. Epub 2023 Jan 12.
9
Completeness of Reporting of Systematic Reviews of Diagnostic Test Accuracy Based on the PRISMA-DTA Reporting Guideline.基于 PRISMA-DTA 报告准则的诊断性测试准确性系统评价报告的完整性。
Clin Chem. 2019 Feb;65(2):291-301. doi: 10.1373/clinchem.2018.292987. Epub 2018 Sep 20.
10
Automated screening of research studies for systematic reviews using study characteristics.利用研究特征进行系统综述的研究自动筛选
Syst Rev. 2018 Apr 25;7(1):64. doi: 10.1186/s13643-018-0724-7.