• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于系统评价的重复记录检测自动化:Deduplicator。

Automation of duplicate record detection for systematic reviews: Deduplicator.

机构信息

Institute for Evidence-Based Healthcare, Bond University, Gold Coast, Australia.

出版信息

Syst Rev. 2024 Aug 2;13(1):206. doi: 10.1186/s13643-024-02619-9.

DOI:10.1186/s13643-024-02619-9
PMID:39095913
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11295717/
Abstract

BACKGROUND

To describe the algorithm and investigate the efficacy of a novel systematic review automation tool "the Deduplicator" to remove duplicate records from a multi-database systematic review search.

METHODS

We constructed and tested the efficacy of the Deduplicator tool by using 10 previous Cochrane systematic review search results to compare the Deduplicator's 'balanced' algorithm to a semi-manual EndNote method. Two researchers each performed deduplication on the 10 libraries of search results. For five of those libraries, one researcher used the Deduplicator, while the other performed semi-manual deduplication with EndNote. They then switched methods for the remaining five libraries. In addition to this analysis, comparison between the three different Deduplicator algorithms ('balanced', 'focused' and 'relaxed') was performed on two datasets of previously deduplicated search results.

RESULTS

Before deduplication, the mean library size for the 10 systematic reviews was 1962 records. When using the Deduplicator, the mean time to deduplicate was 5 min per 1000 records compared to 15 min with EndNote. The mean error rate with Deduplicator was 1.8 errors per 1000 records in comparison to 3.1 with EndNote. Evaluation of the different Deduplicator algorithms found that the 'balanced' algorithm had the highest mean F1 score of 0.9647. The 'focused' algorithm had the highest mean accuracy of 0.9798 and the highest recall of 0.9757. The 'relaxed' algorithm had the highest mean precision of 0.9896.

CONCLUSIONS

This demonstrates that using the Deduplicator for duplicate record detection reduces the time taken to deduplicate, while maintaining or improving accuracy compared to using a semi-manual EndNote method. However, further research should be performed comparing more deduplication methods to establish relative performance of the Deduplicator against other deduplication methods.

摘要

背景

描述一种新的系统综述自动化工具“去重器”的算法,并研究其从多数据库系统综述检索中去除重复记录的效果。

方法

我们构建并测试了去重器工具的功效,使用 10 项先前的 Cochrane 系统综述检索结果来比较去重器的“平衡”算法与半手动 EndNote 方法。两位研究人员分别对 10 个检索结果库进行去重。对于其中 5 个库,一位研究人员使用去重器,另一位使用 EndNote 进行半手动去重。然后,他们切换方法处理其余 5 个库。除了此分析之外,还在两个先前去重的检索结果数据集上比较了三种不同的去重器算法(“平衡”、“聚焦”和“宽松”)。

结果

在去重之前,这 10 项系统综述的平均库大小为 1962 条记录。使用去重器时,每 1000 条记录去重的平均时间为 5 分钟,而使用 EndNote 的时间为 15 分钟。使用去重器的平均错误率为每 1000 条记录 1.8 个错误,而使用 EndNote 的错误率为 3.1 个错误。评估不同的去重器算法发现,“平衡”算法的平均 F1 得分为 0.9647,最高。“聚焦”算法的准确率最高,为 0.9798,召回率最高,为 0.9757。“宽松”算法的精度最高,为 0.9896。

结论

这表明,使用去重器进行重复记录检测可以减少去重所需的时间,同时与使用半手动 EndNote 方法相比,保持或提高准确性。然而,应进行进一步的研究,比较更多的去重方法,以确定去重器相对于其他去重方法的相对性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f391/11295717/92ba9ee938b1/13643_2024_2619_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f391/11295717/289a8734f7ef/13643_2024_2619_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f391/11295717/fdde86d5935a/13643_2024_2619_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f391/11295717/92ba9ee938b1/13643_2024_2619_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f391/11295717/289a8734f7ef/13643_2024_2619_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f391/11295717/fdde86d5935a/13643_2024_2619_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f391/11295717/92ba9ee938b1/13643_2024_2619_Fig3_HTML.jpg

相似文献

1
Automation of duplicate record detection for systematic reviews: Deduplicator.用于系统评价的重复记录检测自动化:Deduplicator。
Syst Rev. 2024 Aug 2;13(1):206. doi: 10.1186/s13643-024-02619-9.
2
The Automated Systematic Search Deduplicator (ASySD): a rapid, open-source, interoperable tool to remove duplicate citations in biomedical systematic reviews.自动化系统搜索去重器 (ASySD):一种快速、开源、可互操作的工具,用于去除生物医学系统评价中的重复引文。
BMC Biol. 2023 Sep 7;21(1):189. doi: 10.1186/s12915-023-01686-z.
3
Better duplicate detection for systematic reviewers: evaluation of Systematic Review Assistant-Deduplication Module.为系统评价者提供更好的重复检测:系统评价助手-重复数据删除模块的评估
Syst Rev. 2015 Jan 14;4(1):6. doi: 10.1186/2046-4053-4-6.
4
Considerations for conducting systematic reviews: A follow-up study to evaluate the performance of various automated methods for reference de-duplication.考虑进行系统评价:评估各种自动参考文献去重方法性能的后续研究。
Res Synth Methods. 2024 Nov;15(6):896-904. doi: 10.1002/jrsm.1736. Epub 2024 Jul 25.
5
Reducing systematic review burden using Deduklick: a novel, automated, reliable, and explainable deduplication algorithm to foster medical research.利用 Deduklick 减少系统综述负担:一种新颖、自动化、可靠且可解释的去重算法,以促进医学研究。
Syst Rev. 2022 Aug 17;11(1):172. doi: 10.1186/s13643-022-02045-9.
6
Deduplicating records in systematic reviews: there are free, accurate automated ways to do so.在系统评价中去除重复记录:有免费、准确的自动化方法来做到这一点。
J Clin Epidemiol. 2022 Dec;152:110-115. doi: 10.1016/j.jclinepi.2022.10.009. Epub 2022 Oct 12.
7
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
8
Rule-based deduplication of article records from bibliographic databases.基于规则对书目数据库中的文章记录进行重复数据删除。
Database (Oxford). 2014 Jan 16;2014:bat086. doi: 10.1093/database/bat086. Print 2014.
9
Machine learning reduced workload with minimal risk of missing studies: development and evaluation of a randomized controlled trial classifier for Cochrane Reviews.机器学习减少了工作量,同时最小化了漏检研究的风险:一项用于 Cochrane 综述的随机对照试验分类器的开发和评估。
J Clin Epidemiol. 2021 May;133:140-151. doi: 10.1016/j.jclinepi.2020.11.003. Epub 2020 Nov 7.
10
Enhancing recall in automated record screening: A resampling algorithm.增强自动化记录筛选中的召回率:一种重抽样算法。
Res Synth Methods. 2024 May;15(3):372-383. doi: 10.1002/jrsm.1690. Epub 2024 Jan 7.

引用本文的文献

1
Mechanisms of Virtual Reality-Based Relaxation in Older Adults: A Scoping Review.老年人基于虚拟现实的放松机制:一项范围综述。
J Clin Med. 2025 Aug 29;14(17):6126. doi: 10.3390/jcm14176126.
2
The use of implementation mapping in healthcare settings: a scoping review.实施映射在医疗环境中的应用:一项范围综述。
Front Public Health. 2025 Jul 16;13:1603178. doi: 10.3389/fpubh.2025.1603178. eCollection 2025.
3
Failure of Passive Immune Transfer in Neonatal Beef Calves: A Scoping Review.新生肉牛犊被动免疫转移失败:一项综述

本文引用的文献

1
The 2-week systematic review (2weekSR) method was successfully blind-replicated by another team: a case study.另一个团队成功地对为期两周的系统评价(2weekSR)方法进行了盲法复制:一项案例研究。
J Clin Epidemiol. 2024 Jan;165:111197. doi: 10.1016/j.jclinepi.2023.10.013. Epub 2023 Oct 23.
2
We extended the 2-week systematic review (2weekSR) methodology to larger, more complex systematic reviews: A case series.我们将 2 周系统综述(2weekSR)方法扩展到更大、更复杂的系统综述:病例系列。
J Clin Epidemiol. 2023 May;157:112-119. doi: 10.1016/j.jclinepi.2023.03.007. Epub 2023 Mar 8.
3
Deduplicating records in systematic reviews: there are free, accurate automated ways to do so.
Animals (Basel). 2025 Jul 14;15(14):2072. doi: 10.3390/ani15142072.
4
Family Physicians' Perceived Needs Regarding Their Mental Health and Wellbeing in Infectious Catastrophic Events: A Mixed Studies Literature Review.家庭医生在传染性灾难事件中对自身心理健康和幸福的感知需求:一项混合研究文献综述
J Prim Care Community Health. 2025 Jan-Dec;16:21501319251356557. doi: 10.1177/21501319251356557. Epub 2025 Jul 23.
5
Prognostic models for predicting patient arrivals in emergency departments: an updated systematic review and research agenda.预测急诊科患者就诊情况的预后模型:最新系统评价与研究议程
BMC Emerg Med. 2025 Jul 1;25(1):106. doi: 10.1186/s12873-025-01250-8.
6
Comparative Efficacy of Minoxidil and 5-Alpha Reductase Inhibitors Monotherapy for Male Pattern Hair Loss: Network Meta-Analysis Study of Current Empirical Evidence.米诺地尔与5α还原酶抑制剂单药治疗男性型脱发的疗效比较:当前实证证据的网状Meta分析研究
J Cosmet Dermatol. 2025 Jul;24(7):e70320. doi: 10.1111/jocd.70320.
7
Leishmaniases in Ethiopia: a scoping review.埃塞俄比亚的利什曼病:一项范围综述
BMJ Open. 2025 Jun 19;15(6):e100284. doi: 10.1136/bmjopen-2025-100284.
8
Fecal microbiota transplantation from patients into animals to establish human microbiota-associated animal models: a scoping review.将患者的粪便微生物群移植到动物体内以建立人类微生物群相关动物模型:一项范围综述
J Transl Med. 2025 Jun 17;23(1):662. doi: 10.1186/s12967-025-06645-6.
9
Postoperative physiotherapy interventions in hospitalized adults undergoing pulmonary resection surgery. A protocol for a scoping review.接受肺切除手术的住院成人的术后物理治疗干预措施。一项范围综述方案。
MethodsX. 2025 May 1;14:103349. doi: 10.1016/j.mex.2025.103349. eCollection 2025 Jun.
10
Iron deficiency in patients with cardiogenic shock: protocol for a scoping review.心源性休克患者的缺铁:一项范围综述方案
BMJ Open. 2025 Apr 19;15(4):e092891. doi: 10.1136/bmjopen-2024-092891.
在系统评价中去除重复记录:有免费、准确的自动化方法来做到这一点。
J Clin Epidemiol. 2022 Dec;152:110-115. doi: 10.1016/j.jclinepi.2022.10.009. Epub 2022 Oct 12.
4
Reducing systematic review burden using Deduklick: a novel, automated, reliable, and explainable deduplication algorithm to foster medical research.利用 Deduklick 减少系统综述负担:一种新颖、自动化、可靠且可解释的去重算法,以促进医学研究。
Syst Rev. 2022 Aug 17;11(1):172. doi: 10.1186/s13643-022-02045-9.
5
Push versus gravity for intermittent bolus gavage tube feeding of preterm and low birth weight infants.轻推与重力在早产儿和低出生体重儿间歇性推注管饲中的比较。
Cochrane Database Syst Rev. 2021 Aug 4;8(8):CD005249. doi: 10.1002/14651858.CD005249.pub3.
6
Interventions for cutaneous disease in systemic lupus erythematosus.治疗系统性红斑狼疮皮肤病变的干预措施。
Cochrane Database Syst Rev. 2021 Mar 9;3(3):CD007478. doi: 10.1002/14651858.CD007478.pub2.
7
Considerations for conducting systematic reviews: evaluating the performance of different methods for de-duplicating references.考虑进行系统评价时:评估不同参考文献去重方法的性能。
Syst Rev. 2021 Jan 23;10(1):38. doi: 10.1186/s13643-021-01583-y.
8
Interventions for frostbite injuries.冻伤损伤的干预措施。
Cochrane Database Syst Rev. 2020 Dec 20;12(12):CD012980. doi: 10.1002/14651858.CD012980.pub2.
9
Number of embryos for transfer following in vitro fertilisation or intra-cytoplasmic sperm injection.体外受精或卵胞浆内单精子注射后移植的胚胎数量。
Cochrane Database Syst Rev. 2020 Aug 21;8(8):CD003416. doi: 10.1002/14651858.CD003416.pub5.
10
Beta-blockers for congestive heart failure in children.用于儿童充血性心力衰竭的β受体阻滞剂。
Cochrane Database Syst Rev. 2020 Jul 23;7(7):CD007037. doi: 10.1002/14651858.CD007037.pub4.