利用 Deduklick 减少系统综述负担：一种新颖、自动化、可靠且可解释的去重算法，以促进医学研究。

Reducing systematic review burden using Deduklick: a novel, automated, reliable, and explainable deduplication algorithm to foster medical research.

机构信息

Risklick AG, Spin-Off, University of Bern, Bern, Switzerland.

CTU Bern, University of Bern, Bern, Switzerland.

出版信息

Syst Rev. 2022 Aug 17;11(1):172. doi: 10.1186/s13643-022-02045-9.

DOI:10.1186/s13643-022-02045-9

PMID:35978441

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9382798/

Abstract

BACKGROUND

Identifying and removing reference duplicates when conducting systematic reviews (SRs) remain a major, time-consuming issue for authors who manually check for duplicates using built-in features in citation managers. To address issues related to manual deduplication, we developed an automated, efficient, and rapid artificial intelligence-based algorithm named Deduklick. Deduklick combines natural language processing algorithms with a set of rules created by expert information specialists.

METHODS

Deduklick's deduplication uses a multistep algorithm of data normalization, calculates a similarity score, and identifies unique and duplicate references based on metadata fields, such as title, authors, journal, DOI, year, issue, volume, and page number range. We measured and compared Deduklick's capacity to accurately detect duplicates with the information specialists' standard, manual duplicate removal process using EndNote on eight existing heterogeneous datasets. Using a sensitivity analysis, we manually cross-compared the efficiency and noise of both methods.

DISCUSSION

Deduklick achieved average recall of 99.51%, average precision of 100.00%, and average F1 score of 99.75%. In contrast, the manual deduplication process achieved average recall of 88.65%, average precision of 99.95%, and average F1 score of 91.98%. Deduklick achieved equal to higher expert-level performance on duplicate removal. It also preserved high metadata quality and drastically reduced time spent on analysis. Deduklick represents an efficient, transparent, ergonomic, and time-saving solution for identifying and removing duplicates in SRs searches. Deduklick could therefore simplify SRs production and represent important advantages for scientists, including saving time, increasing accuracy, reducing costs, and contributing to quality SRs.

摘要

背景

在进行系统评价（SR）时，识别和去除参考文献重复仍然是作者手动使用引文管理器内置功能检查重复的主要耗时问题。为了解决与手动去重相关的问题，我们开发了一种名为 Deduklick 的自动化、高效、快速的基于人工智能的算法。Deduklick 将自然语言处理算法与一组由专家信息专家创建的规则相结合。

方法

Deduklick 的去重使用数据归一化的多步算法，计算相似度得分，并根据元数据字段（如标题、作者、期刊、DOI、年份、问题、卷和页码范围）识别唯一和重复的参考文献。我们使用 EndNote 在八个现有的异构数据集上测量和比较了 Deduklick 准确检测重复的能力与信息专家的标准、手动重复去除过程。使用敏感性分析，我们手动交叉比较了两种方法的效率和噪声。

讨论

Deduklick 的平均召回率为 99.51%，平均精度为 100.00%，平均 F1 分数为 99.75%。相比之下，手动去重过程的平均召回率为 88.65%，平均精度为 99.95%，平均 F1 分数为 91.98%。Deduklick 在去除重复方面达到了与专家水平相当甚至更高的性能。它还保持了较高的元数据质量，并大大减少了分析所花费的时间。Deduklick 为识别和去除 SR 搜索中的重复提供了一种高效、透明、符合人体工程学且节省时间的解决方案。因此，Deduklick 可以简化 SR 的制作，并为科学家们带来重要的优势，包括节省时间、提高准确性、降低成本和有助于制作高质量的 SR。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b42/9382798/59ce5be4dfd1/13643_2022_2045_Fig1_HTML.jpg

相似文献

Reducing systematic review burden using Deduklick: a novel, automated, reliable, and explainable deduplication algorithm to foster medical research.利用 Deduklick 减少系统综述负担：一种新颖、自动化、可靠且可解释的去重算法，以促进医学研究。

Syst Rev. 2022 Aug 17;11(1):172. doi: 10.1186/s13643-022-02045-9.

Better duplicate detection for systematic reviewers: evaluation of Systematic Review Assistant-Deduplication Module.为系统评价者提供更好的重复检测：系统评价助手-重复数据删除模块的评估

Syst Rev. 2015 Jan 14;4(1):6. doi: 10.1186/2046-4053-4-6.

The Automated Systematic Search Deduplicator (ASySD): a rapid, open-source, interoperable tool to remove duplicate citations in biomedical systematic reviews.自动化系统搜索去重器 (ASySD)：一种快速、开源、可互操作的工具，用于去除生物医学系统评价中的重复引文。

BMC Biol. 2023 Sep 7;21(1):189. doi: 10.1186/s12915-023-01686-z.

Automation of duplicate record detection for systematic reviews: Deduplicator.用于系统评价的重复记录检测自动化：Deduplicator。

Syst Rev. 2024 Aug 2;13(1):206. doi: 10.1186/s13643-024-02619-9.

Considerations for conducting systematic reviews: A follow-up study to evaluate the performance of various automated methods for reference de-duplication.考虑进行系统评价：评估各种自动参考文献去重方法性能的后续研究。

Res Synth Methods. 2024 Nov;15(6):896-904. doi: 10.1002/jrsm.1736. Epub 2024 Jul 25.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Deduplicating records in systematic reviews: there are free, accurate automated ways to do so.在系统评价中去除重复记录：有免费、准确的自动化方法来做到这一点。

J Clin Epidemiol. 2022 Dec;152:110-115. doi: 10.1016/j.jclinepi.2022.10.009. Epub 2022 Oct 12.

Considerations for conducting systematic reviews: evaluating the performance of different methods for de-duplicating references.考虑进行系统评价时：评估不同参考文献去重方法的性能。

Syst Rev. 2021 Jan 23;10(1):38. doi: 10.1186/s13643-021-01583-y.

Artificial intelligence in systematic reviews: promising when appropriately used.系统评价中的人工智能：恰当使用时前景广阔。

BMJ Open. 2023 Jul 7;13(7):e072254. doi: 10.1136/bmjopen-2023-072254.

Evaluating the relationship between citation set size, team size and screening methods used in systematic reviews: a cross-sectional study.评估系统评价中引文集大小、团队规模和使用的筛选方法之间的关系：一项横断面研究。

BMC Med Res Methodol. 2021 Jul 8;21(1):142. doi: 10.1186/s12874-021-01335-5.

引用本文的文献

Coronary atherosclerosis screening in asymptomatic adults using coronary artery calcium for cardiovascular prevention: a systematic review of randomised controlled trials and prospective cohorts.使用冠状动脉钙化对无症状成年人进行冠状动脉粥样硬化筛查以预防心血管疾病：随机对照试验和前瞻性队列的系统评价

BMJ Open. 2025 Jul 5;15(7):e101472. doi: 10.1136/bmjopen-2025-101472.

Heat as a prognostic factor for the development and progression of diabetes: a systematic review and meta-analysis.热作为糖尿病发生和进展的一个预后因素：一项系统评价和荟萃分析。

Cochrane Database Syst Rev. 2025 Jul 2;7(7):CD016289. doi: 10.1002/14651858.CD016289.

Assessment of digital therapeutics in decentralized clinical trials: A scoping review.分散式临床试验中数字疗法的评估：一项范围综述。

PLOS Digit Health. 2025 Jun 23;4(6):e0000905. doi: 10.1371/journal.pdig.0000905. eCollection 2025 Jun.

Fecal microbiota transplantation from patients into animals to establish human microbiota-associated animal models: a scoping review.将患者的粪便微生物群移植到动物体内以建立人类微生物群相关动物模型：一项范围综述

J Transl Med. 2025 Jun 17;23(1):662. doi: 10.1186/s12967-025-06645-6.

High Risk of Chronic Endometritis in Isthmocele-A Systematic Review and Meta-Analysis.子宫峡部憩室患者慢性子宫内膜炎的高风险——一项系统评价与Meta分析

J Clin Med. 2025 May 22;14(11):3628. doi: 10.3390/jcm14113628.

Effectiveness of SARS-CoV-2 testing strategies: A scoping review.严重急性呼吸综合征冠状病毒2（SARS-CoV-2）检测策略的有效性：一项范围综述。

Cochrane Evid Synth Methods. 2023 Nov 21;1(9):e12030. doi: 10.1002/cesm.12030. eCollection 2023 Nov.

High live birth rates after laparoscopic isthmocele repair in infertility: a systematic review and meta-analysis.腹腔镜下峡部憩室修复术后治疗不孕症的高活产率：一项系统评价和荟萃分析。

Front Endocrinol (Lausanne). 2025 Apr 15;16:1507482. doi: 10.3389/fendo.2025.1507482. eCollection 2025.

Exploring the design and impact of integrated health and social care services for children and young people living in underserved populations: a systematic review.探索为生活在服务不足人群中的儿童和青少年提供的综合健康与社会护理服务的设计与影响：一项系统综述。

BMC Public Health. 2025 Apr 11;25(1):1359. doi: 10.1186/s12889-025-22508-7.

Work smart, not hard: analysis of delays faced by clinical trials investigating spinal fusion using Protocol AI.事半功倍：使用协议人工智能对脊柱融合临床试验所面临延误情况的分析。

Front Surg. 2025 Mar 27;12:1546367. doi: 10.3389/fsurg.2025.1546367. eCollection 2025.

Impact of haematopoietic stem cell transplantation for benign and malignant haematologic and non-haematologic disorders on fertility: a systematic review and meta-analysis.造血干细胞移植治疗良性和恶性血液系统及非血液系统疾病对生育能力的影响：一项系统评价和荟萃分析。

Bone Marrow Transplant. 2025 May;60(5):645-672. doi: 10.1038/s41409-025-02520-6. Epub 2025 Feb 26.

本文引用的文献

BMC Biol. 2023 Sep 7;21(1):189. doi: 10.1186/s12915-023-01686-z.

Technological advances in preclinical meta-research.临床前元研究中的技术进步。

BMJ Open Sci. 2021 Jul 25;5(1):e100131. doi: 10.1136/bmjos-2020-100131. eCollection 2021.

The PRISMA 2020 statement: an updated guideline for reporting systematic reviews.PRISMA 2020 声明：系统评价报告的更新指南。

BMJ. 2021 Mar 29;372:n71. doi: 10.1136/bmj.n71.

PRISMA-S: an extension to the PRISMA Statement for Reporting Literature Searches in Systematic Reviews.PRISMA-S：用于在系统评价中报告文献检索的 PRISMA 声明的扩展。

Syst Rev. 2021 Jan 26;10(1):39. doi: 10.1186/s13643-020-01542-z.

Syst Rev. 2021 Jan 23;10(1):38. doi: 10.1186/s13643-021-01583-y.

The European artificial intelligence strategy: implications and challenges for digital health.欧盟人工智能战略：对数字健康的影响和挑战。

Lancet Digit Health. 2020 Jul;2(7):e376-e379. doi: 10.1016/S2589-7500(20)30112-6. Epub 2020 Jun 23.

A full systematic review was completed in 2 weeks using automation tools: a case study.在两周内使用自动化工具完成了全面的系统回顾：案例研究。

J Clin Epidemiol. 2020 May;121:81-90. doi: 10.1016/j.jclinepi.2020.01.008. Epub 2020 Jan 28.

Glossary for systematic reviews and meta-analyses.系统评价和荟萃分析词汇表。

Int Endod J. 2020 Feb;53(2):232-249. doi: 10.1111/iej.13217. Epub 2019 Nov 25.

revtools: An R package to support article screening for evidence synthesis.revtools：一个支持证据综合文章筛选的 R 包。

Res Synth Methods. 2019 Dec;10(4):606-614. doi: 10.1002/jrsm.1374. Epub 2019 Oct 18.

Reference checking for systematic reviews using Endnote.使用Endnote对系统评价进行参考文献核对。

J Med Libr Assoc. 2018 Oct;106(4):542-546. doi: 10.5195/jmla.2018.489. Epub 2018 Oct 1.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用 Deduklick 减少系统综述负担：一种新颖、自动化、可靠且可解释的去重算法，以促进医学研究。

Reducing systematic review burden using Deduklick: a novel, automated, reliable, and explainable deduplication algorithm to foster medical research.

机构信息

出版信息

BACKGROUND

METHODS

DISCUSSION

背景

方法

讨论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献