• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

诊断试验准确性系统评价中的数据提取与合成:一个用于自动化和评估该过程的语料库

Data Extraction and Synthesis in Systematic Reviews of Diagnostic Test Accuracy: A Corpus for Automating and Evaluating the Process.

作者信息

Norman Christopher, Leeflang Mariska, Névéol Aurélie

机构信息

LIMSI, CNRS, Université Paris Saclay, F-91405 Orsay.

Academic Medical Center, University of Amsterdam, Amsterdam, the Netherlands.

出版信息

AMIA Annu Symp Proc. 2018 Dec 5;2018:817-826. eCollection 2018.

PMID:30815124
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6371350/
Abstract

BACKGROUND

Systematic reviews are critical for obtaining accurate estimates of diagnostic test accuracy, yet these require extracting information buried in free text articles, an often laborious process.

OBJECTIVE

We create a dataset describing the data extraction and synthesis processes in 63 DTA systematic reviews, and demonstrate its utility by using it to replicate the data synthesis in the original reviews.

METHOD

We construct our dataset using a custom automated extraction pipeline complemented with manual extraction, verification, and post-editing. We evaluate using manual assessment by two annotators and by comparing against data extracted from source files.

RESULTS

The constructed dataset contains 5,848 test results for 1,354 diagnostic tests from 1,738 diagnostic studies. We observe an extraction error rate of 0.06-0.3%.

CONCLUSIONS

This constitutes the first dataset describing the later stages of the DTA systematic review process, and is intended to be useful for automating or evaluating the process.

摘要

背景

系统评价对于获得诊断试验准确性的准确估计至关重要,但这需要从自由文本文章中提取信息,这通常是一个费力的过程。

目的

我们创建了一个数据集,描述了63项诊断准确性系统评价中的数据提取和综合过程,并通过使用该数据集在原始评价中复制数据综合来证明其效用。

方法

我们使用定制的自动提取管道构建数据集,并辅以手动提取、验证和后期编辑。我们通过两名注释者的人工评估以及与从源文件中提取的数据进行比较来进行评估。

结果

构建的数据集包含来自1738项诊断研究的1354项诊断试验的5848个试验结果。我们观察到提取错误率为0.06 - 0.3%。

结论

这构成了第一个描述诊断准确性系统评价过程后期阶段的数据集,旨在用于自动化或评估该过程。

相似文献

1
Data Extraction and Synthesis in Systematic Reviews of Diagnostic Test Accuracy: A Corpus for Automating and Evaluating the Process.诊断试验准确性系统评价中的数据提取与合成:一个用于自动化和评估该过程的语料库
AMIA Annu Symp Proc. 2018 Dec 5;2018:817-826. eCollection 2018.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Measuring quality of reporting in systematic reviews of diagnostic test accuracy studies in medical imaging: comparison of PRISMA-DTA and PRISMA.医学影像诊断试验准确性研究系统评价中报告质量的衡量:PRISMA-DTA与PRISMA的比较
Ultrasound Obstet Gynecol. 2023 Feb;61(2):257-266. doi: 10.1002/uog.26043. Epub 2023 Jan 12.
4
Increased workload for systematic review literature searches of diagnostic tests compared with treatments: challenges and opportunities.与治疗相比,诊断测试系统评价文献检索的工作量增加:挑战与机遇。
JMIR Med Inform. 2014 May 27;2(1):e11. doi: 10.2196/medinform.3037.
5
IPDmada: An R Shiny tool for analyzing and visualizing individual patient data meta-analyses of diagnostic test accuracy.IPDmada:一个用于分析和可视化诊断测试准确性的个体患者数据荟萃分析的 R Shiny 工具。
Res Synth Methods. 2021 Jan;12(1):45-54. doi: 10.1002/jrsm.1444. Epub 2020 Sep 9.
6
Risk of bias assessment of test comparisons was uncommon in comparative accuracy systematic reviews: an overview of reviews.比较准确性系统评价中,对试验比较的偏倚风险评估并不常见:系统评价概述。
J Clin Epidemiol. 2020 Nov;127:167-174. doi: 10.1016/j.jclinepi.2020.08.007. Epub 2020 Aug 13.
7
Search strategies to identify diagnostic accuracy studies in MEDLINE and EMBASE.在MEDLINE和EMBASE中识别诊断准确性研究的检索策略。
Cochrane Database Syst Rev. 2013 Sep 11;2013(9):MR000022. doi: 10.1002/14651858.MR000022.pub3.
8
Meta-analyses of diagnostic test accuracy could not be reproduced.无法重现诊断性测试准确性的荟萃分析。
J Clin Epidemiol. 2020 Nov;127:161-166. doi: 10.1016/j.jclinepi.2020.06.033. Epub 2020 Jul 15.
9
A methodological review of how heterogeneity has been examined in systematic reviews of diagnostic test accuracy.对诊断试验准确性系统评价中如何检验异质性的方法学综述。
Health Technol Assess. 2005 Mar;9(12):1-113, iii. doi: 10.3310/hta9120.
10
Preferred Reporting Items for a Systematic Review and Meta-analysis of Diagnostic Test Accuracy Studies: The PRISMA-DTA Statement.诊断测试准确性研究的系统评价和荟萃分析的首选报告项目:PRISMA-DTA 声明。
JAMA. 2018 Jan 23;319(4):388-396. doi: 10.1001/jama.2017.19163.

引用本文的文献

1
Antiseptics as effective virucidal agents against SARS-CoV-2: Systematic review and Bayesian network meta-analysis.作为对抗新型冠状病毒有效杀病毒剂的防腐剂:系统评价与贝叶斯网络荟萃分析
Jpn Dent Sci Rev. 2025 Dec;61:138-154. doi: 10.1016/j.jdsr.2025.05.001. Epub 2025 Jun 10.
2
Imaging-validated correlates and implications of the pathophysiologic mechanisms of ageing-related cerebral large artery and small vessel diseases: a systematic review and meta-analysis.影像学验证的衰老相关大脑大动脉和小血管疾病病理生理机制的相关性及影响:一项系统评价和荟萃分析
Behav Brain Funct. 2025 Apr 22;21(1):12. doi: 10.1186/s12993-025-00274-1.
3
Toward Automated Data Extraction According to Tabular Data Structure: Cross-sectional Pilot Survey of the Comparative Clinical Literature.迈向基于表格数据结构的自动数据提取:比较临床文献的横断面试点调查
JMIR Form Res. 2021 Nov 24;5(11):e33124. doi: 10.2196/33124.
4
Amplifying Domain Expertise in Clinical Data Pipelines.增强临床数据管道中的领域专业知识。
JMIR Med Inform. 2020 Nov 5;8(11):e19612. doi: 10.2196/19612.
5
Measuring the impact of screening automation on meta-analyses of diagnostic test accuracy.测量筛查自动化对诊断性试验准确性的荟萃分析的影响。
Syst Rev. 2019 Oct 28;8(1):243. doi: 10.1186/s13643-019-1162-x.

本文引用的文献

1
Three Dimensions of Reproducibility in Natural Language Processing.自然语言处理中可重复性的三个维度
LREC Int Conf Lang Resour Eval. 2018 May;2018:156-165.
2
Rapid diagnostic tests for typhoid and paratyphoid (enteric) fever.伤寒和副伤寒(肠道)热的快速诊断检测
Cochrane Database Syst Rev. 2017 May 26;5(5):CD008892. doi: 10.1002/14651858.CD008892.pub2.
3
Extracting PICO Sentences from Clinical Trial Reports using .使用……从临床试验报告中提取PICO句子
J Mach Learn Res. 2016;17.
4
What does research reproducibility mean?研究的可重复性是什么意思?
Sci Transl Med. 2016 Jun 1;8(341):341ps12. doi: 10.1126/scitranslmed.aaf5027.
5
1,500 scientists lift the lid on reproducibility.1500名科学家揭开了可重复性的盖子。
Nature. 2016 May 26;533(7604):452-4. doi: 10.1038/533452a.
6
Increased workload for systematic review literature searches of diagnostic tests compared with treatments: challenges and opportunities.与治疗相比,诊断测试系统评价文献检索的工作量增加:挑战与机遇。
JMIR Med Inform. 2014 May 27;2(1):e11. doi: 10.2196/medinform.3037.
7
Using text mining for study identification in systematic reviews: a systematic review of current approaches.在系统评价中使用文本挖掘进行研究识别:当前方法的系统评价
Syst Rev. 2015 Jan 14;4(1):5. doi: 10.1186/2046-4053-4-5.
8
Systematic review automation technologies.系统评价自动化技术。
Syst Rev. 2014 Jul 9;3:74. doi: 10.1186/2046-4053-3-74.
9
ExaCT: automatic extraction of clinical trial characteristics from journal publications.ExaCT:从期刊出版物中自动提取临床试验特征。
BMC Med Inform Decis Mak. 2010 Sep 28;10:56. doi: 10.1186/1472-6947-10-56.
10
Systematic reviews of diagnostic test accuracy.诊断试验准确性的系统评价。
Ann Intern Med. 2008 Dec 16;149(12):889-97. doi: 10.7326/0003-4819-149-12-200812160-00008.