• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用结构化和非结构化数据的概率性记录链接来识别自发不良事件报告系统中的重复病例。

Using Probabilistic Record Linkage of Structured and Unstructured Data to Identify Duplicate Cases in Spontaneous Adverse Event Reporting Systems.

作者信息

Kreimeyer Kory, Menschik David, Winiecki Scott, Paul Wendy, Barash Faith, Woo Emily Jane, Alimchandani Meghna, Arya Deepa, Zinderman Craig, Forshee Richard, Botsis Taxiarchis

机构信息

Office of Biostatistics and Epidemiology, Center for Biologics Evaluation and Research, US Food and Drug Administration, 10903 New Hampshire Ave, Silver Spring, MD, 20993-0002, USA.

出版信息

Drug Saf. 2017 Jul;40(7):571-582. doi: 10.1007/s40264-017-0523-4.

DOI:10.1007/s40264-017-0523-4
PMID:28293864
Abstract

INTRODUCTION

Duplicate case reports in spontaneous adverse event reporting systems pose a challenge for medical reviewers to efficiently perform individual and aggregate safety analyses. Duplicate cases can bias data mining by generating spurious signals of disproportional reporting of product-adverse event pairs.

OBJECTIVE

We have developed a probabilistic record linkage algorithm for identifying duplicate cases in the US Vaccine Adverse Event Reporting System (VAERS) and the US Food and Drug Administration Adverse Event Reporting System (FAERS).

METHODS

In addition to using structured field data, the algorithm incorporates the non-structured narrative text of adverse event reports by examining clinical and temporal information extracted by the Event-based Text-mining of Health Electronic Records system, a natural language processing tool. The final component of the algorithm is a novel duplicate confidence value that is calculated by a rule-based empirical approach that looks for similarities in a number of criteria between two case reports.

RESULTS

For VAERS, the algorithm identified 77% of known duplicate pairs with a precision (or positive predictive value) of 95%. For FAERS, it identified 13% of known duplicate pairs with a precision of 100%. The textual information did not improve the algorithm's automated classification for VAERS or FAERS. The empirical duplicate confidence value increased performance on both VAERS and FAERS, mainly by reducing the occurrence of false-positives.

CONCLUSIONS

The algorithm was shown to be effective at identifying pre-linked duplicate VAERS reports. The narrative text was not shown to be a key component in the automated detection evaluation; however, it is essential for supporting the semi-automated approach that is likely to be deployed at the Food and Drug Administration, where medical reviewers will perform some manual review of the most highly ranked reports identified by the algorithm.

摘要

引言

自发不良事件报告系统中的重复病例报告给医学审评人员有效开展个体和总体安全性分析带来了挑战。重复病例可能会通过生成产品-不良事件对不成比例报告的虚假信号来影响数据挖掘。

目的

我们开发了一种概率性记录链接算法,用于识别美国疫苗不良事件报告系统(VAERS)和美国食品药品监督管理局不良事件报告系统(FAERS)中的重复病例。

方法

该算法除了使用结构化字段数据外,还通过检查基于事件的健康电子记录文本挖掘系统(一种自然语言处理工具)提取的临床和时间信息,纳入了不良事件报告的非结构化叙述文本。算法的最后一个组成部分是一个新的重复置信值,它通过基于规则的经验方法计算得出,该方法寻找两个病例报告之间在多个标准上的相似性。

结果

对于VAERS,该算法识别出77%的已知重复对,精确率(或阳性预测值)为95%。对于FAERS,它识别出13%的已知重复对,精确率为100%。文本信息并未改善该算法对VAERS或FAERS的自动分类。经验性重复置信值提高了VAERS和FAERS的性能,主要是通过减少假阳性的发生。

结论

该算法被证明在识别预先链接的VAERS重复报告方面是有效的。叙述文本在自动检测评估中并非关键组成部分;然而,它对于支持美国食品药品监督管理局可能采用的半自动方法至关重要,在该局医学审评人员将对算法识别出的排名最高的报告进行一些人工审评。

相似文献

1
Using Probabilistic Record Linkage of Structured and Unstructured Data to Identify Duplicate Cases in Spontaneous Adverse Event Reporting Systems.利用结构化和非结构化数据的概率性记录链接来识别自发不良事件报告系统中的重复病例。
Drug Saf. 2017 Jul;40(7):571-582. doi: 10.1007/s40264-017-0523-4.
2
Data mining for prospective early detection of safety signals in the Vaccine Adverse Event Reporting System (VAERS): a case study of febrile seizures after a 2010-2011 seasonal influenza virus vaccine.利用数据挖掘技术对疫苗不良事件报告系统(VAERS)中的安全信号进行前瞻性早期检测:2010-2011 年季节性流感病毒疫苗接种后发热性惊厥的病例研究。
Drug Saf. 2013 Jul;36(7):547-56. doi: 10.1007/s40264-013-0051-9.
3
Surveillance for safety after immunization: Vaccine Adverse Event Reporting System (VAERS)--United States, 1991-2001.免疫接种后安全性监测:疫苗不良事件报告系统(VAERS)——美国,1991 - 2001年
MMWR Surveill Summ. 2003 Jan 24;52(1):1-24.
4
Decision support environment for medical product safety surveillance.医疗产品安全监测的决策支持环境
J Biomed Inform. 2016 Dec;64:354-362. doi: 10.1016/j.jbi.2016.07.023. Epub 2016 Jul 28.
5
A new algorithmic approach for the extraction of temporal associations from clinical narratives with an application to medical product safety surveillance reports.一种从临床叙述中提取时间关联的新算法方法及其在医疗产品安全监测报告中的应用。
J Biomed Inform. 2016 Aug;62:78-89. doi: 10.1016/j.jbi.2016.06.006. Epub 2016 Jun 17.
6
Mining association patterns of drug-interactions using post marketing FDA's spontaneous reporting data.利用美国食品药品监督管理局(FDA)上市后自发报告数据挖掘药物相互作用的关联模式。
J Biomed Inform. 2016 Apr;60:294-308. doi: 10.1016/j.jbi.2016.02.009. Epub 2016 Feb 20.
7
Automatic signal extraction, prioritizing and filtering approaches in detecting post-marketing cardiovascular events associated with targeted cancer drugs from the FDA Adverse Event Reporting System (FAERS).从美国食品药品监督管理局不良事件报告系统(FAERS)中检测与靶向抗癌药物相关的上市后心血管事件时的自动信号提取、优先级排序和筛选方法。
J Biomed Inform. 2014 Feb;47:171-7. doi: 10.1016/j.jbi.2013.10.008. Epub 2013 Oct 28.
8
Post-licensure surveillance of trivalent live attenuated influenza vaccine in adults, United States, Vaccine Adverse Event Reporting System (VAERS), July 2005-June 2013.美国2005年7月至2013年6月,成人三价减毒活流感疫苗上市后监测,疫苗不良事件报告系统(VAERS)
Vaccine. 2014 Nov 12;32(48):6499-504. doi: 10.1016/j.vaccine.2014.09.018. Epub 2014 Sep 22.
9
Association rule mining in the US Vaccine Adverse Event Reporting System (VAERS).美国疫苗不良事件报告系统(VAERS)中的关联规则挖掘。
Pharmacoepidemiol Drug Saf. 2015 Sep;24(9):922-33. doi: 10.1002/pds.3797. Epub 2015 Jun 4.
10
A signal detection method for temporal variation of adverse effect with vaccine adverse event reporting system data.一种基于疫苗不良事件报告系统数据的不良效应时间变化的信号检测方法。
BMC Med Inform Decis Mak. 2017 Jul 5;17(Suppl 2):76. doi: 10.1186/s12911-017-0472-y.

引用本文的文献

1
An Evaluation of Duplicate Adverse Event Reports Characteristics in the Food and Drug Administration Adverse Event Reporting System.食品药品监督管理局不良事件报告系统中重复不良事件报告特征的评估
Drug Saf. 2025 Jun 4. doi: 10.1007/s40264-025-01560-7.
2
Computational tools and data integration to accelerate vaccine development: challenges, opportunities, and future directions.加速疫苗开发的计算工具与数据整合:挑战、机遇及未来方向
Front Immunol. 2025 Mar 7;16:1502484. doi: 10.3389/fimmu.2025.1502484. eCollection 2025.
3
Drug-associated gingival disorders: a retrospective pharmacovigilance assessment using disproportionality analysis.

本文引用的文献

1
Decision support environment for medical product safety surveillance.医疗产品安全监测的决策支持环境
J Biomed Inform. 2016 Dec;64:354-362. doi: 10.1016/j.jbi.2016.07.023. Epub 2016 Jul 28.
2
A new algorithmic approach for the extraction of temporal associations from clinical narratives with an application to medical product safety surveillance reports.一种从临床叙述中提取时间关联的新算法方法及其在医疗产品安全监测报告中的应用。
J Biomed Inform. 2016 Aug;62:78-89. doi: 10.1016/j.jbi.2016.06.006. Epub 2016 Jun 17.
3
Can Natural Language Processing Improve the Efficiency of Vaccine Adverse Event Report Review?
药物相关性牙龈疾病:一项使用不成比例分析的回顾性药物警戒评估
BDJ Open. 2025 Mar 11;11(1):24. doi: 10.1038/s41405-024-00291-8.
4
Might We Come Together on a Paradigm Shift to Manage ICSRs with a Decentralized Data Model?我们能否就范式转变达成共识,采用分散数据模型来管理ICSR(个体病例安全报告)?
Drug Saf. 2025 Mar 8. doi: 10.1007/s40264-025-01539-4.
5
Artificial intelligence-enabled safety monitoring in Alzheimer's disease clinical trials.阿尔茨海默病临床试验中基于人工智能的安全监测
J Prev Alzheimers Dis. 2025 Jan;12(1):100002. doi: 10.1016/j.tjpad.2024.100002. Epub 2025 Jan 1.
6
AI-Based Computational Methods in Early Drug Discovery and Post Market Drug Assessment: A Survey.早期药物发现与上市后药物评估中基于人工智能的计算方法:一项综述。
IEEE Trans Comput Biol Bioinform. 2025 Jan-Feb;22(1):97-115. doi: 10.1109/TCBB.2024.3492708.
7
Trust but Verify: Lessons Learned for the Application of AI to Case-Based Clinical Decision-Making From Postmarketing Drug Safety Assessment at the US Food and Drug Administration.信任但要验证:从美国食品和药物管理局的上市后药物安全评估看应用人工智能进行基于病例的临床决策的经验教训。
J Med Internet Res. 2024 Jun 6;26:e50274. doi: 10.2196/50274.
8
Navigating duplication in pharmacovigilance databases: a scoping review.药物警戒数据库中重复数据的处理:范围综述。
BMJ Open. 2024 Apr 29;14(4):e081990. doi: 10.1136/bmjopen-2023-081990.
9
Anaphylactic Reactions to COVID-19 Vaccines: An Updated Assessment Based on Pharmacovigilance Data.新型冠状病毒肺炎疫苗的过敏反应:基于药物警戒数据的最新评估
Vaccines (Basel). 2023 Mar 8;11(3):613. doi: 10.3390/vaccines11030613.
10
Anaphylaxis rates following mRNA COVID-19 vaccination in children and adolescents: Analysis of data reported to EudraVigilance.儿童和青少年接种 mRNA COVID-19 疫苗后的过敏反应发生率:EudraVigilance 报告数据的分析。
Vaccine. 2023 Mar 31;41(14):2382-2386. doi: 10.1016/j.vaccine.2023.02.067. Epub 2023 Feb 27.
自然语言处理能否提高疫苗不良事件报告审查的效率?
Methods Inf Med. 2016;55(2):144-50. doi: 10.3414/ME14-01-0066. Epub 2015 Sep 23.
4
Accuracy of Probabilistic Linkage Using the Enhanced Matching System for Public Health and Epidemiological Studies.使用公共卫生与流行病学研究增强匹配系统的概率性链接的准确性
PLoS One. 2015 Aug 24;10(8):e0136179. doi: 10.1371/journal.pone.0136179. eCollection 2015.
5
Linking mothers and infants within electronic health records: a comparison of deterministic and probabilistic algorithms.在电子健康记录中关联母婴:确定性算法与概率性算法的比较
Pharmacoepidemiol Drug Saf. 2015 Jan;24(1):45-51. doi: 10.1002/pds.3728. Epub 2014 Nov 18.
6
Performance of probabilistic method to detect duplicate individual case safety reports.用于检测重复个体病例安全报告的概率方法的性能
Drug Saf. 2014 Apr;37(4):249-58. doi: 10.1007/s40264-014-0146-y.
7
Vaccine adverse event text mining system for extracting features from vaccine safety reports.疫苗不良事件文本挖掘系统,用于从疫苗安全报告中提取特征。
J Am Med Inform Assoc. 2012 Nov-Dec;19(6):1011-8. doi: 10.1136/amiajnl-2012-000881. Epub 2012 Aug 25.
8
Evaluation of record linkage between a large healthcare provider and the Utah Population Database.大型医疗机构与犹他州人口数据库间的记录链接评估。
J Am Med Inform Assoc. 2012 Jun;19(e1):e54-9. doi: 10.1136/amiajnl-2011-000335. Epub 2011 Sep 16.
9
Results from simulated data sets: probabilistic record linkage outperforms deterministic record linkage.模拟数据集的结果:概率记录链接优于确定性记录链接。
J Clin Epidemiol. 2011 May;64(5):565-72. doi: 10.1016/j.jclinepi.2010.05.008. Epub 2010 Oct 16.
10
Extending the Fellegi-Sunter probabilistic record linkage method for approximate field comparators.扩展 Fellegi-Sunter 概率记录链接方法以用于近似字段比较器。
J Biomed Inform. 2010 Feb;43(1):24-30. doi: 10.1016/j.jbi.2009.08.004. Epub 2009 Aug 13.