Dupuch Marie, Grabar Natalia
CNRS UMR 8163 STL; Université Lille 1&3, F-59653 Villeneuve d'Ascq, France; INSERM, U872, Paris F-75006, France; Viseo-Objet Direct, 4, Avenue Doyen Louis Weil, F-38000 Grenoble, France.
CNRS UMR 8163 STL; Université Lille 1&3, F-59653 Villeneuve d'Ascq, France.
J Biomed Inform. 2015 Apr;54:174-85. doi: 10.1016/j.jbi.2014.11.007. Epub 2015 Feb 4.
Pharmacovigilance is the activity related to the collection, analysis and prevention of adverse drug reactions (ADRs) induced by drugs or biologics. The detection of adverse drug reactions is performed using statistical algorithms and groupings of ADR terms from the MedDRA (Medical Dictionary for Drug Regulatory Activities) terminology. Standardized MedDRA Queries (SMQs) are the groupings which become a standard for assisting the retrieval and evaluation of MedDRA-coded ADR reports worldwide. Currently 84 SMQs have been created, while several important safety topics are not yet covered. Creation of SMQs is a long and tedious process performed by the experts. It relies on manual analysis of MedDRA in order to find out all the relevant terms to be included in a SMQ. Our objective is to propose an automatic method for assisting the creation of SMQs using the clustering of terms which are semantically similar.
The experimental method relies on a specific semantic resource, and also on the semantic distance algorithms and clustering approaches. We perform several experiments in order to define the optimal parameters.
Our results show that the proposed method can assist the creation of SMQs and make this process faster and systematic. The average performance of the method is precision 59% and recall 26%. The correlation of the results obtained is 0.72 against the medical doctors judgments and 0.78 against the medical coders judgments.
These results and additional evaluation indicate that the generated clusters can be efficiently used for the detection of pharmacovigilance signals, as they provide better signal detection than the existing SMQs.
药物警戒是与收集、分析及预防药物或生物制品引起的药物不良反应(ADR)相关的活动。药物不良反应的检测通过统计算法以及使用来自《药物监管活动医学词典》(MedDRA)术语的ADR术语分组来进行。标准化MedDRA查询(SMQ)是这些分组,它们成为协助全球范围内检索和评估用MedDRA编码的ADR报告的标准。目前已创建了84个SMQ,但一些重要的安全主题尚未涵盖。创建SMQ是一个由专家执行的漫长而繁琐的过程。它依赖于对MedDRA进行人工分析,以便找出要纳入SMQ的所有相关术语。我们的目标是提出一种自动方法,利用语义相似的术语聚类来协助创建SMQ。
实验方法依赖于特定的语义资源,以及语义距离算法和聚类方法。我们进行了多项实验以确定最佳参数。
我们的结果表明,所提出的方法可以协助创建SMQ,并使这个过程更快且更具系统性。该方法的平均性能为精确率59%,召回率26%。所得结果与医生判断的相关性为0.72,与医学编码人员判断的相关性为0.78。
这些结果及进一步评估表明,生成的聚类可有效地用于药物警戒信号的检测,因为它们比现有的SMQ能提供更好的信号检测。