Bousquet Cédric, Souvignet Julien, Sadou Éric, Jaulent Marie-Christine, Declerck Gunnar
Laboratoire d'Informatique Médicale et d'Ingénierie des Connaissances en e-Santé, LIMICS, Sorbonne Université, Inserm, Université Paris 13, Paris, France.
Unit of Public Health and Medical Informatics, University of Saint Etienne, Saint Etienne, France.
Front Pharmacol. 2019 Sep 10;10:975. doi: 10.3389/fphar.2019.00975. eCollection 2019.
Formal definitions allow selecting terms (e.g., identifying all terms related to "Infectious disease" using the query "has causative agent organism") and terminological reasoning (e.g., "hepatitis B" is a "hepatitis" and is an "infectious disease"). However, the standard international terminology Medical Dictionary for Regulatory Activities (MedDRA) used for coding adverse drug reactions in pharmacovigilance databases does not beneficiate from such formal definitions. Our objective was to evaluate the potential of reuse of ontological and non-ontological resources for generating such definitions for MedDRA. We developed several methods that collectively allow a semiautomatic semantic enrichment of MedDRA: 1) using MedDRA-to-SNOMED Clinical Terms (SNOMED CT) mappings (available in the Unified Medical Language System metathesaurus or other mapping resources, e.g., the MedDRA preferred term "hepatitis B" is associated to the SNOMED CT concept "type B viral hepatitis") to extract term definitions (e.g., "hepatitis B" is associated with the following properties: has finding site liver structure, has associated morphology inflammation morphology, and has causative agent hepatitis B virus); 2) using MedDRA labels and lexical/syntactic methods for automatic decomposition of complex MedDRA terms (e.g., the MedDRA systems organ class "blood and lymphatic system disorders" is decomposed in blood system disorders and lymphatic system disorders) or automatic suggestions of properties (e.g., the string "cyclic" in preferred term "cyclic neutropenia" leads to the property has clinical course cyclic). The Unified Medical Language System metathesaurus was the main ontological resource reusable for generating formal definitions for MedDRA terms. The non-ontological resources (another mapping resource provided by Nadkarni and Darer in 2010 and MedDRA labels) allowed defining few additional preferred terms. While the Ci4SeR tool helped the curator to define 1,935 terms by suggesting potential supplemental relations based on the parents' and siblings' semantic definition, defining manually all MedDRA terms remains expensive in time. Several ontological and non-ontological resources are available for associating MedDRA terms to SNOMED CT concepts with semantic properties, but providing manual definitions is still necessary. The ontology of adverse events is a possible alternative but does not cover all MedDRA terms either. Perspectives are to implement more efficient techniques to find more logical relations between SNOMED CT and MedDRA in an automated way.
形式定义允许选择术语(例如,使用查询“具有病原体生物体”来识别所有与“传染病”相关的术语)以及进行术语推理(例如,“乙型肝炎”是一种“肝炎”,并且是一种“传染病”)。然而,用于药物警戒数据库中不良药物反应编码的标准国际术语《监管活动医学词典》(MedDRA)并未受益于此类形式定义。我们的目标是评估重用本体论和非本体论资源为MedDRA生成此类定义的潜力。我们开发了几种方法,这些方法共同允许对MedDRA进行半自动语义丰富:1)使用MedDRA到SNOMED临床术语(SNOMED CT)的映射(可在统一医学语言系统元词表或其他映射资源中获取,例如,MedDRA首选术语“乙型肝炎”与SNOMED CT概念“B型病毒性肝炎”相关联)来提取术语定义(例如,“乙型肝炎”与以下属性相关联:具有发现部位肝脏结构、具有相关形态学炎症形态学以及具有病原体乙型肝炎病毒);2)使用MedDRA标签以及词汇/句法方法对复杂的MedDRA术语进行自动分解(例如,MedDRA系统器官类别“血液和淋巴系统疾病”被分解为血液系统疾病和淋巴系统疾病)或自动生成属性建议(例如,首选术语“周期性中性粒细胞减少症”中的字符串“周期性”导致具有临床病程周期性的属性)。统一医学语言系统元词表是可用于为MedDRA术语生成形式定义的主要本体论资源。非本体论资源(Nadkarni和Darer在2010年提供的另一种映射资源以及MedDRA标签)允许定义少量额外的首选术语。虽然Ci4SeR工具通过基于父项和兄弟项的语义定义建议潜在的补充关系帮助编目员定义了1935个术语,但手动定义所有MedDRA术语在时间上仍然成本高昂。有几种本体论和非本体论资源可用于将MedDRA术语与具有语义属性的SNOMED CT概念相关联,但仍然需要提供手动定义。不良事件本体是一种可能的替代方案,但也不能涵盖所有MedDRA术语。未来的方向是实施更有效的技术,以自动方式在SNOMED CT和MedDRA之间找到更多逻辑关系。