Xu Justin, Mazwi Mjaye, Johnson Alistair E W
Child Health Evaluative Sciences, The Hospital for Sick Children, Toronto, Ontario, Canada.
Department of Critical Care Medicine, The Hospital for Sick Children, Toronto, Ontario, Canada.
JAMIA Open. 2023 Jul 8;6(3):ooad046. doi: 10.1093/jamiaopen/ooad046. eCollection 2023 Oct.
Standard ontologies are critical for interoperability and multisite analyses of health data. Nevertheless, mapping concepts to ontologies is often done with generic tools and is labor-intensive. Contextualizing candidate concepts within source data is also done in an ad hoc manner.
We present AnnoDash, a flexible dashboard to support annotation of concepts with terms from a given ontology. Text-based similarity is used to identify likely matches, and large language models are used to improve ontology ranking. A convenient interface is provided to visualize observations associated with a concept, supporting the disambiguation of vague concept descriptions. Time-series plots contrast the concept with known clinical measurements. We evaluated the dashboard qualitatively against several ontologies (SNOMED CT, LOINC, etc.) by using MIMIC-IV measurements. The dashboard is web-based and step-by-step instructions for deployment are provided, simplifying usage for nontechnical audiences. The modular code structure enables users to extend upon components, including improving similarity scoring, constructing new plots, or configuring new ontologies.
AnnoDash, an improved clinical terminology annotation tool, can facilitate data harmonizing by promoting mapping of clinical data. AnnoDash is freely available at https://github.com/justin13601/AnnoDash (https://doi.org/10.5281/zenodo.8043943).
标准本体对于健康数据的互操作性和多站点分析至关重要。然而,将概念映射到本体通常使用通用工具,且劳动强度大。在源数据中对候选概念进行情境化处理也是临时进行的。
我们展示了AnnoDash,这是一个灵活的仪表板,用于支持使用给定本体中的术语对概念进行注释。基于文本的相似度用于识别可能的匹配项,大语言模型用于改进本体排名。提供了一个方便的界面来可视化与一个概念相关的观察结果,支持消除模糊概念描述的歧义。时间序列图将该概念与已知的临床测量进行对比。我们通过使用MIMIC-IV测量对仪表板针对几种本体(SNOMED CT、LOINC等)进行了定性评估。该仪表板基于网络,并提供了部署的分步说明,简化了非技术用户的使用。模块化代码结构允许用户扩展组件,包括改进相似度评分、构建新的图表或配置新的本体。
AnnoDash是一种改进的临床术语注释工具,可通过促进临床数据的映射来推动数据协调。AnnoDash可在https://github.com/justin13601/AnnoDash(https://doi.org/10.5281/zenodo.8043943)上免费获取。