SNOMED CT实体链接挑战赛。

SNOMED CT entity linking challenge.

作者信息

Davidson Rory, Hardman Will, Amit Guy, Bilu Yonatan, Della Mea Vincenzo, Galaida Aleksandr, Girshovitz Irena, Kulyabin Mikhail, Horia Popescu Mihai, Roitero Kevin, Sokolov Gleb, Yanover Chen

机构信息

SNOMED International, London W2 6BD, United Kingdom.

Veratai Ltf, Woking GU22 7QW, United Kingdom.

出版信息

J Am Med Inform Assoc. 2025 Sep 1;32(9):1397-1406. doi: 10.1093/jamia/ocaf104.

DOI:10.1093/jamia/ocaf104

PMID:40657868

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12361850/

Abstract

OBJECTIVE

This paper presents the results from a competition challenging participants to develop entity linking models using a subset of annotated MIMIC-IV-Note data and the SNOMED CT Terminology.

MATERIALS AND METHODS

As a basis for this work, a large set of 74 808 annotations was curated across 272 discharge notes spanning 6624 unique clinical concepts. Submissions were evaluated using the mean Intersection-over-Union metric, evaluated at the character level with the 3 best performing solutions awarded a cash prize.

RESULTS

The winning solutions employed contrasting approaches: a dictionary-based method, an encoder-based method, and a decoder-based method.

DISCUSSION

Our analysis reveals that concept frequency in training data significantly impacts model performance, with rare concepts proving particularly challenging. High concept entropy and annotation ambiguity were also associated with decreased performance.

CONCLUSION

Findings from this work suggest that future projects should focus on improving entity linking for rare concepts and developing methods to better leverage contextual information when training examples are scarce.

摘要

目的

本文展示了一场竞赛的结果，该竞赛要求参与者使用带注释的MIMIC-IV-Note数据子集和SNOMED CT术语来开发实体链接模型。

材料与方法

作为这项工作的基础，我们精心整理了一大组74808条注释，涵盖272份出院记录中的6624个独特临床概念。使用平均交并比指标对提交的方案进行评估，在字符级别进行评估，表现最佳的3个解决方案将获得现金奖励。

结果

获胜方案采用了不同的方法：基于字典的方法、基于编码器的方法和基于解码器的方法。

讨论

我们的分析表明，训练数据中的概念频率会显著影响模型性能，罕见概念尤其具有挑战性。高概念熵和注释歧义也与性能下降有关。

结论

这项工作的结果表明，未来的项目应专注于改善罕见概念的实体链接，并在训练示例稀缺时开发更好地利用上下文信息的方法。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

SNOMED CT实体链接挑战赛。

SNOMED CT entity linking challenge.

作者信息

机构信息

出版信息

OBJECTIVE

MATERIALS AND METHODS

RESULTS

DISCUSSION

CONCLUSION

目的

材料与方法

结果

讨论

结论

相似文献

本文引用的文献

SNOMED CT实体链接挑战赛。

SNOMED CT entity linking challenge.

作者信息

机构信息

出版信息

OBJECTIVE

MATERIALS AND METHODS

RESULTS

DISCUSSION

CONCLUSION

目的

材料与方法

结果

讨论

结论

相似文献

本文引用的文献