IBM Research, Yorktown Heights, NY.
AMIA Annu Symp Proc. 2024 Jan 11;2023:484-493. eCollection 2023.
Knowledge of social determinants of health (SDOH), which refer to nonmedical factors influencing health outcomes, can help providers improve patient care. However, SDOH are often documented in unstructured notes, making them more inaccessible. Although previous works have attempted SDOH extraction from clinical notes, most efforts defined SDOH more narrowly and focused on the note's social history (SH) section, where social factors are traditionally documented. Here, we introduce a new SDOH dataset covering a broad range of SDOH content that is annotated over entire notes. We characterize what, where, and how SDOH information is documented in clinical text, present baseline systems using a token classification and generative approach, and investigate whether training only on the SH section can effectively extract SDOH from the entire note. The final dataset, consisting of 2,007 annotations covering 7 open-ended SDOH domains over 500 notes, will be publicly released to encourage further research in this area.
健康的社会决定因素(SDOH)知识,是指影响健康结果的非医疗因素,它可以帮助提供者改善患者护理。然而,SDOH 通常记录在非结构化的笔记中,这使得它们更难以获取。尽管之前的工作已经尝试从临床笔记中提取 SDOH,但大多数努力都将 SDOH 定义得更狭义,并且专注于记录社会因素的传统的笔记的社会历史(SH)部分。在这里,我们引入了一个新的 SDOH 数据集,涵盖了广泛的 SDOH 内容,并对整个笔记进行了注释。我们描述了 SDOH 信息在临床文本中是如何记录的,包括记录的内容、位置和方式,使用基于标记的分类和生成方法展示了基线系统,并研究了仅在 SH 部分进行训练是否可以有效地从整个笔记中提取 SDOH。最终数据集由 2007 个注释组成,涵盖了 500 多个笔记中的 7 个开放的 SDOH 领域,将公开发布以鼓励该领域的进一步研究。