Yang Xi, Lyu Tianchen, Lee Chih-Yin, Bian Jiang, Hogan William R, Wu Yonghui
Health Outcomes and Biomedical Informatics College of Medicine, University of Florida Gainesville, USA.
Proc (IEEE Int Conf Healthc Inform). 2019 Jun;2019. doi: 10.1109/ICHI.2019.8904544. Epub 2019 Nov 21.
In this study, we examined a deep learning method for de-identification of clinical notes at UF Health under a cross-institute setting. We developed deep learning models using 2014 i2b2/UTHealth corpus and evaluated the performance using clinical notes collected from UF Health. We compared four pre-trained word embeddings, including two embeddings from the general domain and two embeddings from the clinical domain. We also explored linguistic features (i.e., word shape and part-of-speech) to further improve the performance of de-identification. The experimental results show that the performance of deep learning models trained using i2b2/UTHealth corpus significantly dropped (strict and relax F1 scores dropped from 0.9547 and 0.9646 to 0.8360 and 0.8870) when applied to another corpus from a different institution (UF Health). Linguistic features, including word shapes and part-of-speech, could further improve the performance of de-identification in cross-institute settings (improved to 0.8527 and 0.9052).
在本研究中,我们考察了一种深度学习方法,用于在跨机构环境下对佛罗里达大学健康中心(UF Health)的临床记录进行去识别处理。我们使用2014年i2b2/德克萨斯大学健康科学中心(UTHealth)语料库开发了深度学习模型,并使用从UF Health收集的临床记录评估其性能。我们比较了四种预训练词嵌入,包括两种通用领域的嵌入和两种临床领域的嵌入。我们还探索了语言特征(即词形和词性)以进一步提高去识别性能。实验结果表明,当将使用i2b2/UTHealth语料库训练的深度学习模型应用于来自不同机构(UF Health)的另一个语料库时,其性能显著下降(严格和宽松F1分数分别从0.9547和0.9646降至0.8360和0.8870)。包括词形和词性在内的语言特征可以在跨机构环境中进一步提高去识别性能(提高到0.8527和0.9052)。