Gero Zelalem, Ho Joyce C
Department of Computer Science, Emory University, Atlanta, USA.
Proc (IEEE Int Conf Healthc Inform). 2021 Aug;2021:83-92. doi: 10.1109/ichi52183.2021.00024. Epub 2021 Oct 15.
There is an increased adoption of electronic health record systems by a variety of hospitals and medical centers. This provides an opportunity to leverage automated computer systems in assisting healthcare workers. One of the least utilized but rich source of patient information is the unstructured clinical text. In this work, we develop CATAN, a chart-aware temporal attention network for learning patient representations from clinical notes. We introduce a novel representation where each note is considered a single unit, like a sentence, and composed of attention-weighted words. The notes in turn are aggregated into a patient representation using a second weighting unit, note attention. Unlike standard attention computations which focus only on the content of the note, we incorporate the chart-time for each note as a constraint for attention calculation. This allows our model to focus on notes closer to the prediction time. Using the MIMIC-III dataset, we empirically show that our patient representation and attention calculation achieves the best performance in comparison with various state-of-the-art baselines for one-year mortality prediction and 30-day hospital readmission. Moreover, the attention weights can be used to offer transparency into our model's predictions.
各种医院和医疗中心对电子健康记录系统的采用率在不断提高。这为利用自动化计算机系统协助医护人员提供了机会。患者信息的一个利用最少但内容丰富的来源是非结构化临床文本。在这项工作中,我们开发了CATAN,一种用于从临床记录中学习患者表征的图表感知时间注意力网络。我们引入了一种新颖的表征,其中每个记录都被视为一个单独的单元,类似于一个句子,并且由注意力加权的单词组成。这些记录又使用第二个加权单元(记录注意力)聚合为患者表征。与仅关注记录内容的标准注意力计算不同,我们将每个记录的图表时间作为注意力计算的约束条件。这使我们的模型能够专注于更接近预测时间的记录。使用MIMIC-III数据集,我们通过实验表明,与各种用于一年死亡率预测和30天再入院的最新基线相比,我们的患者表征和注意力计算取得了最佳性能。此外,注意力权重可用于使我们模型的预测具有透明度。