Shrestha Ingroj, Srinivasan Padmini
University of Iowa, Iowa City, Iowa, United States.
AMIA Annu Symp Proc. 2022 Feb 21;2021:1109-1118. eCollection 2021.
Mental illness, a serious problem across the globe, requires multi-pronged solutions including effective computational models to predict illness. Mental illness diagnosis is complicated by the pronounced sharing of symptoms and mutual pre-dispositions. Set in this context we offer a systematic comparison of seven deep learning and two conventional machine learning models for predicting mental illness from the history of present illness free-text descriptions in patient records. The models tested include a new architecture CB-MH which ranks best for F1 (0.62) while another attention model is best for F2 (0.71). We also explore model decisions using Integrated Gradients interpretability method which we use to identify key influential features. Overall, the majority of true positives have key features appearing in meaningful contexts. False negatives are most challenging with most key features appearing in unclear contexts. False positives are mostly true positives in actuality as supported by a small-scale clinician-based user judgement study.
精神疾病是全球范围内的一个严重问题,需要多方面的解决方案,包括有效的计算模型来预测疾病。精神疾病的诊断因症状的显著重叠和相互易感性而变得复杂。在此背景下,我们对七种深度学习模型和两种传统机器学习模型进行了系统比较,这些模型用于根据患者记录中现病史的自由文本描述来预测精神疾病。测试的模型包括一种新架构CB-MH,其F1得分最高(0.62),而另一种注意力模型在F2方面表现最佳(0.71)。我们还使用集成梯度可解释性方法探索模型决策,该方法用于识别关键影响特征。总体而言,大多数真阳性在有意义的上下文中具有关键特征。假阴性最具挑战性,大多数关键特征出现在不明确的上下文中。根据一项基于小规模临床医生的用户判断研究,误报实际上大多是真阳性。