一种用于预测精神科住院患者常见风险的高度可扩展深度学习语言模型。

A highly scalable deep learning language model for common risks prediction among psychiatric inpatients.

作者信息

Zhu Enzhao, Wang Jiayi, Zhou Guoquan, Li Chunbo, Chen Fazhan, Ju Kang, Chen Liangliang, Yin Yichao, Chen Yi, Zhang Yanping, Zhang Xu, Zhou Xinlin, Wang Zongyuan, Qiu Jianping, Wang Hui, Shi Weizhong, Wang Feng, Wang Dong, Chen Zhihao, Hou Jiaojiao, Li Hui, Ai Zisheng

机构信息

School of Medicine, Tongji University, Shanghai, China.

Shanghai Putuo Mental Health Center, Putuo District, Shanghai, China.

出版信息

BMC Med. 2025 May 28;23(1):308. doi: 10.1186/s12916-025-04150-7.

DOI:10.1186/s12916-025-04150-7

PMID:40437564

Abstract

BACKGROUND

There is a lack of studies exploring the performance of Transformers-based language models in common risks assessment among psychiatric inpatients. We aim to develop a scalable risk assessment model using multidimensional textualized data and test the stability, robustness, and benefit of this approach.

METHODS

In this real-world cohort study, a deep learning language model was developed and validated using first hospitalized cases diagnosed with schizophrenia, bipolar disorder, and depressive disorder between January 2016 and March 2023 in three hospitals. The algorithm was externally validated on an independent testing cohort comprising 1180 patients. A total of 140 features, including first medical records (FMR), laboratory examinations, medical orders, and psychological scales, were assessed for analysis. The outcomes were short- and long-term impulsivity (STI and LTI), risk of suicide (STSS and LTSS), and need of physical restraint (STPR and LTPR) assessed by qualified nurses or clinicians. Analysis was carried out between August 2024 and June 2024. Models with different architectures and input settings were compared with each other. The area under the receiver operating characteristic curve (AUROC) was used to assess the primary performance of models. The clinical utility was determined by the net benefit under Youden's threshold.

RESULTS

Of 7451 patients included in this study, 2982 (47.6%) were male, and the median (interquartile range) age was 42 (28-57) years. The overall incidence of outcomes was 635 (8.5%), 728 (10.5%), 659 (8.8%), 803 (10.8%), 588 (7.9%), and 728 (9.8%) for STPR, LTPR, STSS, LTSS, STI, and LTI, respectively. The multitask semi-structured Transformers-based language (SSTL) model showed more promising AUROCs (STPR: 0.915; LTPR: 0.844; STSS: 0.867; LTSS: 0.879; STI: 0.899; LTI: 0.894) in the prediction of these outcomes than single-tasked or multimodal language models and traditional structured data models. Combining FMR with other data from electronic health records led to significant improvements in the performance and clinical utility of SSTL models based on demographic, diagnosis, laboratory tests, treatment, and psychological scales.

CONCLUSIONS

The SSTL model shows potential advantages in prognostic evaluation. FMR is a strong predictor for common risks prediction and may benefit other tasks in psychiatry with minimum requirements for data and data processing.

摘要

背景

目前缺乏关于基于Transformer的语言模型在精神科住院患者常见风险评估中表现的研究。我们旨在开发一种使用多维文本数据的可扩展风险评估模型，并测试该方法的稳定性、稳健性和益处。

方法

在这项真实世界队列研究中，使用2016年1月至2023年3月期间在三家医院首次住院诊断为精神分裂症、双相情感障碍和抑郁症的病例开发并验证了一种深度学习语言模型。该算法在一个由1180名患者组成的独立测试队列上进行了外部验证。共评估了140个特征，包括首次病历（FMR）、实验室检查、医嘱和心理量表以进行分析。结局指标为合格护士或临床医生评估的短期和长期冲动性（STI和LTI）、自杀风险（STSS和LTSS）以及身体约束需求（STPR和LTPR）。分析于2024年8月至2024年6月进行。比较了具有不同架构和输入设置的模型。使用受试者操作特征曲线下面积（AUROC）来评估模型的主要性能。临床效用由约登指数阈值下的净效益确定。

结果

本研究纳入的7451例患者中，2982例（47.6%）为男性，年龄中位数（四分位间距）为42（28 - 57）岁。STPR、LTPR、STSS、LTSS、STI和LTI结局的总体发生率分别为635例（8.5%）、728例（10.5%）、659例（8.8%）、803例（10.8%）、588例（7.9%）和728例（9.8%）。基于多任务半结构化Transformer的语言（SSTL）模型在预测这些结局方面显示出比单任务或多模态语言模型以及传统结构化数据模型更有前景的AUROC（STPR：0.915；LTPR：0.844；STSS：0.867；LTSS：0.879；STI：0.899；LTI：0.894）。将FMR与电子健康记录中的其他数据相结合，显著提高了基于人口统计学、诊断、实验室检查、治疗和心理量表的SSTL模型的性能和临床效用。

结论

SSTL模型在预后评估中显示出潜在优势。FMR是常见风险预测的有力预测指标，并且在对数据和数据处理要求最低的情况下可能有益于精神病学中的其他任务。

相似文献

A highly scalable deep learning language model for common risks prediction among psychiatric inpatients.一种用于预测精神科住院患者常见风险的高度可扩展深度学习语言模型。

BMC Med. 2025 May 28;23(1):308. doi: 10.1186/s12916-025-04150-7.

Comprehensive Symptom Prediction in Inpatients With Acute Psychiatric Disorders Using Wearable-Based Deep Learning Models: Development and Validation Study.使用基于可穿戴设备的深度学习模型对急性精神障碍住院患者进行全面症状预测：开发和验证研究。

J Med Internet Res. 2024 Nov 13;26:e65994. doi: 10.2196/65994.

Short-term Suicide Risk After Psychiatric Hospital Discharge.精神科出院后的短期自杀风险

JAMA Psychiatry. 2016 Nov 1;73(11):1119-1126. doi: 10.1001/jamapsychiatry.2016.2035.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

Predicting Diagnostic Progression to Schizophrenia or Bipolar Disorder via Machine Learning.通过机器学习预测精神分裂症或双相情感障碍的诊断进展

JAMA Psychiatry. 2025 May 1;82(5):459-469. doi: 10.1001/jamapsychiatry.2024.4702.

Right care, first time: a highly personalised and measurement-based care model to manage youth mental health.精准医疗，首次就诊：高度个性化和基于评估的青少年心理健康管理医疗模式。

Med J Aust. 2019 Nov;211 Suppl 9:S3-S46. doi: 10.5694/mja2.50383.

Understanding and predicting suicidality using a combined genomic and clinical risk assessment approach.采用基因组与临床风险评估相结合的方法理解和预测自杀倾向。

Mol Psychiatry. 2015 Nov;20(11):1266-85. doi: 10.1038/mp.2015.112. Epub 2015 Aug 18.

Development and Validation of a Deep Learning Model for Predicting Treatment Response in Patients With Newly Diagnosed Epilepsy.深度学习模型在预测新发癫痫患者治疗反应中的开发与验证

JAMA Neurol. 2022 Oct 1;79(10):986-996. doi: 10.1001/jamaneurol.2022.2514.

Development and Validation of a Dynamic Real-Time Risk Prediction Model for Intensive Care Units Patients Based on Longitudinal Irregular Data: Multicenter Retrospective Study.基于纵向不规则数据的重症监护病房患者动态实时风险预测模型的开发与验证：多中心回顾性研究

J Med Internet Res. 2025 Apr 23;27:e69293. doi: 10.2196/69293.

Projective Technique Testing Approach to the Understanding of Psychological Pain in Suicidal and Non-Suicidal Psychiatric Inpatients.运用投射技术测试方法理解有自杀和无自杀行为的精神科住院患者的心理痛苦。

Int J Environ Res Public Health. 2019 Dec 31;17(1):284. doi: 10.3390/ijerph17010284.

本文引用的文献

Detection of suicidality from medical text using privacy-preserving large language models.使用隐私保护大语言模型从医学文本中检测自杀倾向。

Br J Psychiatry. 2024 Dec;225(6):532-537. doi: 10.1192/bjp.2024.134.

Development and validation of a machine learning model for prediction of type 2 diabetes in patients with mental illness.用于预测精神疾病患者2型糖尿病的机器学习模型的开发与验证

Acta Psychiatr Scand. 2025 Mar;151(3):245-258. doi: 10.1111/acps.13687. Epub 2024 Apr 4.

Prevalence and variability of restrictive care practice use (physical restraint, seclusion and chemical restraint) in adult mental health inpatient settings: A systematic review and meta-analysis.成人精神科住院环境中限制护理实践（身体约束、隔离和药物约束）的使用情况：系统评价和荟萃分析。

J Clin Nurs. 2024 Apr;33(4):1256-1281. doi: 10.1111/jocn.17041. Epub 2024 Feb 2.

Metabolomics on depression: A comparison of clinical and animal research.代谢组学与抑郁症：临床与动物研究的比较。

J Affect Disord. 2024 Mar 15;349:559-568. doi: 10.1016/j.jad.2024.01.053. Epub 2024 Jan 9.

Restraint and Seclusion Practices and Policies in U.S. Forensic Psychiatric Hospitals.美国法医精神病院的约束和隔离实践与政策。

J Am Acad Psychiatry Law. 2023 Dec 8;51(4):566-574. doi: 10.29158/JAAPL.230099-23.

Trajectories through semantic spaces in schizophrenia and the relationship to ripple bursts.精神分裂症语义空间轨迹与涟漪爆发的关系。

Proc Natl Acad Sci U S A. 2023 Oct 17;120(42):e2305290120. doi: 10.1073/pnas.2305290120. Epub 2023 Oct 10.

Natural Language Processing in Psychiatry: A Field at an Inflection Point.精神病学中的自然语言处理：一个处于转折点的领域。

Biol Psychiatry Cogn Neurosci Neuroimaging. 2023 Oct;8(10):979-981. doi: 10.1016/j.bpsc.2023.08.001.

A scoping review of preprocessing methods for unstructured text data to assess data quality.对非结构化文本数据进行预处理以评估数据质量的范围回顾。

Int J Popul Data Sci. 2022 Oct 4;7(1):1757. doi: 10.23889/ijpds.v6i1.1757. eCollection 2022.

Lexical stability of psychiatric clinical notes from electronic health records over a decade.十年间电子健康记录中精神科临床笔记的词汇稳定性

Acta Neuropsychiatr. 2023 Aug 25;37:e16. doi: 10.1017/neu.2023.46.

Suicide risk detection using artificial intelligence: the promise of creating a benchmark dataset for research on the detection of suicide risk.利用人工智能进行自杀风险检测：创建自杀风险检测研究基准数据集的前景。

Front Psychiatry. 2023 Jul 24;14:1186569. doi: 10.3389/fpsyt.2023.1186569. eCollection 2023.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种用于预测精神科住院患者常见风险的高度可扩展深度学习语言模型。

A highly scalable deep learning language model for common risks prediction among psychiatric inpatients.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献