• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

预测中年人群的自杀行为:英国生物库的机器学习分析。

Prediction of Suicidal Behaviors in the Middle-aged Population: Machine Learning Analyses of UK Biobank.

机构信息

West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China.

Med-X Center for Informatics, Sichuan University, Chengdu, China.

出版信息

JMIR Public Health Surveill. 2023 Feb 20;9:e43419. doi: 10.2196/43419.

DOI:10.2196/43419
PMID:36805366
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9989910/
Abstract

BACKGROUND

Suicidal behaviors, including suicide deaths and attempts, are major public health concerns. However, previous suicide models required a huge amount of input features, resulting in limited applicability in clinical practice.

OBJECTIVE

We aimed to construct applicable models (ie, with limited features) for short- and long-term suicidal behavior prediction. We further validated these models among individuals with different genetic risks of suicide.

METHODS

Based on the prospective cohort of UK Biobank, we included 223 (0.06%) eligible cases of suicide attempts or deaths, according to hospital inpatient or death register data within 1 year from baseline and randomly selected 4460 (1.18%) controls (1:20) without such records. We similarly identified 833 (0.22%) cases of suicidal behaviors 1 to 6 years from baseline and 16,660 (4.42%) corresponding controls. Based on 143 input features, mainly including sociodemographic, environmental, and psychosocial factors; medical history; and polygenic risk scores (PRS) for suicidality, we applied a bagged balanced light gradient-boosting machine (LightGBM) with stratified 10-fold cross-validation and grid-search to construct the full prediction models for suicide attempts or deaths within 1 year or between 1 and 6 years. The Shapley Additive Explanations (SHAP) approach was used to quantify the importance of input features, and the top 20 features with the highest SHAP values were selected to train the applicable models. The external validity of the established models was assessed among 50,310 individuals who participated in UK Biobank repeated assessments both overall and by the level of PRS for suicidality.

RESULTS

Individuals with suicidal behaviors were on average 56 years old, with equal sex distribution. The application of these full models in the external validation data set demonstrated good model performance, with the area under the receiver operating characteristic (AUROC) curves of 0.919 and 0.892 within 1 year and between 1 and 6 years, respectively. Importantly, the applicable models with the top 20 most important features showed comparable external-validated performance (AUROC curves of 0.901 and 0.885) as the full models, based on which we found that individuals in the top quintile of predicted risk accounted for 91.7% (n=11) and 80.7% (n=25) of all suicidality cases within 1 year and during 1 to 6 years, respectively. We further obtained comparable prediction accuracy when applying these models to subpopulations with different genetic susceptibilities to suicidality. For example, for the 1-year risk prediction, the AUROC curves were 0.907 and 0.885 for the high (>2nd tertile of PRS) and low (<1st) genetic susceptibilities groups, respectively.

CONCLUSIONS

We established applicable machine learning-based models for predicting both the short- and long-term risk of suicidality with high accuracy across populations of varying genetic risk for suicide, highlighting a cost-effective method of identifying individuals with a high risk of suicidality.

摘要

背景

自杀行为,包括自杀死亡和自杀未遂,是重大的公共卫生问题。然而,之前的自杀模型需要大量的输入特征,因此在临床实践中的适用性有限。

目的

我们旨在构建适用于短期和长期自杀行为预测的模型(即具有有限特征)。我们进一步在具有不同自杀遗传风险的个体中验证了这些模型。

方法

基于英国生物库的前瞻性队列,我们根据医院住院或死亡登记数据,在基线后 1 年内纳入了 223 例(0.06%)自杀未遂或死亡的合格病例,并随机选择了 4460 例(1.18%)无此类记录的对照(1:20)。我们同样确定了 833 例(0.22%)自杀行为 1 至 6 年的病例和 16660 例(4.42%)相应的对照。基于 143 个输入特征,主要包括社会人口统计学、环境和心理社会因素;病史;以及自杀性的多基因风险评分(PRS),我们应用袋装平衡轻梯度提升机(LightGBM)进行分层 10 折交叉验证和网格搜索,构建了 1 年内或 1 至 6 年内自杀未遂或死亡的全预测模型。Shapley Additive Explanations(SHAP)方法用于量化输入特征的重要性,选择前 20 个具有最高 SHAP 值的特征来训练适用模型。我们通过英国生物库的所有参与者和自杀性 PRS 水平的重复评估来评估所建立模型的外部有效性。

结果

自杀行为者的平均年龄为 56 岁,性别分布均衡。在外部验证数据集中应用这些全模型显示出良好的模型性能,在 1 年内和 1 至 6 年内的受试者工作特征(ROC)曲线下面积分别为 0.919 和 0.892。重要的是,基于前 20 个最重要特征的适用模型表现出与全模型相当的外部验证性能(ROC 曲线下面积分别为 0.901 和 0.885),基于此,我们发现预测风险最高的五分位数人群中,分别有 91.7%(n=11)和 80.7%(n=25)的人在 1 年内和 1 至 6 年内发生自杀。我们还在具有不同自杀遗传易感性的亚人群中应用这些模型时获得了可比的预测准确性。例如,对于 1 年风险预测,高(PRS>第 2 tertile)和低(PRS<第 1 tertile)遗传易感性组的 ROC 曲线分别为 0.907 和 0.885。

结论

我们建立了基于机器学习的适用模型,用于预测短期和长期自杀风险,在具有不同自杀遗传风险的人群中具有较高的准确性,突出了一种具有成本效益的方法,可以识别自杀风险较高的个体。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a67e/9989910/70b81defba24/publichealth_v9i1e43419_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a67e/9989910/3a00e0c342c4/publichealth_v9i1e43419_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a67e/9989910/fddf3643fc2b/publichealth_v9i1e43419_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a67e/9989910/70b81defba24/publichealth_v9i1e43419_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a67e/9989910/3a00e0c342c4/publichealth_v9i1e43419_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a67e/9989910/fddf3643fc2b/publichealth_v9i1e43419_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a67e/9989910/70b81defba24/publichealth_v9i1e43419_fig3.jpg

相似文献

1
Prediction of Suicidal Behaviors in the Middle-aged Population: Machine Learning Analyses of UK Biobank.预测中年人群的自杀行为:英国生物库的机器学习分析。
JMIR Public Health Surveill. 2023 Feb 20;9:e43419. doi: 10.2196/43419.
2
Predicting suicidal behavior in individuals with depression over 50 years of age: Evidence from the UK biobank.预测50岁以上抑郁症患者的自杀行为:来自英国生物银行的证据。
Digit Health. 2024 Oct 13;10:20552076241287450. doi: 10.1177/20552076241287450. eCollection 2024 Jan-Dec.
3
Acoustic Analysis of Speech for Screening for Suicide Risk: Machine Learning Classifiers for Between- and Within-Person Evaluation of Suicidality.言语声学分析用于自杀风险筛查:用于个体间和个体内评估自杀倾向的机器学习分类器。
J Med Internet Res. 2023 Mar 23;25:e45456. doi: 10.2196/45456.
4
Fracture risk prediction in postmenopausal women with traditional and machine learning models in a nationwide, prospective cohort study in Switzerland with validation in the UK Biobank.在瑞士进行的一项全国性前瞻性队列研究中,使用传统和机器学习模型对绝经后妇女进行骨折风险预测,并在英国生物库中进行验证。
J Bone Miner Res. 2024 Aug 21;39(8):1103-1112. doi: 10.1093/jbmr/zjae089.
5
Uncovering Clinical Risk Factors and Predicting Severe COVID-19 Cases Using UK Biobank Data: Machine Learning Approach.利用英国生物库数据揭示临床风险因素并预测严重 COVID-19 病例:机器学习方法。
JMIR Public Health Surveill. 2021 Sep 30;7(9):e29544. doi: 10.2196/29544.
6
Machine Learning-Based Prediction of Suicidality in Adolescents With Allergic Rhinitis: Derivation and Validation in 2 Independent Nationwide Cohorts.基于机器学习的青少年变应性鼻炎自杀倾向预测:在 2 个独立的全国性队列中的推导和验证。
J Med Internet Res. 2024 Feb 14;26:e51473. doi: 10.2196/51473.
7
Association of Genome-Wide Polygenic Scores for Multiple Psychiatric and Common Traits in Preadolescent Youths at Risk of Suicide.有自杀风险的青春期前青少年多种精神疾病及常见性状的全基因组多基因评分关联研究
JAMA Netw Open. 2022 Feb 1;5(2):e2148585. doi: 10.1001/jamanetworkopen.2021.48585.
8
Machine Learning-Based Prediction of Suicidal Thinking in Adolescents by Derivation and Validation in 3 Independent Worldwide Cohorts: Algorithm Development and Validation Study.基于机器学习的青少年自杀思维预测:在 3 个独立的全球队列中的推导和验证:算法开发和验证研究。
J Med Internet Res. 2024 May 17;26:e55913. doi: 10.2196/55913.
9
An explainable predictive model for suicide attempt risk using an ensemble learning and Shapley Additive Explanations (SHAP) approach.一种使用集成学习和夏普利值加法解释(SHAP)方法的自杀未遂风险可解释预测模型。
Asian J Psychiatr. 2023 Jan;79:103316. doi: 10.1016/j.ajp.2022.103316. Epub 2022 Nov 7.
10
Understanding and predicting suicidality using a combined genomic and clinical risk assessment approach.采用基因组与临床风险评估相结合的方法理解和预测自杀倾向。
Mol Psychiatry. 2015 Nov;20(11):1266-85. doi: 10.1038/mp.2015.112. Epub 2015 Aug 18.

引用本文的文献

1
Machine learning algorithms and their predictive accuracy for suicide and self-harm: Systematic review and meta-analysis.机器学习算法及其对自杀和自我伤害的预测准确性:系统评价与荟萃分析。
PLoS Med. 2025 Sep 11;22(9):e1004581. doi: 10.1371/journal.pmed.1004581. eCollection 2025 Sep.
2
Machine learning based identification of suicidal ideation using non-suicidal predictors in a university mental health clinic.在大学心理健康诊所中,基于机器学习利用非自杀性预测因素识别自杀意念。
Sci Rep. 2025 Apr 22;15(1):13843. doi: 10.1038/s41598-025-97387-4.
3
Estimation of Machine Learning-Based Models to Predict Dementia Risk in Patients With Atherosclerotic Cardiovascular Diseases: UK Biobank Study.

本文引用的文献

1
Diet and Risk of Incident Lung Cancer: A Large Prospective Cohort Study in UK Biobank.饮食与肺癌发病风险:英国生物库大型前瞻性队列研究。
Am J Clin Nutr. 2021 Dec 1;114(6):2043-2051. doi: 10.1093/ajcn/nqab298.
2
Permutation-based identification of important biomarkers for complex diseases via machine learning models.基于排列的机器学习模型识别复杂疾病的重要生物标志物。
Nat Commun. 2021 May 21;12(1):3008. doi: 10.1038/s41467-021-22756-2.
3
Identification of Suicide Attempt Risk Factors in a National US Survey Using Machine Learning.
基于机器学习的模型预测动脉粥样硬化性心血管疾病患者痴呆风险的评估:英国生物银行研究
JMIR Aging. 2025 Feb 26;8:e64148. doi: 10.2196/64148.
4
Automatically extracting social determinants of health for suicide: a narrative literature review.自动提取自杀的健康社会决定因素:一项叙述性文献综述。
Npj Ment Health Res. 2024 Nov 6;3(1):51. doi: 10.1038/s44184-024-00087-6.
5
Predicting suicidal behavior in individuals with depression over 50 years of age: Evidence from the UK biobank.预测50岁以上抑郁症患者的自杀行为:来自英国生物银行的证据。
Digit Health. 2024 Oct 13;10:20552076241287450. doi: 10.1177/20552076241287450. eCollection 2024 Jan-Dec.
6
Evidence for the biopsychosocial model of suicide: a review of whole person modeling studies using machine learning.自杀的生物心理社会模型的证据:对使用机器学习的全人建模研究的综述
Front Psychiatry. 2024 Jan 11;14:1294666. doi: 10.3389/fpsyt.2023.1294666. eCollection 2023.
基于机器学习的全美调查中自杀未遂风险因素的识别。
JAMA Psychiatry. 2021 Apr 1;78(4):398-406. doi: 10.1001/jamapsychiatry.2020.4165.
4
LDpred2: better, faster, stronger.LDpred2:更优、更快、更强。
Bioinformatics. 2021 Apr 1;36(22-23):5424-5431. doi: 10.1093/bioinformatics/btaa1029.
5
An Interpretable Prediction Model for Identifying N-Methylguanosine Sites Based on XGBoost and SHAP.一种基于XGBoost和SHAP的用于识别N-甲基鸟苷位点的可解释预测模型。
Mol Ther Nucleic Acids. 2020 Aug 25;22:362-372. doi: 10.1016/j.omtn.2020.08.022. eCollection 2020 Dec 4.
6
Predicting suicide attempt or suicide death following a visit to psychiatric specialty care: A machine learning study using Swedish national registry data.预测精神科专科就诊后自杀未遂或自杀死亡:一项使用瑞典国家登记数据的机器学习研究。
PLoS Med. 2020 Nov 6;17(11):e1003416. doi: 10.1371/journal.pmed.1003416. eCollection 2020 Nov.
7
Suicide rates continue to rise in England and Wales.在英格兰和威尔士,自杀率持续上升。
BMJ. 2020 Sep 3;370:m3431. doi: 10.1136/bmj.m3431.
8
A Novel Multiresolution-Statistical Texture Analysis Architecture: Radiomics-Aided Diagnosis of PDAC Based on Plain CT Images.一种新型多分辨率统计纹理分析架构:基于平扫CT图像的放射组学辅助胰腺癌诊断。
IEEE Trans Med Imaging. 2021 Jan;40(1):12-25. doi: 10.1109/TMI.2020.3021254. Epub 2020 Dec 29.
9
Predicting death by suicide following an emergency department visit for parasuicide with administrative health care system data and machine learning.利用行政医疗保健系统数据和机器学习预测因蓄意自伤前往急诊科就诊后自杀死亡情况。
EClinicalMedicine. 2020 Feb 18;20:100281. doi: 10.1016/j.eclinm.2020.100281. eCollection 2020 Mar.
10
Genome-wide association study of dietary intake in the UK biobank study and its associations with schizophrenia and other traits.基于英国生物库研究的全基因组关联研究饮食摄入及其与精神分裂症和其他特征的关联。
Transl Psychiatry. 2020 Feb 3;10(1):51. doi: 10.1038/s41398-020-0688-y.