Genomics Program, College of Public Health, University of South Florida, Tampa, FL, United States.
Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill and Carolina Population Center, University of North Carolina at Chapel Hill, United States.
J Affect Disord. 2021 Mar 1;282:894-905. doi: 10.1016/j.jad.2020.12.076. Epub 2020 Dec 24.
A range of factors have been identified that contribute to greater incidence, severity, and prolonged course of post-traumatic stress disorder (PTSD), including: comorbid and/or prior psychopathology; social adversity such as low socioeconomic position, perceived discrimination, and isolation; and biological factors such as genomic variation at glucocorticoid receptor regulatory network (GRRN) genes. This complex etiology and clinical course make identification of people at higher risk of PTSD challenging. Here we leverage machine learning (ML) approaches to identify a core set of factors that may together predispose persons to PTSD.
We used multiple ML approaches to assess the relationship among DNA methylation (DNAm) at GRRN genes, prior psychopathology, social adversity, and prospective risk for PTS severity (PTSS).
ML models predicted prospective risk of PTSS with high accuracy. The Gradient Boost approach was the top-performing model with mean absolute error of 0.135, mean square error of 0.047, root mean square error of 0.217, and R of 95.29%. Prior PTSS ranked highest in predicting the prospective risk of PTSS, accounting for >88% of the prediction. The top ranked GRRN CpG site was cg05616442, in AKT1, and the top ranked social adversity feature was loneliness.
Multiple factors including prior PTSS, social adversity, and DNAm play a role in predicting prospective risk of PTSS. ML models identified factors accounting for increased PTSS risk with high accuracy, which may help to target risk factors that reduce the likelihood or course of PTSD, potentially pointing to approaches that can lead to early intervention.
One of the limitations of this study is small sample size.
已确定一系列因素会导致创伤后应激障碍(PTSD)的发生率、严重程度和病程延长,包括:合并症和/或既往精神病理学;社会逆境,如社会经济地位低、感知歧视和孤立;以及生物因素,如糖皮质激素受体调节网络(GRRN)基因的基因组变异。这种复杂的病因和临床病程使得识别 PTSD 风险较高的人群具有挑战性。在这里,我们利用机器学习(ML)方法来确定一组可能共同导致个体易患 PTSD 的核心因素。
我们使用多种 ML 方法来评估 GRRN 基因的 DNA 甲基化(DNAm)、既往精神病理学、社会逆境与 PTSD 严重程度(PTSS)前瞻性风险之间的关系。
ML 模型可以准确预测 PTSS 的前瞻性风险。梯度提升方法是表现最好的模型,平均绝对误差为 0.135,均方误差为 0.047,均方根误差为 0.217,R 为 95.29%。既往 PTSS 在预测 PTSS 的前瞻性风险方面排名最高,占前瞻性风险的>88%。排名最高的 GRRN CpG 位点是 AKT1 中的 cg05616442,排名最高的社会逆境特征是孤独。
包括既往 PTSD、社会逆境和 DNAm 在内的多种因素在预测 PTSD 的前瞻性风险中发挥作用。ML 模型准确识别出导致 PTSD 风险增加的因素,这可能有助于针对降低 PTSD 可能性或病程的风险因素,潜在地指向可以实现早期干预的方法。
本研究的局限性之一是样本量小。