Duarte Martha, Salamanca Mayra, Gonzalez Juan M, Roman Laporte Roberto, Gattamorta Karina, Lopez Martinez Fernando Enrique, Clochesy John, Rincon Acuna Juan Carlos
Keralty Hospital, Miami, FL, USA.
Sanitas Florida, Miami, USA.
Clin Nurs Res. 2024 Jun;33(5):355-369. doi: 10.1177/10547738241252887. Epub 2024 May 27.
Depression is recognized as a significant public health issue in the United States. The National Survey on Drug Use and Health reports that 21.0 million adults aged 18 or older had major depressive disorder in 2020, including 14.8 million experiencing a major depressive episode with severe impairment. The aim is to predict the positivity of Patient Health Questionnaire-2 (PHQ-2) outcomes among patients in primary care settings by analyzing a range of variables, including socioeconomic status, demographic characteristics, and health behaviors, thereby identifying those at increased risk for depression. Employing a machine learning approach, the study utilizes retrospective data from electronic health records across 15 primary care clinics in South Florida to explore the relationship between social determinants of health (SDoH), including area of deprivation index (ADI) and PHQ-2 positivity. The study encompasses 15 primary care clinics located in South Florida, where a diverse patient population receives care. Analysis included 94,572 patient visits; 74,636 records were included in the study. If a zip+4 was not available or an ADI score did not exist, the visit was not included in the final analysis. Screening involved the PHQ-2, assessing depressed mood and anhedonia, with a cutoff >2 indicating positive screening. ADI was used to assess SDoH by matching patients' residential postal codes to ADI national percentiles. Demographics, sexual history, tobacco use, caffeine intake, and community involvement were also evaluated in the study. Over 40 machine learning algorithms were explored for their accuracy in predicting PHQ-2 outcomes, using software tools including Scikit-learn and stats models in Python. Variables were normalized, scored, and then subjected to predictive regression models, with Random Forest showing outstanding performance. Feature engineering and correlation analysis identified ADI, age, education, visit type, coffee intake, and marital status as significant predictors of PHQ-2 positivity. The area under the curve and model accuracies varied across clinics, with specific clinics showing higher predictive accuracy and others ( > .05). The study concludes that the ADI, as a proxy for SDoH, alongside other individual factors, can predict PHQ-2 positivity. Health organizations can use this information to anticipate health needs and resource allocation.
抑郁症在美国被视为一个重大的公共卫生问题。《全国药物使用和健康调查》报告称,2020年有2100万18岁及以上的成年人患有重度抑郁症,其中1480万人经历了伴有严重功能障碍的重度抑郁发作。目的是通过分析一系列变量,包括社会经济地位、人口特征和健康行为,预测初级保健机构中患者的患者健康问卷-2(PHQ-2)结果的阳性情况,从而识别出抑郁症风险增加的人群。该研究采用机器学习方法,利用来自南佛罗里达州15家初级保健诊所电子健康记录的回顾性数据,探讨健康的社会决定因素(SDoH)之间的关系,包括贫困指数(ADI)区域和PHQ-2阳性情况。该研究涵盖了位于南佛罗里达州的15家初级保健诊所,不同的患者群体在那里接受治疗。分析包括94572次患者就诊;74636份记录被纳入研究。如果没有可用的邮政编码加4或不存在ADI分数,则该就诊不包括在最终分析中。筛查采用PHQ-2,评估抑郁情绪和快感缺失,临界值>2表明筛查呈阳性。通过将患者的居住邮政编码与ADI全国百分位数相匹配,使用ADI来评估SDoH。研究中还评估了人口统计学、性史、烟草使用、咖啡因摄入量和社区参与情况。使用包括Python中的Scikit-learn和统计模型在内的软件工具,探索了40多种机器学习算法在预测PHQ-2结果方面的准确性。对变量进行了归一化、评分,然后应用于预测回归模型,随机森林表现出卓越的性能。特征工程和相关性分析确定ADI、年龄、教育程度、就诊类型、咖啡摄入量和婚姻状况是PHQ-2阳性的重要预测因素。曲线下面积和模型准确性在各诊所之间有所不同,特定诊所显示出更高的预测准确性,而其他诊所则不然(>0.05)。该研究得出结论,ADI作为SDoH的替代指标,与其他个体因素一起,可以预测PHQ-2阳性情况。卫生组织可以利用这些信息来预测健康需求和资源分配。