Farina Mateo P, Klopack Eric T, Crimmins Eileen M
Department of Human Development and Family Science and the Population Research Center, University of Texas at Austin, Austin, Texas, United States.
Davis School of Gerontology, University of Southern California Los Angeles California, United States.
J Gerontol A Biol Sci Med Sci. 2025 Jul 24;80(8). doi: 10.1093/gerona/glaf153.
Increasingly, biomarkers are used to understand health and health inequalities among older adults. Combined with advancements in machine-learning approaches, researchers are using predictive algorithms of later life health to identify biomarkers of interest and create biological risk scores. However, these algorithms may select biomarkers that are most important for the majority populations, which, in most population-based samples, would reflect the health and aging of White older adults. Understanding how biomarker selection varies across race/ethnicity across different types of health outcomes is paramount to advancing GeroScience research. We used the 2016 Venous Blood Substudy (VBS) of the Health and Retirement Study. We fit race-stratified boosted decision tree models to predict all-cause mortality, multimorbidity, diabetes, and heart conditions from 54 biomarkers in the 2016 VBS that covered 11 biological systems. We, then, graphed biomarkers that had feature values above .01 for each algorithm to show racial/ethnic differences in biomarker selection. We found more variation in biomarker selection across racial/ethnic groups for all-cause mortality. We found little variation in biomarker selection for heart conditions and diabetes. There was some variation for multimorbidity but with substantial overlap across racial/ethnic groups. Although machine-learning approaches for developing biological risk scores and identifying biomarkers linked to later life health will yield additional insight into aging processes in human populations, researchers must consider how these approaches may differ across race/ethnicity for different types of health conditions and its potential implications for GeroScience research.
生物标志物越来越多地被用于了解老年人的健康状况和健康不平等现象。结合机器学习方法的进步,研究人员正在使用晚年健康的预测算法来识别感兴趣的生物标志物并创建生物风险评分。然而,这些算法可能会选择对大多数人群最重要的生物标志物,而在大多数基于人群的样本中,这将反映白人老年人的健康和衰老情况。了解生物标志物的选择在不同种族/族裔以及不同类型健康结果之间如何变化,对于推进老年科学研究至关重要。我们使用了健康与退休研究的2016年静脉血子研究(VBS)。我们拟合了按种族分层的增强决策树模型,以根据2016年VBS中涵盖11个生物系统的54种生物标志物来预测全因死亡率、多种疾病并存、糖尿病和心脏病。然后,我们绘制了每种算法中特征值高于0.01的生物标志物,以显示生物标志物选择中的种族/族裔差异。我们发现,在全因死亡率方面,不同种族/族裔群体在生物标志物选择上的差异更大。我们发现,在心脏病和糖尿病的生物标志物选择上差异较小。在多种疾病并存方面存在一些差异,但不同种族/族裔群体之间有很大重叠。尽管用于开发生物风险评分和识别与晚年健康相关的生物标志物的机器学习方法将为人群衰老过程带来更多见解,但研究人员必须考虑这些方法在不同种族/族裔以及不同类型健康状况下可能存在的差异及其对老年科学研究的潜在影响。