Amazon.com, LLC, Washington, D. C, United States of America.
Amazon UK Services Ltd., London, United Kingdom.
PLoS One. 2024 Oct 9;19(10):e0308922. doi: 10.1371/journal.pone.0308922. eCollection 2024.
This study aims to demonstrate that demographics combined with biometrics can be used to predict obesity related chronic disease risk and produce a health risk score that outperforms body mass index (BMI)-the most commonly used biomarker for obesity. We propose training an ensemble of small neural networks to fuse demographics and biometrics inputs. The categorical outputs of the networks are then turned into a multi-dimensional risk map, which associates diverse inputs with stratified, output health risk. Our ensemble model is optimized and validated on disjoint subsets of nationally representative data (N~100,000) from the National Health and Nutrition Examination Survey (NHANES). To broaden applicability of the proposed method, we consider only non-invasive inputs that can be easily measured through modern devices. Our results show that: (a) neural networks can predict individual conditions (e.g., diabetes, hypertension) or the union of multiple (e.g., nine) health conditions; (b) Softmax model outputs can be used to stratify individual- or any-condition risk; (c) ensembles of neural networks improve generalizability; (d) multiple-input models outperform BMI (e.g., 75.1% area under the receiver operator curve for eight-input, any-condition models compared to 64.2% for BMI); (e) small neural networks are as effective as larger ones for the inference tasks considered; the proposed models are small enough that they can be expressed as human-readable equations, and they can be adapted to clinical settings to identify high-risk, undiagnosed populations.
本研究旨在证明,人口统计学特征与生物特征相结合可用于预测与肥胖相关的慢性疾病风险,并生成健康风险评分,其表现优于目前最常用的肥胖生物标志物——体重指数 (BMI)。我们提出训练一组小型神经网络来融合人口统计学特征和生物特征输入。然后,将网络的分类输出转化为多维风险图,将各种输入与分层的输出健康风险相关联。我们的集成模型在来自国家健康和营养检查调查 (NHANES) 的全国代表性数据 (N~100,000) 的不相交子集上进行了优化和验证。为了拓宽所提出方法的适用性,我们仅考虑可以通过现代设备轻松测量的非侵入性输入。我们的结果表明:(a) 神经网络可以预测个体疾病(例如,糖尿病、高血压)或多种疾病(例如,九种)的综合情况;(b) Softmax 模型输出可用于分层个体或任何疾病的风险;(c) 神经网络集成可提高泛化能力;(d) 多输入模型优于 BMI(例如,对于八输入、任何疾病模型,其受试者工作特征曲线下面积为 75.1%,而 BMI 为 64.2%);(e) 对于所考虑的推理任务,小型神经网络与大型神经网络一样有效;所提出的模型足够小,可以用人类可读的方程表示,并且可以适应临床环境,以识别高风险、未确诊的人群。