Lama Lara, Wilhelmsson Oskar, Norlander Erik, Gustafsson Lars, Lager Anton, Tynelius Per, Wärvik Lars, Östenson Claes-Göran
CGI Inc., Stockholm, Sweden.
Region Stockholm, Center for Epidemiological Research, Stockholm, Sweden.
Heliyon. 2021 Jun 25;7(7):e07419. doi: 10.1016/j.heliyon.2021.e07419. eCollection 2021 Jul.
To study if machine learning methodology can be used to detect persons with increased type 2 diabetes or prediabetes risk among people without known abnormal glucose regulation.
Machine learning and interpretable machine learning models were applied on research data from Stockholm Diabetes Preventive Program, including more than 8000 people initially with normal glucose tolerance or prediabetes to determine high and low risk features for further impairment in glucose tolerance at follow-up 10 and 20 years later.
The features with the highest importance on the outcome were body mass index, waist-hip ratio, age, systolic and diastolic blood pressure, and diabetes heredity. High values of these features as well as diabetes heredity conferred increased risk of type 2 diabetes. . The machine learning model was used to generate individual, comprehensible risk profiles, where the diabetes risk was obtained for each person in the data set. Features with the largest increasing or decreasing effects on the risk were determined.
The primary application of this machine learning model is to predict individual type 2 diabetes risk in people without diagnosed diabetes, and to which features the risk relates. However, since most features affecting diabetes risk also play a role for metabolic control in diabetes, e.g. body mass index, diet composition, tobacco use, and stress, the tool can possibly also be used in diabetes care to develop more individualized, easily accessible health care plans to be utilized when encountering the patients.
研究机器学习方法是否可用于在葡萄糖调节无已知异常的人群中检测2型糖尿病风险增加或糖尿病前期的个体。
将机器学习和可解释机器学习模型应用于斯德哥尔摩糖尿病预防项目的研究数据,该数据包括8000多名最初糖耐量正常或处于糖尿病前期的人群,以确定10年和20年后随访时糖耐量进一步受损的高风险和低风险特征。
对结果影响最重要的特征是体重指数、腰臀比、年龄、收缩压和舒张压以及糖尿病遗传因素。这些特征的高值以及糖尿病遗传因素会增加2型糖尿病的风险。使用机器学习模型生成个体可理解的风险概况,从而获得数据集中每个人的糖尿病风险。确定了对风险影响最大的增加或降低的特征。
该机器学习模型的主要应用是预测未诊断糖尿病个体的2型糖尿病风险以及风险与哪些特征相关。然而,由于大多数影响糖尿病风险的特征在糖尿病的代谢控制中也起作用,例如体重指数、饮食组成、吸烟和压力,该工具也可能用于糖尿病护理,以制定更个性化、易于获取的医疗保健计划,在接诊患者时使用。